One of the advantages of a large lexical database is the ability to test large-scale ideas about language behaviors. As a quick experiment this afternoon, I extracted all the colexification patterns from the database. These are all the words that are glossed by multiple distinct words within the same language.* 20 minutes to download the file, and about the same to manipulate it with the igraph package in R to produce some cluster visualisations.
*Of course, there are going to be issues with this, particularly in the lack of colexification evidence for some languages. The data are only as complete or as good as the dictionaries that went into the database in the first place.
Filed under: Bardi