[HTML][HTML] Using hybridization networks to retrace the evolution of Indo-European languages

M Willems, E Lord, L Laforest, G Labelle… - BMC Evolutionary …, 2016 - Springer
M Willems, E Lord, L Laforest, G Labelle, FJ Lapointe, AM Di Sciullo, V Makarenkov
BMC Evolutionary Biology, 2016Springer
Background Curious parallels between the processes of species and language evolution
have been observed by many researchers. Retracing the evolution of Indo-European (IE)
languages remains one of the most intriguing intellectual challenges in historical linguistics.
Most of the IE language studies use the traditional phylogenetic tree model to represent the
evolution of natural languages, thus not taking into account reticulate evolutionary events,
such as language hybridization and word borrowing which can be associated with species …
Background
Curious parallels between the processes of species and language evolution have been observed by many researchers. Retracing the evolution of Indo-European (IE) languages remains one of the most intriguing intellectual challenges in historical linguistics. Most of the IE language studies use the traditional phylogenetic tree model to represent the evolution of natural languages, thus not taking into account reticulate evolutionary events, such as language hybridization and word borrowing which can be associated with species hybridization and horizontal gene transfer, respectively. More recently, implicit evolutionary networks, such as split graphs and minimal lateral networks, have been used to account for reticulate evolution in linguistics.
Results
Striking parallels existing between the evolution of species and natural languages allowed us to apply three computational biology methods for reconstruction of phylogenetic networks to model the evolution of IE languages. We show how the transfer of methods between the two disciplines can be achieved, making necessary methodological adaptations. Considering basic vocabulary data from the well-known Dyen’s lexical database, which contains word forms in 84 IE languages for the meanings of a 200-meaning Swadesh list, we adapt a recently developed computational biology algorithm for building explicit hybridization networks to study the evolution of IE languages and compare our findings to the results provided by the split graph and galled network methods.
Conclusion
We conclude that explicit phylogenetic networks can be successfully used to identify donors and recipients of lexical material as well as the degree of influence of each donor language on the corresponding recipient languages. We show that our algorithm is well suited to detect reticulate relationships among languages, and present some historical and linguistic justification for the results obtained. Our findings could be further refined if relevant syntactic, phonological and morphological data could be analyzed along with the available lexical data.
Springer