Unknown

Dataset Information

0

Testing methods of linguistic homeland detection using synthetic data.


ABSTRACT: Two families of quantitative methods have been used to infer geographical homelands of language families: Bayesian phylogeography and the 'diversity method'. Bayesian methods model how populations may have moved using a phylogenetic tree as a backbone, while the diversity method assumes that the geographical area where linguistic diversity is highest likely corresponds to the homeland. No systematic tests of the performances of the different methods in a linguistic context have so far been published. Here, we carry out performance testing by simulating language families, including branching structures and word lists, along with speaker populations moving in space. We test six different methods: two versions of BayesTraits; the relaxed random walk model of BEAST 2; our own RevBayes implementations of a fixed rate and a variable rates random walk model; and the diversity method. As a result of the tests, we propose a hierarchy of performance of the different methods. Factors such as geographical idiosyncrasies, incomplete sampling, tree imbalance and small family sizes all have a negative impact on performance, but mostly across the board, the performance hierarchy generally being impervious to such factors. This article is part of the theme issue 'Reconstructing prehistoric languages'.

SUBMITTER: Wichmann S 

PROVIDER: S-EPMC8059642 | biostudies-literature | 2021 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Testing methods of linguistic homeland detection using synthetic data.

Wichmann Søren S   Rama Taraka T  

Philosophical transactions of the Royal Society of London. Series B, Biological sciences 20210322 1824


Two families of quantitative methods have been used to infer geographical homelands of language families: Bayesian phylogeography and the 'diversity method'. Bayesian methods model how populations may have moved using a phylogenetic tree as a backbone, while the diversity method assumes that the geographical area where linguistic diversity is highest likely corresponds to the homeland. No systematic tests of the performances of the different methods in a linguistic context have so far been publi  ...[more]

Similar Datasets

| S-EPMC9583843 | biostudies-literature
| S-EPMC8961895 | biostudies-literature
| S-EPMC11649536 | biostudies-literature
| S-EPMC6822750 | biostudies-literature
| S-EPMC9204969 | biostudies-literature
| S-EPMC9395491 | biostudies-literature
| S-EPMC9019745 | biostudies-literature
| S-EPMC11801581 | biostudies-literature
2015-05-20 | E-GEOD-60633 | biostudies-arrayexpress
2023-12-25 | GSE227911 | GEO