ORCID Profile
0000-0001-6765-3813
Current Organisation
The University of Auckland
Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.
Publisher: American Association for the Advancement of Science (AAAS)
Date: 28-07-2023
Abstract: The origins of the Indo-European language family are hotly disputed. Bayesian phylogenetic analyses of core vocabulary have produced conflicting results, with some supporting a farming expansion out of Anatolia ~9000 years before present (yr B.P.), while others support a spread with horse-based pastoralism out of the Pontic-Caspian Steppe ~6000 yr B.P. Here we present an extensive database of Indo-European core vocabulary that eliminates past inconsistencies in cognate coding. Ancestry-enabled phylogenetic analysis of this dataset indicates that few ancient languages are direct ancestors of modern clades and produces a root age of ~8120 yr B.P. for the family. Although this date is not consistent with the Steppe hypothesis, it does not rule out an initial homeland south of the Caucasus, with a subsequent branch northward onto the steppe and then across Europe. We reconcile this hybrid hypothesis with recently published ancient DNA evidence from the steppe and the northern Fertile Crescent.
Publisher: Proceedings of the National Academy of Sciences
Date: 08-2022
Abstract: The Bantu expansion transformed the linguistic, economic, and cultural composition of sub-Saharan Africa. However, the exact dates and routes taken by the ancestors of the speakers of the more than 500 current Bantu languages remain uncertain. Here, we use the recently developed “break-away” geographical diffusion model, specially designed for modeling migrations, with “augmented” geographic information, to reconstruct the Bantu language family expansion. This Bayesian phylogeographic approach with augmented geographical data provides a powerful way of linking linguistic, archaeological, and genetic data to test hypotheses about large language family expansions. We compare four hypotheses: an early major split north of the rainforest a migration through the Sangha River Interval corridor around 2,500 BP a coastal migration around 4,000 BP and a migration through the rainforest before the corridor opening, at 4,000 BP. Our results produce a topology and timeline for the Bantu language family, which supports the hypothesis of an expansion through Central African tropical forests at 4,420 BP (4,040 to 5,000 95% highest posterior density interval), well before the Sangha River Interval was open.
Publisher: Springer Science and Business Media LLC
Date: 10-11-2021
DOI: 10.1038/S41586-021-04108-8
Abstract: The origin and early dispersal of speakers of Transeurasian languages—that is, Japanese, Korean, Tungusic, Mongolic and Turkic—is among the most disputed issues of Eurasian population history 1–3 . A key problem is the relationship between linguistic dispersals, agricultural expansions and population movements 4,5 . Here we address this question by ‘triangulating’ genetics, archaeology and linguistics in a unified perspective. We report wide-ranging datasets from these disciplines, including a comprehensive Transeurasian agropastoral and basic vocabulary an archaeological database of 255 Neolithic–Bronze Age sites from Northeast Asia and a collection of ancient genomes from Korea, the Ryukyu islands and early cereal farmers in Japan, complementing previously published genomes from East Asia. Challenging the traditional ‘pastoralist hypothesis’ 6–8 , we show that the common ancestry and primary dispersals of Transeurasian languages can be traced back to the first farmers moving across Northeast Asia from the Early Neolithic onwards, but that this shared heritage has been masked by extensive cultural interaction since the Bronze Age. As well as marking considerable progress in the three in idual disciplines, by combining their converging evidence we show that the early spread of Transeurasian speakers was driven by agriculture.
Publisher: Pleiades Publishing Ltd
Date: 08-2019
Publisher: The Royal Society
Date: 03-2018
DOI: 10.1098/RSOS.171504
Abstract: The Dravidian language family consists of about 80 varieties (Hammarström H. 2016 Glottolog 2.7 ) spoken by 220 million people across southern and central India and surrounding countries (Steever SB. 1998 In The Dravidian languages (ed. SB Steever), pp. 1–39: 1). Neither the geographical origin of the Dravidian language homeland nor its exact dispersal through time are known. The history of these languages is crucial for understanding prehistory in Eurasia, because despite their current restricted range, these languages played a significant role in influencing other language groups including Indo-Aryan (Indo-European) and Munda (Austroasiatic) speakers. Here, we report the results of a Bayesian phylogenetic analysis of cognate-coded lexical data, elicited first hand from native speakers, to investigate the subgrouping of the Dravidian language family, and provide dates for the major points of ersification. Our results indicate that the Dravidian language family is approximately 4500 years old, a finding that corresponds well with earlier linguistic and archaeological studies. The main branches of the Dravidian language family (North, Central, South I, South II) are recovered, although the placement of languages within these main branches erges from previous classifications. We find considerable uncertainty with regard to the relationships between the main branches.
Publisher: Cold Spring Harbor Laboratory
Date: 07-10-2022
DOI: 10.1101/2022.10.05.510045
Abstract: In a recent study we used an interdisciplinary approach combining linguistics, archaeology and genetics to analyse the Transeurasian languages 1 . Our analysis concluded that the early dispersals of these languages were driven by agriculture. A preprint published on this server presents objections to the Transeurasian hypothesis and its association with farming dispersals 2 . However, close inspection of that text reveals numerous misinterpretations and inconsistencies. In the interest of furthering scientific debate over Transeurasian language and population history, we address the critiques, revising datasets and fine-tuning approaches. The linguistic critique questions the quantity and quality of our datasets. Here we show that the number of surviving cognate sets for Transeurasian is in line with that for well-established language families. In addition, we find that Tian et al.’s failure to reject a core of regularly corresponding cognates in the basic vocabulary creates ground for a consensus about the genealogical relatability of the Transeurasian languages. The archaeological critique attempts a re-analysis of one Bayesian test using re-scored data only for northern China. Over half of the suggested re-scorings contain inconsistencies and it is not explained why the re-analysis retains the original data for sites outside northern China, comprising almost 60% of the total. More importantly, the sweeping claim that there is no evidence supporting the prehistoric migrations analysed in our study is not backed by any discussion of the archaeological record. With respect to genetics, the preprint claims a re-analysis showing that the data ‘do not conclusively support the farming-driven dispersal of Turkic, Mongolian, and Tungusic, nor the two-wave spread of farming to Korea.’ In fact, the only genetic re-analysis presented is limited to s les from Korea and Japan and does not contradict our original conservative modelling of Neolithic in iduals with Hongshan and our Bronze Age ones with Upper Xiajiadian. In sum, in bringing multiple lines of evidence together through triangulation, we gained a more balanced and richer understanding of Transeurasian dispersals than each discipline could provide in idually. Our research doubtless leaves room for improvement but we remain confident that triangulation did not ‘fail’, but rather brought us a step closer to understanding the history of the Transeurasian languages.
Publisher: Public Library of Science (PLoS)
Date: 08-04-2019
Publisher: Research Square Platform LLC
Date: 15-07-2021
DOI: 10.21203/RS.3.RS-255765/V1
Abstract: The origin and early dispersal of speakers of Transeurasian languages, i.e., Japanese, Korean, Tungusic, Mongolic and Turkic, is among the most disputed issues of Eurasian population history. A key problem is the relationship between linguistic dispersals, agricultural expansions and population movements. Here we address this question through ‘triangulating’ genetics, archaeology and linguistics in a unified perspective. We report new, wide-ranging datasets from these disciplines, including the most comprehensive Transeurasian agropastoral and basic vocabulary presented to date, an archaeological database of 255 Neolithic and Bronze Age sites from Northeast Asia, and the first collection of ancient genomes from Korea, the Ryukyu islands and early cereal farmers in Japan, complementing previously published genomes from East Asia. Challenging the traditional ‘Pastoralist Hypothesis’, we show that the common ancestry and primary dispersals of Transeurasian languages can be traced back to the first farmers moving across Northeast Asia from the Early Neolithic onwards, but that this shared heritage has been masked by extensive cultural interaction since the Bronze Age. As well as marking significant progress in the three in idual disciplines, by combining their converging evidence, we show that the early spread of Transeurasian speakers was driven by agriculture.
Publisher: Cold Spring Harbor Laboratory
Date: 19-11-2018
DOI: 10.1101/474296
Abstract: Elaboration of Bayesian phylogenetic inference methods has continued at pace in recent years with major new advances in nearly all aspects of the joint modelling of evolutionary data. It is increasingly appreciated that some evolutionary questions can only be adequately answered by combining evidence from multiple independent sources of data, including genome sequences, s ling dates, phenotypic data, radiocarbon dates, fossil occurrences, and biogeographic range information among others. Including all relevant data into a single joint model is very challenging both conceptually and computationally. Advanced computational software packages that allow robust development of compatible (sub-)models which can be composed into a full model hierarchy have played a key role in these developments. Developing such software frameworks is increasingly a major scientific activity in its own right, and comes with specific challenges, from practical software design, development and engineering challenges to statistical and conceptual modelling challenges. BEAST 2 is one such computational software platform, and was first announced over 4 years ago. Here we describe a series of major new developments in the BEAST 2 core platform and model hierarchy that have occurred since the first release of the software, culminating in the recent 2.5 release. Bayesian phylogenetic inference methods have undergone considerable development in recent years, and joint modelling of rich evolutionary data, including genomes, phenotypes and fossil occurrences is increasingly common. Advanced computational software packages that allow robust development of compatible (sub-)models which can be composed into a full model hierarchy have played a key role in these developments. Developing scientific software is increasingly crucial to advancement in many fields of biology. The challenges range from practical software development and engineering, distributed team coordination, conceptual development and statistical modelling, to validation and testing. BEAST 2 is one such computational software platform for phylogenetics, population genetics and phylodynamics, and was first announced over 4 years ago. Here we describe the full range of new tools and models available on the BEAST 2.5 platform, which expand joint evolutionary inference in many new directions, especially for joint inference over multiple data types, non-tree models and complex phylodynamics.
No related grants have been discovered for Remco Bouckaert.