The lengths of admixture tracts

Mason Liang; Rasmus Nielsen

doi:10.1534/genetics.114.162362

The lengths of admixture tracts

Mason Liang^*, Rasmus Nielsen

^*Corresponding author af dette arbejde

Bioinformatik og RNA Biologi

57 Citationer (Scopus)

Abstract

The distribution of admixture tract lengths has received considerable attention, in part because it can be used to infer the timing of past gene flow events between populations. It is commonly assumed that these lengths can be modeled as independently and identically distributed (iid) exponential random variables. This assumption is fundamental for many popular methods that analyze admixture using hidden Markov models. We compare the expected distribution of admixture tract lengths under a number of population-genetic models to the distribution predicted by the Wright-Fisher model with recombination. We show that under the latter model, the assumption of iid exponential tract lengths does not hold for recent or for ancient admixture events and that relying on this assumption can lead to false positives when inferring the number of admixture events. To further investigate the tract-length distribution, we develop a dyadic interval-based stochastic process for generating admixture tracts. This representation is useful for analyzing admixture tract-length distributions for populations with recent admixture, a scenario in which existing models perform poorly.

Originalsprog	Engelsk
Tidsskrift	Genetics
Vol/bind	197
Udgave nummer	3
Sider (fra-til)	953-967
Antal sider	15
ISSN	0016-6731
DOI	https://doi.org/10.1534/genetics.114.162362
Status	Udgivet - 1 jan. 2014

Adgang til dokumentet

10.1534/genetics.114.162362

Andre filer og links

Link to publication in Scopus

Citationsformater

@article{57ba854817e547ef8f335cecc1558b6d,

title = "The lengths of admixture tracts",

abstract = "The distribution of admixture tract lengths has received considerable attention, in part because it can be used to infer the timing of past gene flow events between populations. It is commonly assumed that these lengths can be modeled as independently and identically distributed (iid) exponential random variables. This assumption is fundamental for many popular methods that analyze admixture using hidden Markov models. We compare the expected distribution of admixture tract lengths under a number of population-genetic models to the distribution predicted by the Wright-Fisher model with recombination. We show that under the latter model, the assumption of iid exponential tract lengths does not hold for recent or for ancient admixture events and that relying on this assumption can lead to false positives when inferring the number of admixture events. To further investigate the tract-length distribution, we develop a dyadic interval-based stochastic process for generating admixture tracts. This representation is useful for analyzing admixture tract-length distributions for populations with recent admixture, a scenario in which existing models perform poorly.",

author = "Mason Liang and Rasmus Nielsen",

year = "2014",

month = jan,

day = "1",

doi = "10.1534/genetics.114.162362",

language = "English",

volume = "197",

pages = "953--967",

journal = "Genetics",

issn = "1943-2631",

publisher = "The Genetics Society of America (GSA)",

number = "3",

}

TY - JOUR

T1 - The lengths of admixture tracts

AU - Liang, Mason

AU - Nielsen, Rasmus

PY - 2014/1/1

Y1 - 2014/1/1

N2 - The distribution of admixture tract lengths has received considerable attention, in part because it can be used to infer the timing of past gene flow events between populations. It is commonly assumed that these lengths can be modeled as independently and identically distributed (iid) exponential random variables. This assumption is fundamental for many popular methods that analyze admixture using hidden Markov models. We compare the expected distribution of admixture tract lengths under a number of population-genetic models to the distribution predicted by the Wright-Fisher model with recombination. We show that under the latter model, the assumption of iid exponential tract lengths does not hold for recent or for ancient admixture events and that relying on this assumption can lead to false positives when inferring the number of admixture events. To further investigate the tract-length distribution, we develop a dyadic interval-based stochastic process for generating admixture tracts. This representation is useful for analyzing admixture tract-length distributions for populations with recent admixture, a scenario in which existing models perform poorly.

AB - The distribution of admixture tract lengths has received considerable attention, in part because it can be used to infer the timing of past gene flow events between populations. It is commonly assumed that these lengths can be modeled as independently and identically distributed (iid) exponential random variables. This assumption is fundamental for many popular methods that analyze admixture using hidden Markov models. We compare the expected distribution of admixture tract lengths under a number of population-genetic models to the distribution predicted by the Wright-Fisher model with recombination. We show that under the latter model, the assumption of iid exponential tract lengths does not hold for recent or for ancient admixture events and that relying on this assumption can lead to false positives when inferring the number of admixture events. To further investigate the tract-length distribution, we develop a dyadic interval-based stochastic process for generating admixture tracts. This representation is useful for analyzing admixture tract-length distributions for populations with recent admixture, a scenario in which existing models perform poorly.

UR - http://www.scopus.com/inward/record.url?scp=84904260921&partnerID=8YFLogxK

U2 - 10.1534/genetics.114.162362

DO - 10.1534/genetics.114.162362

M3 - Journal article

C2 - 24770332

AN - SCOPUS:84904260921

SN - 1943-2631

VL - 197

SP - 953

EP - 967

JO - Genetics

JF - Genetics

IS - 3

ER -

The lengths of admixture tracts

Abstract

Adgang til dokumentet

Andre filer og links

Fingeraftryk

Citationsformater