Abstract
Motivation Estimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases. Results Here we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Bioinformatics |
Vol/bind | 33 |
Udgave nummer | 19 |
Sider (fra-til) | 3148-3150 |
Antal sider | 3 |
ISSN | 1367-4803 |
DOI | |
Status | Udgivet - 1 okt. 2017 |