fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample

Emil Jørsboe; Kristian Ebbesen Hanghøj; Anders Albrechtsen

doi:10.1093/bioinformatics/btx474

fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample

Emil Jørsboe^*, Kristian Ebbesen Hanghøj, Anders Albrechtsen

^*Corresponding author af dette arbejde

Bioinformatik og RNA Biologi

11 Citationer (Scopus)

Abstract

Motivation Estimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases. Results Here we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.

Originalsprog	Engelsk
Tidsskrift	Bioinformatics
Vol/bind	33
Udgave nummer	19
Sider (fra-til)	3148-3150
Antal sider	3
ISSN	1367-4803
DOI	https://doi.org/10.1093/bioinformatics/btx474
Status	Udgivet - 1 okt. 2017

Adgang til dokumentet

10.1093/bioinformatics/btx474

btx474.pdfForlagets udgivne version, 186 KB

https://academic.oup.com/bioinformatics/article-pdf/33/19/3148/25164913/btx474.pdf

Andre filer og links

Link to publication in Scopus

Citationsformater

@article{8f9959dd3b754a97971660253c255c83,

title = "fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample",

abstract = "Motivation Estimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases. Results Here we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.",

author = "Emil J{\o}rsboe and Hangh{\o}j, {Kristian Ebbesen} and Anders Albrechtsen",

year = "2017",

month = oct,

day = "1",

doi = "10.1093/bioinformatics/btx474",

language = "English",

volume = "33",

pages = "3148--3150",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "19",

}

TY - JOUR

T1 - fastNGSadmix

T2 - admixture proportions and principal component analysis of a single NGS sample

AU - Jørsboe, Emil

AU - Hanghøj, Kristian Ebbesen

AU - Albrechtsen, Anders

PY - 2017/10/1

Y1 - 2017/10/1

N2 - Motivation Estimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases. Results Here we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.

AB - Motivation Estimation of admixture proportions and principal component analysis (PCA) are fundamental tools in populations genetics. However, applying these methods to low- or mid-depth sequencing data without taking genotype uncertainty into account can introduce biases. Results Here we present fastNGSadmix, a tool to fast and reliably estimate admixture proportions and perform PCA from next generation sequencing data of a single individual. The analyses are based on genotype likelihoods of the input sample and a set of predefined reference populations. The method has high accuracy, even at low sequencing depth and corrects for the biases introduced by small reference populations.

UR - http://www.scopus.com/inward/record.url?scp=85030685978&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btx474

DO - 10.1093/bioinformatics/btx474

M3 - Journal article

C2 - 28957500

AN - SCOPUS:85030685978

SN - 1367-4803

VL - 33

SP - 3148

EP - 3150

JO - Bioinformatics

JF - Bioinformatics

IS - 19

ER -

fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample

Abstract

Adgang til dokumentet

Andre filer og links

Fingeraftryk

Citationsformater