Bayesian analysis of genetic association across tree-structured routine healthcare data in the UK Biobank

Adrian Cortes, Calliope A. Dendrou, Allan Motyer, Luke Jostins, Damjan Vukcevic, Alexander Dilthey, Peter Donnelly, Stephen Leslie, Lars Fugger, Gil McVean*

*Corresponding author for this work
25 Citations (Scopus)

Abstract

Genetic discovery from the multitude of phenotypes extractable from routine healthcare data can transform understanding of the human phenome and accelerate progress toward precision medicine. However, a critical question when analyzing high-dimensional and heterogeneous data is how best to interrogate increasingly specific subphenotypes while retaining statistical power to detect genetic associations. Here we develop and employ a new Bayesian analysis framework that exploits the hierarchical structure of diagnosis classifications to analyze genetic variants against UK Biobank disease phenotypes derived from self-reporting and hospital episode statistics. Our method displays a more than 20% increase in power to detect genetic effects over other approaches and identifies new associations between classical human leukocyte antigen (HLA) alleles and common immune-mediated diseases (IMDs). By applying the approach to genetic risk scores (GRSs), we show the extent of genetic sharing among IMDs and expose differences in disease perception or diagnosis with potential clinical implications.

Original languageEnglish
JournalNature Genetics
Volume49
Issue number9
Pages (from-to)1311-1318
Number of pages8
ISSN1061-4036
DOIs
Publication statusPublished - 2017

Fingerprint

Dive into the research topics of 'Bayesian analysis of genetic association across tree-structured routine healthcare data in the UK Biobank'. Together they form a unique fingerprint.

Cite this