Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes

Tune Hannes Pers; Niclas Tue Hansen; Kasper Lage Hansen; Pernille Koefoed; Piotr Dworzynski; Martin Lee Miller; Tracey J Flint; Erling Mellerup; Henrik Dam; Ole A Andreassen; Srdjan Djurovic; Ingrid Melle; Anders D Børglum; Thomas Werge; Shaun Purcell; Manuel A Ferreira; Irene Kouskoumvekaki; Christopher Workman; Torben Hansen; Ole Mors; Søren Brunak

doi:10.1002/gepi.20580

Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes

Tune Hannes Pers, Niclas Tue Hansen, Kasper Lage Hansen, Pernille Koefoed, Piotr Dworzynski, Martin Lee Miller, Tracey J Flint, Erling Mellerup, Henrik Dam, Ole A Andreassen, Srdjan Djurovic, Ingrid Melle, Anders D Børglum, Thomas Werge, Shaun Purcell, Manuel A Ferreira, Irene Kouskoumvekaki, Christopher Workman, Torben Hansen, Ole MorsSøren Brunak

28 Citations (Scopus)

Abstract

Meta-analyses of large-scale association studies typically proceed solely within one data type and do not exploit the potential complementarities in other sources of molecular evidence. Here, we present an approach to combine heterogeneous data from genome-wide association (GWA) studies, protein-protein interaction screens, disease similarity, linkage studies, and gene expression experiments into a multi-layered evidence network which is used to prioritize the entire protein-coding part of the genome identifying a shortlist of candidate genes. We report specifically results on bipolar disorder, a genetically complex disease where GWA studies have only been moderately successful. We validate one such candidate experimentally, YWHAH, by genotyping five variations in 640 patients and 1,377 controls. We found a significant allelic association for the rs1049583 polymorphism in YWHAH (adjusted P = 5.6e-3) with an odds ratio of 1.28 [1.12-1.48], which replicates a previous case-control study. In addition, we demonstrate our approach's general applicability by use of type 2 diabetes data sets. The method presented augments moderately powered GWA data, and represents a validated, flexible, and publicly available framework for identifying risk genes in highly polygenic diseases. The method is made available as a web service at

Original language	English
Journal	Genetic Epidemiology
Volume	35
Issue number	5
Pages (from-to)	318-332
Number of pages	5
ISSN	0741-0395
DOIs	https://doi.org/10.1002/gepi.20580
Publication status	Published - Jul 2011

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1002/gepi.20580

Cite this

Pers, T. H., Hansen, N. T., Hansen, K. L., Koefoed, P., Dworzynski, P., Miller, M. L., Flint, T. J., Mellerup, E., Dam, H., Andreassen, O. A., Djurovic, S., Melle, I., Børglum, A. D., Werge, T., Purcell, S., Ferreira, M. A., Kouskoumvekaki, I., Workman, C., Hansen, T., ... Brunak, S. (2011). Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes. Genetic Epidemiology, 35(5), 318-332. https://doi.org/10.1002/gepi.20580

Pers, TH, Hansen, NT, Hansen, KL, Koefoed, P, Dworzynski, P, Miller, ML, Flint, TJ, Mellerup, E, Dam, H, Andreassen, OA, Djurovic, S, Melle, I, Børglum, AD, Werge, T, Purcell, S, Ferreira, MA, Kouskoumvekaki, I, Workman, C, Hansen, T, Mors, O & Brunak, S 2011, 'Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes', Genetic Epidemiology, vol. 35, no. 5, pp. 318-332. https://doi.org/10.1002/gepi.20580

@article{0e10d0fd01a74000860ab5b548d26449,

title = "Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes",

abstract = "Meta-analyses of large-scale association studies typically proceed solely within one data type and do not exploit the potential complementarities in other sources of molecular evidence. Here, we present an approach to combine heterogeneous data from genome-wide association (GWA) studies, protein-protein interaction screens, disease similarity, linkage studies, and gene expression experiments into a multi-layered evidence network which is used to prioritize the entire protein-coding part of the genome identifying a shortlist of candidate genes. We report specifically results on bipolar disorder, a genetically complex disease where GWA studies have only been moderately successful. We validate one such candidate experimentally, YWHAH, by genotyping five variations in 640 patients and 1,377 controls. We found a significant allelic association for the rs1049583 polymorphism in YWHAH (adjusted P = 5.6e-3) with an odds ratio of 1.28 [1.12-1.48], which replicates a previous case-control study. In addition, we demonstrate our approach's general applicability by use of type 2 diabetes data sets. The method presented augments moderately powered GWA data, and represents a validated, flexible, and publicly available framework for identifying risk genes in highly polygenic diseases. The method is made available as a web service at",

author = "Pers, {Tune Hannes} and Hansen, {Niclas Tue} and Hansen, {Kasper Lage} and Pernille Koefoed and Piotr Dworzynski and Miller, {Martin Lee} and Flint, {Tracey J} and Erling Mellerup and Henrik Dam and Andreassen, {Ole A} and Srdjan Djurovic and Ingrid Melle and B{\o}rglum, {Anders D} and Thomas Werge and Shaun Purcell and Ferreira, {Manuel A} and Irene Kouskoumvekaki and Christopher Workman and Torben Hansen and Ole Mors and S{\o}ren Brunak",

note = "{\textcopyright} 2011 Wiley-Liss, Inc.",

year = "2011",

month = jul,

doi = "10.1002/gepi.20580",

language = "English",

volume = "35",

pages = "318--332",

journal = "Genetic Epidemiology",

issn = "0741-0395",

publisher = "JohnWiley & Sons, Inc.",

number = "5",

}

TY - JOUR

T1 - Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes

AU - Pers, Tune Hannes

AU - Hansen, Niclas Tue

AU - Hansen, Kasper Lage

AU - Koefoed, Pernille

AU - Dworzynski, Piotr

AU - Miller, Martin Lee

AU - Flint, Tracey J

AU - Mellerup, Erling

AU - Dam, Henrik

AU - Andreassen, Ole A

AU - Djurovic, Srdjan

AU - Melle, Ingrid

AU - Børglum, Anders D

AU - Werge, Thomas

AU - Purcell, Shaun

AU - Ferreira, Manuel A

AU - Kouskoumvekaki, Irene

AU - Workman, Christopher

AU - Hansen, Torben

AU - Mors, Ole

AU - Brunak, Søren

PY - 2011/7

Y1 - 2011/7

N2 - Meta-analyses of large-scale association studies typically proceed solely within one data type and do not exploit the potential complementarities in other sources of molecular evidence. Here, we present an approach to combine heterogeneous data from genome-wide association (GWA) studies, protein-protein interaction screens, disease similarity, linkage studies, and gene expression experiments into a multi-layered evidence network which is used to prioritize the entire protein-coding part of the genome identifying a shortlist of candidate genes. We report specifically results on bipolar disorder, a genetically complex disease where GWA studies have only been moderately successful. We validate one such candidate experimentally, YWHAH, by genotyping five variations in 640 patients and 1,377 controls. We found a significant allelic association for the rs1049583 polymorphism in YWHAH (adjusted P = 5.6e-3) with an odds ratio of 1.28 [1.12-1.48], which replicates a previous case-control study. In addition, we demonstrate our approach's general applicability by use of type 2 diabetes data sets. The method presented augments moderately powered GWA data, and represents a validated, flexible, and publicly available framework for identifying risk genes in highly polygenic diseases. The method is made available as a web service at

AB - Meta-analyses of large-scale association studies typically proceed solely within one data type and do not exploit the potential complementarities in other sources of molecular evidence. Here, we present an approach to combine heterogeneous data from genome-wide association (GWA) studies, protein-protein interaction screens, disease similarity, linkage studies, and gene expression experiments into a multi-layered evidence network which is used to prioritize the entire protein-coding part of the genome identifying a shortlist of candidate genes. We report specifically results on bipolar disorder, a genetically complex disease where GWA studies have only been moderately successful. We validate one such candidate experimentally, YWHAH, by genotyping five variations in 640 patients and 1,377 controls. We found a significant allelic association for the rs1049583 polymorphism in YWHAH (adjusted P = 5.6e-3) with an odds ratio of 1.28 [1.12-1.48], which replicates a previous case-control study. In addition, we demonstrate our approach's general applicability by use of type 2 diabetes data sets. The method presented augments moderately powered GWA data, and represents a validated, flexible, and publicly available framework for identifying risk genes in highly polygenic diseases. The method is made available as a web service at

U2 - 10.1002/gepi.20580

DO - 10.1002/gepi.20580

M3 - Journal article

C2 - 21484861

SN - 0741-0395

VL - 35

SP - 318

EP - 332

JO - Genetic Epidemiology

JF - Genetic Epidemiology

IS - 5

ER -

Meta-analysis of heterogeneous data sources for genome-scale identification of risk genes in complex phenotypes

Abstract

UN SDGs

Access to Document

Fingerprint

Cite this