Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

H Bjørn Nielsen; Mathieu Almeida; Agnieszka Juncker; Simon Rasmussen; Junhua Li; Shinichi Sunagawa; Damian Rafal Plichta; Laurent Gautier; Anders G. Pedersen; Emmanuelle Le Chatelier; Eric Pelletier; Ida Bonde; Trine Nielsen; Chaysavanh Manichanh; Manimozhiyan Arumugam; Jean-Michel Batto; Marcelo Bertalan Quintanilha dos Santos; Nikolaj Blom; Natalia Borruel; Kristoffer Sølvsten Burgdorf; Fouad Boumezbeur; Francesc Casellas; Joël Doré; Piotr Dworzynski; Francisco Guarner; Torben Hansen; Falk Hildebrand; Rolf Sommer Kaas; Sean Kennedy; Karsten Kristiansen; Jens Roat Kultima; Pierre Léonard; Florence Levenez; Ole Lund; Bouziane Moumen; Denis Le Paslier; Nicolas Pons; Oluf Borbye Pedersen; Edi Prifti; Junjie Qin; Jeroen Raes; Søren Johannes Sørensen; Julien Tap; Sebastian Tims; David Ussery; Takuji Yamada; Pierre Renault; Thomas Sicheritz-Pontén; Peer Bork; Jun Wang; MetaHIT Consortium

doi:10.1038/nbt.2939

Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

H Bjørn Nielsen, Mathieu Almeida, Agnieszka Juncker, Simon Rasmussen, Junhua Li, Shinichi Sunagawa, Damian Rafal Plichta, Laurent Gautier, Anders G. Pedersen, Emmanuelle Le Chatelier, Eric Pelletier, Ida Bonde, Trine Nielsen, Chaysavanh Manichanh, Manimozhiyan Arumugam, Jean-Michel Batto, Marcelo Bertalan Quintanilha dos Santos, Nikolaj Blom, Natalia Borruel, Kristoffer Sølvsten BurgdorfFouad Boumezbeur, Francesc Casellas, Joël Doré, Piotr Dworzynski, Francisco Guarner, Torben Hansen, Falk Hildebrand, Rolf Sommer Kaas, Sean Kennedy, Karsten Kristiansen, Jens Roat Kultima, Pierre Léonard, Florence Levenez, Ole Lund, Bouziane Moumen, Denis Le Paslier, Nicolas Pons, Oluf Borbye Pedersen, Edi Prifti, Junjie Qin, Jeroen Raes, Søren Johannes Sørensen, Julien Tap, Sebastian Tims, David Ussery, Takuji Yamada, Pierre Renault, Thomas Sicheritz-Pontén, Peer Bork, Jun Wang, MetaHIT Consortium

454 Citationer (Scopus)

Abstract

Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.

Originalsprog	Engelsk
Tidsskrift	Nature Biotechnology
Vol/bind	32
Udgave nummer	8
Sider (fra-til)	822–828
Antal sider	7
ISSN	1087-0156
DOI	https://doi.org/10.1038/nbt.2939
Status	Udgivet - 1 aug. 2014

FN’s Verdensmål

Dette resultat bidrager til følgende verdensmål

Adgang til dokumentet

10.1038/nbt.2939

Citationsformater

Nielsen, H. B., Almeida, M., Juncker, A., Rasmussen, S., Li, J., Sunagawa, S., Plichta, D. R., Gautier, L., Pedersen, A. G., Le Chatelier, E., Pelletier, E., Bonde, I., Nielsen, T., Manichanh, C., Arumugam, M., Batto, J.-M., dos Santos, M. B. Q., Blom, N., Borruel, N., ... MetaHIT Consortium (2014). Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes. Nature Biotechnology, 32(8), 822–828. https://doi.org/10.1038/nbt.2939

Nielsen, HB, Almeida, M, Juncker, A, Rasmussen, S, Li, J, Sunagawa, S, Plichta, DR, Gautier, L, Pedersen, AG, Le Chatelier, E, Pelletier, E, Bonde, I, Nielsen, T, Manichanh, C, Arumugam, M, Batto, J-M, dos Santos, MBQ, Blom, N, Borruel, N, Burgdorf, KS, Boumezbeur, F, Casellas, F, Doré, J, Dworzynski, P, Guarner, F, Hansen, T, Hildebrand, F, Kaas, RS, Kennedy, S, Kristiansen, K, Kultima, JR, Léonard, P, Levenez, F, Lund, O, Moumen, B, Le Paslier, D, Pons, N, Pedersen, OB, Prifti, E, Qin, J, Raes, J, Sørensen, SJ, Tap, J, Tims, S, Ussery, D, Yamada, T, Renault, P, Sicheritz-Pontén, T, Bork, P, Wang, J & MetaHIT Consortium 2014, 'Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes', Nature Biotechnology, bind 32, nr. 8, s. 822–828. https://doi.org/10.1038/nbt.2939

@article{34a8c77b105b4de6aa31826afb6e7652,

title = "Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes",

abstract = "Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.",

author = "Nielsen, {H Bj{\o}rn} and Mathieu Almeida and Agnieszka Juncker and Simon Rasmussen and Junhua Li and Shinichi Sunagawa and Plichta, {Damian Rafal} and Laurent Gautier and Pedersen, {Anders G.} and {Le Chatelier}, Emmanuelle and Eric Pelletier and Ida Bonde and Trine Nielsen and Chaysavanh Manichanh and Manimozhiyan Arumugam and Jean-Michel Batto and {dos Santos}, {Marcelo Bertalan Quintanilha} and Nikolaj Blom and Natalia Borruel and Burgdorf, {Kristoffer S{\o}lvsten} and Fouad Boumezbeur and Francesc Casellas and Jo{\"e}l Dor{\'e} and Piotr Dworzynski and Francisco Guarner and Torben Hansen and Falk Hildebrand and Kaas, {Rolf Sommer} and Sean Kennedy and Karsten Kristiansen and Kultima, {Jens Roat} and Pierre L{\'e}onard and Florence Levenez and Ole Lund and Bouziane Moumen and {Le Paslier}, Denis and Nicolas Pons and Pedersen, {Oluf Borbye} and Edi Prifti and Junjie Qin and Jeroen Raes and S{\o}rensen, {S{\o}ren Johannes} and Julien Tap and Sebastian Tims and David Ussery and Takuji Yamada and Pierre Renault and Thomas Sicheritz-Pont{\'e}n and Peer Bork and Jun Wang and {MetaHIT Consortium}",

year = "2014",

month = aug,

day = "1",

doi = "10.1038/nbt.2939",

language = "English",

volume = "32",

pages = "822–828",

journal = "Nature Biotechnology",

issn = "1087-0156",

publisher = "nature publishing group",

number = "8",

}

TY - JOUR

T1 - Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

AU - Nielsen, H Bjørn

AU - Almeida, Mathieu

AU - Juncker, Agnieszka

AU - Rasmussen, Simon

AU - Li, Junhua

AU - Sunagawa, Shinichi

AU - Plichta, Damian Rafal

AU - Gautier, Laurent

AU - Pedersen, Anders G.

AU - Le Chatelier, Emmanuelle

AU - Pelletier, Eric

AU - Bonde, Ida

AU - Nielsen, Trine

AU - Manichanh, Chaysavanh

AU - Arumugam, Manimozhiyan

AU - Batto, Jean-Michel

AU - dos Santos, Marcelo Bertalan Quintanilha

AU - Blom, Nikolaj

AU - Borruel, Natalia

AU - Burgdorf, Kristoffer Sølvsten

AU - Boumezbeur, Fouad

AU - Casellas, Francesc

AU - Doré, Joël

AU - Dworzynski, Piotr

AU - Guarner, Francisco

AU - Hansen, Torben

AU - Hildebrand, Falk

AU - Kaas, Rolf Sommer

AU - Kennedy, Sean

AU - Kristiansen, Karsten

AU - Kultima, Jens Roat

AU - Léonard, Pierre

AU - Levenez, Florence

AU - Lund, Ole

AU - Moumen, Bouziane

AU - Le Paslier, Denis

AU - Pons, Nicolas

AU - Pedersen, Oluf Borbye

AU - Prifti, Edi

AU - Qin, Junjie

AU - Raes, Jeroen

AU - Sørensen, Søren Johannes

AU - Tap, Julien

AU - Tims, Sebastian

AU - Ussery, David

AU - Yamada, Takuji

AU - Renault, Pierre

AU - Sicheritz-Pontén, Thomas

AU - Bork, Peer

AU - Wang, Jun

AU - MetaHIT Consortium

PY - 2014/8/1

Y1 - 2014/8/1

N2 - Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.

AB - Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.

U2 - 10.1038/nbt.2939

DO - 10.1038/nbt.2939

M3 - Journal article

C2 - 24997787

SN - 1087-0156

VL - 32

SP - 822

EP - 828

JO - Nature Biotechnology

JF - Nature Biotechnology

IS - 8

ER -

Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

Abstract

FN’s Verdensmål

Adgang til dokumentet

Fingeraftryk

Citationsformater