TY - JOUR
T1 - Genome-wide detection of chromosomal rearrangements, indels, and mutations in circular chromosomes by short read sequencing
AU - Skovgaard, Ole
AU - Bak, Mads
AU - Løbner-Olesen, Anders
AU - Tommerup, Niels
PY - 2011/8
Y1 - 2011/8
N2 - Whole-genome sequencing (WGS) with new short-read sequencing technologies has recently been applied for genome-wide identification of mutations. Genomic rearrangements have, however, often remained undetected by WGS, and additional analyses are required for their detection. Here, we have applied a combination of WGS and genome copy number analysis, for the identification of mutations that suppress the growth deficiency imposed by excessive initiations from the Escherichia coli origin of replication, oriC. The E. coli chromosome, like the majority of bacterial chromosomes, is circular, and DNA replication is initiated by assembling two replication complexes at the origin, oriC. These complexes then replicate the chromosome bidirectionally toward the terminus, ter. In a population of growing cells, this results in a copy number gradient, so that origin-proximal sequences are more frequent than origin-distal sequences. Major rearrangements in the chromosome are, therefore, readily identified by changes in copy number, i.e., certain sequences become over- or under-represented. Of the eight mutations analyzed in detail here, six were found to affect a single gene only, one was a large chromosomal inversion, and one was a large chromosomal duplication. The latter two mutations could not be detected solely by WGS, validating the present approach for identification of genomic rearrangements. We further suggest the use of copy number analysis in combination with WGS for validation of newly assembled bacterial chromosomes.
AB - Whole-genome sequencing (WGS) with new short-read sequencing technologies has recently been applied for genome-wide identification of mutations. Genomic rearrangements have, however, often remained undetected by WGS, and additional analyses are required for their detection. Here, we have applied a combination of WGS and genome copy number analysis, for the identification of mutations that suppress the growth deficiency imposed by excessive initiations from the Escherichia coli origin of replication, oriC. The E. coli chromosome, like the majority of bacterial chromosomes, is circular, and DNA replication is initiated by assembling two replication complexes at the origin, oriC. These complexes then replicate the chromosome bidirectionally toward the terminus, ter. In a population of growing cells, this results in a copy number gradient, so that origin-proximal sequences are more frequent than origin-distal sequences. Major rearrangements in the chromosome are, therefore, readily identified by changes in copy number, i.e., certain sequences become over- or under-represented. Of the eight mutations analyzed in detail here, six were found to affect a single gene only, one was a large chromosomal inversion, and one was a large chromosomal duplication. The latter two mutations could not be detected solely by WGS, validating the present approach for identification of genomic rearrangements. We further suggest the use of copy number analysis in combination with WGS for validation of newly assembled bacterial chromosomes.
U2 - 10.1101/gr.117416.110
DO - 10.1101/gr.117416.110
M3 - Journal article
C2 - 21555365
SN - 1088-9051
VL - 21
SP - 1388
EP - 1393
JO - Genome Research
JF - Genome Research
IS - 8
ER -