TY - JOUR
T1 - Identification and classification of conserved RNA secondary structures in the human genome.
AU - Pedersen, Jakob Skou
AU - Bejerano, Gill
AU - Siepel, Adam
AU - Rosenbloom, Kate
AU - Lindblad-Toh, Kerstin
AU - Lander, Eric S
AU - Kent, Jim
AU - Miller, Webb
AU - Haussler, David
N1 - Keywords: 3' Untranslated Regions; Animals; Chickens; Computational Biology; Conserved Sequence; Dogs; Genome; Genome, Human; Humans; Mice; MicroRNAs; Nucleic Acid Conformation; Rats; Sequence Analysis, RNA; Tetraodontiformes; Zebrafish
PY - 2006
Y1 - 2006
N2 - The discoveries of microRNAs and riboswitches, among others, have shown functional RNAs to be biologically more important and genomically more prevalent than previously anticipated. We have developed a general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functional RNAs encoded in the human genome and used it to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs. At a loose threshold for acceptance, this search resulted in a set of 48,479 candidate RNA structures. This screen finds a large number of known functional RNAs, including 195 miRNAs, 62 histone 3'UTR stem loops, and various types of known genetic recoding elements. Among the highest-scoring new predictions are 169 new miRNA candidates, as well as new candidate selenocysteine insertion sites, RNA editing hairpins, RNAs involved in transcript auto regulation, and many folds that form singletons or small functional RNA families of completely unknown function. While the rate of false positives in the overall set is difficult to estimate and is likely to be substantial, the results nevertheless provide evidence for many new human functional RNAs and present specific predictions to facilitate their further characterization.
AB - The discoveries of microRNAs and riboswitches, among others, have shown functional RNAs to be biologically more important and genomically more prevalent than previously anticipated. We have developed a general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functional RNAs encoded in the human genome and used it to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs. At a loose threshold for acceptance, this search resulted in a set of 48,479 candidate RNA structures. This screen finds a large number of known functional RNAs, including 195 miRNAs, 62 histone 3'UTR stem loops, and various types of known genetic recoding elements. Among the highest-scoring new predictions are 169 new miRNA candidates, as well as new candidate selenocysteine insertion sites, RNA editing hairpins, RNAs involved in transcript auto regulation, and many folds that form singletons or small functional RNA families of completely unknown function. While the rate of false positives in the overall set is difficult to estimate and is likely to be substantial, the results nevertheless provide evidence for many new human functional RNAs and present specific predictions to facilitate their further characterization.
U2 - 10.1371/journal.pcbi.0020033
DO - 10.1371/journal.pcbi.0020033
M3 - Journal article
C2 - 16628248
SN - 1553-734X
VL - 2
SP - e33
JO - P L o S Computational Biology (Online)
JF - P L o S Computational Biology (Online)
IS - 4
ER -