TY - JOUR
T1 - The identification and functional annotation of RNA structures conserved in vertebrates
AU - Seemann, Ernst Stefan
AU - Mirza, Aashiq Hussain
AU - Hansen, Claus
AU - Bang-Berthelsen, Claus H
AU - Garde, Christian
AU - Christensen-Dalsgaard, Mikkel
AU - Torarinsson, Elfar
AU - Yao, Zizhen
AU - Workman, Christopher T
AU - Pociot, Flemming
AU - Nielsen, Henrik
AU - Tommerup, Niels
AU - Ruzzo, Walter L.
AU - Gorodkin, Jan
N1 - Published by Cold Spring Harbor Laboratory Press.
PY - 2017/8
Y1 - 2017/8
N2 - Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality.
AB - Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality.
U2 - 10.1101/gr.208652.116
DO - 10.1101/gr.208652.116
M3 - Journal article
C2 - 28487280
SN - 1088-9051
VL - 27
SP - 1371
EP - 1383
JO - Genome Research
JF - Genome Research
ER -