Annotation of the protein coding regions of the equine genome

Matthew S. Hestand, Theodore S. Kalbfleisch, Stephen J. Coleman, Zheng Zeng, Jinze Liu, Ludovic Antoine Alexandre Orlando, James N. MacLeod

    14 Citationer (Scopus)
    83 Downloads (Pure)

    Abstract

    Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced mRNA from a pool of forty-three different tissues. From these, we derived the structures of 68,594 transcripts. In addition, we identified 301,829 positions with SNPs or small indels within these transcripts relative to EquCab2. Interestingly, 780 variants extend the open reading frame of the transcript and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross-species transcriptional and genomic comparisons.

    OriginalsprogEngelsk
    Artikelnummere0124375
    TidsskriftPLOS ONE
    Vol/bind10
    Udgave nummer6
    Antal sider13
    ISSN1932-6203
    DOI
    StatusUdgivet - 2015

    Fingeraftryk

    Dyk ned i forskningsemnerne om 'Annotation of the protein coding regions of the equine genome'. Sammen danner de et unikt fingeraftryk.

    Citationsformater