Unknown

Dataset Information

0

Advancing Trypanosoma brucei genome annotation through ribosome profiling and spliced leader mapping.


ABSTRACT: Since the initial publication of the trypanosomatid genomes, curation has been ongoing. Here we make use of existing Trypanosoma brucei ribosome profiling data to provide evidence of ribosome occupancy (and likely translation) of mRNAs from 225 currently unannotated coding sequences (CDSs). A small number of these putative genes correspond to extra copies of previously annotated genes, but 85% are novel. The median size of these novels CDSs is small (81 aa), indicating that past annotation work has excelled at detecting large CDSs. Of the unique CDSs confirmed here, over half have candidate orthologues in other trypanosomatid genomes, most of which were not yet annotated as protein-coding genes. Nonetheless, approximately one-third of the new CDSs were found only in T. brucei subspecies. Using ribosome footprints, RNA-Seq and spliced leader mapping data, we updated previous work to definitively revise the start sites for 414 CDSs as compared to the current gene models. The data pointed to several regions of the genome that had sequence errors that altered coding region boundaries. Finally, we consolidated this data with our previous work to propose elimination of 683 putative genes as protein-coding and arrive at a view of the translatome of slender bloodstream and procyclic culture form T. brucei.

SUBMITTER: Parsons M 

PROVIDER: S-EPMC4644489 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Advancing Trypanosoma brucei genome annotation through ribosome profiling and spliced leader mapping.

Parsons Marilyn M   Ramasamy Gowthaman G   Vasconcelos Elton J R EJ   Jensen Bryan C BC   Myler Peter J PJ  

Molecular and biochemical parasitology 20150801 2


Since the initial publication of the trypanosomatid genomes, curation has been ongoing. Here we make use of existing Trypanosoma brucei ribosome profiling data to provide evidence of ribosome occupancy (and likely translation) of mRNAs from 225 currently unannotated coding sequences (CDSs). A small number of these putative genes correspond to extra copies of previously annotated genes, but 85% are novel. The median size of these novels CDSs is small (81 aa), indicating that past annotation work  ...[more]

Similar Datasets

2015-12-22 | GSE72463 | GEO
2015-12-22 | E-GEOD-72463 | biostudies-arrayexpress
| S-EPMC1852752 | biostudies-literature
| S-EPMC1409817 | biostudies-literature
2010-06-25 | GSE22571 | GEO