Unknown

Dataset Information

0

RNA sequencing reveals a diverse and dynamic repertoire of the Xenopus tropicalis transcriptome over development.


ABSTRACT: The Xenopus embryo has provided key insights into fate specification, the cell cycle, and other fundamental developmental and cellular processes, yet a comprehensive understanding of its transcriptome is lacking. Here, we used paired end RNA sequencing (RNA-seq) to explore the transcriptome of Xenopus tropicalis in 23 distinct developmental stages. We determined expression levels of all genes annotated in RefSeq and Ensembl and showed for the first time on a genome-wide scale that, despite a general state of transcriptional silence in the earliest stages of development, approximately 150 genes are transcribed prior to the midblastula transition. In addition, our splicing analysis uncovered more than 10,000 novel splice junctions at each stage and revealed that many known genes have additional unannotated isoforms. Furthermore, we used Cufflinks to reconstruct transcripts from our RNA-seq data and found that ?13.5% of the final contigs are derived from novel transcribed regions, both within introns and in intergenic regions. We then developed a filtering pipeline to separate protein-coding transcripts from noncoding RNAs and identified a confident set of 6686 noncoding transcripts in 3859 genomic loci. Since the current reference genome, XenTro3, consists of hundreds of scaffolds instead of full chromosomes, we also performed de novo reconstruction of the transcriptome using Trinity and uncovered hundreds of transcripts that are missing from the genome. Collectively, our data will not only aid in completing the assembly of the Xenopus tropicalis genome but will also serve as a valuable resource for gene discovery and for unraveling the fundamental mechanisms of vertebrate embryogenesis.

SUBMITTER: Tan MH 

PROVIDER: S-EPMC3530680 | BioStudies | 2013-01-01

REPOSITORIES: biostudies

Similar Datasets

2012-05-03 | E-GEOD-37452 | ArrayExpress
2015-01-01 | S-EPMC4562602 | BioStudies
2010-01-01 | S-EPMC2994648 | BioStudies
2010-01-01 | S-EPMC3055250 | BioStudies
2011-01-01 | S-EPMC3144230 | BioStudies
2011-01-01 | S-EPMC3247858 | BioStudies
2007-01-01 | S-EPMC1890556 | BioStudies
2017-01-01 | S-EPMC5322306 | BioStudies
2014-01-01 | S-EPMC3911580 | BioStudies
2012-01-01 | S-EPMC3532188 | BioStudies