Transcriptomics

Dataset Information

0

Comprehensive identification of long non-coding RNAs expressed during zebrafish embryogenesis [RNA_seq]


ABSTRACT: Long non-coding RNAs (lncRNAs) comprise a diverse class of transcripts that structurally resemble mRNAs but do not encode proteins. Recent genome-wide studies in human and mouse have annotated lncRNAs expressed in cell lines and adult tissues, but a systematic analysis of lncRNAs expressed during vertebrate embryogenesis has been elusive. To identify lncRNAs with potential functions in vertebrate embryogenesis, we performed a time series of RNA-Seq experiments at eight stages during early zebrafish development. We reconstructed 56,535 high-confidence transcripts in 28,912 loci, recovering the vast majority of expressed RefSeq transcripts, while identifying thousands of novel isoforms and expressed loci. We defined a stringent set of 1,133 non-coding multi-exonic transcripts expressed during embryogenesis. These include long intergenic ncRNAs (lincRNAs), intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, and precursors for small RNAs (sRNAs). Zebrafish lncRNAs share many of the characteristics of their mammalian counterparts: relatively short length, low exon number, low expression, and conservation levels comparable to introns. Subsets of lncRNAs carry chromatin signatures characteristic of genes with developmental functions. The temporal expression profile of lncRNAs revealed two novel properties: lncRNAs are expressed in narrower time windows than protein-coding genes and are specifically enriched in early-stage embryos. In addition, several lncRNAs show tissue-specific expression and distinct subcellular localization patterns. Integrative computational analyses associated individual lncRNAs with specific pathways and functions, ranging from cell cycle regulation to morphogenesis. Our study provides the first comprehensive identification of lncRNAs in a vertebrate embryo and forms the foundation for future genetic, genomic and evolutionary studies. RNA-Seq for 8 zebrafish developmental stages, 2 lanes for each stage (3 for shield).

ORGANISM(S): Danio rerio  

SUBMITTER: Albin Sandelin   Joshua Z Levin  John L Rinn  Alexander F Schier  Andrea Pauli  Eivind Valen  Michael F Lin  Nadine L Vastenhouw  Manuel Garber  Aviv Regev 

PROVIDER: E-GEOD-32898 | ArrayExpress | 2011-11-17

SECONDARY ACCESSION(S): GSE32898SRP009426PRJNA154389

REPOSITORIES: GEO, ArrayExpress, ENA

altmetric image

Publications

Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis.

Pauli Andrea A   Valen Eivind E   Lin Michael F MF   Garber Manuel M   Vastenhouw Nadine L NL   Levin Joshua Z JZ   Fan Lin L   Sandelin Albin A   Rinn John L JL   Regev Aviv A   Schier Alexander F AF  

Genome research 20111122 3


Long noncoding RNAs (lncRNAs) comprise a diverse class of transcripts that structurally resemble mRNAs but do not encode proteins. Recent genome-wide studies in humans and the mouse have annotated lncRNAs expressed in cell lines and adult tissues, but a systematic analysis of lncRNAs expressed during vertebrate embryogenesis has been elusive. To identify lncRNAs with potential functions in vertebrate embryogenesis, we performed a time-series of RNA-seq experiments at eight stages during early ze  ...[more]

Similar Datasets

2011-11-17 | E-GEOD-32899 | ArrayExpress
2015-05-15 | E-GEOD-54964 | ArrayExpress
2015-05-15 | E-GEOD-54966 | ArrayExpress
2015-05-15 | E-GEOD-54968 | ArrayExpress
2014-06-01 | E-GEOD-48321 | ArrayExpress
2012-03-01 | GSE27722 | GEO
2011-11-17 | E-GEOD-32900 | ArrayExpress
2012-03-01 | E-GEOD-27722 | ArrayExpress
2013-09-12 | E-GEOD-50783 | ArrayExpress
| PRJNA146503 | ENA