Unknown

Dataset Information

0

Computational identification of putative lincRNAs in mouse embryonic stem cell.


ABSTRACT: As the regulatory factors, lncRNAs play critical roles in embryonic stem cells. And lincRNAs are most widely studied lncRNAs, however, there might still might exist a large member of uncovered lncRNAs. In this study, we constructed the de novo assembly of transcriptome to detect 6,701 putative long intergenic non-coding transcripts (lincRNAs) expressed in mouse embryonic stem cells (ESCs), which might be incomplete with the lack coverage of 5' ends assessed by CAGE peaks. Comparing the TSS proximal regions between the known lincRNAs and their closet protein coding transcripts, our results revealed that the lincRNA TSS proximal regions are associated with the characteristic genomic and epigenetic features. Subsequently, 1,293 lincRNAs were corrected at their 5' ends using the putative lincRNA TSS regions predicted by the TSS proximal region prediction model based on genomic and epigenetic features. Finally, 43 putative lincRNAs were annotated by Gene Ontology terms. In conclusion, this work provides a novel catalog of mouse ESCs-expressed lincRNAs with the relatively complete transcript length, which might be useful for the investigation of transcriptional and post-transcriptional regulation of lincRNA in mouse ESCs and even mammalian development.

SUBMITTER: Liu H 

PROVIDER: S-EPMC5054606 | biostudies-literature | 2016 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Computational identification of putative lincRNAs in mouse embryonic stem cell.

Liu Hui H   Lyu Jie J   Liu Hongbo H   Gao Yang Y   Guo Jing J   He Hongjuan H   Han Zhengbin Z   Zhang Yan Y   Wu Qiong Q  

Scientific reports 20161007


As the regulatory factors, lncRNAs play critical roles in embryonic stem cells. And lincRNAs are most widely studied lncRNAs, however, there might still might exist a large member of uncovered lncRNAs. In this study, we constructed the de novo assembly of transcriptome to detect 6,701 putative long intergenic non-coding transcripts (lincRNAs) expressed in mouse embryonic stem cells (ESCs), which might be incomplete with the lack coverage of 5' ends assessed by CAGE peaks. Comparing the TSS proxi  ...[more]

Similar Datasets

| S-EPMC7876754 | biostudies-literature
| S-EPMC5732849 | biostudies-literature
| S-EPMC2928534 | biostudies-literature
2007-07-11 | GSE7800 | GEO
| S-EPMC1851713 | biostudies-literature
| S-EPMC5260497 | biostudies-literature
2019-10-22 | PXD008964 | Pride
| S-EPMC3075915 | biostudies-literature