Transcriptomics

Dataset Information

0

Targeted RT-PCR assays spanning unannotated splice junctions sequenced by Roche 454.


ABSTRACT: The ENCODE projects seeks to identify and characterize functional elements in the human genome. Throughout the scale-up phase of ENCODE, the transcriptome group has generate Long RNA-Seq, Small RNA-Seq, Cap-Analysis of Gene Expression (CAGE), and RNA-PET short read data on the Illumina platform for ~ 40 different human primary and transformed cell lines in replicate. From these data several high-resolution and discrete features/elements have been mined out (5’ caps, splice junctions, polyadenylation sites, small RNAs, etc…). However, because these data are obtained from short-read data, we have only limited “connectivity” information. For example, from the long RNA-Seq data, which was sequenced in mate-pair fashion with average insert sizes ~ 200 bp, we know that the sequence from mate 1 is physically linked to the sequence in mate 2. We don’t know the sequence in between and we don’t know how this mate-pair is connected to other mate-pairs in the context of longer transcripts in vivo. To date, this information is gleaned from models generated in silico: In our case, by the program Cufflinks. Consequently, we have a collection of transcript models exhibiting a vast array of local complexity assembled from short read data that need to be experimentally tested. Alternatively, one can “cut to the chase” and use a more raw/elemental form of the data as a basis for additional experimentation to clone out the longer sequences generated in vivo. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf

ORGANISM(S): Homo sapiens

PROVIDER: GSE38886 | GEO | 2012/06/26

SECONDARY ACCESSION(S): PRJNA169392

REPOSITORIES: GEO

Similar Datasets

2012-06-26 | E-GEOD-38886 | biostudies-arrayexpress
| phs001255 | dbGaP
| PRJNA209488 | ENA
2022-05-30 | E-MTAB-11351 | biostudies-arrayexpress
2022-05-31 | PXD033870 | Pride
2020-01-23 | PXD014844 | Pride
2010-01-25 | GSE19986 | GEO
2010-01-24 | E-GEOD-19986 | biostudies-arrayexpress
2012-04-02 | E-GEOD-36025 | biostudies-arrayexpress
| PRJEB1813 | ENA