Dataset Information


The Mapping of Predicted Triplex DNA:RNA in the Drosophila Genome Reveals a Prominent Location in Development- and Morphogenesis-Related Genes.

ABSTRACT: Double-stranded DNA is able to form triple-helical structures by accommodating a third nucleotide strand. A nucleic acid triplex occurs according to Hoogsteen rules that predict the stability and affinity of the third strand bound to the Watson-Crick duplex. The "triplex-forming oligonucleotide" (TFO) can be a short sequence of RNA that binds to the major groove of the targeted duplex only when this duplex presents a sequence of purine or pyrimidine bases in one of the DNA strands. Many nuclear proteins are known to bind triplex DNA or DNA:RNA, but their biological functions are unexplored. We identified sequences that are capable of engaging as the "triplex-forming oligonucleotide" in both the pre-lncRNA and pre-mRNA collections of Drosophila melanogaster These motifs were matched against the Drosophila genome in order to identify putative sequences of triplex formation in intergenic regions, promoters, and introns/exons. Most of the identified TFOs appear to be located in the intronic region of the analyzed genes. Computational prediction of the most targeted genes by TFOs originating from pre-lncRNAs and pre-mRNAs revealed that they are restrictively associated with development- and morphogenesis-related gene networks. The refined analysis by Gene Ontology enrichment demonstrates that some individual TFOs present genome-wide scale matches that are located in numerous genes and regulatory sequences. The triplex DNA:RNA computational mapping at the genome-wide scale suggests broad interference in the regulatory process of the gene networks orchestrated by TFO RNAs acting in association simultaneously at multiple sites.

SUBMITTER: Pasquier C 

PROVIDER: S-EPMC5499136 | BioStudies | 2017-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC5591256 | BioStudies
2010-01-01 | S-EPMC2935506 | BioStudies
2013-01-01 | S-EPMC3711454 | BioStudies
2009-01-01 | S-EPMC2796241 | BioStudies
2009-01-01 | S-EPMC2730491 | BioStudies
1000-01-01 | S-EPMC1181241 | BioStudies
1000-01-01 | S-EPMC147161 | BioStudies
1000-01-01 | S-EPMC6182420 | BioStudies
2012-01-01 | S-EPMC3401312 | BioStudies
2018-01-01 | S-EPMC5824356 | BioStudies