Unknown

Dataset Information

0

Detection and analysis of spliced chimeric mRNAs in sequence databanks.


ABSTRACT: We have developed a databank screening procedure, the In Silico Trans-splicing Retrieval System (ISTReS), to identify heterologous, spliced mRNAs with potential origin from chromosomal translocations, mRNA trans-splicing and multi-locus transcription. A parsing algorithm to screen cDNA versus genome Blast outputs was implemented. Key filtering criteria were Blast scores of > or =300, match lengths of > or =95% of the query sequences, junction of the two partners at exon-exon borders and concordant 'sense/sense' reading orientation. ISTReS was validated by the successful identification of bona fide chromosomal translocation-derived fusion transcripts in the HGI and RefSeq databanks. The performance of ISTReS was verified against recently identified chimeric antisense transcripts, where it revealed essentially no independent proof of antisense transcription and absence of exon-exon borders at the chimeric join, consistent with an artefactual origin. Analysis of the UNIGENE database revealed 21 742 chimeric sequences overall that correspond to approximately 1% of the database transcripts. Novel FOP-Rho GAP and methionyl tRNA synthetase-advillin chimeric mRNAs with the canonical features of heterologous-genes spliced-transcripts were identified among 246 chimeras from the RefSeq databank. This suggests a frequency of canonically-spliced chimeras of approximately 1% of all the hybrid sequences in current databanks. These findings demonstrate the efficiency of ISTReS and the overall feasibility of sequence/structure-based strategies to search for chimeric mRNAs candidate to derive from the splicing of heterologous transcripts.

SUBMITTER: Romani A 

PROVIDER: S-EPMC150249 | biostudies-literature | 2003 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection and analysis of spliced chimeric mRNAs in sequence databanks.

Romani Antonello A   Guerra Emanuela E   Trerotola Marco M   Alberti Saverio S  

Nucleic acids research 20030201 4


We have developed a databank screening procedure, the In Silico Trans-splicing Retrieval System (ISTReS), to identify heterologous, spliced mRNAs with potential origin from chromosomal translocations, mRNA trans-splicing and multi-locus transcription. A parsing algorithm to screen cDNA versus genome Blast outputs was implemented. Key filtering criteria were Blast scores of > or =300, match lengths of > or =95% of the query sequences, junction of the two partners at exon-exon borders and concorda  ...[more]

Similar Datasets

| S-EPMC3531304 | biostudies-literature
| S-EPMC33275 | biostudies-literature
| S-EPMC361064 | biostudies-other
| S-EPMC3650844 | biostudies-literature
| S-EPMC44185 | biostudies-other
| S-EPMC7851563 | biostudies-literature
| S-EPMC6538341 | biostudies-literature