Transcriptomics

Dataset Information

0

Application of RNA-seq for single nucleotide variation identification patients with hereditary diseases: analysis of trasncriptomic data and perspective use


ABSTRACT: Background: Various DNA sequencing methods, including particular genes sequencing, whole genome sequencing or exome sequencing, are presently used to identify single nucleotide variations (SNVs) associated with inherited diseases. Most of the known mutations and polymorphisms associated with hereditary diseases are localized in the coding regions and cause an amino-acid substitution, appearance of new stop codon or open reading frame shifts. These types of mutations can be identified using transcriptome sequencing. Full transcriptome sequencing is widely used to study diseases. The results of such investigations are usually available in public databases. This data could be tapped for identification of novel mutations and disease-associated variants. In this study, we are analyzing the potential use of full transcriptome sequencing for SNV identification, using RNA-seq of myocardium of patients with HCMP as an example. The goal of the current study is development of pipeline for SNV identification from RNA-seq data and application of this pipeline to identification of putative pathogenically significant SNV. Results: The algorithm of identification of SNV based on transcriptomic sequencing data has been developed. The algorithm was evaluated and the optimal quality threshold was determined based on allelic discrimination for the rs397516037 mutation (MYBPC3 c.3697C>T) among patients. A quality threshold of 75 was selected, at which false-positive and false-negative results for the rs397516037 variant were excluded. 43587 total SNVs with 75 and higher quality was identified in 48 transcriptomes of HCMP patients. In order to further verify the results obtained using the algorithm, 8 SNVs were selected in the MYBPC3 and MYH7 genes. Each of these 8 SNVs was confirmed by Sanger sequencing. Conclusions: The developed method has proven its effectiveness in identifying putative pathogenic variants in RNA-seq data. Its use will be especially useful in the context of large volumes of accumulated transcriptomic data that have not been analyzed in this way before. However, it is necessary to consider the limitations and scope of this method. The study was conducted on the basis of a disease and mode of inheritance are well suited for this method, but in other cases its applicability should be assessed individually.

ORGANISM(S): Homo sapiens

PROVIDER: GSE273325 | GEO | 2025/06/04

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2013-06-01 | E-MTAB-1159 | biostudies-arrayexpress
| EGAS00001000927 | EGA
2024-06-24 | GSE218122 | GEO
2016-01-12 | PXD003317 | Pride
2014-01-08 | GSE53876 | GEO
2012-08-07 | GSE37524 | GEO
2024-01-24 | GSE253950 | GEO
2022-05-25 | GSE179055 | GEO
2024-08-22 | GSE183348 | GEO
2024-08-22 | GSE183301 | GEO