Unknown

Dataset Information

0

Genome-wide detection of human variants that disrupt intronic branchpoints.


ABSTRACT: Pre-messenger RNA splicing is initiated with the recognition of a single-nucleotide intronic branchpoint (BP) within a BP motif by spliceosome elements. Forty-eight rare variants in 43 human genes have been reported to alter splicing and cause disease by disrupting BP. However, until now, no computational approach was available to efficiently detect such variants in massively parallel sequencing data. We established a comprehensive human genome-wide BP database by integrating existing BP data and generating new BP data from RNA sequencing of lariat debranching enzyme DBR1-mutated patients and from machine-learning predictions. We characterized multiple features of BP in major and minor introns and found that BP and BP-2 (two nucleotides upstream of BP) positions exhibit a lower rate of variation in human populations and higher evolutionary conservation than the intronic background, while being comparable to the exonic background. We developed BPHunter as a genome-wide computational approach to systematically and efficiently detect intronic variants that may disrupt BP recognition. BPHunter retrospectively identified 40 of the 48 known pathogenic BP variants, in which we summarized a strategy for prioritizing BP variant candidates. The remaining eight variants all create AG-dinucleotides between the BP and acceptor site, which is the likely reason for missplicing. We demonstrated the practical utility of BPHunter prospectively by using it to identify a novel germline heterozygous BP variant of STAT2 in a patient with critical COVID-19 pneumonia and a novel somatic intronic 59-nucleotide deletion of ITPKB in a lymphoma patient, both of which were validated experimentally. BPHunter is publicly available from https://hgidsoft.rockefeller.edu/BPHunter and https://github.com/casanova-lab/BPHunter.

SUBMITTER: Zhang P 

PROVIDER: S-EPMC9636908 | biostudies-literature | 2022 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide detection of human variants that disrupt intronic branchpoints.

Zhang Peng P   Philippot Quentin Q   Ren Weicheng W   Lei Wei-Te WT   Li Juan J   Stenson Peter D PD   Palacín Pere Soler PS   Colobran Roger R   Boisson Bertrand B   Zhang Shen-Ying SY   Puel Anne A   Pan-Hammarström Qiang Q   Zhang Qian Q   Cooper David N DN   Abel Laurent L   Casanova Jean-Laurent JL  

Proceedings of the National Academy of Sciences of the United States of America 20221028 44


Pre-messenger RNA splicing is initiated with the recognition of a single-nucleotide intronic branchpoint (BP) within a BP motif by spliceosome elements. Forty-eight rare variants in 43 human genes have been reported to alter splicing and cause disease by disrupting BP. However, until now, no computational approach was available to efficiently detect such variants in massively parallel sequencing data. We established a comprehensive human genome-wide BP database by integrating existing BP data an  ...[more]

Similar Datasets

| S-EPMC10655562 | biostudies-literature
| S-EPMC4315302 | biostudies-literature
2014-10-01 | GSE53328 | GEO
2014-10-01 | GSE53327 | GEO
2014-10-01 | GSE53326 | GEO
2014-10-01 | E-GEOD-53328 | biostudies-arrayexpress
2014-10-01 | E-GEOD-53326 | biostudies-arrayexpress
2014-10-01 | E-GEOD-53327 | biostudies-arrayexpress
| S-EPMC5880247 | biostudies-literature
| S-EPMC4370647 | biostudies-literature