Unknown

Dataset Information

0

Accurate assignment of disease liability to genetic variants using only population data.


ABSTRACT:

Purpose

The growing size of public variant repositories prompted us to test the accuracy of pathogenicity prediction of DNA variants using population data alone.

Methods

Under the a priori assumption that the ratio of the prevalence of variants in healthy population vs that in affected populations form 2 distinct distributions (pathogenic and benign), we used a Bayesian method to assign probability to a variant belonging to either distribution.

Results

The approach, termed Bayesian prevalence ratio (BayPR), accurately parsed 300 of 313 expertly curated CFTR variants: 284 of 296 pathogenic/likely pathogenic variants in 1 distribution and 16 of 17 benign/likely benign variants in another. BayPR produced an area under the receiver operating characteristic curve of 0.99 for 103 functionally confirmed missense CFTR variants, which is equal to or exceeds 10 commonly used algorithms (area under the receiver operating characteristic curve range = 0.54-0.99). Application of BayPR to expertly curated variants in 8 genes associated with 7 Mendelian conditions led to the assignment of a disease-causing probability of ≥80% to 1350 of 1374 (98.3%) pathogenic/likely pathogenic variants and of ≤20% to 22 of 23 (95.7%) benign/likely benign variants.

Conclusion

Irrespective of the variant type or functional effect, the BayPR approach provides probabilities of pathogenicity for DNA variants responsible for Mendelian disorders using only the variant counts in affected and unaffected population samples.

SUBMITTER: Collaco JM 

PROVIDER: S-EPMC10280479 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate assignment of disease liability to genetic variants using only population data.

Collaco Joseph M JM   Raraigh Karen S KS   Betz Joshua J   Aksit Melis Atalar MA   Blau Nenad N   Brown Jordan J   Dietz Harry C HC   MacCarrick Gretchen G   Nogee Lawrence M LM   Sheridan Molly B MB   Vernon Hilary J HJ   Beaty Terri H TH   Louis Thomas A TA   Cutting Garry R GR  

Genetics in medicine : official journal of the American College of Medical Genetics 20211130 1


<h4>Purpose</h4>The growing size of public variant repositories prompted us to test the accuracy of pathogenicity prediction of DNA variants using population data alone.<h4>Methods</h4>Under the a priori assumption that the ratio of the prevalence of variants in healthy population vs that in affected populations form 2 distinct distributions (pathogenic and benign), we used a Bayesian method to assign probability to a variant belonging to either distribution.<h4>Results</h4>The approach, termed  ...[more]

Similar Datasets

| S-EPMC9729100 | biostudies-literature
| S-EPMC3909108 | biostudies-literature
| S-EPMC4932600 | biostudies-literature
| S-EPMC7507196 | biostudies-literature
| S-EPMC8266611 | biostudies-literature
| S-EPMC2625398 | biostudies-literature
| S-EPMC10495273 | biostudies-literature
| S-EPMC4377079 | biostudies-literature
| S-EPMC9388528 | biostudies-literature
| S-EPMC5139035 | biostudies-literature