Unknown

Dataset Information

0

Bayesian estimation of gene constraint from an evolutionary model with gene features.


ABSTRACT: Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, shet. Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

SUBMITTER: Zeng T 

PROVIDER: S-EPMC10245655 | biostudies-literature | 2023 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bayesian estimation of gene constraint from an evolutionary model with gene features.

Zeng Tony T   Spence Jeffrey P JP   Mostafavi Hakhamanesh H   Pritchard Jonathan K JK  

bioRxiv : the preprint server for biology 20240410


Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ∼25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate infe  ...[more]

Similar Datasets

| S-EPMC10312940 | biostudies-literature
| S-EPMC7017863 | biostudies-literature
| S-EPMC3622139 | biostudies-literature
| S-EPMC6731579 | biostudies-literature
| S-EPMC3038348 | biostudies-literature
| S-EPMC6697484 | biostudies-literature
| S-EPMC11783320 | biostudies-literature
| S-EPMC10403175 | biostudies-literature
| S-EPMC5605759 | biostudies-literature
| S-EPMC4308717 | biostudies-literature