Dataset Information


Quantifying deleterious effects of regulatory variants.

ABSTRACT: The majority of genome-wide association study (GWAS) risk variants reside in non-coding DNA sequences. Understanding how these sequence modifications lead to transcriptional alterations and cell-to-cell variability can help unraveling genotype-phenotype relationships. Here, we describe a computational method, dubbed CAPE, which calculates the likelihood of a genetic variant deactivating enhancers by disrupting the binding of transcription factors (TFs) in a given cellular context. CAPE learns sequence signatures associated with putative enhancers originating from large-scale sequencing experiments (such as ChIP-seq or DNase-seq) and models the change in enhancer signature upon a single nucleotide substitution. CAPE accurately identifies causative cis-regulatory variation including expression quantitative trait loci (eQTLs) and DNase I sensitivity quantitative trait loci (dsQTLs) in a tissue-specific manner with precision superior to several currently available methods. The presented method can be trained on any tissue-specific dataset of enhancers and known functional variants and applied to prioritize disease-associated variants in the corresponding tissue.


PROVIDER: S-EPMC5389506 | BioStudies | 2017-01-01

REPOSITORIES: biostudies

Similar Datasets

2017-01-01 | S-EPMC5509100 | BioStudies
2012-01-01 | S-EPMC3501342 | BioStudies
2019-01-01 | S-EPMC6405142 | BioStudies
2017-01-01 | S-EPMC5351933 | BioStudies
2018-01-01 | S-EPMC6265487 | BioStudies
2021-01-01 | S-EPMC7875769 | BioStudies
2017-01-01 | S-EPMC5741056 | BioStudies
2019-01-01 | S-EPMC6382991 | BioStudies
2017-01-01 | S-EPMC5499808 | BioStudies
2019-01-01 | S-EPMC6631995 | BioStudies