Genomics

Dataset Information

0

A systematic evaluation of the design, orientation, and sequence context dependencies of massively parallel reporter assays


ABSTRACT: Enhancers play important roles in evolution and disease. However, traditional assays to test enhancers are low throughput and not scalable to the >100,000 enhancers in the human genome. To better prioritize variants associated with disease and to study the role of enhancers, our group and others developed massively parallel reporter assays (MPRAs), which functionally screen thousands of sequences for regulatory activity in parallel. Although MPRAs have been applied to address diverse questions in gene regulation, there has been no systematic comparison of how differences in experimental design influence findings, making it difficult to interpret results and compare between groups. Here, we screen a library of 2,440 sequences, representing candidate liver enhancers and controls, in HepG2 cells for regulatory activity using nine different approaches (including conventional episomal, STARR-seq, and lentiviral MPRA designs). We identify subtle but significant differences in the resulting measurements that correlate with epigenetic and sequence-level features. We also test this library in both orientations with respect to the promoter, validating en masse that enhancer activity is robustly independent of orientation. Finally, we develop and apply a novel method to assemble and functionally test libraries of the same putative enhancers as 192-mers, 354-mers, and 678-mers, and observe surprisingly large differences in functional activity. This work provides a framework for the experimental design of high-throughput reporter assays, suggesting that the extended sequence context of tested elements, and to a lesser degree the precise assay, influence MPRA results.

ORGANISM(S): Homo sapiens

PROVIDER: GSE142696 | GEO | 2019/12/30

REPOSITORIES: GEO

Similar Datasets

2020-09-30 | GSE157430 | GEO
2013-03-15 | E-GEOD-33367 | biostudies-arrayexpress
2022-07-01 | GSE202564 | GEO
2021-07-24 | GSE180714 | GEO
2023-01-19 | GSE195901 | GEO
2023-01-19 | GSE195902 | GEO
2023-01-19 | GSE222160 | GEO
2023-01-19 | GSE194087 | GEO
2012-02-26 | E-GEOD-31982 | biostudies-arrayexpress
2020-12-04 | GSE156857 | GEO