Genomics

Dataset Information

0

Recurrent Integration of Human Papillomavirus Genomes at Transcriptional Regulatory Hubs


ABSTRACT: Oncogenic human papillomavirus (HPV) genomes are often integrated into host chromosomes in HPV-associated cancers. HPV genomes are integrated either as a single copy, or as tandem repeats of viral DNA interspersed with, or without, host DNA. Integration occurs frequently in common fragile sites susceptible to tandem repeat formation, and the flanking or interspersed host DNA often contains transcriptional enhancer elements. When co-amplified with the viral genome, these enhancers can form super-enhancer-like elements that drive high viral oncogene expression. Here, we compiled highly curated datasets of HPV integration sites in cervical (CESC) and head and neck squamous cell carcinoma (HNSCC) cancers and assessed the number of breakpoints, viral transcriptional activity, and host genome copy number at each insertion site. Tumors frequently contained multiple distinct HPV integration sites, but often only one “driver” site that expressed viral RNA. Since common fragile sites and active enhancer elements are cell-type specific, we mapped these regions in cervical cell lines using FANCD2 and Brd4/H3K27ac ChIP-seq, respectively. Large enhancer clusters, or super-enhancers, were also defined using the Brd4/H3K27ac ChIP-seq dataset. HPV integration breakpoints were enriched at both FANCD2-associated fragile sites, and enhancer-rich regions, and frequently showed adjacent focal DNA amplification in CESC samples. We identified recurrent integration “hotspots” that were enriched for super-enhancers, some of which function as regulatory hubs for cell-identity genes. We propose that during persistent infection, extrachromosomal HPV minichromosomes associate with these transcriptional epicenters, and accidental integration could promote viral oncogene expression and carcinogenesis.

ORGANISM(S): Homo sapiens

PROVIDER: GSE183048 | GEO | 2021/09/01

REPOSITORIES: GEO

Similar Datasets

2018-11-15 | GSE122512 | GEO
2023-04-19 | GSE195631 | GEO
2021-02-03 | GSE141101 | GEO
2022-06-30 | GSE197121 | GEO
2010-07-01 | E-GEOD-5049 | biostudies-arrayexpress
2007-06-12 | GSE5049 | GEO
2023-08-08 | GSE240391 | GEO
| EGAS00001000599 | EGA
2014-11-01 | E-GEOD-62912 | biostudies-arrayexpress
2021-07-28 | GSE144293 | GEO