Project description:Oncogenic human papillomavirus (HPV) genomes are often integrated into host chromosomes in HPV-associated cancers. HPV genomes are integrated either as a single copy, or as tandem repeats of viral DNA interspersed with, or without, host DNA. Integration occurs frequently in common fragile sites susceptible to tandem repeat formation, and the flanking or interspersed host DNA often contains transcriptional enhancer elements. When co-amplified with the viral genome, these enhancers can form super-enhancer-like elements that drive high viral oncogene expression. Here, we compiled highly curated datasets of HPV integration sites in cervical (CESC) and head and neck squamous cell carcinoma (HNSCC) cancers and assessed the number of breakpoints, viral transcriptional activity, and host genome copy number at each insertion site. Tumors frequently contained multiple distinct HPV integration sites, but often only one “driver” site that expressed viral RNA. Since common fragile sites and active enhancer elements are cell-type specific, we mapped these regions in cervical cell lines using FANCD2 and Brd4/H3K27ac ChIP-seq, respectively. Large enhancer clusters, or super-enhancers, were also defined using the Brd4/H3K27ac ChIP-seq dataset. HPV integration breakpoints were enriched at both FANCD2-associated fragile sites, and enhancer-rich regions, and frequently showed adjacent focal DNA amplification in CESC samples. We identified recurrent integration “hotspots” that were enriched for super-enhancers, some of which function as regulatory hubs for cell-identity genes. We propose that during persistent infection, extrachromosomal HPV minichromosomes associate with these transcriptional epicenters, and accidental integration could promote viral oncogene expression and carcinogenesis.
Project description:PeCa is a rare carcinoma in developed countries, but it presents higher incidence rates in South America, Asia, and Africa, where limited economic and social conditions play a large impact leading to delay in diagnosis, and treatment initiation. The infection by HPV is a the risk factors and can occur through the canonical HPV/p53/RB1 pathway mediated by the E2/E6/E7 viral oncoproteins. During the transformation process, HPV inserts its genetic material into host Integration Sites (IS), affecting coding genes and miRNAs. In penile cancer (PeCa) there is limited data on the miRNAs. Considering the high frequency of HPV infection in PeCa patients in Northeast Brazil, global miRNA expression profiling was performed in high-risk HPV-associated PeCa. A panel of differentially expressed miRNAs (miRDE) was successfully constructed using 22 PeCa tissues and five non-tumor penile tissues.
Project description:Human papillomavirus (HPV) integration is a critical step in cervical cancer development, while the oncogenic mechanism in genome-wide transcriptional level is still poorly understood. In this study, we employed integrative analysis on multi-omics data of cervical cancer cell lines. Through HPV integration detection, super enhancer (SE) identification, SE-associated gene expression and extrachromosomal DNA (ecDNA) investigation, we aimed to explore the genome-wide transcriptional influence of HPV integration. We identified 5 high-ranking cellular super enhancers generated by HPV integration (the HPV breakpoint induced cellular super enhancers, BP-cSE), leading to intra-chromosomal and inter-chromosomal regulations of chromosomal genes. The pathway analysis showed the dysregulated chromosomal genes were correlated to cervical cancer associated pathways. Importantly, we demonstrated that BP-cSE existed in the HPV-host ecDNA, explaining above transcription alterations. Our results suggest that HPV integration generates cellular super enhancers and functions as ecDNA to regulate unconstraint transcription, expanding the tumorigenic mechanism of HPV integration and providing insights of developing new diagnostic and therapeutic strategies.
Project description:We sequenced DNA from a bulk of Col x Ler F2 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. For nanopore sequencing of gDNA from 1,000 pooled seedlings, 10-day-old seedlings were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:We sequenced DNA from the leaves of ten Col x Ler F1 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. These data were used as a negative control for COmapper, as no crossover sites were expected to be detected. For nanopore sequencing of gDNA from leaves, leaves from 10 5-week-old plants were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:Genomic DNA from 55 wild type Col x Ler F2 individuals was extracted using the CTAB method. Equal amounts of DNA from these 55 plants were pooled into two groups (pool 1 = 4 plants; pool 2 = 51 plants), and nine micrograms of gDNA from each pool was used to generate Nanopore sequencing libraries with the Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced independently using PromethION (BGI, Hong Kong).
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:E18 mouse brain single cell profiling using the 10x Genomics Chromium instrument workflow with either Illumina short read sequencing for the standard gene profiling and Nanopore PromethION long read sequencing for isoform profiling.
Project description:ChIP-Seq for H3K27 trimethylation was performed for two HPV-positive and two HPV-negative squamous cell carcinoma cell lines. The data served two purposes. First, the data were used as an example implementation of our novel ChIP-Seq Peak Prioritization pipeline, PePr. We have developed the PePr pipeline, a ChIP-Seq Peak Prioritization pipeline that accounts for the variation among replicates and peak location relative to a gene. We show, using a transcription factor dataset (which exhibited small variation among samples), that PePr performs favorably compared to commonly used peak callers and that it achieves balanced sensitivity and specificity. We also show, using histone modification data (which exhibited larger variation among samples), that PePr can improve the detection of differential H3K27me3 regions compared with a common current approach. Using data from ChIP-Seq and gene expression experiments performed in parallel on the same samples, we show that the incorporation of functional annotations can improve the prioritization of functional sites. Secondly, the data were used to assess real differences in the genome-wide H3K27me3 profiles between HPV-positive and HPV-negative carcinoma cell lines. Careful analysis and integration of the data with DNA methylation and gene expression data performed on the same cell lines demonstrated striking differences exist. ChIP-Seq for H3K27 trimethylation was performed for two HPV-positive and two HPV-negative squamous cell carcinoma (SCC) cell lines. Input DNA was also sequenced for each sample to serve as a control. The goal was to determine overall differences in H3K27me3 patterns observed between the HPV-positive and HPV-negative SCC cell lines.