Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

Whole Genome Bisulfite Sequencing by ENCODE/HAIB

ABSTRACT: This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu). This track was produced as part of the ENCODE project. It reports the percentage of DNA molecules that exhibit cytosine methylation. In general, DNA methylation within a gene's promoter is associated with gene silencing, and DNA methylation within the exons and introns of a gene is associated with gene expression. Proper regulation of DNA methylation is essential during development and aberrant DNA methylation is a hallmark of cancer. DNA methylation status was assayed with Whole Genome Bisulfite Sequencing (WGBS). Genomic DNA was sheared by sonication, end-repaired and then ligated to methylated sequencing adapters. The library fragments were treated with sodium bisulfite and amplified by PCR to convert every unmethylated cytosine to a thymine while leaving methylated cytosines intact. The sequenced fragments were aligned to a bisulfite-converted reference genome. For each assayed cytosine, the number of sequencing reads covering that C and the percentage of those reads that were methylated were reported. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf DNA methylation at cytosines across the genome was assayed with Whole Genome Bisulfite Sequencing (WGBS). WGBS was performed on cell lines grown by ENCODE production groups. WGBS was carried out by the Myers production group at the HudsonAlpha Institute for Biotechnology. Isolation of Genomic DNA: Genomic DNA was isolated from each cell line using the QIAGEN DNeasy Blood & Tissue Kit according to the instructions provided by the manufacturer. DNA concentrations for each genomic DNA preparation were determined using fluorescent DNA-binding dye and a fluorometer (Invitrogen Quant-iT dsDNA High Sensitivity Kit and Qubit Fluorometer). Typically, 2 µg of genomic DNA is used to make WGBS libraries. WGBS Library Construction and Sequencing: WGBS library construction started with sonication of genomic DNA on a Covaris S2 instrument. Sheared ends were then repaired and blunted with DNA polymerase I, T4 DNA polymerase and T4 polynucleotide kinase in the presence of dATP, dGTP and dTTP. After end repair, Klenow exo- DNA Polymerase was used to add an adenosine as a 3' overhang. Next, a methylated version of the Illumina paired-end adapters was ligated onto the DNA. Adapter-ligated 400 bp genomic DNA fragments were selected using a 2% agarose SizeSelect E-gel. The selected adapter-ligated fragments were treated with sodium bisulfite using the Zymo Research EZ DNA Methylation Gold Kit, which converts unmethylated cytosines to uracils and leaves methylated cytosines unchanged. Bisulfite-treated DNA was amplified in a final PCR reaction which was optimized to uniformly amplify diverse fragment sizes and sequence contexts in the same reaction. During this final PCR reaction, uracils were copied as thymines, resulting in a thymine in the PCR products wherever an unmethylated cytosine existed in the genomic DNA. These libraries were then sequenced with an Illumina HiSeq 2000 according to the manufacturer's recommendations as paired-end 50 bp reads. Libraries were sequenced to a depth of 600 million aligned reads. Data Analysis: To analyze the sequence data, Bismark (Krueger and Andrews, 2011) was used to align sequences reads. Generally, each read went through a conversion of Cs to Ts and was then aligned to fully converted plus and minus strands of the hg19 build of the human genome. A few custom refinements were made to the Bismark program. Since these libraries were made in a directional orientation with the first read always being C-poor, we skipped unnecessary alignments to impossible orientations. We also implemented a more stringent uniqueness filter, only allowing reads that have one acceptable alignment (based on default Bowtie parameters) across both strands. Once reads were aligned, the percent methylation was calculated for each cytosine using the original sequence reads. The percent methylation and number of reads is reported for each CpG in the wgEncodeHaibMethylWgbsXXXXCpg.bigBed file and for each non CpG cytosine in the wgEncodeHaibMethylWgbsXXXXNoncpg.bigBed file.

ORGANISM(S): Homo sapiens

SUBMITTER: UCSC ENCODE DCC

PROVIDER: E-GEOD-40832 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

ACCESS DATA

Similar Datasets

Project description:This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Florencia Pauli mailto:fpauli@hudsonalpha.org). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu). This track is produced as part of the ENCODE project. The track reports the percentage of DNA molecules that exhibit cytosine methylation at specific CpG dinucleotides. In general, DNA methylation within a gene's promoter is associated with gene silencing, and DNA methylation within the exons and introns of a gene is associated with gene expression. Proper regulation of DNA methylation is essential during development and aberrant DNA methylation is a hallmark of cancer. DNA methylation status is assayed at more than 500,000 CpG dinucleotides in the genome using Reduced Representation Bisulfite Sequencing (RRBS). Genomic DNA is digested with the methyl-insensitive restriction enzyme MspI, small genomic DNA fragments are purified by gel electrophoresis, and then used to construct an Illumina sequencing library. The library fragments are treated with sodium bisulfite and amplified by PCR to convert every unmethylated cytosine to a thymidine while leaving methylated cytosines intact. The sequenced fragments are aligned to a customized reference genome sequence and for each assayed CpG we report the number of sequencing reads covering that CpG and the percentage of those reads that are methylated. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf DNA methylation at CpG sites was assayed with a modified version of Reduced Representation Bisulfite Sequencing (RRBS; Meissner et al., 2008). RRBS was performed on cell lines grown by many ENCODE production groups. The production group that grew the cells and isolated genomic DNA is indicated in the "obtainedBy" field of the metadata. When a cell type was provided by more than one lab, the data for the cells from only one lab are displayed in the table above. However, the data for every cell type from every lab is available from the Downloads page. RRBS was carried out by the Myers production group at the HudsonAlpha Institute for Biotechnology. Isolation of genomic DNA Genomic DNA is isolated from biological replicates of each cell line using the QIAGEN DNeasy Blood & Tissue Kit according to the instructions provided by the manufacturer. DNA concentrations for each genomic DNA preparation are determined using fluorescent DNA binding dye and a fluorometer (Invitrogen Quant-iT dsDNA High Sensitivity Kit and Qubit Fluorometer). Typically, 1 µg of DNA is used to make an RRBS library; however, we have also had success in making libraries with 200 ng genomic DNA from rare or precious samples. RRBS library construction and sequencing RRBS library construction starts with MspI digestion of genomic DNA , which cuts at every CCGG regardless of methylation status. Klenow exo- DNA Polymerase is then used to fill in the recessed end of the genomic DNA and add an adenosine as a 3prime overhang. Next, a methylated version of the Illumina paired-end adapters is ligated onto the DNA. Adapter ligated genomic DNA fragments between 105 and 185 basepairs are selected using agarose gel electrophoresis and Qiagen Qiaquick Gel Extraction Kit. The selected adapter-ligated fragments are treated with sodium bisulfite using the Zymo Research EZ DNA Methylation Gold Kit, which converts unmethylated cytosines to uracils and leaves methylated cytosines unchanged. Bisulfite treated DNA is amplified in a final PCR reaction which has been optimized to uniformly amplify diverse fragment sizes and sequence contexts in the same reaction. During this final PCR reaction uracils are copied as thymines resulting in a thymine in the PCR products wherever an unmethylated cytosine existed in the genomic DNA. The sample is now ready for sequencing on the Illumina sequencing platform. These libraries were sequenced with an Illumina Genome Analyzer IIx according to the manufacturer's recommendations. Data analysis To analyze the sequence data, a reference genome is created that contains only the 36 base pairs adjacent to every MspI site and every C in those sequences is changed to T. A converted sequence read file is then created by changing each C in the original sequence reads to a T. The converted sequence reads are aligned to the converted reference genome, and only reads that map uniquely to the reference genome are kept. Once reads are aligned the percent methylation is calculated for each CpG using the original sequence reads. The percent methylation and number of reads is reported for each CpG.

Project description:Cytosine methylation in the genome of Drosophila melanogaster has been elusive and controversial: methylcytosine has been detected at very low levels in early embryos, but the genomic location and function of methylation has not been established. We have mapped cytosine methylation genomewide in Stage 5 Drosophila embryo DNA by combining immuno-enrichment for 5-methylcytosine, bisulfite conversion, and deep sequencing. Unlike methylation patterns observed in other eukaryotic species, methylation in Drosophila is punctate and highly strand-asymmetrical; we confirmed this by direct PCR amplification and sequencing of bisulfite-converted DNA. Despite the locally asymmetric nature of methylation, large-scale patterns of methylation are symmetric. Methylated regions make up ~1% of the genome, and within these regions methylation of individual cytosines averages 2-10%. Methylation is concentrated in specific 5-base sequence motifs that are CA- and CT-rich but depleted of guanine. It is depleted from promoters, coding sequences, and most retrotransposons, and enriched in introns and in certain simple sequence repeats containing the commonly methylated motifs. Comparison with available gene expression data indicates that methylation in a gene is associated with lower expression; the X chromosome, which is subject to gene dosage compensation, is more densely methylated than the autosomes. This study firmly establishes the presence of cytosine methylation in Drosophila; the temporal overlap of methylation with the maternal-zygotic transition raises the possibility that methylation participates in the transition to zygotic gene expression. To enrich for rare cytosine methylation in Drosophila at embryonic Stage 5 (2-3 hours post-fertilization), we enriched sonicated Stage 5 genomic DNA for methylcytosine by immunoprecipitation with antibody to 5-methylcytosine. The immunoprecipitated DNA was then bisulfite converted and Illumina sequenced to obtain direct evidence for the presence of methylation. The presence and extent of DNA methylation was confirmed by Illumina sequencing of bisulfite-converted PCR amplicons.

Dataset Information

Whole Genome Bisulfite Sequencing by ENCODE/HAIB

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets