Transcriptomics,Genomics

Dataset Information

242

U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line


ABSTRACT: U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30x genomic sequence coverage using a novel 50-base mate paired strategy with a 1.4kb mean insert library. A total of 1,014,984,286 mate-end and 120,691,623 single-end two-base encoded reads were generated from five slides. All data were aligned using a custom designed tool called BFAST, allowing optimal color space read alignment and accurate identification of DNA variants. The aligned sequence reads and mate pair information identified 35 interchromosomal translocation events, 1,315 structural variations (>100bp), 191,743 small (<21bp) insertions and deletions (indels), and 2,384,470 single nucleotide variations (SNVs). Among these observations, the known homozygous mutation in PTEN was robustly identified, and genes involved in cell adhesion were overrepresented in the mutated gene list. Data were compared to 219,187 heterozygous single nucleotide polymorphisms assayed by Illumina 1M Duo genotyping array to assess accuracy: 93.83% of all SNPs were reliably detected at filtering thresholds that yield greater than 99.99% sequence accuracy. Protein coding sequences were disrupted predominantly in this cancer cell line due to small indels, large deletions and translocations. In total, 512 genes were homozygously mutated, including 154 by SNVs, 178 by small indels, 145 by large microdeletions and 35 by interchromosomal translocations to reveal a highly mutated cell line genome. Of the small homozygously mutated variants, 8 SNVs and 99 indels were novel events not present in dbSNP. These data demonstrate that routine generation of broad cancer genome sequence is possible outside of genome centers. The sequence analysis of U87MG provides an unparalleled level of mutational resolution compared to any cell line to date. Whole genome sequencing of the U87MG brain cancer cell line using the AB SOLiD3 sequencer and genotyping using the Illumina Human1M-Duov3 DNA Analysis BeadChip

ORGANISM(S): Homo sapiens  

SUBMITTER: Stanley F Nelson   Michael J Clark 

PROVIDER: E-GEOD-19986 | ArrayExpress | 2010-01-24

SECONDARY ACCESSION(S): SRP001699GSE19986PRJNA120147

REPOSITORIES: GEO, ArrayExpress, ENA

altmetric image

Publications

U87MG decoded: the genomic sequence of a cytogenetically aberrant human cancer cell line.

Clark Michael James MJ   Homer Nils N   O'Connor Brian D BD   Chen Zugen Z   Eskin Ascia A   Lee Hane H   Merriman Barry B   Nelson Stanley F SF  

PLoS genetics 20100129 1


U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30x genomic sequence coverage using a novel 50-base mate paired strategy with a 1.4kb mean insert library. A total of 1,014,984,286 mate-end and 120,691,623 single-end two-base encoded reads were generated from fi  ...[more]

Similar Datasets

2015-01-01 | S-EPMC4418901 | BioStudies
2011-01-01 | S-EPMC3106329 | BioStudies
2015-01-01 | S-EPMC4309431 | BioStudies
2016-01-01 | S-EPMC5812775 | BioStudies
2015-01-01 | S-EPMC4300219 | BioStudies
2016-01-01 | S-EPMC4977481 | BioStudies
2016-06-30 | E-GEOD-72617 | ArrayExpress
2019-01-01 | S-EPMC6397097 | BioStudies
2017-01-01 | S-EPMC5204340 | BioStudies
2018-01-01 | S-EPMC5885007 | BioStudies