Unknown

Dataset Information

0

Structural variant analysis of a cancer reference cell line sample using multiple sequencing technologies.


ABSTRACT:

Background

The cancer genome is commonly altered with thousands of structural rearrangements including insertions, deletions, translocation, inversions, duplications, and copy number variations. Thus, structural variant (SV) characterization plays a paramount role in cancer target identification, oncology diagnostics, and personalized medicine. As part of the SEQC2 Consortium effort, the present study established and evaluated a consensus SV call set using a breast cancer reference cell line and matched normal control derived from the same donor, which were used in our companion benchmarking studies as reference samples.

Results

We systematically investigated somatic SVs in the reference cancer cell line by comparing to a matched normal cell line using multiple NGS platforms including Illumina short-read, 10X Genomics linked reads, PacBio long reads, Oxford Nanopore long reads, and high-throughput chromosome conformation capture (Hi-C). We established a consensus SV call set of a total of 1788 SVs including 717 deletions, 230 duplications, 551 insertions, 133 inversions, 146 translocations, and 11 breakends for the reference cancer cell line. To independently evaluate and cross-validate the accuracy of our consensus SV call set, we used orthogonal methods including PCR-based validation, Affymetrix arrays, Bionano optical mapping, and identification of fusion genes detected from RNA-seq. We evaluated the strengths and weaknesses of each NGS technology for SV determination, and our findings provide an actionable guide to improve cancer genome SV detection sensitivity and accuracy.

Conclusions

A high-confidence consensus SV call set was established for the reference cancer cell line. A large subset of the variants identified was validated by multiple orthogonal methods.

SUBMITTER: Talsania K 

PROVIDER: S-EPMC9746098 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structural variant analysis of a cancer reference cell line sample using multiple sequencing technologies.

Talsania Keyur K   Shen Tsai-Wei TW   Chen Xiongfong X   Jaeger Erich E   Li Zhipan Z   Chen Zhong Z   Chen Wanqiu W   Tran Bao B   Kusko Rebecca R   Wang Limin L   Pang Andy Wing Chun AWC   Yang Zhaowei Z   Choudhari Sulbha S   Colgan Michael M   Fang Li Tai LT   Carroll Andrew A   Shetty Jyoti J   Kriga Yuliya Y   German Oksana O   Smirnova Tatyana T   Liu Tiantain T   Li Jing J   Kellman Ben B   Hong Karl K   Hastie Alex R AR   Natarajan Aparna A   Moshrefi Ali A   Granat Anastasiya A   Truong Tiffany T   Bombardi Robin R   Mankinen Veronnica V   Meerzaman Daoud D   Mason Christopher E CE   Collins Jack J   Stahlberg Eric E   Xiao Chunlin C   Wang Charles C   Xiao Wenming W   Zhao Yongmei Y  

Genome biology 20221213 1


<h4>Background</h4>The cancer genome is commonly altered with thousands of structural rearrangements including insertions, deletions, translocation, inversions, duplications, and copy number variations. Thus, structural variant (SV) characterization plays a paramount role in cancer target identification, oncology diagnostics, and personalized medicine. As part of the SEQC2 Consortium effort, the present study established and evaluated a consensus SV call set using a breast cancer reference cell  ...[more]

Similar Datasets

| S-EPMC11245320 | biostudies-literature
| S-EPMC5411774 | biostudies-literature
| S-EPMC11370364 | biostudies-literature
| S-EPMC9510867 | biostudies-literature
2008-06-12 | E-GEOD-2723 | biostudies-arrayexpress
| S-EPMC8219666 | biostudies-literature
| S-EPMC8456427 | biostudies-literature
| S-EPMC7760597 | biostudies-literature
2005-06-03 | GSE2723 | GEO
| S-EPMC11687947 | biostudies-literature