Unknown

Dataset Information

0

Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny.


ABSTRACT:

Summary

Copy number variation is an important and abundant source of variation in the human genome, which has been associated with a number of diseases, especially cancer. Massively parallel next-generation sequencing allows copy number profiling with fine resolution. Such efforts, however, have met with mixed successes, with setbacks arising partly from the lack of reliable analytical methods to meet the diverse and unique challenges arising from the myriad experimental designs and study goals in genetic studies. In cancer genomics, detection of somatic copy number changes and profiling of allele-specific copy number (ASCN) are complicated by experimental biases and artifacts as well as normal cell contamination and cancer subclone admixture. Furthermore, careful statistical modeling is warranted to reconstruct tumor phylogeny by both somatic ASCN changes and single nucleotide variants. Here we describe a flexible computational pipeline, MARATHON, which integrates multiple related statistical software for copy number profiling and downstream analyses in disease genetic studies.

Availability and implementation

MARATHON is publicly available at https://github.com/yuchaojiang/MARATHON.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Urrutia E 

PROVIDER: S-EPMC6248831 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny.

Urrutia Eugene E   Chen Hao H   Zhou Zilu Z   Zhang Nancy R NR   Jiang Yuchao Y  

Bioinformatics (Oxford, England) 20180601 12


<h4>Summary</h4>Copy number variation is an important and abundant source of variation in the human genome, which has been associated with a number of diseases, especially cancer. Massively parallel next-generation sequencing allows copy number profiling with fine resolution. Such efforts, however, have met with mixed successes, with setbacks arising partly from the lack of reliable analytical methods to meet the diverse and unique challenges arising from the myriad experimental designs and stud  ...[more]

Similar Datasets

| S-EPMC4481700 | biostudies-literature
| S-EPMC9392257 | biostudies-literature
| S-EPMC6171490 | biostudies-literature
| S-EPMC11621028 | biostudies-literature
| S-EPMC10688497 | biostudies-literature
| S-EPMC9882003 | biostudies-literature
| S-EPMC7451135 | biostudies-literature
| S-EPMC8267662 | biostudies-literature
| S-EPMC7331695 | biostudies-literature
| S-EPMC4344483 | biostudies-literature