Unknown

Dataset Information

0

Reconstruction of clonal trees and tumor composition from multi-sample sequencing data.


ABSTRACT:

Motivation

DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor.

Results

We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an integer linear programming solution to the VAF factorization problem in the case of error-free data and extend this solution to real data with a probabilistic model for errors. The resulting AncesTree algorithm is better able to identify ancestral relationships between individual mutations than existing approaches, particularly in ultra-deep sequencing data when high read counts for mutations yield high confidence VAFs.

Availability and implementation

An implementation of AncesTree is available at: http://compbio.cs.brown.edu/software.

SUBMITTER: El-Kebir M 

PROVIDER: S-EPMC4542783 | biostudies-literature | 2015 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reconstruction of clonal trees and tumor composition from multi-sample sequencing data.

El-Kebir Mohammed M   Oesper Layla L   Acheson-Field Hannah H   Raphael Benjamin J BJ  

Bioinformatics (Oxford, England) 20150601 12


<h4>Motivation</h4>DNA sequencing of multiple samples from the same tumor provides data to analyze the process of clonal evolution in the population of cells that give rise to a tumor.<h4>Results</h4>We formalize the problem of reconstructing the clonal evolution of a tumor using single-nucleotide mutations as the variant allele frequency (VAF) factorization problem. We derive a combinatorial characterization of the solutions to this problem and show that the problem is NP-complete. We derive an  ...[more]

Similar Datasets

| S-EPMC6936074 | biostudies-literature
| S-EPMC8215921 | biostudies-literature
| S-EPMC11480863 | biostudies-literature
| S-EPMC7468132 | biostudies-literature
| S-EPMC4062932 | biostudies-literature
| S-EPMC11483135 | biostudies-literature
| S-EPMC8419336 | biostudies-literature
| S-EPMC11661639 | biostudies-literature
| S-EPMC5606061 | biostudies-literature
| S-EPMC4593588 | biostudies-literature