Unknown

Dataset Information

0

Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations.


ABSTRACT: Clonal populations accumulate mutations over time, resulting in different haplotypes. Deep sequencing of such a population in principle provides information to reconstruct these haplotypes and the frequency at which the haplotypes occur. However, this reconstruction is technically not trivial, especially not in clonal systems with a relatively low mutation frequency. The low number of segregating sites in those systems adds ambiguity to the haplotype phasing and thus obviates the reconstruction of genome-wide haplotypes based on sequence overlap information.Therefore, we present EVORhA, a haplotype reconstruction method that complements phasing information in the non-empty read overlap with the frequency estimations of inferred local haplotypes. As was shown with simulated data, as soon as read lengths and/or mutation rates become restrictive for state-of-the-art methods, the use of this additional frequency information allows EVORhA to still reliably reconstruct genome-wide haplotypes. On real data, we show the applicability of the method in reconstructing the population composition of evolved bacterial populations and in decomposing mixed bacterial infections from clinical samples.

SUBMITTER: Pulido-Tamayo S 

PROVIDER: S-EPMC4652744 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Frequency-based haplotype reconstruction from deep sequencing data of bacterial populations.

Pulido-Tamayo Sergio S   Sánchez-Rodríguez Aminael A   Swings Toon T   Van den Bergh Bram B   Dubey Akanksha A   Steenackers Hans H   Michiels Jan J   Fostier Jan J   Marchal Kathleen K  

Nucleic acids research 20150518 16


Clonal populations accumulate mutations over time, resulting in different haplotypes. Deep sequencing of such a population in principle provides information to reconstruct these haplotypes and the frequency at which the haplotypes occur. However, this reconstruction is technically not trivial, especially not in clonal systems with a relatively low mutation frequency. The low number of segregating sites in those systems adds ambiguity to the haplotype phasing and thus obviates the reconstruction  ...[more]

Similar Datasets

| S-EPMC8042772 | biostudies-literature
| S-EPMC3439898 | biostudies-other
| S-EPMC6931272 | biostudies-literature
2013-11-03 | E-GEOD-48592 | biostudies-arrayexpress
| S-EPMC8735729 | biostudies-literature
2013-11-03 | GSE48592 | GEO
| S-EPMC4180835 | biostudies-literature
| S-EPMC4132706 | biostudies-literature
| S-EPMC8504635 | biostudies-literature