Unknown

Dataset Information

0

FaNDOM: Fast nested distance-based seeding of optical maps.


ABSTRACT: Optical mapping (OM) provides single-molecule readouts of fluorescently labeled sequence motifs on long fragments of DNA, resolved to nucleotide-level coordinates. With the advent of microfluidic technologies for analysis of DNA molecules, it is possible to inexpensively generate long OM data ( >150 kbp) at high coverage. In addition to scaffolding for de novo assembly, OM data can be aligned to a reference genome for identification of genomic structural variants. We introduce FaNDOM (Fast Nested Distance Seeding of Optical Maps)-an optical map alignment tool that greatly reduces the search space of the alignment process. On four benchmark human datasets, FaNDOM was significantly (4-14×) faster than competing tools while maintaining comparable sensitivity and specificity. We used FaNDOM to map variants in three cancer cell lines and identified many biologically interesting structural variants, including deletions, duplications, gene fusions and gene-disrupting rearrangements. FaNDOM is publicly available at https://github.com/jluebeck/FaNDOM.

SUBMITTER: Raeisi Dehkordi S 

PROVIDER: S-EPMC8134938 | biostudies-literature | 2021 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

FaNDOM: Fast nested distance-based seeding of optical maps.

Raeisi Dehkordi Siavash S   Luebeck Jens J   Bafna Vineet V  

Patterns (New York, N.Y.) 20210503 5


Optical mapping (OM) provides single-molecule readouts of fluorescently labeled sequence motifs on long fragments of DNA, resolved to nucleotide-level coordinates. With the advent of microfluidic technologies for analysis of DNA molecules, it is possible to inexpensively generate long OM data ( > 150 kbp) at high coverage. In addition to scaffolding for <i>de novo</i> assembly, OM data can be aligned to a reference genome for identification of genomic structural variants. We introduce FaNDOM (F  ...[more]

Similar Datasets

| S-EPMC5026249 | biostudies-literature
| S-EPMC4576710 | biostudies-literature
| S-EPMC2812899 | biostudies-literature
| S-EPMC6508699 | biostudies-literature
| S-EPMC11638338 | biostudies-literature
| S-EPMC7766091 | biostudies-literature
| S-EPMC9666547 | biostudies-literature
| S-EPMC9763367 | biostudies-literature
| S-EPMC6043419 | biostudies-literature
| S-EPMC8696110 | biostudies-literature