Unknown

Dataset Information

0

Unsupervised detection of fragment length signatures of circulating tumor DNA using non-negative matrix factorization.


ABSTRACT: Sequencing of cell-free DNA (cfDNA) is currently being used to detect cancer by searching both for mutational and non-mutational alterations. Recent work has shown that the length distribution of cfDNA fragments from a cancer patient can inform tumor load and type. Here, we propose non-negative matrix factorization (NMF) of fragment length distributions as a novel and completely unsupervised method for studying fragment length patterns in cfDNA. Using shallow whole-genome sequencing (sWGS) of cfDNA from a cohort of patients with metastatic castration-resistant prostate cancer (mCRPC), we demonstrate how NMF accurately infers the true tumor fragment length distribution as an NMF component - and that the sample weights of this component correlate with ctDNA levels (r=0.75). We further demonstrate how using several NMF components enables accurate cancer detection on data from various early stage cancers (AUC = 0.96). Finally, we show that NMF, when applied across genomic regions, can be used to discover fragment length signatures associated with open chromatin.

SUBMITTER: Renaud G 

PROVIDER: S-EPMC9363120 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications


Sequencing of cell-free DNA (cfDNA) is currently being used to detect cancer by searching both for mutational and non-mutational alterations. Recent work has shown that the length distribution of cfDNA fragments from a cancer patient can inform tumor load and type. Here, we propose non-negative matrix factorization (NMF) of fragment length distributions as a novel and completely unsupervised method for studying fragment length patterns in cfDNA. Using shallow whole-genome sequencing (sWGS) of cf  ...[more]

Similar Datasets

| S-EPMC4948782 | biostudies-literature
| S-EPMC10165836 | biostudies-literature
| S-EPMC3479143 | biostudies-literature
| S-EPMC5746986 | biostudies-literature
| S-EPMC11924690 | biostudies-literature
| S-EPMC6559648 | biostudies-literature
| S-EPMC10009971 | biostudies-literature
| S-EPMC6483061 | biostudies-literature
| S-EPMC8044499 | biostudies-literature
| S-EPMC7355241 | biostudies-literature