Ontology highlight
ABSTRACT:
SUBMITTER: Shen H
PROVIDER: S-EPMC10883418 | biostudies-literature | 2024 Jan
REPOSITORIES: biostudies-literature
Shen Hongru H Liu Jilei J Chen Kexin K Li Xiangchun X
Briefings in bioinformatics 20240101 2
We present a language model Affordable Cancer Interception and Diagnostics (ACID) that can achieve high classification performance in the diagnosis of cancer exclusively from using raw cfDNA sequencing reads. We formulate ACID as an autoregressive language model. ACID is pretrained with language sentences that are obtained from concatenation of raw sequencing reads and diagnostic labels. We benchmark ACID against three methods. On testing set subjected to whole-genome sequencing, ACID significan ...[more]