Genomics

Dataset Information

0

Harnessing Bronchoalveolar Lavage Metagenomics for Distinguishing Lung Cancer from Pulmonary Infectious Diseases


ABSTRACT: Recent progress in unbiased metagenomic next-generation sequencing (mNGS) allows simultaneous examination of microbial and host genetic material in a single test. Leveraging affordable bronchoalveolar lavage fluid (BALF) mNGS data, we employed machine learning to create a diagnostic approach distinguishing lung cancer from pulmonary infections, conditions prone to misdiagnosis in clinical settings. This prospective study analyzed BALF-mNGS data from lung cancer and pulmonary infection patients, delineating differences in DNA/RNA microbial composition, bacteriophage abundances, and host responses, including gene expression, transposable element levels, immune cell composition, and tumor fraction derived from copy number variation (CNV). Integrating these metrics into a host/microbe metagenomics-driven machine learning model (Model VI) demonstrated robustness, achieving an AUC of 0.87 (95% CI = 0.857-0.883), sensitivity = 73.8%, and specificity = 84.5% in the training cohort, and an AUC of 0.831 (95% CI = 0.819-0.843), sensitivity = 67.1%, and specificity = 94.4% in the validation cohort for distinguishing lung cancer from pulmonary infections. The application of a rule-in and rule-out strategy-based composite predictive model significantly enhances accuracy (ACC) in distinguishing between lung cancer and tuberculosis (ACC=0.913), fungal infection (ACC=0.955), and bacterial infection (ACC=0.836). These findings highlight the potential of cost-effective mNGS-based analysis as a valuable tool for early differentiation between lung cancer and pulmonary infections, offering significant benefits through a single comprehensive testing.

ORGANISM(S): Homo sapiens

PROVIDER: GSE252118 | GEO | 2024/01/08

REPOSITORIES: GEO

Similar Datasets

2023-12-18 | GSE244210 | GEO
2024-01-26 | PXD046731 | Pride
2015-05-17 | E-GEOD-66499 | biostudies-arrayexpress
2018-10-01 | GSE104251 | GEO
2019-01-17 | GSE99997 | GEO
2019-05-30 | PXD012645 | Pride
2008-12-31 | GSE14245 | GEO
2013-07-12 | E-GEOD-48787 | biostudies-arrayexpress
2024-01-16 | MTBLS6990 | MetaboLights
2015-05-17 | GSE66499 | GEO