Unknown

Dataset Information

0

Efficient and accurate detection of viral sequences at single-cell resolution reveals novel viruses perturbing host gene expression.


ABSTRACT: More than 300,000 mammalian virus species are estimated to cause disease in humans. They inhabit human tissues such as the lungs, blood, and brain and often remain undetected. Efficient and accurate detection of viral infection is vital to understanding its impact on human health and to make accurate predictions to limit adverse effects, such as future epidemics. The increasing use of high-throughput sequencing methods in research, agriculture, and healthcare provides an opportunity for the cost-effective surveillance of viral diversity and investigation of virus-disease correlation. However, existing methods for identifying viruses in sequencing data rely on and are limited to reference genomes or cannot retain single-cell resolution through cell barcode tracking. We introduce a method that accurately and rapidly detects viral sequences in bulk and single-cell transcriptomics data based on highly conserved amino acid domains, which enables the detection of RNA viruses covering up to 1012 virus species. The analysis of viral presence and host gene expression in parallel at single-cell resolution allows for the characterization of host viromes and the identification of viral tropism and host responses. We applied our method to identify novel viruses in rhesus macaque PBMC data that display cell type specificity and whose presence correlates with altered host gene expression.

SUBMITTER: Luebbert L 

PROVIDER: S-EPMC10760059 | biostudies-literature | 2023 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient and accurate detection of viral sequences at single-cell resolution reveals putative novel viruses perturbing host gene expression.

Luebbert Laura L   Sullivan Delaney K DK   Carilli Maria M   Hjörleifsson Kristján Eldjárn KE   Winnett Alexander Viloria AV   Chari Tara T   Pachter Lior L  

bioRxiv : the preprint server for biology 20250102


There are an estimated 300,000 mammalian viruses from which infectious diseases in humans may arise. They inhabit human tissues such as the lungs, blood, and brain and often remain undetected. Efficient and accurate detection of viral infection is vital to understanding its impact on human health and to make accurate predictions to limit adverse effects, such as future epidemics. The increasing use of high-throughput sequencing methods in research, agriculture, and healthcare provides an opportu  ...[more]

Similar Datasets

| S-EPMC3663743 | biostudies-literature
| S-EPMC8266615 | biostudies-literature
| S-EPMC8945605 | biostudies-literature
| S-EPMC10749771 | biostudies-literature
2019-09-09 | GSE126213 | GEO
2019-07-10 | GSE124309 | GEO
| S-EPMC8883245 | biostudies-literature
| S-EPMC8826084 | biostudies-literature
2023-10-13 | GSE225614 | GEO
| S-EPMC4784055 | biostudies-literature