Unknown

Dataset Information

0

Identification of emerging viral genomes in transcriptomic datasets of alfalfa (Medicago sativa L.).


ABSTRACT: BACKGROUND:Publicly available transcriptomic datasets have become a valuable tool for the discovery of new pathogens, particularly viruses. In this study, several coding-complete viral genomes previously not found or experimentally confirmed in alfalfa were identified in the plant datasets retrieved from the NCBI Sequence Read Archive. METHODS:Publicly available Medicago spp. transcriptomic datasets were retrieved from the NCBI SRA database. The raw reads were first mapped to the reference genomes of Medicago sativa and Medigago truncatula followed by the alignment of the unmapped reads to the NCBI viral genome database and de novo assembly using the SPAdes tool. When possible, assemblies were experimentally confirmed using 5'/3' RACE and RT-PCRs. RESULTS:Twenty three different viruses were identified in the analyzed datasets, of which several represented emerging viruses not reported in alfalfa prior to this study. Among them were two strains of cnidium vein yellowing virus, lychnis mottle virus and Cactus virus X, for which coding-complete genomic sequences were obtained by a de novo assembly. CONCLUSIONS:The results improve our knowledge of the diversity and host range of viruses infecting alfalfa, provide essential tools for their diagnostics and characterization and demonstrate the utility of transcriptomic datasets for the discovery of new pathogens.

SUBMITTER: Jiang P 

PROVIDER: S-EPMC6902351 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8428138 | biostudies-literature
| S-EPMC7460629 | biostudies-literature
| S-EPMC1148778 | biostudies-other
| S-EPMC4961700 | biostudies-other
| S-EPMC6489223 | biostudies-literature
2022-03-31 | GSE197482 | GEO