Rapidly expanding genetic diversity and host range of the Circoviridae viral family and other Rep encoding small circular ssDNA genomes.
ABSTRACT: The genomes of numerous circoviruses and distantly related circular ssDNA viruses encoding a rolling circle replication initiator protein (Rep) have been characterized from the tissues of mammals, fish, insects, plants (geminivirus and nanovirus), in human and animal feces, in an algae cell, and in diverse environmental samples. We review the genome organization, phylogenetic relationships and initial prevalence studies of cycloviruses, a proposed new genus in the Circoviridae family. Viral fossil rep sequences were also recently identified integrated on the chromosomes of mammals, frogs, lancelets, crustaceans, mites, gastropods, roundworms, placozoans, hydrozoans, protozoans, land plants, fungi, algae, and phytoplasma bacterias and their plasmids, reflecting the very wide past host range of rep bearing viruses. An ancient origin for viruses with Rep-encoding small circular ssDNA genomes, predating the diversification of eukaryotes, is discussed. The cellular hosts and pathogenicity of many recently described rep-containing circular ssDNA genomes remain to be determined. Future studies of the virome of single cell and multi-cellular eukaryotes are likely to further extend the known diversity and host-range of small rep-containing circular ssDNA viral genomes.
Project description:A virus with a circular Rep-encoding single-stranded DNA (ssDNA) (CRESS-DNA) genome (PmCV-1) was isolated from Penaeus monodon shrimps in Vietnam. The gene structure of the 1,777-nucleotide (nt) genome was similar to that of circoviruses and cycloviruses, but the nucleic acid and protein sequence identities to these viruses were very low.
Project description:Circoviruses are highly prevalent porcine and avian pathogens. In recent years, novel circular ssDNA genomes have recently been detected in a variety of fecal and environmental samples using deep sequencing approaches. In this study the identification of genomes of novel circoviruses and cycloviruses in feces of insectivorous bats is reported. Pan-reactive primers were used targeting the conserved rep region of circoviruses and cycloviruses to screen DNA bat fecal samples. Using this approach, partial rep sequences were detected which formed five phylogenetic groups distributed among the Circovirus and the recently proposed Cyclovirus genera of the Circoviridae. Further analysis using inverse PCR and Sanger sequencing led to the characterization of four new putative members of the family Circoviridae with genome size ranging from 1,608 to 1,790 nt, two inversely arranged ORFs, and canonical nonamer sequences atop a stem loop.
Project description:Nanoviruses are a family of plant viruses that possess a genome of multiple circular single-stranded DNA (ssDNA) components and are strikingly similar in their replication mode to the plant geminiviruses and to the circoviruses that infect birds or mammals. These viruses multiply by rolling circle replication using virus-encoded multifunctional replication initiator proteins (Rep proteins) that catalyze the initiation of replication on a double-stranded DNA (dsDNA) intermediate and the resolution of the ssDNA into circles. Here we report the solution NMR three-dimensional structure of the endonuclease domain from the master Rep (M-Rep) protein of faba bean necrotic yellows virus (FBNYV), a representative of the nanoviruses. The domain comprises amino acids 2-95 (M-Rep2-95), and its global fold is similar to those previously described for the gemini- and circovirus Rep endonuclease domains, consisting of a central 5-stranded antiparallel beta-sheet covered on one side by an alpha-helix and irregular loops and on the other, more open side of the domain, by an alpha-helix containing the catalytic tyrosine residue (the catalytic helix). Longer domain constructs extending to amino acids 117 and 124 were also characterized. They contain an additional alpha-helix, are monomeric, and exhibit catalytic activity indistinguishable from that of M-Rep2-95. The binding site for the catalytic metal was identified by paramagnetic broadening and maps to residues on the exposed face of the central beta-sheet. A comparison with the previously determined Rep endonuclease domain structures of tomato yellow leaf curl Sardinia virus (TYLCSV), a geminivirus, and that of porcine circovirus type 2 (PCV2) Rep allows the identification of a positively charged surface that is most likely involved in dsDNA binding, and reveals common features shared by all endonuclease domains of nanovirus, geminivirus, and circovirus Rep proteins.
Project description:Viral metagenomics has recently revealed the ubiquitous and diverse nature of single-stranded DNA (ssDNA) viruses that encode a conserved replication initiator protein (Rep) in the marine environment. Although eukaryotic circular Rep-encoding ssDNA (CRESS-DNA) viruses were originally thought to only infect plants and vertebrates, recent studies have identified these viruses in a number of invertebrates. To further explore CRESS-DNA viruses in the marine environment, this study surveyed CRESS-DNA viruses in various marine invertebrate species. A total of 27 novel CRESS-DNA genomes, with Reps that share less than 60.1% identity with previously reported viruses, were recovered from 21 invertebrate species, mainly crustaceans. Phylogenetic analysis based on the Rep revealed a novel clade of CRESS-DNA viruses that included approximately one third of the marine invertebrate associated viruses identified here and whose members may represent a novel family. Investigation of putative capsid proteins (Cap) encoded within the eukaryotic CRESS-DNA viral genomes from this study and those in GenBank demonstrated conserved patterns of predicted intrinsically disordered regions (IDRs), which can be used to complement similarity-based searches to identify divergent structural proteins within novel genomes. Overall, this study expands our knowledge of CRESS-DNA viruses associated with invertebrates and explores a new tool to evaluate divergent structural proteins encoded by these viruses.
Project description:Viral metagenomics of feces collected from 58 Peruvian children with unexplained diarrhea revealed several small circular ssDNA genomes. Two genomes related to sequences previously reported in feces from chimpanzees and other mammals and recently named smacoviruses were characterized and then detected by PCR in 1.7 % (1/58) and 19 % (11/58) of diarrheal samples, respectively. Another three genomes from a distinct small circular ssDNA viral group provisionally called pecoviruses encoded Cap and Rep proteins with <35 % identity to those in related genomes reported in human, seal, porcine and dromedary feces. Pecovirus DNA was detected in 15.5 % (9/58), 5.9 % (3/51) and 3 % (3/100) of fecal samples from unexplained diarrhea in Peru, Nicaragua and Chile, respectively. Feces containing these ssDNA genomes also contained known human enteric viral pathogens. The cellular origins of these circular ssDNA viruses, whether human cells, ingested plants, animals or fungal foods, or residents of the gut microbiome, are currently unknown.
Project description:Mycoviruses are viruses that infect fungi and have the potential to control fungal diseases of crops when associated with hypovirulence. Typically, mycoviruses have double-stranded (ds) or single-stranded (ss) RNA genomes. No mycoviruses with DNA genomes have previously been reported. Here, we describe a hypovirulence-associated circular ssDNA mycovirus from the plant pathogenic fungus Sclerotinia sclerotiorum. The genome of this ssDNA virus, named Sclerotinia sclerotiorum hypovirulence-associated DNA virus 1 (SsHADV-1), is 2166 nt, coding for a replication initiation protein (Rep) and a coat protein (CP). Although phylogenetic analysis of Rep showed that SsHADV-1 is related to geminiviruses, it is notably distinct from geminiviruses both in genome organization and particle morphology. Polyethylene glycol-mediated transfection of fungal protoplasts was successful with either purified SsHADV-1 particles or viral DNA isolated directly from infected mycelium. The discovery of an ssDNA mycovirus enhances the potential of exploring fungal viruses as valuable tools for molecular manipulation of fungi and for plant disease control and expands our knowledge of global virus ecology and evolution.
Project description:BACKGROUND: In addition to vertical transmission, organisms can also acquire genes from other distantly related species or from their extra-chromosomal elements (plasmids and viruses) via horizontal gene transfer (HGT). It has been suggested that phages represent substantial forces in prokaryotic evolution. In eukaryotes, retroviruses, which can integrate into host genome as an obligate step in their replication strategy, comprise approximately 8% of the human genome. Unlike retroviruses, few members of other virus families are known to transfer genes to host genomes. RESULTS: Here we performed a systematic search for sequences related to circular single-stranded DNA (ssDNA) viruses in publicly available eukaryotic genome databases followed by comprehensive phylogenetic analysis. We conclude that the replication initiation protein (Rep)-related sequences of geminiviruses, nanoviruses and circoviruses have been frequently transferred to a broad range of eukaryotic species, including plants, fungi, animals and protists. Some of the transferred viral genes were conserved and expressed, suggesting that these genes have been coopted to assume cellular functions in the host genomes. We also identified geminivirus-like and parvovirus-like transposable elements in genomes of fungi and lower animals, respectively, and thereby provide direct evidence that eukaryotic transposons could derive from ssDNA viruses. CONCLUSIONS: Our discovery extends the host range of circular ssDNA viruses and sheds light on the origin and evolution of these viruses. It also suggests that ssDNA viruses act as an unforeseen source of genetic innovation in their hosts.
Project description:Single-stranded (ss) DNA viruses are a major component of the earth virome. In particular, the circular, Rep-encoding ssDNA (CRESS-DNA) viruses show high diversity and abundance in various habitats. By combining sequence similarity network and phylogenetic analyses of the replication proteins (Rep) belonging to the HUH endonuclease superfamily, we show that the replication machinery of the CRESS-DNA viruses evolved, on three independent occasions, from the Reps of bacterial rolling circle-replicating plasmids. The CRESS-DNA viruses emerged via recombination between such plasmids and cDNA copies of capsid genes of eukaryotic positive-sense RNA viruses. Similarly, the rep genes of prokaryotic DNA viruses appear to have evolved from HUH endonuclease genes of various bacterial and archaeal plasmids. Our findings also suggest that eukaryotic polyomaviruses and papillomaviruses with dsDNA genomes have evolved via parvoviruses from CRESS-DNA viruses. Collectively, our results shed light on the complex evolutionary history of a major class of viruses revealing its polyphyletic origins.
Project description:Replication initiation of nanoviruses, plant viruses with a multipartite circular single-stranded DNA genome, is triggered by the master Rep (M-Rep) protein. To enable the study of interactions between M-Rep and viral or host factors involved in replication, we designed oligohistidine-tagged variants of the nanovirus Faba bean necrotic yellows virus (FBNYV) M-Rep protein that allow affinity purification of enzymatically active M-Rep from plant tissue. The tagged M-Rep protein was able to initiate replication of its cognate and other FBNYV DNAs in Nicotiana benthamiana leaf disks and plants. The replicon encoding the tagged M-Rep protein multiplied and moved systemically in FBNYV-infected Vicia faba plants and was transmitted by the aphid vector of the virus. Using the tagged M-Rep protein, we demonstrated the in planta interaction between wild-type M-Rep and its tagged counterpart. Such a tagged and fully functional replication initiator protein will have bearings on the isolation of protein complexes from plants.
Project description:BACKGROUND:Viruses strongly influence microbial population dynamics and ecosystem functions. However, our ability to quantitatively evaluate those viral impacts is limited to the few cultivated viruses and double-stranded DNA (dsDNA) viral genomes captured in quantitative viral metagenomes (viromes). This leaves the ecology of non-dsDNA viruses nearly unknown, including single-stranded DNA (ssDNA) viruses that have been frequently observed in viromes, but not quantified due to amplification biases in sequencing library preparations (Multiple Displacement Amplification, Linker Amplification or Tagmentation). METHODS:Here we designed mock viral communities including both ssDNA and dsDNA viruses to evaluate the capability of a sequencing library preparation approach including an Adaptase step prior to Linker Amplification for quantitative amplification of both dsDNA and ssDNA templates. We then surveyed aquatic samples to provide first estimates of the abundance of ssDNA viruses. RESULTS:Mock community experiments confirmed the biased nature of existing library preparation methods for ssDNA templates (either largely enriched or selected against) and showed that the protocol using Adaptase plus Linker Amplification yielded viromes that were ±1.8-fold quantitative for ssDNA and dsDNA viruses. Application of this protocol to community virus DNA from three freshwater and three marine samples revealed that ssDNA viruses as a whole represent only a minor fraction (<5%) of DNA virus communities, though individual ssDNA genomes, both eukaryote-infecting Circular Rep-Encoding Single-Stranded DNA (CRESS-DNA) viruses and bacteriophages from the Microviridae family, can be among the most abundant viral genomes in a sample. DISCUSSION:Together these findings provide empirical data for a new virome library preparation protocol, and a first estimate of ssDNA virus abundance in aquatic systems.