Project description:Although numerous metagenome, amplicon sequencing-based studies have been conducted to date to characterize marine microbial communities, relatively few have employed full metagenome shotgun sequencing to obtain a broader picture of the functional features of these marine microbial communities. Moreover, most of these studies only performed sporadic sampling, which is insufficient to understand an ecosystem comprehensively. In this study, we regularly conducted seawater sampling along the northeastern Pacific coast of Japan between March 2012 and May 2016. We collected 213 seawater samples and prepared size-based fractions to generate 454 subsets of samples for shotgun metagenome sequencing and analysis. We also determined the sequences of 16S rRNA (n = 111) and 18S rRNA (n = 47) gene amplicons from smaller sample subsets. We thereafter developed the Ocean Monitoring Database for time-series metagenomic data ( http://marine-meta.healthscience.sci.waseda.ac.jp/omd/ ), which provides a three-dimensional bird's-eye view of the data. This database includes results of digital DNA chip analysis, a novel method for estimating ocean characteristics such as water temperature from metagenomic data. Furthermore, we developed a novel classification method that includes more information about viruses than that acquired using BLAST. We further report the discovery of a large number of previously overlooked (TAG)n repeat sequences in the genomes of marine microbes. We predict that the availability of this time-series database will lead to major discoveries in marine microbiome research.
Project description:We report data associated with the identification of three polyhydroxyalkanoate synthase genes (phaC) isolated from the marine bacteria metagenome of Aaptos aaptos marine sponge in the waters of Bidong Island, Terengganu, Malaysia. Our data describe the extraction of bacterial metagenome from sponge tissue, measurement of purity and concentration of extracted metagenome, polymerase chain reaction (PCR)-mediated amplification using degenerate primers targeting Class I and II phaC genes, sequencing at First BASE Laboratories Sdn Bhd, and phylogenetic analysis of identified and known phaC genes. The partial nucleotide sequences were aligned, refined, compared with the Basic Local Alignment Search Tool (BLAST) databases, and released online in GenBank. The data include the identified partial putative phaC and their GenBank accession numbers, which are Rhodocista sp. phaC (MF457754), Pseudomonas sp. phaC (MF437016), and an uncultured bacterium AR5-9d_16 phaC (MF457753).
Project description:Anaerobic ammonium-oxidizing (anammox) bacteria are responsible for a significant portion of the loss of fixed nitrogen from the oceans, making them important players in the global nitrogen cycle. To date, marine anammox bacteria found in marine water columns and sediments worldwide belong almost exclusively to the 'Candidatus Scalindua' species, but the molecular basis of their metabolism and competitive fitness is presently unknown. We applied community sequencing of a marine anammox enrichment culture dominated by 'Candidatus Scalindua profunda' to construct a genome assembly, which was subsequently used to analyse the most abundant gene transcripts and proteins. In the S. profunda assembly, 4756 genes were annotated, and only about half of them showed the highest identity to the only other anammox bacterium of which a metagenome assembly had been constructed so far, the freshwater 'Candidatus Kuenenia stuttgartiensis'. In total, 2016 genes of S. profunda could not be matched to the K. stuttgartiensis metagenome assembly at all, and a similar number of genes in K.stuttgartiensis could not be found in S. profunda. Most of these genes did not have a known function but 98 expressed genes could be attributed to oligopeptide transport, amino acid metabolism, use of organic acids and electron transport. On the basis of the S. profunda metagenome, and environmental metagenome data, we observed pronounced differences in the gene organization and expression of important anammox enzymes, such as hydrazine synthase (HzsAB), nitrite reductase (NirS) and inorganic nitrogen transport proteins. Adaptations of Scalindua to the substrate limitation of the ocean may include highly expressed ammonium, nitrite and oligopeptide transport systems and pathways for the transport, oxidation, and assimilation of small organic compounds that may allow a more versatile lifestyle contributing to the competitive fitness of Scalindua in the marine realm.
Project description:We report 11 bacterial draft genome sequences and 38 metagenome-assembled genomes (MAGs) from marine phytoplankton exometabolite enrichments. The genomes and MAGs represent marine bacteria adapted to the metabolite environment of phycospheres, organic matter-rich regions surrounding phytoplankton cells, and are useful for exploring functional and taxonomic attributes of phytoplankton-associated bacterial communities.
Project description:Viruses have a profound influence on both the ecology and evolution of marine plankton, but the genetic diversity of viral assemblages, particularly those in deeper ocean waters, remains poorly described. Here we report on the construction and analysis of a viral metagenome prepared from below the euphotic zone in a temperate, eutrophic bay of coastal California.We purified viruses from approximately one cubic meter of seawater collected from 200 m depth in Monterey Bay, CA. DNA was extracted from the virus fraction, sheared, and cloned with no prior amplification into a plasmid vector and propagated in E. coli to produce the MBv200m library. Random clones were sequenced by the Sanger method. Sequences were assembled then compared to sequences in GenBank and to other viral metagenomic libraries using BLAST analyses.Only 26% of the 881 sequences remaining after assembly had significant (E?0.001) BLAST hits to sequences in the GenBank nr database, with most being matches to bacteria (15%) and viruses (8%). When BLAST analysis included environmental sequences, 74% of sequences in the MBv200m library had a significant match. Most of these hits (70%) were to microbial metagenome sequences and only 0.7% were to sequences from viral metagenomes. Of the 121 sequences with a significant hit to a known virus, 94% matched bacteriophages (Families Podo-, Sipho-, and Myoviridae) and 6% matched viruses of eukaryotes in the Family Phycodnaviridae (5 sequences) or the Mimivirus (2 sequences). The largest percentages of hits to viral genes of known function were to those involved in DNA modification (25%) or structural genes (17%). Based on reciprocal BLAST analyses, the MBv200m library appeared to be most similar to viral metagenomes from two other bays and least similar to a viral metagenome from the Arctic Ocean.Direct cloning of DNA from diverse marine viruses was feasible and resulted in a distribution of virus types and functional genes at depth that differed in detail, but were broadly similar to those found in surface marine waters. Targeted viral analyses are useful for identifying those components of the greater marine metagenome that circulate in the subcellular size fraction.
Project description:Most current approaches to analyse metagenomic data rely on reference genomes. Novel microbial communities extend far beyond the coverage of reference databases and de novo metagenome assembly from complex microbial communities remains a great challenge. Here we present a novel experimental and bioinformatic framework, metaSort, for effective construction of bacterial genomes from metagenomic samples. MetaSort provides a sorted mini-metagenome approach based on flow cytometry and single-cell sequencing methodologies, and employs new computational algorithms to efficiently recover high-quality genomes from the sorted mini-metagenome by the complementary of the original metagenome. Through extensive evaluations, we demonstrated that metaSort has an excellent and unbiased performance on genome recovery and assembly. Furthermore, we applied metaSort to an unexplored microflora colonized on the surface of marine kelp and successfully recovered 75 high-quality genomes at one time. This approach will greatly improve access to microbial genomes from complex or novel communities.
Project description:When a bacterial genome is compared to the metagenome of an environment it inhabits, most genes recruit at high sequence identity. In free-living bacteria (for instance marine bacteria compared against the ocean metagenome) certain genomic regions are totally absent in recruitment plots, representing therefore genes unique to individual bacterial isolates. We show that these Metagenomic Islands (MIs) are also visible in bacteria living in human hosts when their genomes are compared to sequences from the human microbiome, despite the compartmentalized structure of human-related environments such as the gut. From an applied point of view, MIs of human pathogens (e.g. those identified in enterohaemorragic Escherichia coli against the gut metagenome or in pathogenic Neisseria meningitidis against the oral metagenome) include virulence genes that appear to be absent in related strains or species present in the microbiome of healthy individuals. We propose that this strategy (i.e. recruitment analysis of pathogenic bacteria against the metagenome of healthy subjects) can be used to detect pathogenicity regions in species where the genes involved in virulence are poorly characterized. Using this approach, we detect well-known pathogenicity islands and identify new potential virulence genes in several human pathogens.
Project description:Nucleo-cytoplasmic large DNA viruses are doubled stranded DNA viruses capable of infecting eukaryotic cells. Since the discovery of Mimivirus and Pandoravirus, there has been no doubt about their extraordinary features compared to "classic" viruses. Recently, we reported the expansion of the proposed family Pithoviridae, with the description of Cedratvirus and Orpheovirus, two new viruses related to Pithoviruses. Studying the major capsid protein of Orpheovirus, we detected a homologous sequence in a mine drainage metagenome. The in-depth exploration of this metagenome, using the MG-Digger program, enabled us to retrieve up to 10 contigs with clear evidence of viral sequences. Moreover, phylogenetic analyses further extended our screening with the discovery in another marine metagenome of a second virus closely related to Orpheovirus IHUMI-LCC2. This virus is a misidentified virus confused with and annotated as a Rickettsiales bacterium. It presents a partial genome size of about 170 kbp.
Project description:Marine salterns are composed of several shallow ponds with a salinity gradient, from seawater to salt saturation, with gradually changing microbial populations. Here, we report the metagenome sequencing of the prokaryotic microbiota of two ponds with 13% and 33% salinity from a saltern in Santa Pola, Spain.
Project description:BACKGROUND: The enormous database of microbial DNA generated from the Sargasso Sea metagenome provides a unique opportunity to locate genes participating in different biosynthetic pathways and to attempt to understand the relationship and evolution of those genes. In this article, an analysis of the Sargasso Sea metagenome is made with respect to the seven genes of the tryptophan pathway. RESULTS: At least 5% of all the genes that are related to amino acid biosynthesis are tryptophan (trp) genes. Many contigs and scaffolds contain whole or split operons that are similar to previously analyzed trp gene organizations. Only two scaffolds discovered in this analysis possess a different operon organization of tryptophan pathway genes than those previously known. Many marine organisms lack an operon-type organization of these genes or have mini-operons containing only two trp genes. In addition, the trpB genes from this search reveal that the dichotomous division between trpB_1 and trpB_2 also occurs in organisms from the Sargasso Sea. One cluster was found to contain trpB sequences that were closely related to each other but distinct from most known trpB sequences. CONCLUSION: The data show that trp genes are widely dispersed within this metagenome. The novel organization of these genes and an unusual group of trpB_1 sequences that were found among some of these Sargasso Sea bacteria indicate that there is much to be discovered about both the reason for certain gene orders and the regulation of tryptophan biosynthesis in marine bacteria.