Unknown

Dataset Information

0

Soil microbiome dataset from Guanica dry forest in Puerto Rico generated by shotgun sequencing.


ABSTRACT: Guanica dry forest (GDF), located in the southwest area or region of Puerto Rico, is among the most preserved subtropical dry forests in the world [1]. To describe the taxonomic diversity and functional profiles of this environment, metagenomic DNA was extracted from a metagenomic library generated from the GDF. The DNA was shotgun-sequenced using Illumina and analyzed using the MG-RAST server. The diversity profile revealed that the most abundant domain was Bacteria (97.8%) followed by Archaea (1.12%), Eukaryota (1.02%) and Viruses (0.03%). Out of the 50 phyla present, the most abundant was Proteobacteria (41.6%) followed by Actinobacteria (18.7%) and Acidobacteria (7.06%). Moreover, a total of 213 orders, 384 families and 791 genus were identified. The functional profile showed abundance of genes related to Carbohydrates (13.16%), Clustering-based subsystems (13.0%), Amino Acids and Derivatives (9.9%) and Protein Metabolism (8.24%). Furthermore, more specific grouping showed that NULL (21.5%) was the most abundant function group, followed by Plant-Prokaryote DOE project (6.05%), Protein biosynthesis (4.82%), Central carbohydrate metabolism (3.98%), DNA repair (2.72%) and Resistance to antibiotics and toxic compounds (2.66%). This dataset is useful in bioprospecting studies with application in biomedical sciences, biotechnology and microbial, population and applied ecology fields.

SUBMITTER: Sotomayor-Mena RG 

PROVIDER: S-EPMC6926141 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Soil microbiome dataset from Guanica dry forest in Puerto Rico generated by shotgun sequencing.

Sotomayor-Mena Roberto G RG   Rios-Velazquez Carlos C  

Data in brief 20191203


Guanica dry forest (GDF), located in the southwest area or region of Puerto Rico, is among the most preserved subtropical dry forests in the world [1]. To describe the taxonomic diversity and functional profiles of this environment, metagenomic DNA was extracted from a metagenomic library generated from the GDF. The DNA was shotgun-sequenced using Illumina and analyzed using the MG-RAST server. The diversity profile revealed that the most abundant domain was Bacteria (97.8%) followed by Archaea  ...[more]

Similar Datasets

| S-EPMC6247408 | biostudies-literature
| S-EPMC5740410 | biostudies-literature
| S-EPMC4778678 | biostudies-literature
| S-EPMC6461016 | biostudies-literature
| PRJEB26500 | ENA
| PRJEB23349 | ENA
| S-EPMC5417066 | biostudies-literature