Unknown

Dataset Information

0

Predicted structural proteome of Sphagnum divinum and proteome-scale annotation.


ABSTRACT:

Motivation

Sphagnum-dominated peatlands store a substantial amount of terrestrial carbon. The genus is undersampled and under-studied. No experimental crystal structure from any Sphagnum species exists in the Protein Data Bank and fewer than 200 Sphagnum-related genes have structural models available in the AlphaFold Protein Structure Database. Tools and resources are needed to help bridge these gaps, and to enable the analysis of other structural proteomes now made possible by accurate structure prediction.

Results

We present the predicted structural proteome (25 134 primary transcripts) of Sphagnum divinum computed using AlphaFold, structural alignment results of all high-confidence models against an annotated nonredundant crystallographic database of over 90,000 structures, a structure-based classification of putative Enzyme Commission (EC) numbers across this proteome, and the computational method to perform this proteome-scale structure-based annotation.

Availability and implementation

All data and code are available in public repositories, detailed at https://github.com/BSDExabio/SAFA. The structural models of the S. divinum proteome have been deposited in the ModelArchive repository at https://modelarchive.org/doi/10.5452/ma-ornl-sphdiv.

SUBMITTER: Davidson RB 

PROVIDER: S-EPMC10463551 | biostudies-literature | 2023 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicted structural proteome of Sphagnum divinum and proteome-scale annotation.

Davidson Russell B RB   Coletti Mark M   Gao Mu M   Piatkowski Bryan B   Sreedasyam Avinash A   Quadir Farhan F   Weston David J DJ   Schmutz Jeremy J   Cheng Jianlin J   Skolnick Jeffrey J   Parks Jerry M JM   Sedova Ada A  

Bioinformatics (Oxford, England) 20230801 8


<h4>Motivation</h4>Sphagnum-dominated peatlands store a substantial amount of terrestrial carbon. The genus is undersampled and under-studied. No experimental crystal structure from any Sphagnum species exists in the Protein Data Bank and fewer than 200 Sphagnum-related genes have structural models available in the AlphaFold Protein Structure Database. Tools and resources are needed to help bridge these gaps, and to enable the analysis of other structural proteomes now made possible by accurate  ...[more]

Similar Datasets

| S-EPMC3205055 | biostudies-literature
| S-EPMC2831211 | biostudies-literature
| S-EPMC5390313 | biostudies-literature
| S-EPMC11296891 | biostudies-literature
| S-EPMC9135334 | biostudies-literature
| S-EPMC3285560 | biostudies-literature
| S-EPMC9487898 | biostudies-literature