Unknown

Dataset Information

0

Structome: a tool for the rapid assembly of datasets for structural phylogenetics.


ABSTRACT:

Summary

Protein structures carry signal of common ancestry and can therefore aid in reconstructing their evolutionary histories. To expedite the structure-informed inference process, a web server, Structome, has been developed that allows users to rapidly identify protein structures similar to a query protein and to assemble datasets useful for structure-based phylogenetics. Structome was created by clustering ∼94% of the structures in RCSB PDB using 90% sequence identity and representing each cluster by a centroid structure. Structure similarity between centroid proteins was calculated, and annotations from PDB, SCOP, and CATH were integrated. To illustrate utility, an H3 histone was used as a query, and results show that the protein structures returned by Structome span both sequence and structural diversity of the histone fold. Additionally, the pre-computed nexus-formatted distance matrix, provided by Structome, enables analysis of evolutionary relationships between proteins not identifiable using searches based on sequence similarity alone. Our results demonstrate that, beginning with a single structure, Structome can be used to rapidly generate a dataset of structural neighbours and allows deep evolutionary history of proteins to be studied.

Availability and implementation

Structome is available at: https://structome.bii.a-star.edu.sg.

SUBMITTER: Malik AJ 

PROVIDER: S-EPMC10692761 | biostudies-literature | 2023

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structome: a tool for the rapid assembly of datasets for structural phylogenetics.

Malik Ashar J AJ   Langer Desiree D   Verma Chandra S CS   Poole Anthony M AM   Allison Jane R JR  

Bioinformatics advances 20231003 1


<h4>Summary</h4>Protein structures carry signal of common ancestry and can therefore aid in reconstructing their evolutionary histories. To expedite the structure-informed inference process, a web server, Structome, has been developed that allows users to rapidly identify protein structures similar to a query protein and to assemble datasets useful for structure-based phylogenetics. Structome was created by clustering ∼94% of the structures in RCSB PDB using 90% sequence identity and representin  ...[more]

Similar Datasets

| S-EPMC3561365 | biostudies-literature
| S-EPMC7475046 | biostudies-literature
| S-EPMC6764373 | biostudies-literature
| S-EPMC4467657 | biostudies-literature
| S-EPMC3299638 | biostudies-literature
| S-EPMC5552126 | biostudies-literature
| S-EPMC2925852 | biostudies-other
| S-EPMC10098151 | biostudies-literature
| S-EPMC7020997 | biostudies-literature
| PRJEB78482 | ENA