Unknown

Dataset Information

0

PSICalc: a novel approach to identifying and ranking critical non-proximal interdependencies within the overall protein structure.


ABSTRACT:

Motivation

AlphaFold has been a major advance in predicting protein structure, but still leaves the problem of determining which sub-molecular components of a protein are essential for it to carry out its function within the cell. Direct coupling analysis predicts two- and three-amino acid contacts, but there may be essential interdependencies that are not proximal within the 3D structure. The problem to be addressed is to design a computational method that locates and ranks essential non-proximal interdependencies within a protein involving five or more amino acids, using large, multiple sequence alignments (MSAs) for both globular and intrinsically unstructured proteins.

Results

We developed PSICalc (Protein Subdomain Interdependency Calculator), a laptop-friendly, pattern-discovery, bioinformatics software tool that analyzes large MSAs for both structured and unstructured proteins, locates both proximal and non-proximal inter-dependent sites, and clusters them into pairwise (second order), third-order and higher-order clusters using a k-modes approach, and provides ranked results within minutes. To aid in visualizing these interdependencies, we developed a graphical user interface that displays these subdomain relationships as a polytree graph. To demonstrate, we provide examples of both proximal and non-proximal interdependencies documented for eukaryotic topoisomerase II including between the unstructured C-terminal domain and the N-terminal domain.

Availability and implementation

https://github.com/jdeweeselab/psicalc-package.

Supplementary information

Supplementary data are available at Bioinformatics Advances online.

SUBMITTER: Townsley TD 

PROVIDER: S-EPMC9710643 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

PSICalc: a novel approach to identifying and ranking critical non-proximal interdependencies within the overall protein structure.

Townsley Thomas D TD   Wilson James T JT   Akers Harrison H   Bryant Timothy T   Cordova Salvador S   Wallace T L TL   Durston Kirk K KK   Deweese Joseph E JE  

Bioinformatics advances 20220818 1


<h4>Motivation</h4>AlphaFold has been a major advance in predicting protein structure, but still leaves the problem of determining which sub-molecular components of a protein are essential for it to carry out its function within the cell. Direct coupling analysis predicts two- and three-amino acid contacts, but there may be essential interdependencies that are not proximal within the 3D structure. The problem to be addressed is to design a computational method that locates and ranks essential no  ...[more]

Similar Datasets

| S-EPMC4924125 | biostudies-literature
| S-EPMC4227039 | biostudies-literature
| S-EPMC6516864 | biostudies-literature
| S-EPMC11906859 | biostudies-literature
| S-EPMC11172036 | biostudies-literature
| S-EPMC2650257 | biostudies-literature
| S-EPMC9839634 | biostudies-literature
| S-EPMC7882882 | biostudies-literature
| S-EPMC4576026 | biostudies-literature
2015-09-07 | PXD002440 | Pride