Unknown

Dataset Information

0

Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction.


ABSTRACT: Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To determine the importance of each MSA's feature at the inter-residue level, we added an attention layer to the deep neural network. We show that combining four MSAs of different E-value cutoffs improved the model prediction performance as compared to single E-value MSA features. A further improvement was observed when an attention layer was used and even more when additional prediction tasks of bond angle predictions were added. The improvement of distance predictions were successfully transferred to achieve better protein tertiary structure modeling.

SUBMITTER: Jain A 

PROVIDER: S-EPMC8027171 | biostudies-literature | 2021 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction.

Jain Aashish A   Terashi Genki G   Kagaya Yuki Y   Maddhuri Venkata Subramaniya Sai Raghavendra SR   Christoffer Charles C   Kihara Daisuke D  

Scientific reports 20210407 1


Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To d  ...[more]

Similar Datasets

| S-EPMC7831258 | biostudies-literature
| S-EPMC5820155 | biostudies-literature
| S-EPMC6324825 | biostudies-literature
| S-EPMC8616805 | biostudies-literature
| S-EPMC9881607 | biostudies-literature
| S-EPMC3287581 | biostudies-literature
| S-EPMC6237422 | biostudies-literature
| S-EPMC8204903 | biostudies-literature
| S-EPMC7703788 | biostudies-literature
| S-EPMC7180065 | biostudies-literature