Unknown

Dataset Information

0

Deep Learning Empowers the Discovery of Self-Assembling Peptides with Over 10 Trillion Sequences.


ABSTRACT: Self-assembling of peptides is essential for a variety of biological and medical applications. However, it is challenging to investigate the self-assembling properties of peptides within the complete sequence space due to the enormous sequence quantities. Here, it is demonstrated that a transformer-based deep learning model is effective in predicting the aggregation propensity (AP) of peptide systems, even for decapeptide and mixed-pentapeptide systems with over 10 trillion sequence quantities. Based on the predicted AP values, not only the aggregation laws for designing self-assembling peptides are derived, but the transferability relation among the APs of pentapeptides, decapeptides, and mixed pentapeptides is also revealed, leading to discoveries of self-assembling peptides by concatenating or mixing, as consolidated by experiments. This deep learning approach enables speedy, accurate, and thorough search and design of self-assembling peptides within the complete sequence space of oligopeptides, advancing peptide science by inspiring new biological and medical applications.

SUBMITTER: Wang J 

PROVIDER: S-EPMC10625107 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Deep Learning Empowers the Discovery of Self-Assembling Peptides with Over 10 Trillion Sequences.

Wang Jiaqi J   Liu Zihan Z   Zhao Shuang S   Xu Tengyan T   Wang Huaimin H   Li Stan Z SZ   Li Wenbin W  

Advanced science (Weinheim, Baden-Wurttemberg, Germany) 20230925 31


Self-assembling of peptides is essential for a variety of biological and medical applications. However, it is challenging to investigate the self-assembling properties of peptides within the complete sequence space due to the enormous sequence quantities. Here, it is demonstrated that a transformer-based deep learning model is effective in predicting the aggregation propensity (AP) of peptide systems, even for decapeptide and mixed-pentapeptide systems with over 10 trillion sequence quantities.  ...[more]

Similar Datasets

| S-EPMC9844539 | biostudies-literature
| S-EPMC8302598 | biostudies-literature
| S-EPMC10014862 | biostudies-literature
| S-EPMC9814929 | biostudies-literature
| S-EPMC7586918 | biostudies-literature
| S-EPMC5574881 | biostudies-literature
| S-EPMC10790182 | biostudies-literature
| S-EPMC5388898 | biostudies-literature
| S-EPMC4803621 | biostudies-literature
| S-EPMC3466390 | biostudies-literature