Unknown

Dataset Information

0

ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs.


ABSTRACT: Extracelluar matrix (ECM) proteins create complex networks of macromolecules which fill-in the extracellular spaces of living tissues. They provide structural support and play an important role in maintaining cellular functions. Identification of ECM proteins can play a vital role in studying various types of diseases. Conventional wet lab-based methods are reliable; however, they are expensive and time consuming and are, therefore, not scalable. In this research, we propose a sequence-based novel machine learning approach for the prediction of ECM proteins. In the proposed method, composition of k-spaced amino acid pair (CKSAAP) features are encoded into a classifiable latent space (LS) with the help of deep latent space encoding (LSE). A comprehensive ablation analysis is conducted for performance evaluation of the proposed method. Results are compared with other state-of-the-art methods on the benchmark dataset, and the proposed ECM-LSE approach has shown to comprehensively outperform the contemporary methods.

SUBMITTER: Al-Saggaf UM 

PROVIDER: S-EPMC8552119 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs.

Al-Saggaf Ubaid M UM   Usman Muhammad M   Naseem Imran I   Moinuddin Muhammad M   Jiman Ahmad A AA   Alsaggaf Mohammed U MU   Alshoubaki Hitham K HK   Khan Shujaat S  

Frontiers in bioengineering and biotechnology 20211014


Extracelluar matrix (ECM) proteins create complex networks of macromolecules which fill-in the extracellular spaces of living tissues. They provide structural support and play an important role in maintaining cellular functions. Identification of ECM proteins can play a vital role in studying various types of diseases. Conventional wet lab-based methods are reliable; however, they are expensive and time consuming and are, therefore, not scalable. In this research, we propose a sequence-based nov  ...[more]

Similar Datasets

| S-EPMC7188683 | biostudies-literature
| S-EPMC8928959 | biostudies-literature
| S-EPMC7647693 | biostudies-literature
| S-EPMC3146527 | biostudies-literature
| S-EPMC1863424 | biostudies-literature
| S-EPMC10965421 | biostudies-literature
| S-EPMC3478286 | biostudies-literature
| S-EPMC7273372 | biostudies-literature
| S-EPMC7859348 | biostudies-literature
| S-EPMC4469302 | biostudies-literature