Unknown

Dataset Information

0

Using Machine Learning to Identify Biomarkers Affecting Fat Deposition in Pigs by Integrating Multisource Transcriptome Information.


ABSTRACT: Fat deposition in pigs is not only closely related to pig production efficiency and pork quality but also an ideal model for human obesity. Transcriptome sequencing is widely used to study fat deposition. However, due to small sample sizes, high false positive rates, and poor consistency of results from different studies, new strategies are urgently needed. Machine learning, a new analysis method, can effectively fit complex data and accurately identify samples and genes. In this study, 36 samples of adipose tissue, muscle tissue, and liver tissue were collected from Songliao black pigs and Landrace pigs, and the mRNA of all the samples was sequenced. In addition, we collected transcriptome data for 64 samples in the GEO database from four different sources. After standardization and imputation of missing values in the data set comprising 100 samples, traditional differential expression analysis was carried out, and different numbers of expressed genes were selected as features for the training model of eight machine learning methods. In the 1000 replications of fourfold cross validation with 100 samples, AdaBoost performed best, with an average prediction accuracy greater than 93% and the highest mean area under the curve in predicting the high- and low-fat content groups among the eight ML methods. According to their performance-based ranks inferred by AdaBoost, 12 genes related to fat deposition were identified; among them, FASN and APOD were specifically expressed in adipose tissue, and APOA1 was specifically expressed in the liver, which could be important candidate biomarkers affecting fat deposition.

SUBMITTER: Liu H 

PROVIDER: S-EPMC9413214 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using Machine Learning to Identify Biomarkers Affecting Fat Deposition in Pigs by Integrating Multisource Transcriptome Information.

Liu Huatao H   Xing Kai K   Jiang Yifan Y   Liu Yibing Y   Wang Chuduan C   Ding Xiangdong X  

Journal of agricultural and food chemistry 20220811 33


Fat deposition in pigs is not only closely related to pig production efficiency and pork quality but also an ideal model for human obesity. Transcriptome sequencing is widely used to study fat deposition. However, due to small sample sizes, high false positive rates, and poor consistency of results from different studies, new strategies are urgently needed. Machine learning, a new analysis method, can effectively fit complex data and accurately identify samples and genes. In this study, 36 sampl  ...[more]

Similar Datasets

| S-EPMC11743517 | biostudies-literature
| S-EPMC4388518 | biostudies-literature
2024-09-30 | GSE270124 | GEO
| S-EPMC7431984 | biostudies-literature
| S-EPMC9138122 | biostudies-literature
2018-10-15 | E-MTAB-1434 | biostudies-arrayexpress
| S-EPMC4968439 | biostudies-literature
| S-EPMC4431873 | biostudies-literature
| S-EPMC3499287 | biostudies-literature
| S-EPMC5639760 | biostudies-literature