<HashMap><database>biostudies-literature</database><scores/><additional><omics_type>Unknown</omics_type><volume>8(4)</volume><submitter>Kelly RS</submitter><funding>National Institute of Allergy and Infectious Diseases</funding><funding>National Heart, Lung, and Blood Institute</funding><funding>National Centers for Advancing Translational Sciences</funding><funding>NHLBI NIH HHS</funding><funding>U.S. Department of Defense</funding><pubmed_abstract>To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (&lt;i>n&lt;/i> = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (&lt;i>p&lt;/i> for difference &lt; 0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.</pubmed_abstract><journal>Metabolites</journal><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC6316795</full_dataset_link><repository>biostudies-literature</repository><pubmed_title>Partial Least Squares Discriminant Analysis and Bayesian Networks for Metabolomic Prediction of Childhood Asthma.</pubmed_title><pmcid>PMC6316795</pmcid><funding_grant_id>1R01HL123915-01</funding_grant_id><funding_grant_id>U54TR001012</funding_grant_id><funding_grant_id>U01HL091528</funding_grant_id><funding_grant_id>1R01HL123546-01A1</funding_grant_id><funding_grant_id>T32 HL007427</funding_grant_id><funding_grant_id>5T32AI007306-30</funding_grant_id><funding_grant_id>R01 HL139634</funding_grant_id><funding_grant_id>P01 HL132825</funding_grant_id><funding_grant_id>W81XWH-17-1-0533</funding_grant_id><funding_grant_id>5R01HL123915-05</funding_grant_id><pubmed_authors>Kelly RS</pubmed_authors><pubmed_authors>Lee-Sarwar KA</pubmed_authors><pubmed_authors>Lasky-Su J</pubmed_authors><pubmed_authors>Litonjua AA</pubmed_authors><pubmed_authors>Chu SH</pubmed_authors><pubmed_authors>Virkud YV</pubmed_authors><pubmed_authors>Weiss ST</pubmed_authors><pubmed_authors>McGeachie MJ</pubmed_authors><pubmed_authors>Huang M</pubmed_authors><pubmed_authors>Kachroo P</pubmed_authors></additional><is_claimable>false</is_claimable><name>Partial Least Squares Discriminant Analysis and Bayesian Networks for Metabolomic Prediction of Childhood Asthma.</name><description>To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (&lt;i>n&lt;/i> = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (&lt;i>p&lt;/i> for difference &lt; 0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.</description><dates><release>2018-01-01T00:00:00Z</release><publication>2018 Oct</publication><modification>2021-03-18T08:53:43Z</modification><creation>2019-03-26T22:36:08Z</creation></dates><accession>S-EPMC6316795</accession><cross_references><pubmed>30360514</pubmed><doi>10.3390/metabo8040068</doi></cross_references></HashMap>