<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Wu W</submitter><funding>NCATS NIH HHS</funding><funding>NCRR NIH HHS</funding><funding>NIDA NIH HHS</funding><funding>NHLBI NIH HHS</funding><funding>NIGMS NIH HHS</funding><pagination>1280-8</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC4038417</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>133(5)</volume><pubmed_abstract>BACKGROUND:Previous studies have identified asthma phenotypes based on small numbers of clinical, physiologic, or inflammatory characteristics. However, no studies have used a wide range of variables using machine learning approaches. OBJECTIVES:We sought to identify subphenotypes of asthma by using blood, bronchoscopic, exhaled nitric oxide, and clinical data from the Severe Asthma Research Program with unsupervised clustering and then characterize them by using supervised learning approaches. METHODS:Unsupervised clustering approaches were applied to 112 clinical, physiologic, and inflammatory variables from 378 subjects. Variable selection and supervised learning techniques were used to select relevant and nonredundant variables and address their predictive values, as well as the predictive value of the full variable set. RESULTS:Ten variable clusters and 6 subject clusters were identified, which differed and overlapped with previous clusters. Patients with traditionally defined severe asthma were distributed through subject clusters 3 to 6. Cluster 4 identified patients with early-onset allergic asthma with low lung function and eosinophilic inflammation. Patients with later-onset, mostly severe asthma with nasal polyps and eosinophilia characterized cluster 5. Cluster 6 asthmatic patients manifested persistent inflammation in blood and bronchoalveolar lavage fluid and exacerbations despite high systemic corticosteroid use and side effects. Age of asthma onset, quality of life, symptoms, medications, and health care use were some of the 51 nonredundant variables distinguishing subject clusters. These 51 variables classified test cases with 88% accuracy compared with 93% accuracy with all 112 variables. CONCLUSION:The unsupervised machine learning approaches used here provide unique insights into disease, confirming other approaches while revealing novel additional phenotypes.</pubmed_abstract><journal>The Journal of allergy and clinical immunology</journal><pubmed_title>Unsupervised phenotyping of Severe Asthma Research Program participants using expanded lung data.</pubmed_title><pmcid>PMC4038417</pmcid><funding_grant_id>HL69149</funding_grant_id><funding_grant_id>M01 RR03186</funding_grant_id><funding_grant_id>HL69167</funding_grant_id><funding_grant_id>R01GM087694</funding_grant_id><funding_grant_id>M01RR07122</funding_grant_id><funding_grant_id>R01-HL69174</funding_grant_id><funding_grant_id>U10 HL109257</funding_grant_id><funding_grant_id>HL69349</funding_grant_id><funding_grant_id>R01 HL069349</funding_grant_id><funding_grant_id>R01 HL069149</funding_grant_id><funding_grant_id>U10 HL109250</funding_grant_id><funding_grant_id>U10 HL109172</funding_grant_id><funding_grant_id>M01 RR007122</funding_grant_id><funding_grant_id>R01 HL069167</funding_grant_id><funding_grant_id>M01 RR003186</funding_grant_id><funding_grant_id>R01 HL069155</funding_grant_id><funding_grant_id>HL69130</funding_grant_id><funding_grant_id>R01 HL069130</funding_grant_id><funding_grant_id>HL69174</funding_grant_id><funding_grant_id>R01 HL069174</funding_grant_id><funding_grant_id>M01 RR018390</funding_grant_id><funding_grant_id>R01 HL069170</funding_grant_id><funding_grant_id>HL69170</funding_grant_id><funding_grant_id>HL69116</funding_grant_id><funding_grant_id>UL1 TR000005</funding_grant_id><funding_grant_id>P30 DA035778</funding_grant_id><funding_grant_id>HL69155</funding_grant_id><funding_grant_id>HL087665</funding_grant_id><funding_grant_id>UL1 TR000427</funding_grant_id><funding_grant_id>R01 GM087694</funding_grant_id><funding_grant_id>U10 HL109164</funding_grant_id><funding_grant_id>R01 HL069116</funding_grant_id><funding_grant_id>R01 HL087665</funding_grant_id><pubmed_authors>Erzurum S</pubmed_authors><pubmed_authors>Curran-Everett D</pubmed_authors><pubmed_authors>Israel E</pubmed_authors><pubmed_authors>Wenzel SE</pubmed_authors><pubmed_authors>Busse WW</pubmed_authors><pubmed_authors>Bleecker E</pubmed_authors><pubmed_authors>Castro M</pubmed_authors><pubmed_authors>Chung KF</pubmed_authors><pubmed_authors>Wu W</pubmed_authors><pubmed_authors>Moore W</pubmed_authors><pubmed_authors>Calhoun WJ</pubmed_authors><pubmed_authors>Gaston B</pubmed_authors></additional><is_claimable>false</is_claimable><name>Unsupervised phenotyping of Severe Asthma Research Program participants using expanded lung data.</name><description>BACKGROUND:Previous studies have identified asthma phenotypes based on small numbers of clinical, physiologic, or inflammatory characteristics. However, no studies have used a wide range of variables using machine learning approaches. OBJECTIVES:We sought to identify subphenotypes of asthma by using blood, bronchoscopic, exhaled nitric oxide, and clinical data from the Severe Asthma Research Program with unsupervised clustering and then characterize them by using supervised learning approaches. METHODS:Unsupervised clustering approaches were applied to 112 clinical, physiologic, and inflammatory variables from 378 subjects. Variable selection and supervised learning techniques were used to select relevant and nonredundant variables and address their predictive values, as well as the predictive value of the full variable set. RESULTS:Ten variable clusters and 6 subject clusters were identified, which differed and overlapped with previous clusters. Patients with traditionally defined severe asthma were distributed through subject clusters 3 to 6. Cluster 4 identified patients with early-onset allergic asthma with low lung function and eosinophilic inflammation. Patients with later-onset, mostly severe asthma with nasal polyps and eosinophilia characterized cluster 5. Cluster 6 asthmatic patients manifested persistent inflammation in blood and bronchoalveolar lavage fluid and exacerbations despite high systemic corticosteroid use and side effects. Age of asthma onset, quality of life, symptoms, medications, and health care use were some of the 51 nonredundant variables distinguishing subject clusters. These 51 variables classified test cases with 88% accuracy compared with 93% accuracy with all 112 variables. CONCLUSION:The unsupervised machine learning approaches used here provide unique insights into disease, confirming other approaches while revealing novel additional phenotypes.</description><dates><release>2014-01-01T00:00:00Z</release><publication>2014 May</publication><modification>2020-10-31T09:14:36Z</modification><creation>2019-03-27T01:29:03Z</creation></dates><accession>S-EPMC4038417</accession><cross_references><pubmed>24589344</pubmed><doi>10.1016/j.jaci.2013.11.042</doi></cross_references></HashMap>