<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Huo J</submitter><funding>National Heart, Lung, and Blood Institute</funding><funding>NHLBI NIH HHS</funding><funding>National Science Foundation</funding><pagination>449-457</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC11577832</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>27(2)</volume><pubmed_abstract>&lt;h4>Purpose&lt;/h4>This study aimed to develop a machine learning-based questionnaire (BASH-GN) to classify obstructive sleep apnea (OSA) risk by considering risk factor subtypes.&lt;h4>Methods&lt;/h4>Participants who met study inclusion criteria were selected from the Sleep Heart Health Study Visit 1 (SHHS 1) database. Other participants from the Wisconsin Sleep Cohort (WSC) served as an independent test dataset. Participants with an apnea hypopnea index (AHI) ≥ 15/h were considered as high risk for OSA. Potential risk factors were ranked using mutual information between each factor and the AHI, and only the top 50% were selected. We classified the subjects into 2 different groups, low and high phenotype groups, according to their risk scores. We then developed the BASH-GN, a machine learning-based questionnaire that consists of two logistic regression classifiers for the 2 different subtypes of OSA risk prediction.&lt;h4>Results&lt;/h4>We evaluated the BASH-GN on the SHHS 1 test set (n = 1237) and WSC set (n = 1120) and compared its performance with four commonly used OSA screening questionnaires, the Four-Variable, Epworth Sleepiness Scale, Berlin, and STOP-BANG. The model outperformed these questionnaires on both test sets regarding the area under the receiver operating characteristic (AUROC) and the area under the precision-recall curve (AUPRC). The model achieved AUROC (SHHS 1: 0.78, WSC: 0.76) and AUPRC (SHHS 1: 0.72, WSC: 0.74), respectively. The questionnaire is available at https://c2ship.org/bash-gn .&lt;h4>Conclusion&lt;/h4>Considering OSA subtypes when evaluating OSA risk may improve the accuracy of OSA screening.</pubmed_abstract><journal>Sleep &amp; breathing = Schlaf &amp; Atmung</journal><pubmed_title>BASH-GN: a new machine learning-derived questionnaire for screening obstructive sleep apnea.</pubmed_title><pmcid>PMC11577832</pmcid><funding_grant_id>R21 HL159661</funding_grant_id><funding_grant_id>2052528</funding_grant_id><funding_grant_id>R21HL159661-01</funding_grant_id><pubmed_authors>Huo J</pubmed_authors><pubmed_authors>Li A</pubmed_authors><pubmed_authors>Quan SF</pubmed_authors><pubmed_authors>Roveda J</pubmed_authors></additional><is_claimable>false</is_claimable><name>BASH-GN: a new machine learning-derived questionnaire for screening obstructive sleep apnea.</name><description>&lt;h4>Purpose&lt;/h4>This study aimed to develop a machine learning-based questionnaire (BASH-GN) to classify obstructive sleep apnea (OSA) risk by considering risk factor subtypes.&lt;h4>Methods&lt;/h4>Participants who met study inclusion criteria were selected from the Sleep Heart Health Study Visit 1 (SHHS 1) database. Other participants from the Wisconsin Sleep Cohort (WSC) served as an independent test dataset. Participants with an apnea hypopnea index (AHI) ≥ 15/h were considered as high risk for OSA. Potential risk factors were ranked using mutual information between each factor and the AHI, and only the top 50% were selected. We classified the subjects into 2 different groups, low and high phenotype groups, according to their risk scores. We then developed the BASH-GN, a machine learning-based questionnaire that consists of two logistic regression classifiers for the 2 different subtypes of OSA risk prediction.&lt;h4>Results&lt;/h4>We evaluated the BASH-GN on the SHHS 1 test set (n = 1237) and WSC set (n = 1120) and compared its performance with four commonly used OSA screening questionnaires, the Four-Variable, Epworth Sleepiness Scale, Berlin, and STOP-BANG. The model outperformed these questionnaires on both test sets regarding the area under the receiver operating characteristic (AUROC) and the area under the precision-recall curve (AUPRC). The model achieved AUROC (SHHS 1: 0.78, WSC: 0.76) and AUPRC (SHHS 1: 0.72, WSC: 0.74), respectively. The questionnaire is available at https://c2ship.org/bash-gn .&lt;h4>Conclusion&lt;/h4>Considering OSA subtypes when evaluating OSA risk may improve the accuracy of OSA screening.</description><dates><release>2023-01-01T00:00:00Z</release><publication>2023 May</publication><modification>2025-04-26T01:54:52.247Z</modification><creation>2025-04-06T10:18:15.303Z</creation></dates><accession>S-EPMC11577832</accession><cross_references><pubmed>35482152</pubmed><doi>10.1007/s11325-022-02629-8</doi></cross_references></HashMap>