Project description:Azoospermia, defined by the absence of sperm in the ejaculate, manifests as obstructive azoospermia (OA) or non-obstructive azoospermia (NOA). Reliable predictive models utilizing biomarkers could aid in clinical decision-making. This study included 352 azoospermia patients, with 152 diagnosed with OA and 200 with NOA. The data were randomly divided into a training set (244 cases) and a validation set (108 cases) for machine learning analysis. The training set was utilized for univariate and multivariate logistic regression to identify key predictors of NOA. Following this, nine machine learning. This study included 352 azoospermia patients, with 152 diagnosed with OA and 200 with NOA. The data were randomly divided into a training set (244 cases) and a validation set (108 cases) for machine learning analysis. The training set was utilized for univariate and multivariate logistic regression to identify key predictors of NOA. Following this, nine machine learning methods were employed to refine the prediction model. A novel nomogram model was developed, and its predictive performance was evaluated using receiver operating characteristic curves, calibration plots, and decision curve analysis. Univariate and multivariate logistic regression analyses identified semen pH and follicle-stimulating hormone (FSH) as positive predictors of NOA, while mean testicular volume (MTV) and inhibin B (INHB) were negatively correlated with NOA. Among nine machine learning methods evaluated, the Gradient Boosting Decision Trees achieved the highest performance with an area under the curve (AUC) of 0.974, whereas Random Forest showed the lowest AUC at 0.953. The nomogram model, incorporating these four factors, demonstrated robust predictive performance with AUCs of 0.984 in the training set and 0.976 in the validation set. Calibration and decision curve analysis confirmed the model's accuracy and clinical utility. Optimal cut-off points for biomarkers were identified: FSH at 7.50 IU/L (AUC = 0.96), INHB at 43.45 pg/ml (AUC = 0.95), MTV at 9.92 ml (AUC = 0.91), and semen pH at 6.95 (AUC = 0.71). The novel nomogram model incorporating FSH, INHB, MTV, and pH effectively predicts NOA in patients. This model offers a valuable tool for personalized diagnosis and management of azoospermia.

Project description:BackgroundSurveillance is universally recommended for non-small cell lung cancer (NSCLC) patients treated with curative-intent radiotherapy. High-quality evidence to inform optimal surveillance strategies is lacking. Machine learning demonstrates promise in accurate outcome prediction for a variety of health conditions. The purpose of this study was to utilise readily available patient, tumour, and treatment data to develop, validate and externally test machine learning models for predicting recurrence, recurrence-free survival (RFS) and overall survival (OS) at 2 years from treatment.MethodsA retrospective, multicentre study of patients receiving curative-intent radiotherapy for NSCLC was undertaken. A total of 657 patients from 5 hospitals were eligible for inclusion. Data pre-processing derived 34 features for predictive modelling. Combinations of 8 feature reduction methods and 10 machine learning classification algorithms were compared, producing risk-stratification models for predicting recurrence, RFS and OS. Models were compared with 10-fold cross validation and an external test set and benchmarked against TNM-stage and performance status. Youden Index was derived from validation set ROC curves to distinguish high and low risk groups and Kaplan-Meier analyses performed.FindingsMedian follow-up time was 852 days. Parameters were well matched across training-validation and external test sets: Mean age was 73 and 71 respectively, and recurrence, RFS and OS rates at 2 years were 43% vs 34%, 54% vs 47% and 54% vs 47% respectively. The respective validation and test set AUCs were as follows: 1) RFS: 0·682 (0·575-0·788) and 0·681 (0·597-0·766), 2) Recurrence: 0·687 (0·582-0·793) and 0·722 (0·635-0·81), and 3) OS: 0·759 (0·663-0·855) and 0·717 (0·634-0·8). Our models were superior to TNM stage and performance status in predicting recurrence and OS.InterpretationThis robust and ready to use machine learning method, validated and externally tested, sets the stage for future clinical trials entailing quantitative personalised risk-stratification and surveillance following curative-intent radiotherapy for NSCLC.FundingA full list of funding bodies that contributed to this study can be found in the Acknowledgements section.

Project description:ObjectivesPrognostication of neurologic status among survivors of in-hospital cardiac arrests remains a challenging task for physicians. Although models such as the Cardiac Arrest Survival Post-Resuscitation In-hospital score are useful for predicting neurologic outcomes, they were developed using traditional statistical techniques. In this study, we derive and compare the performance of several machine learning models with each other and with the Cardiac Arrest Survival Post-Resuscitation In-hospital score for predicting the likelihood of favorable neurologic outcomes among survivors of resuscitation.DesignAnalysis of the Get With The Guidelines-Resuscitation registry.SettingSeven-hundred fifty-five hospitals participating in Get With The Guidelines-Resuscitation from January 1, 2001, to January 28, 2017.PatientsAdult in-hospital cardiac arrest survivors.InterventionsNone.Measurements and main resultsOf 117,674 patients in our cohort, 28,409 (24%) had a favorable neurologic outcome, as defined as survival with a Cerebral Performance Category score of less than or equal to 2 at discharge. Using patient characteristics, pre-existing conditions, prearrest interventions, and periarrest variables, we constructed logistic regression, support vector machines, random forests, gradient boosted machines, and neural network machine learning models to predict favorable neurologic outcome. Events prior to October 20, 2009, were used for model derivation, and all subsequent events were used for validation. The gradient boosted machine predicted favorable neurologic status at discharge significantly better than the Cardiac Arrest Survival Post-Resuscitation In-hospital score (C-statistic: 0.81 vs 0.73; p < 0.001) and outperformed all other machine learning models in terms of discrimination, calibration, and accuracy measures. Variables that were consistently most important for prediction across all models were duration of arrest, initial cardiac arrest rhythm, admission Cerebral Performance Category score, and age.ConclusionsThe gradient boosted machine algorithm was the most accurate for predicting favorable neurologic outcomes in in-hospital cardiac arrest survivors. Our results highlight the utility of machine learning for predicting neurologic outcomes in resuscitated patients.

Project description:BackgroundOxidative stress process plays a key role in aging and cancer; however, currently, there is paucity of machine-learning model studies investigating the relationship between oxidative stress and prognosis of elderly patients with esophageal squamous cancer (ESCC).MethodsThis study included elderly patients with ESCC who underwent curative ESCC resection surgery continuously from January 2013 to December 2020 and were stratified into the training and external validation cohorts. Using Cox stepwise regression analysis based on Akaike information criterion, the relationship between oxidative stress biomarkers and prognosis was explored, and a geriatric ESCC-related oxidative stress score (OSS) was constructed. To construct a predictive model for 3-year overall survival (OS), machine-learning strategies including decision tree (DT), random forest (RF), and support vector machine (SVM) were employed. These machine-learning strategies play a key role in data mining and pattern recognition tasks. Each model was tested in the external validation cohort through 1000 resampling iterations. Validation was conducted using receiver operating characteristic area under the curve (AUC) and calibration plots.ResultsThe training cohort and validation cohort consisted of 340 and 145 patients, respectively. In the training cohort, the 3-year OS rate for patients was 59.2%. We constructed the OSS based on systemic oxidative stress biomarkers using the training cohort. The study found that pathological N stage, pathological T stage, tumor histological type, lymphovascular invasion, CEA, OSS, CA 19 - 9, and the amount of bleeding were the most important factors influencing the 3-year OS. These eight important features were included in training the RF, DT, and SVM and trained on the training cohort and validated cohort, respectively. In the training cohort, the RF model demonstrated the highest predictive performance with an AUC of 0.975 (0.962-0.987), while the DT model is 0.784 (0.739-0.830) and the SVM is 0.879 (0.843-0.916). In the external validation cohort, the RF model again exhibited the highest performance with an AUC of 0.791 (0.717-0.864), compared to the DT model with an AUC of 0.717 (0.640-0.794) and 0.779 (0.702-0.856) in SVM.ConclusionsThe random forest clinical prediction model constructed based on OSS can effectively predict the prognosis of elderly patients with ESCC after curative surgery.

Dataset Information

Comparison of nomogram and machine‐learning methods for predicting the survival of non‐small cell lung cancer patients

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets