Project description:BackgroundPrognostic prediction for surgical treatment of gastric cancer remains valuable in clinical practice. This study aimed to develop survival models for postoperative gastric cancer patients.MethodsEleven thousand seventy-five patients from the Surveillance, Epidemiology, and End Results (SEER) database were included, and 122 patients from the Chinese database were used for external validation. The training cohort was created to create three separate models, including Cox regression, RSF, and DeepSurv, using data from the SEER database split into training and test cohorts with a 7:3 ratio. Test cohort was used to evaluate model performance using c-index, Brier scores, calibration, and the area under the curve (AUC). The new risk stratification based on the best model will be compared with the AJCC stage on the test and Chinese cohorts using decision curve analysis (DCA), the net reclassification index (NRI), and integrated discrimination improvement (IDI).ResultsIt was discovered that the DeepSurv model predicted postoperative gastric cancer patients' overall survival (OS) with a c-index of 0.787; the area under the curve reached 0.781, 0.798, 0.868 at 1-, 3- and 5- years, respectively; the Brier score was below 0.25 at different time points; showing an advantage over the Cox and RSF models. The results are also validated in the China cohort. The calibration plots demonstrated good agreement between the DeepSurv model's forecast and actual results. The NRI values (test cohort: 0.399, 0.288, 0.267 for 1-, 3- and 5-year OS prediction; China cohort:0.399, 0.288 for 1- and 3-year OS prediction) and IDI (test cohort: 0.188, 0.169, 0.157 for 1-, 3- and 5-year OS prediction; China cohort: 0.189, 0.169 for 1- and 3-year OS prediction) indicated that the risk score stratification performed significantly better than the AJCC staging alone (P < 0.05). DCA showed that the risk score stratification was clinically useful and had better discriminative ability than the AJCC staging. Finally, an interactive native web-based prediction tool was constructed for the survival prediction of patients with postoperative gastric cancer.ConclusionIn this study, a high-performance prediction model for the postoperative prognosis of gastric cancer was developed using DeepSurv, which offers essential benefits for risk stratification and prognosis prediction for each patient.
Project description:BackgroundThe currently available prediction models, such as the Cox model, were too simplistic to correctly predict the outcome of gastric adenocarcinoma patients. This study aimed to develop and validate survival prediction models for gastric adenocarcinoma patients using the deep learning survival neural network.MethodsA total of 14,177 patients with gastric adenocarcinoma from the Surveillance, Epidemiology, and End Results (SEER) database were included in the study and randomly divided into the training and testing group with a 7:3 ratio. Two algorithms were chosen to build the prediction models, and both algorithms include random survival forest (RSF) and a deep learning based-survival prediction algorithm (DeepSurv). Also, a traditional Cox proportional hazard (CoxPH) model was constructed for comparison. The consistency index (C-index), Brier score, and integrated Brier score (IBS) were used to evaluate the model's predictive performance. The accuracy of predicting survival at 1, 3, 5, and 10 years was also assessed using receiver operating characteristic curves (ROC), calibration curves, and area under the ROC curve (AUC).ResultsGastric adenocarcinoma patients were randomized into a training group (n = 9923) and a testing group (n = 4254). DeepSurv showed the best performance among the three models (c-index: 0.772, IBS: 0.1421), which was superior to that of the traditional CoxPH model (c-index: 0.755, IBS: 0.1506) and the RSF with 3-year survival prediction model (c-index: 0.766, IBS: 0.1502). The DeepSurv model produced superior accuracy and calibrated survival estimates predicting 1-, 3- 5- and 10-year survival (AUC: 0.825-0.871).ConclusionsA deep learning algorithm was developed to predict more accurate prognostic information for gastric cancer patients. The DeepSurv model has advantages over the CoxPH and RSF models and performs well in discriminative performance and calibration.
Project description:BackgroundThe aim was to develop a personalized survival prediction deep learning model for cervical adenocarcinoma patients and process personalized survival prediction.MethodsA total of 2501 cervical adenocarcinoma patients from the surveillance, epidemiology and end results database and 220 patients from Qilu hospital were enrolled in this study. We created our deep learning (DL) model to manipulate the data and evaluated its performance against four other competitive models. We tried to demonstrate a new grouping system oriented by survival outcomes and process personalized survival prediction by using our DL model.ResultsThe DL model reached 0.878 c-index and 0.09 Brier score in the test set, which was better than the other four models. In the external test set, our model achieved a 0.80 c-index and 0.13 Brier score. Thus, we developed prognosis-oriented risk grouping for patients according to risk scores computed by our DL model. Notable differences among groupings were observed. In addition, a personalized survival prediction system based on our risk-scoring grouping was developed.ConclusionsWe developed a deep neural network model for cervical adenocarcinoma patients. The performance of this model proved to be superior to other models. The results of external validation supported the possibility that the model can be used in clinical work. Finally, our survival grouping and personalized prediction system provided more accurate prognostic information for patients than traditional FIGO stages.
Project description:PurposeMachine learning (ML) is a strong candidate for making accurate predictions, as we can use large amount of data with powerful computational algorithms. We developed a ML based model to predict survival of patients with colorectal cancer (CRC) using data from two independent datasets.Materials and methodsA total of 364,316 and 1,572 CRC patients were included from the Surveillance, Epidemiology, and End Results (SEER) and a Korean dataset, respectively. As SEER combines data from 18 cancer registries, internal validation was done using 18-Fold-Cross-Validation then external validation was performed by testing the trained model on the Korean dataset. Performance was evaluated using area under the receiver operating characteristic curve (AUROC), sensitivity and positive predictive values.ResultsClinicopathological characteristics were significantly different between the two datasets and the SEER showed a significant lower 5-year survival rate compared to the Korean dataset (60.1% vs. 75.3%, p < 0.001). The ML-based model using the Light gradient boosting algorithm achieved a better performance in predicting 5-year-survival compared to American Joint Committee on Cancer stage (AUROC, 0.804 vs. 0.736; p < 0.001). The most important features which influenced model performance were age, number of examined lymph nodes, and tumor size. Sensitivity and positive predictive values of predicting 5-year-survival for classes including dead or alive were reported as 68.14%, 77.51% and 49.88%, 88.1% respectively in the validation set. Survival probability can be checked using the web-based survival predictor (http://colorectalcancer.pythonanywhere.com).ConclusionML-based model achieved a much better performance compared to staging in individualized estimation of survival of patients with CRC.
Project description:Background: Postoperative mortality occurs in 1-2% of patients undergoing major inpatient surgery. The currently available prediction tools using summaries of intraoperative data are limited by their inability to reflect shifting risk associated with intraoperative physiological perturbations. We sought to compare similar benchmarks to a deep-learning algorithm predicting postoperative 30-day mortality.Methods: We constructed a multipath convolutional neural network model using patient characteristics, co-morbid conditions, preoperative laboratory values, and intraoperative numerical data from patients undergoing surgery with tracheal intubation at a single medical centre. Data for 60 min prior to a randomly selected time point were utilised. Model performance was compared with a deep neural network, a random forest, a support vector machine, and a logistic regression using predetermined summary statistics of intraoperative data.Results: Of 95 907 patients, 941 (1%) died within 30 days. The multipath convolutional neural network predicted postoperative 30-day mortality with an area under the receiver operating characteristic curve of 0.867 (95% confidence interval [CI]: 0.835-0.899). This was higher than that for the deep neural network (0.825; 95% CI: 0.790-0.860), random forest (0.848; 95% CI: 0.815-0.882), support vector machine (0.836; 95% CI: 0.802-870), and logistic regression (0.837; 95% CI: 0.803-0.871).Conclusions: A deep-learning time-series model improves prediction compared with models with simple summaries of intraoperative data. We have created a model that can be used in real time to detect dynamic changes in a patient's risk for postoperative mortality.
Project description:BackgroundMachine learning (ML) achieves better predictions of postoperative mortality than previous prediction tools. Free-text descriptions of the preoperative diagnosis and the planned procedure are available preoperatively. Because reading these descriptions helps anesthesiologists evaluate the risk of the surgery, we hypothesized that deep learning (DL) models with unstructured text could improve postoperative mortality prediction. However, it is challenging to extract meaningful concept embeddings from this unstructured clinical text.ObjectiveThis study aims to develop a fusion DL model containing structured and unstructured features to predict the in-hospital 30-day postoperative mortality before surgery. ML models for predicting postoperative mortality using preoperative data with or without free clinical text were assessed.MethodsWe retrospectively collected preoperative anesthesia assessments, surgical information, and discharge summaries of patients undergoing general and neuraxial anesthesia from electronic health records (EHRs) from 2016 to 2020. We first compared the deep neural network (DNN) with other models using the same input features to demonstrate effectiveness. Then, we combined the DNN model with bidirectional encoder representations from transformers (BERT) to extract information from clinical texts. The effects of adding text information on the model performance were compared using the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC). Statistical significance was evaluated using P<.05.ResultsThe final cohort contained 121,313 patients who underwent surgeries. A total of 1562 (1.29%) patients died within 30 days of surgery. Our BERT-DNN model achieved the highest AUROC (0.964, 95% CI 0.961-0.967) and AUPRC (0.336, 95% CI 0.276-0.402). The AUROC of the BERT-DNN was significantly higher compared to logistic regression (AUROC=0.952, 95% CI 0.949-0.955) and the American Society of Anesthesiologist Physical Status (ASAPS AUROC=0.892, 95% CI 0.887-0.896) but not significantly higher compared to the DNN (AUROC=0.959, 95% CI 0.956-0.962) and the random forest (AUROC=0.961, 95% CI 0.958-0.964). The AUPRC of the BERT-DNN was significantly higher compared to the DNN (AUPRC=0.319, 95% CI 0.260-0.384), the random forest (AUPRC=0.296, 95% CI 0.239-0.360), logistic regression (AUPRC=0.276, 95% CI 0.220-0.339), and the ASAPS (AUPRC=0.149, 95% CI 0.107-0.203).ConclusionsOur BERT-DNN model has an AUPRC significantly higher compared to previously proposed models using no text and an AUROC significantly higher compared to logistic regression and the ASAPS. This technique helps identify patients with higher risk from the surgical description text in EHRs.
Project description:IntroductionThe postoperative recurrence of gastric cancer has a significant impact on the overall prognosis of patients. Therefore, accurately predicting the postoperative recurrence of gastric cancer is crucial.MethodsThis retrospective study gathered data from 2,813 gastric cancer patients who underwent radical surgery between 2011 and 2017 at two medical centers. Follow-up was extended until May 2023, and cases were categorized as recurrent or non-recurrent based on postoperative outcomes. Clinical pathological information and imaging data were collected for all patients. A new deep learning signature (DLS) was generated using pretreatment CT images, based on a pre-trained baseline (a customized Resnet50), for predicting postoperative recurrence. The deep learning fusion signature (DLFS) was created by combining the score of DLS with the weighted values of identified clinical features. The predictive performance of the model was evaluated based on discrimination, calibration, and clinical usefulness. Survival curves were plotted to investigate the differences between DLFS and prognosis.ResultsIn this study, 2813 patients with gastric cancer (GC) were recruited and allocated into training, internal validation, and external validation cohorts. The DLFS was developed and assessed for its capability in predicting the risk of postoperative recurrence. The DLFS exhibited excellent performance with AUCs of 0.833 (95% CI, 0.809-0.858) in the training set, 0.831 (95% CI, 0.792-0.871) in the internal validation set, and 0.859 (95% CI, 0.806-0.912) in the external validation set, along with satisfactory calibration across all cohorts (P>0.05). Furthermore, the DLFS model significantly outperformed both the clinical model and DLS (P<0.05). High-risk recurrent patients exhibit a significantly poorer prognosis compared to low-risk recurrent patients (P<0.05).ConclusionsThe integrated model developed in this study, focusing on GC patients undergoing radical surgery, accurately identifies cases at high risk of postoperative recurrence and highlights the potential of DLFS as a prognostic factor for GC patients.
Project description:Objective To compare the performance of a deep learning survival network with the tumor, node, and metastasis (TNM) staging system in survival prediction and test the reliability of individual treatment recommendations provided by the network. Methods In this population-based cohort study, we developed and validated a deep learning survival model using consecutive cases of newly diagnosed stage I to IV esophageal cancer between January 2004 and December 2015 in a Surveillance, Epidemiology, and End Results (SEER) database. The model was externally validated in an independent cohort from Fujian Provincial Hospital. The C statistic was used to compare the performance of the deep learning survival model and TNM staging system. Two other deep learning risk prediction models were trained for treatment recommendations. A Kaplan–Meier survival curve was used to compare survival between the population that followed the recommended therapy and those who did not. Results A total of 9069 patients were included in this study. The deep learning network showed more promising results in predicting esophageal cancer-specific survival than the TNM stage in the internal test dataset (C-index=0.753 vs. 0.638) and external validation dataset (C-index=0.687 vs. 0.643). The population who received the recommended treatments had superior survival compared to those who did not, based on the internal test dataset (hazard ratio, 0.753; 95% CI, 0.556-0.987; P=0.042) and the external validation dataset (hazard ratio, 0.633; 95% CI, 0.459-0.834; P=0.0003). Conclusion Deep learning neural networks have potential advantages over traditional linear models in prognostic assessment and treatment recommendations. This novel analytical approach may provide reliable information on individual survival and treatment recommendations for patients with esophageal cancer.
Project description:BackgroundPatients in the intensive care unit (ICU) are often in critical condition and have a high mortality rate. Accurately predicting the survival probability of ICU patients is beneficial to timely care and prioritizing medical resources to improve the overall patient population survival. Models developed by deep learning (DL) algorithms show good performance on many models. However, few DL algorithms have been validated in the dimension of survival time or compared with traditional algorithms.MethodsVariables from the Early Warning Score, Sequential Organ Failure Assessment Score, Simplified Acute Physiology Score II, Acute Physiology and Chronic Health Evaluation (APACHE) II, and APACHE IV models were selected for model development. The Cox regression, random survival forest (RSF), and DL methods were used to develop prediction models for the survival probability of ICU patients. The prediction performance was independently evaluated in the MIMIC-III Clinical Database (MIMIC-III), the eICU Collaborative Research Database (eICU), and Shanghai Pulmonary Hospital Database (SPH).ResultsForty variables were collected in total for model development. 83 943 participants from 3 databases were included in the study. The New-DL model accurately stratified patients into different survival probability groups with a C-index of >0.7 in the MIMIC-III, eICU, and SPH, performing better than the other models. The calibration curves of the models at 3 and 10 days indicated that the prediction performance was good. A user-friendly interface was developed to enable the model's convenience.ConclusionsCompared with traditional algorithms, DL algorithms are more accurate in predicting the survival probability during ICU hospitalization. This novel model can provide reliable, individualized survival probability prediction.
Project description:ImportanceSelection of antiseizure medications (ASMs) for epilepsy remains largely a trial-and-error approach. Under this approach, many patients have to endure sequential trials of ineffective treatments until the "right drugs" are prescribed.ObjectiveTo develop and validate a deep learning model using readily available clinical information to predict treatment success with the first ASM for individual patients.Design, setting, and participantsThis cohort study developed and validated a prognostic model. Patients were treated between 1982 and 2020. All patients were followed up for a minimum of 1 year or until failure of the first ASM. A total of 2404 adults with epilepsy newly treated at specialist clinics in Scotland, Malaysia, Australia, and China between 1982 and 2020 were considered for inclusion, of whom 606 (25.2%) were excluded from the final cohort because of missing information in 1 or more variables.ExposuresOne of 7 antiseizure medications.Main outcomes and measuresWith the use of the transformer model architecture on 16 clinical factors and ASM information, this cohort study first pooled all cohorts for model training and testing. The model was trained again using the largest cohort and externally validated on the other 4 cohorts. The area under the receiver operating characteristic curve (AUROC), weighted balanced accuracy, sensitivity, and specificity of the model were all assessed for predicting treatment success based on the optimal probability cutoff. Treatment success was defined as complete seizure freedom for the first year of treatment while taking the first ASM. Performance of the transformer model was compared with other machine learning models.ResultsThe final pooled cohort included 1798 adults (54.5% female; median age, 34 years [IQR, 24-50 years]). The transformer model that was trained using the pooled cohort had an AUROC of 0.65 (95% CI, 0.63-0.67) and a weighted balanced accuracy of 0.62 (95% CI, 0.60-0.64) on the test set. The model that was trained using the largest cohort only had AUROCs ranging from 0.52 to 0.60 and a weighted balanced accuracy ranging from 0.51 to 0.62 in the external validation cohorts. Number of pretreatment seizures, presence of psychiatric disorders, electroencephalography, and brain imaging findings were the most important clinical variables for predicted outcomes in both models. The transformer model that was developed using the pooled cohort outperformed 2 of the 5 other models tested in terms of AUROC.Conclusions and relevanceIn this cohort study, a deep learning model showed the feasibility of personalized prediction of response to ASMs based on clinical information. With improvement of performance, such as by incorporating genetic and imaging data, this model may potentially assist clinicians in selecting the right drug at the first trial.