Dataset Information


Weekly dengue forecasts in Iquitos, Peru; San Juan, Puerto Rico; and Singapore.

ABSTRACT: BACKGROUND:Predictive models can serve as early warning systems and can be used to forecast future risk of various infectious diseases. Conventionally, regression and time series models are used to forecast dengue incidence, using dengue surveillance (e.g., case counts) and weather data. However, these models may be limited in terms of model assumptions and the number of predictors that can be included. Machine learning (ML) methods are designed to work with a large number of predictors and thus offer an appealing alternative. Here, we compared the performance of ML algorithms with that of regression models in predicting dengue cases and outbreaks from 4 to up to 12 weeks in advance. Many countries lack sufficient health surveillance infrastructure, as such we evaluated the contribution of dengue surveillance and weather data on the predictive power of these models. METHODS:We developed ML, regression, and time series models to forecast weekly dengue case counts and outbreaks in Iquitos, Peru; San Juan, Puerto Rico; and Singapore from 1990-2016. Forecasts were generated using available weekly dengue surveillance, and weather data. We evaluated the agreement between model forecasts and actual dengue observations using Mean Absolute Error and Matthew's Correlation Coefficient (MCC). RESULTS:For near term predictions of weekly case counts and when using surveillance data, ML models had 21% and 33% less error than regression and time series models respectively. However, using weather data only, ML models did not demonstrate a practical advantage. When forecasting weekly dengue outbreaks 12 weeks in advance, ML models achieved a maximum MCC of 0.61. CONCLUSIONS:Our results identified 2 scenarios when ML models are advantageous over regression model: 1) predicting dengue weekly case counts 4 weeks ahead when dengue surveillance data are available and 2) predicting weekly dengue outbreaks 12 weeks ahead when dengue surveillance data are unavailable. Given the advantages of ML models, dengue early warning systems may be improved by the inclusion of these models.


PROVIDER: S-EPMC7567393 | BioStudies | 2020-01-01

REPOSITORIES: biostudies

Similar Datasets

2016-01-01 | S-EPMC4764515 | BioStudies
1000-01-01 | S-EPMC5010413 | BioStudies
2020-01-01 | S-EPMC7595636 | BioStudies
2020-01-01 | S-EPMC7384612 | BioStudies
2020-01-01 | S-EPMC7537891 | BioStudies
2016-01-01 | S-EPMC4816319 | BioStudies
2019-01-01 | S-EPMC6932763 | BioStudies
2018-01-01 | S-EPMC6283552 | BioStudies
2020-01-01 | S-EPMC7654779 | BioStudies
1000-01-01 | S-EPMC5131307 | BioStudies