Introduction

Dengue is an acute infectious disease with Aedes mosquitoes as the main vector (Guzman et al. 2016; Guzman and Harris 2015). The World Health Organization (WHO) reported that dengue was one of the top ten global health threats in 2019. Almost 4 billion people in 128 countries were at risk of dengue (Brady et al. 2012). A study estimated that since 2013, roughly 390 million people have been infected with dengue, of which 96 million had clinical symptoms every year (Bhatt et al. 2013). Since the COVID-19 epidemic, the incidence of dengue fever has risen sharply in many countries, like Singapore. In the past, dengue was mostly confined to tropical and subtropical regions, but now its impact rapidly expanded to the world. Therefore, the establishment of an accurate and early dengue prediction system has been an issue that many scholars have paid more and more attention to.

The machine-learning methods have a non-parametric and non-linear modeling structure (Scavuzzo et al. 2018). They are independent of a priori standards of variable relations and adapted to high-dimensional data, which improve the model’s predictive ability with unprecedented accuracy (Bi et al. 2019; DeGregory et al. 2018; Heo et al. 2019). Machine learning methods have been used to successfully predict the incidence of infectious diseases (Kane et al. 2014; Abbasi and Goldenholz 2019; Jiang et al. 2018).

The application of machine learning to predict dengue incidence has also seen a sharp rise. Salim et al. used the support vector machine (SVM) to successfully predict the dengue outbreak based on meteorological factors in Selangor Malaysia (Salim et al. 2021). Zhao et al. developed the random forest (RF) to estimate weekly dengue cases in Colombia for the next 12 weeks using climatic and socioeconomic factors (Zhao et al. 2020). Recently, as a branch of machine learning, deep learning methods such as convolutional neural network (CNN) and long short-term memory (LSTM) have attracted widespread attention in solving various problems (Yousef et al. 2023; Ahmad et al. 2022; Chopra et al. 2022; Khan et al. 2022). Nguyen et al. used CNN and LSTM algorithms to predict dengue based on climate data in Vietnam (Nguyen et al. 2022). Xu et al. proposed an LSTM model with historical dengue cases and climatic factors, which was effective in predicting monthly dengue cases in 20 cities in mainland China (Xu et al. 2020). Although these models were successfully used to predict dengue incidence, the model and the predictors in different studies were not consistent. Hence, we conducted a systematic review to summarize machine learning methods and predictors in previous literature for predicting dengue incidence based on machine learning methods.

The contribution of this review:

This systematic review aims to review all the published literature on machine learning models for dengue fever prediction. It provides a basis for developing more robust dengue early warning systems in the future by summarizing the characteristics of predictor variables and prediction methods.

Methods

Search strategy

This systematic review was conducted in compliance with the recommendations of the PRISMA guidelines. We searched PubMed, ScienceDirect, and Web of Science databases for articles published up to July 2023. ((Machine learning OR deep learning OR Ensemble OR RF OR SVM OR LASSO OR LSTM) AND (dengue)) was used as the search term.

Eligibility criteria

The eligibility criteria were employed to select suitable papers in the search results in order to make the selected articles as complete and uniform as possible. Three inclusion criteria were as follows: first, this review included only peer-reviewed journal articles published in English full text/pdf, making the information gained authoritative. Second, papers had to incorporate the forecast of dengue. Third, papers must include the machine learning method. Three exclusion criteria were as follows: first of all, abstract-only reports and duplicate titles were omitted. Moreover, papers containing unspecified or suspected dengue were excluded. In addition, papers reporting dengue infection in nonhuman cases were also removed.

Outcome

The outcomes were the incidence rate or cases of dengue as well as dengue serotype (e.g., dengue fever (DF) and dengue hemorrhagic fever (DHF)). The outcomes were converted into the categorical variable according to whether the incidence rate or cases of dengue exceeded a certain threshold when evaluating whether an outbreak of dengue occurred.

Data extraction and analysis

The relevant data were extracted by two independent investigators (Lanlan Fang and Wan Hu): author, year of publication, study area, study period, outcome, predictors, methods used, model validation, evaluation index, and optimal model. Any disagreement was resolved by discussion and consensus with a senior reviewer (Guixia Pan) if needed.

The above key information was extracted from the included studies to make a table. Descriptive statistics were performed based on the characteristics of the predictors and models (including model validation techniques and evaluation metrics).

Results

Literature search

We found 4,517 records in the search and deleted 932 duplicates. The initial search result was 3,585 articles, of which 1022 were considered potentially relevant and required further reading of the abstract for screening. After reading the full text of 126 articles, it was found that 23 of them fully met the eligibility criteria. The screening process is shown in Fig. 1. Basic information of 23 studies including author (year), study area (period), outcome, predictors, method used, model validation, evaluation index, and optimal model are presented in Table 1.

Fig. 1
figure 1

Flowchart of literature search

Table 1 The basic information about 23 studies involved in the review of machine-learning-based prediction models for dengue incidence

Study characteristics

Figure 2 summarizes the types of predictors used in the 23 included articles according to Table 1. 95.7% of the studies utilized meteorological factors. 60.9% of the studies incorporated historical dengue data. 17.4% of the studies considered environmental factors. 17.4% of the studies took into account socio-economic factors. 8.7% of the studies utilized vector surveillance data. 13.0% of the studies considered internet search data. Figure 3 summarizes the combination of predictors used in the 23 included articles. 7 articles (30.4%) were based exclusively on meteorological predictors. 15 articles (65.2%) used a combination of meteorological and other types of factors. Of these studies, 21.7% used meteorological factors and historical dengue cases. 8.7% used a combination of meteorological factors, historical dengue cases, and environmental factors. 8.7% considered meteorological factors, historical dengue cases, and socioeconomic factors. 8.7% incorporated meteorological factors, historical dengue cases, and internet search data. 4.3% considered meteorological and environmental factors. 4.3% used meteorological, environmental, and socioeconomic factors. 8.7% integrated meteorological factors, historical dengue cases, vector monitoring data, and socioeconomic factors. Only one article (4.3%) did not involve meteorological factors, which was based on historical dengue cases and internet search data.

Fig. 2
figure 2

The types of predictors used in the 23 included articles. * Since multiple predictors may be used in a single article, the sum of the factors did not equal 23 and the sum of the percentages did not equal 100

Fig. 3
figure 3

The combinations of predictors used in the 23 included articles

Table 2 lists the specific predictors that appeared in the prediction models of the 23 studies. Among meteorological factors, the majority of studies utilized temperature (20, 87.0%), rainfall (20, 87.0%), and relative humidity (14, 60.9%). Some studies also included wind speed (4, 17.4%), atmospheric pressure (2, 8.7%), evaporation (2, 8.7%), and diurnal temperature range (DTR) (1, 4.3%). In terms of historical dengue data, 43.5% of the studies utilized weekly data, 13.0% used monthly data, and 8.7% used yearly data. Environmental factors included enhanced vegetation index (EVI) (2, 8.7%), normalized difference vegetation index (NDVI) (2, 8.7%), southern oscillation index (SOI) (1, 4.3%), and flood occurrence (1, 4.3%). Socio-economic factors consisted of the population density (3, 13.0%), Gini index (1, 4.3%), education coverage rate (1, 4.3%), and monthly garbage collection quantity (1, 4.3%). Vector surveillance data included the Aedes aegypti index (1, 4.3%) and the Breteau index (1, 4.3%). In addition, internet search data comprised Tweets (1, 4.3%), Baidu search queries (1, 4.3%), and Google search queries (1, 4.3%).

Table 2 The predictors that appeared in the prediction models of the 23 studies (number, percentage)

Figure 4A summarizes the techniques of model validation. 37.5% employed split-sample validation, 29.2% used cross-validation, 25.0% utilized out-of-sample validation, and 8.3% employed retrospective validation. Figure 4B summarizes the model evaluation metrics. These included mean absolute error (MAE) (21.2%), root mean square error (RMSE) (18.2%), accuracy (18.2%), sensitivity (6.1%), specificity (6.1%), area under curve (AUC) (9.1%), mean absolute percentage error (MAPE) (3.0%), mean percentage error (MPE) (3.0%), Theil’s coefficient (3.0%), Matthew Correlation Coefficient (MCC) (3.0%), R-squared (3.0%), mean squared error (MSE) (3.0%), and mean (3.0%). Figure 4C summarizes the optimal models judged based on evaluation metrics in each article. The best models include SVM (6, 26.1%), LSTM (5, 21.7%), RF (4, 17.4%), LASSO (2, 8.7%), ensemble model (2, 8.7%), and other machine learning methods (4, 17.4%).

Fig. 4
figure 4

Model validation techniques (A) and model evaluation metrics (B) and the optimal models judged based on evaluation metrics in each article (C). * Since multiple techniques and evaluation metrics may be used in a single article, the sum of techniques and evaluation metrics did not equal 23

Literature review

We respectively presented the application of SVM, LSTM, RF, LASSO, and ensemble models in the context of dengue epidemiology.

The SVM can solve classification and regression prediction (Huang et al. 2018, Noble 2006). A total of 6/23 articles identified SVM as the best model, of which 4 were for classification and 2 for regression. For classification prediction, Althouse et al. defined the binary outcome as 1 for high-incidence periods and 0 for other periods. The AUC of the SVM model was 0.906 in Singapore and 0.960 in Bangkok (Althouse et al. 2011). Kesorn et al. developed an SVM with an RBF kernel to predict high-prevalence areas in central Thailand with the highest accuracy (96.26%) compared to other models (Kesorn et al. 2015). When Stolerman et al. identified the occurrence of dengue epidemic years, the accuracy of SVM with RBF kernel (91%) was higher than the accuracy of SVM with linear kernel (82%) (Stolerman et al. 2019). Salim et al. used several machine learning methods to assess whether dengue outbreaks would occur, with SVM (linear kernel) providing the best predictions (Salim et al. 2021). For regression prediction, dengue case counts or incidence is a continuous dependent variable. Mustaffa developed an SVM model with the RBF kernel to predict the dengue case counts. The MSE and accuracy of this model were 0.0063 and 86.84% (Mustaffa). Guo et al implemented a support vector regression (SVR) with a linear kernel function to predict the weekly number of dengue cases. (Guo et al. 2017). The R-squared of this model was 0.99, and it achieved the best performance compared to other forecasting techniques.

Recently, deep learning methods have also been increasing in dengue prediction. CNN is widely used to process image data (Salehi et al. 2023) and LSTM is applied to deal with time series problems (Houdt et al. 2020). A total of 5/23 papers had the best performance of the LSTM algorithm compared to other ML tools. Mussumeci and Codeco fitted the models (LSTM, RF, and LASSO) to forecast the weekly dengue incidence, and the LSTM model was the best with an MPE of 0.04 (Mussumeci and Codeco Coelho 2020). Xu et al. developed an LSTM-TL model to effectively predict monthly dengue cases in 20 cities in mainland China (Xu et al. 2020). This model (RMSE=0.91) outperformed other algorithms (LSTM, BPNN, SVM, and GBM). Li et al. fitted an LSTM model for forecasting the number of dengue cases using rainfall, temperature, relative humidity, mean normalized difference vegetation index (NDVI), and historical dengue cases in Brazil (Li et al. 2022). Li further improved this algorithm and constructed the LSTM-ATT model, which has higher prediction performance than RF and LSTM (Li 2022). Nguyen et al. used the deep learning models (CNN, LSTM, LSTM-ATT) for forecasting dengue fever in Vietnam (Nguyen et al. 2022), which also revealed that LSTM-ATT outperformed the base LSTM and CNN.

A total of 4/23 papers had the best performance of the RF compared to other models. Benedum et al. developed an RF algorithm with the best performance to predict dengue case counts (regression) and outbreaks (classification) in 4 to 12 weeks (Benedum et al. 2020). Zhao et al. developed national and departmental RF models and an ANN model to estimate weekly dengue cases for the next 12 weeks in Colombia.

The RF model trained on national data (MAE=24.56) was better than the RF model trained on departmental data (MAE=26.76) and ANN (MAE=25.25) (Zhao et al. 2020). Carvajal et al predicted dengue incidence in Manila based on weather factors and their corresponding lagged effect. The RF model with delayed meteorological effects (MAE=0.15) showed the best predictive accuracy compared to GBM and other statistical models (Carvajal et al. 2018). Gupta et al predicted the dengue case counts in San Juan and Iquitos via various machine learning algorithms. RF produced better results than DT, KNN, SVR, and GNB (Gupta et al. 2023).

A total of 2/23 papers had the best performance of the LASSO algorithm compared to other models. Shi et al. developed a three-month real-time dengue forecast system with the LASSO-derived model. This model forecasted the weekly incidence of dengue and utilized multiple data streams that were updated weekly, including historical dengue case data, climate data, vector surveillance data, and temporal data. The MAPE was the smallest compared with SARIMA, and step-down linear regression (Shi et al. 2016). Chen et al. developed a novel framework based on LASSO regression for producing a spatiotemporal dengue forecast at a neighborhood-level spatial resolution to distinguish between high and low-risk areas (Chen et al. 2018).

The ensemble model combines different models to improve their performance and robustness (Haq et al. 2022). A total of 2/23 papers with the ensemble model were performed. Buczak et al. proposed an ensemble model created by combining three disparate types of component models, which was the best in predicting the total dengue case counts and peak height for Iquitos, Peru (Buczak et al. 2018). McGough et al. developed a high-accuracy ensemble model for predicting the year of dengue outbreaks in Brazil using meteorological factors and historical dengue cases (McGough et al. 2021).

Discussion

Here, we conducted a systematic review to summarize the forecast of dengue incidence based on machine learning methods. We found that 95.65% of the studies incorporated meteorological factors, with the majority utilizing temperature, rainfall, and relative humidity in their machine-learning prediction models. Among the 23 articles, the models most frequently selected as optimal were SVM with 6 articles (26.1%) and LSTM with 5 articles (21.7%).

Machine learning methods are data-driven forecasting methods. In the selection of independent variables, as long as a certain variable can improve the prediction accuracy of the model, it can be introduced into the model regardless of whether it is a risk factor for dengue incidence (Jamshidi et al. 2019; Heo et al. 2019; Peiffer-Smadja et al. 2020). In this review, the predictive variables included not only traditional risk factors that affect the infection of dengue but also other factors that help capture the trend of dengue incidence (i.e., historical dengue data and internet search data (Spratt et al. 2013). First, 95.65% of the articles included meteorological factors, which indicated that meteorological factors were not negligible in predicting the incidence of dengue. Common meteorological indicators included temperature, rainfall, and relative humidity (Wu et al. 2018b, Xiang et al. 2017). In addition, environmental factors (i.e., EVI and NDVI were also considered in some studies (Li et al. 2022) (Zhao et al. 2020), Second, 60.9% of the articles included historical dengue cases. As dengue is an infectious disease, historical cases had a significant impact on the current dengue epidemic. Thus, historical dengue data should be included as a predictor (Ramadona et al. 2016; Jain et al. 2019). Third, 17.4% of studies have also taken into account socioeconomic factors such as population and the Gini index. Fourth, to improve the accuracy of prediction as much as possible, internet search data (i.e., Google Trends, Baidu Index, and Twitter), as currently an emerging predictor, have proven to be a good complement to traditional surveillance data (Wu et al. 2018a). In general, although a wide variety of predictors were incorporated into machine learning models, meteorological factors were non-negligible in the prediction model.

In our review, the SVM was the most frequent (6, 26.1%) optimal model in the 23 included studies. The SVM algorithm has the capability of handling small sample sets, controlling overfitting, and dealing with nonlinear relationships, making it suitable for dengue incidence prediction. Looking at the performance metrics of the developed SVM models, Kesorn et al. obtained the highest accuracy of 96.26 with SVM (RBF) (Kesorn et al. 2015). Althouse et al. reported that the AUC of the SVM model was the highest at 0.960 in Bangkok (Althouse et al. 2011). With the advancement of deep learning methods, LSTM is also becoming increasingly popular, with a total of 5/23 articles being the optimal model. The LSTM has the unique advantage of being able to capture long-term dependencies, predict the incidence of dengue fever at different time intervals, remember important patterns and trends in historical dengue data, and learn complex patterns (nonlinearity and interactions) (Sagheer and Kotb 2019). The LSTM-ATT developed by Nguyen et al. performed the best among multiple deep learning techniques, with an average ranking of 1.60 for RMSE, and 1.95 for MAE (Nguyen et al. 2022). In addition, other algorithms have some unique advantages. The RF can rank the importance of predictors, and Carvajal et al. found that relative humidity, rainfall, and temperature were the best predictors (Carvajal et al. 2018). The LASSO can screen for valuable factors to solve serious collinearity problems, thereby improving the accuracy of predictions (Ranstam and Cook 2018) (Wang et al. 2018). The ensemble model combined the predictions of multiple basic models to obtain a more accurate and robust final prediction. Therefore, in actual prediction, we should not only rely on one method but use multiple methods to fit multiple models and select the model with the best prediction effect according to evaluation criteria.

This review had several limitations. Firstly, our review only included peer-reviewed literature from selected databases and some dengue outbreaks may not have been recorded; therefore, our results should be interpreted with caution. Secondly, most cases of dengue were asymptomatic; thus, there may be some patients with dengue that have not been confirmed and the actual number of cases may be much higher than recorded. Thirdly, because the model evaluation indicators of 17 papers were not the same and different regions varied largely, we cannot use a unified quantitative indicator to determine which machine learning method was the best. Fourthly, in the prediction of dengue incidence, we only discussed machine learning methods. While we did not pay attention to some improved time-series models proven to have higher accuracy.

Conclusions

We reviewed the 23 articles where machine learning methods were successfully applied to predict dengue incidence and confirmed that machine learning methods were attractive enough. The SVM was the most frequent model for dengue prediction with good predictive performance. With the development of deep learning, LSTM might be a more promising model for dengue prediction. More importantly, the meteorological factors including temperature, rainfall, and relative humidity could not be ignored in the prediction model.