1 Introduction

Geospatial data analytics play an essential role in prevention and control of various diseases. Geospatial data model consists of landscape, climate and elevation and so on [24, 26, 29]. This type of model is also used to develop the risk maps for the areas which are likely to get more infection compare to other neighboring locations. Image processing is also used to process the geospatial data [17, 20, 26]. In general, image processing methods are widely used in many applications it includes healthcare, restorations and enhancements, movie maker, color code processing, pattern recognition, robot vision and remote sensing. Nowadays, image processing plays a vital role in various applications to make the decisions and solve various issues in day-to-day life. In general, image processing methods have been used in medical fields for disease diagnosis, clinical care and other healthcare services. Image enhancement is widely used to improve the quality of the image generated from various sources. In general, image enhancement methods are also called as image editors. Image processing software’s and tools are used to manipulate the images [43]. For example, image enhancement is often used to increase and decrease the contrast of a digitally stored image. Image contrast plays a vital role in extracting important features for decision making in various fields it includes disease diagnosis, change detection, feature selection and pattern mining. Image contrast is considered as an essential key to deciding the image quality. Image contrast is determined by the variation between the brightness and visual color of two different objects [44]. Nowadays, various image enhancement algorithms are developed to change the contrast of an image. Image enhancement is also used to alter the images with advanced filters and feature extraction methods. In other words, the essential role of an image enhancement technique is to modify or alter the existing image for a given task. It is observed that various image enhancement methods are available to modify a digital image without destroying its originality and features. Geospatial data is collected from various satellites and remote sensing software’s to analyze the spatial distribution of various features. In this paper, ArcGIS software is used to analyze the geospatial data to model the malaria outbreak.

This paper focused on the responsibility of multi-criteria decision analysis in geographical information systems under multi-attribute decision analysis. Our proposed spatial multi criteria decision analysis (SMCDA) is used to identify a suitable land area for the occurrence of malaria. In addition, the proposed geo-spatial based multi-criteria decision analysis is used to make an effective control strategy for the outbreak of malaria in Vellore, Tamil Nadu, India. Data of malaria were obtained from the Directorate of Health Services, National Informatics Centre, Vellore with the official permission from Director of Health Services, Tamil Nadu. Arc GIS 10.2 is used to model the suitability of malaria in Vellore [15]. Weighted overlay and inverse distance weighting techniques are used to find the suitable land parcels. U.S. National Weather Service has invented a deterministic method called IDW at 1972. Inverse Distance Weighting (IDW) is a type of deterministic method, it assigns the weight values based on the neighborhood location and its values [21]. IDW method is dissimilar to GPR and Kriging approaches. Kriging is a type of interpolation method to model probability distribution based on the mean and covariance. Recently, many researchers from various environments have used Kriging to model the emerging diseases. For example, Olaf Berke et al., (2004) have used Kriging for choropleth disease mapping [3].

In addition to choropleth disease, more number of spreading diseases is spatially correlated to the neighborhood locations. For example, vector-borne and zoonotic diseases occur based on the presence of animal hosts, pathogens and susceptible human populations overlap [36]. The minimum temperature and specific humidity are most often considered as significant parameters to model the swine flu [8]. Similarly, Xiao H et al., 2013 have identified dryness, low wind speed and high-pressure are the significant factors for swine flu outbreak in Changsha, China [45]. In this paper the proposed SMCDA model uses demographical data, weather & climate data and socio economic variables to model the malaria outbreak in Vellore district, Tamil Nadu, India. We conclude that our spatial multi criteria decision making model outperform well for developing decision making rules in public health. Fig 1 represents the traditional decision making process. The structure of the study is described as follows: section 1 and section 2 describe the recent work in habitat suitability model. Section 3 describes the study area and data source. The proposed framework is represented in section 4. Results are discussed in section 5. Finally, section 6 concludes the work.

Fig. 1
figure 1

Traditional decision making process

2 Related work

Multi criteria decision making (MCDM) is used to solve various decision making problems in complex environments. Qualitative and quantitative evaluation criteria’s are used in the MCDM model to find the compromise solution to solve any kind of problem. Nowadays, decision making plays a vital role in public health and human life. For example, new emerging diseases such as Ebola and Zika are considered as major issues in human health. In addition, reemerging diseases such as Malaria, Dengue and Chikungunya are also considered as significant problem in all over the world. In recent years, many researchers are used MCDM models to prevent and control of emerging and reemerging diseases [2, 9, 32, 37]. In order to select best alternative from multiple alternatives with various criteria’s, the following MCDM models are widely used such as Grey System Theory (GST), Analytic Goal Programming (GP), Network Process (ANP), Multi-Attribute Value Theory (MAVT), Analytic Hierarchy Process (AHP), Data Envelopment Analysis (DEA) and many others. Thomas L. Saaty has originally developed AHP model to solve composite decision making problems. In general, AHP often used to formulate a decision in contractor prequalification [1].

ANP works as similar to AHP but the difference is AHP always form a hierarchy with an end whereas ANP often forms a network [22]. DEA is often used to monitor various functions of an organization. For example, DEA is used recently to schedule and monitor the bridge maintenance functions [35]. Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) is used to make a better decision based on the geometric distance between the events [40]. In addition, Grey System Theory (GST) is also considered as an effective solution and it is developed based on the differential equations to evaluate decisions. Recently, hybrid decision-making model is developed based on the above mentioned MCDM models [14]. Opricovic has originally developed VIKOR (VIsekriterijumska optimizacija iKOmpromisno Resenje) MCDM method to identify compromise solution from various contradictory criteria’s [34]. VIKOR method identifies the individual regret values and compromises solution to achieve better decision [42]. Nowadays, there is a need to develop a decision-making model that process and solve uncertainty, especially in biology and medicine. In general, disease diagnosis and epidemiological analysis have a high level of uncertainty and ambiguity [4]. For example, various diseases may have the different impact on different people. In addition, due to the level of immunization every individual may feel different symptoms for the same disease. This variation would degrade the performance of disease diagnosis based on symptoms [18, 19, 28]. AHP method is used in this paper to solve imprecision and uncertainty problems in decision making. Figure 2 represents the classification of decision making and number of publications in each type.

Fig. 2
figure 2

Classification of decision making and number of publications in each type

Spatial Multi-criteria decision analysis (SMCDA) approach combines the Multi-criteria decision analysis (MCDA) and geographic information system (GIS) to model the spatially explicit and implicit information and to make an effective decision under different scenarios and different environment. SMCDA is widely used in many application it include vector born and air born diseases [13, 25, 27], not only disease modeling but also land suitability modeling for hillside development [6, 30, 31, 41], flood monitoring [46], Moose modeling [11], wild goat (Capra aegagrus) modeling in a mountainous arid area [39], United state rivers modeling [12], Asian elephant [38], walleye modeling in the Sandusky River [10] and Grassland bird [16].

3 Data source and study area

The climatic conditions for each block in Vellore district is collected from the Vellore agriculture department. Malaria incidence data is collected from National Informatics Center (NIC), India. Table 1 depicts the climatic and number of infected people for each block in Vellore district. Figure 3 depicts the raster file for the number of infected (malaria) individuals from August 2009 to July 2010 in the study area of Vellore district; it is the one of the states in Tamil Nadu, India. Figure 4 represents the weather data used in the proposed geo-spatial analytical framework.

Table 1 Rank assignment based on significant variables
Fig. 3
figure 3

Malaria incidence data from August 2009–July 2010

Fig. 4
figure 4

Weather data

4 Proposed framework

The Geo-spatial data model is used to model the disease prevalence and spatial concentration of disease risk, which is not given any absolute solution for the decision makers to control the disease. Disease control strategies must be considered not only the spatial information like landscape, weather and climate, but also spatially explicit information like socioeconomic variable, population density, behavior and natural habits of the people. There is need to develop an integrated approach that consists of spatially implicit and explicit information to identity an effective control strategy that prevents and control of emerging infectious disease in a traditional world.

Decision making is a set of organized procedures used to analyze the complex problems in any environment. In order to visualize the suitable area to get malaria, the proposed geospatial analytical framework is used with IDW method. Proposed geospatial analytical framework is represented in Fig. 5. Multi-criteria decision analysis is a decision-making framework that allows the policy makers to make an effective decision with qualitatively and quantitatively. There is a need to develop an efficient decision-making system that controls the spreading diseases. Malaria is one of the emerging diseases worldwide; the root cause of malaria is weather & climate condition of the study area. The climate condition is often called as spatially implicit information, traditional decision-making models do not use the spatially implicit information it most often uses spatially explicit information such as socioeconomic, natural habits of the people. A geographic information system (GIS) is an emerging technology worldwide that mainly focused on gathering, storing, manipulating, managing and analyzing all types of geographic data (weather & climate). Geographical Information Systems (GIS) is used to map and visualize the epidemic spread in a particular location. In recent years, a number of technologies has developed to find the suitable areas for the occurrence of emerging epidemic diseases. Remote Sensing and Geographical Information Systems (GIS) are the familiar techniques used to map the epidemic spread and spatial modeling of emerging diseases.

Fig. 5
figure 5

Proposed geospatial analytical framework

Spatial statistics and spatial autoregressive algorithms use spatial weight matrix to find the relationship between the neighborhood locations. Figure 6 represents an example of contiguity matrix. The spatial location and its weight matrix are shown in Fig. 7.

Fig. 6
figure 6

Contiguity matrix

Fig. 7
figure 7

Example spatial location

The standardized weight matrix W  for the Fig. 6 can be represented as follows,

$$ {W}^{\ast }=\left[\begin{array}{ccccc}\hfill 0\hfill & \hfill 1\hfill & \hfill 1\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 1\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill 1\hfill & \hfill 0\hfill \\ {}\hfill 1\hfill & \hfill 1\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 1\hfill & \hfill 1\hfill & \hfill 0\hfill & \hfill 1\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill 0\hfill \end{array}\right]=\left[\begin{array}{ccccc}\hfill 0\hfill & \hfill 1/2\hfill & \hfill 1/2\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 1/3\hfill & \hfill 0\hfill & \hfill 1/3\hfill & \hfill 1/3\hfill & \hfill 0\hfill \\ {}\hfill 1/3\hfill & \hfill 1/3\hfill & \hfill 0\hfill & \hfill 1/3\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 1/3\hfill & \hfill 1/3\hfill & \hfill 0\hfill & \hfill 1/3\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill 0\hfill \end{array}\right]=\left[\begin{array}{ccccc}\hfill 0\hfill & \hfill 0.5\hfill & \hfill 0.5\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0.33\hfill & \hfill 0\hfill & \hfill 0.33\hfill & \hfill 0.33\hfill & \hfill 0\hfill \\ {}\hfill 0.33\hfill & \hfill 0.33\hfill & \hfill 0\hfill & \hfill 0.33\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0.33\hfill & \hfill 0.33\hfill & \hfill 0\hfill & \hfill 0.33\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill 0\hfill \end{array}\right] $$

5 Result and discussion

5.1 Weight calculation based on Paired comparison analysis (PCA)

Weights are used to rank the significant parameters to get malaria. The initial weights are calculated using paired comparison analysis. The paired comparison analysis is significantly used to rank the parameters when choices are quite different from one another. In addition, paired comparison analysis is also used when decision criteria’s are subjective. Paired Comparison Analysis is an effective solution when decision options are relative to one another. Thus the proposed geospatial based analytical framework uses Paired Comparison Analysis to compute the initial rank. The calculated rank is depicted in Table 1. It is observed that temperature plays a major role among the population aged above 40 and 20–40 respectively. In addition, wind speed, rainfall and population density are also impact more on malaria transmission. Paired comparison and calculated weights are tabulated in Tables 2 and 3 respectively.

Table 2 Paired wise comparison
Table 3 Weight calculation

5.2 Weighted overlay analysis

Figure 8 represents the raster images of various climatic conditions such as maximum and minimum temperature and humidity, rainfall, wind speed and demographical variables of malaria outbreak in Vellore, Tamil Nadu, India. Table 3 depicts the influence of various parameters to model the malaria outbreak. The results generated from Table 3 states that population and rainfalls have the highest influence on malaria outbreak. The influence parameters are used to develop the habitat suitability model for malaria in Vellore, Tamil Nadu, India. Figure 8 represents the raster images of climatic conditions like maximum and minimum of temperature and humidity, rainfall, wind speed. The raster images also contain population, demographical data of malaria in Vellore district. Figure 9 represents a habitat suitability model for malaria in Vellore, Tamil Nadu, India.

Fig. 8
figure 8figure 8

Raster images

Fig. 9
figure 9

Habitat suitability model

Fig. 10
figure 10

Geo-locations suitable for malaria outbreak

Fig. 11
figure 11

Malaria incidence based on male and female

Fig. 12
figure 12

Malaria incidence on various locations

Fig. 13
figure 13

Prediction of malaria incidence

Fig. 14
figure 14

Malaria incidence based on population density and rainfall

5.3 Habitat suitability model based on inverse distance weighting (IDW)

Habitat suitability model is used to identify the land parcels where the malaria outbreak is more likely to happen. Inverse Distance Weighting (IDW) and Weighted Overlay functions are primarily used to find the most suitable spatial locations for the occurrence of the malaria epidemic [23, 47]. Interpolation refers to the property of a point j, Pj, depend upon the measured values for each locations j. Interpolating methods are used analyze the randomly distributed data in the geospatial environment. Interpolating methods are classified as the global method and local method [5, 33]. Global methods use known values to find the unknown value whereas in local methods the unknown value is identified based on the specified number of nearest neighbors [7].

Inverse Distance Weighting is used to find the property at each location, it is given by

$$ {P}_j=\frac{\raisebox{1ex}{${\sum}_{k=1}^n{P}_k$}\!\left/ \!\raisebox{-1ex}{${D}_{j k}^m$}\right.}{\raisebox{1ex}{${\sum}_{k=1}^n1$}\!\left/ \!\raisebox{-1ex}{${D}_{j k}^m$}\right.} $$

W

Pj :

Property at location j.

Pk :

Property at sampled location k.

Dij :

Distance between J to K.

n:

Number of sampled location.

m:

Inverse distance weighting power.

here,

5.4 Reclassification

Reclassification is used to classify or change the old raster values to alternative values. Reclassification is also used to classify many input raster values to the desired values within the range. Reclassification always applied to the entire cell within a zone. The resultant habitat suitability model developed with the help of inverse distance weighting and paired comparison analysis is represented in Fig. 10. ArcGIS 10.2 software is primarily used to develop the habitat suitability model for malaria in Vellore, Tamil Nadu, India. As shown in Fig. 10, there is high suitability index present in the Vellore and its neighborhood locations like Alangayam, Madanur, Gudiyatham, Solignghur, Walajapet, Arakonam. Some areas are less suitability index like Kandili, Jolarpet, Thimiri, Nemili, Natrampali. The geospatial results generated from the proposed habitat suitability model is used for the government to take better decisions against the spread of malaria in Vellore, Tamil Nadu, India. Figures 11 and 12 represent the malaria incidence based on male and female, and various locations respectively. Figure 14 represents the significance of malaria incidence based on population density and rainfall. Figure 13 represents the prediction of malaria incidence based on various independent variables such as Age 21–40, Temperature, Age > 40, Wind speed, Pop Density, Rainfall, Humidity, Age < 20, Smoking, Alcoholic, Diabetics and Wheezing. Table 4 depicts the suitability index count for the habitat suitability model. Tables 5 and 6 represent the influence of various parameters on malaria.

Table 4 Suitability Index
Table 5 Various parameters influence on malaria (1)
Table 6 Various parameters influence on malaria (2)LID

6 Conclusion

Spatial habitat suitability model is developed based on the climatic conditions and social habits of the people who are living in Vellore, Tamil Nadu, India. Inverse Distance Weighting (IDW) function and weighted overlay function are used to predict the suitable locations that most likely affect by malaria. In general, the geospatial data model is used to model the disease prevalence and spatial concentration of disease risk, which is not given any absolute solution for the decision makers to control the disease. Disease control strategies must be considered not only the spatial information like landscape, weather and climate, but also spatially explicit information like socioeconomic variable, population density, behavior and natural habits of the people. There is need to develop an integrated approach that consists of spatially implicit and explicit information to identity an effective control strategy that prevents and control of emerging infectious disease in a traditional world. Decision making is a set of organized procedures used to analyze the complex problems in an organization. Multi-criteria decision analysis is a decision-making framework that allows the policy makers to make an effective decision with qualitatively and quantitatively. There is a need to develop an efficient decision making system that controls the spreading diseases. Malaria is one of the emerging diseases worldwide; the root cause of malaria is weather & climate condition of the study area. The climate condition is often called as spatially implicit information, traditional decision making models does not use the spatially implicit information it most often uses spatially explicit information such as socio economic, natural habits of the people. Spatial Multi-criteria decision analysis (SMCDA) approach combines the Multi-criteria decision analysis and geographic information system (GIS) to model the spatially explicit and implicit information and to make an effective decision under different scenarios and different environment. In this paper, geo-spatial based multi-criteria decision analysis approach is developed to model the malaria outbreak. The future work of this manuscript is to develop various habitat suitability model for emerging diseases.