Introduction

Growth and development of vectors are the main causes of transmission of vector-borne diseases. It is always considered that suitable breeding sites for vector carrying insects is the main factor in transmission of such type of diseases. Hence, it is essential to know what is vector and how diseases are transmitted. Vectors are living organisms that can spread infectious diseases between individuals or from animals to humans, i.e. transfer virus and pathogens from one infected person or animal to another, causing serious health problems in human populations. These vectors may be bloodsucking insects like mosquitoes, which swig disease-generating microbes and virus during a blood meal from an infected host, i.e. animal or human and later inject this infected blood into a new host during their subsequent blood meal (WHO 2017). Therefore, vector-borne diseases are not possible till it is transmitted by any vector from an infected host to another. Excluding mosquitoes, others vectors are ticks, sand-flies, flies, fleas, triatomine bugs and some freshwater aquatic insects. Chikungunya, dengue fever and dengue haemorrhagic fever, Japanese encephalitis, Kala-azar, filariasis, and malaria are considered as the main vector-borne diseases.

The spatial distribution of vector-borne diseases is determined by different dynamic factors like demography, environment and social factors. Unexpected urbanization, urban growth and climatic factors including climate change, variation in temperature and rainfall, variation in moisture contents can influence vector breeding and growth (Morin et al. 2013; Hii et al. 2016). Moreover, various environmental factors such as changes in local land use, local sanitation conditions, drainage facility and improper management of generated waste in and around households can also affect the transmission of vector-borne diseases due to availability of suitable sites for breeding and can render large populations in towns and cities at risk of viral disease spread by mosquitoes (Rattanarithikul et al. 1995; Mushinzimana et al. 2006; Ayele et al. 2012).

As far as Kolkata Municipal Corporation is considered, malaria and dengue are found as major vector-borne diseases. The proportion of Chikungunya and filariasis is limited and found in a little frequency. Japanese encephalitis and Kala-azar are rarely found here (Chatterjee 2017). The prevalence of malaria is much more in comparison to dengue (Sharma et al. 2014). Although the degree of malaria cases has been reducing from 96,909 in 2010 to 4769 in 2017 with no more deaths from 2011, the cases of dengue have also fallen down from 3546 in 2005 to 662 in 2017 although the deaths in dengue cases reoccur year after year. Malaria and dengue are an ever-seen and common public health problem in Kolkata. Anopheles stephensi is the chief source of malaria transmission in Kolkata and dengue is transmitted by Aedes aegypti (Aedes albopictus).

Recently, KMC has launched different initiatives to reduce vector-borne diseases from the capital city including establishment of mosquito research laboratory, establishment of dengue detection centre, dissemination of dengue report through SMS Alert, disease surveillance system strengthening, plying of speedboats along canals for destruction of mosquito larvae, larvicidal spray along canals using rowing-boats as transport, formation of 21 rapid-action teams for vector control, efficient non-medical staff assigned with vector control responsibilities etc. Along with such type of initiatives some micro-level initiatives are also taken including localised vector detection centres, multilingual leaflets, multi-coloured flex-banners, multi-coloured hoardings, awareness meeting, comprehensive booklet, documentary film, auto-miking and house visit etc. for awakening the residents to lessen the extent of risk of vector-borne diseases.

But the best prevention method of vector-borne disease is to detect and destruct considerable suitable breeding site of vector-carrying parasites. Recently, remote sensing (RS) and geographical information system (GIS) prove their efficiency in detecting and recognising the suitable breeding sites of vector-carrying parasites, susceptible risk zones and area under prone to risk using geospatial technique and spatial analyst tool (Nazri et al. 2012; Ahmad et al. 2017; Ali and Ahmad 2018, 2019). The use of spatial tools in GIS for public health is a significant technique to find spatial association and integration (Richardson et al. 2013). Remote sensing and geographical information system provides up-to-date information on environmental parameters of a particular region or location that influence the vector-borne disease transmissions (Nazri et al. 2009). Instead of considering one single factor, the result becomes weighted on considering multiple factors (Wondim et al. 2017; Ali and Ahmad 2018). Hence, the present study punctuates multi-criteria decision making (MCDM) tool for considering the parameters and making decision towards susceptibility of vector-borne diseases and mapping of risk zonation. The fundamental functioning principle of any MCDM process is same: selection of weighted criteria, selection of alternatives or sub-criteria, selection of aggregation, methods, and ultimately rank of alternatives based on weights or outranking (Majumder 2015). Decision making is based on various data relating to the study problem. It has been suggested that 80% of data used by decision makers are geographical, i.e. spatial in nature (Rikalovic et al. 2014). Decision problems that involve geographical data are denoted as geographical or spatial decision problems (Malczewski 2004).

Several research works have been carried out using RS and GIS in malaria susceptibility, dengue risk zonation and analysis of other vector-borne diseases over time (Nazri et at. 2012; Sarfraz et al. 2014; Tu et al. 2014; Ahmad et al. 2017; Wondim et al. 2017; Ali and Ahmad 2018). Digital elevation model, satellite data and location of GPS superior spatial analysis were utilised for current environment strategy in urban areas (Dongus et al. 2009). Geospatial technology was applied to identify and demarcate malaria-susceptible areas (Delgado et al. 2011). Satellite images with various resolutions, i.e. IRS LISS I, LISS II and WiFS were utilised to recognise suitable vector habitats and susceptible areas (Sharma et al. 1996; Palaniyandi 2004; Palaniyandi and Mariappan 2012). It is always required to rely on authentic and accurate data sets for vector-borne diseases’ control and management to predict susceptible areas and project risk mapping (Robert et al. 2003; Palaniyandi 2013). Mosquito-producing fields were identified in California through correlating Anopheles larva density with reflectance of canopy growth in early season using Landsat TM imagery and GIS technique (Wood et al. 1992). Detection of many vector diseases is directly not possible through field visit and observations, thus Multi-criteria analysis (MCA) along with GIS and AHP technique was applied by integrating various thematic layers for malaria risk mapping West Singhbhum district of the Jharkhand, India (Ahmad et al. 2017). Various socio-economic, geographic and epidemiological factors were integrated to familiar with malarial hotspots (Qayum et al. 2015). Moreover, there are different studies in different geographical locations of earth which have shown the capability of RS and GIS technique in detecting vector-borne diseases (Nakhapakorn and Tripathi 2005; Rochon et al. 2010; Hongoh et al. 2011; Khormi and Kumar 2011; Yadav et al. 2012; Walker et al. 2013; Nazri et al. 2016; Ali and Ahmad 2018, 2019).

The present study also highlights the application of RS data and GIS technique along with analytic hierarchy process as a multi-criteria decision-making tool for identifying susceptible areas and hotspot mapping of vector-borne disease. Thus, present study aimed to use GIS tools and other primary and secondary data for measuring the weight of various environmental, demographic and epidemic factors to make interrelationships towards vector-borne disease and to map susceptible areas. Such mapping of VBDs can provide assistance to health authorities to understand spatial distribution under risk as well as temporal outbreak of such diseases. Risk maps have prepared for public health and decision making because this will help in preventing measure and also provide guideline for implementing control programs and preparing health facilities based on spatial information of area under risk.

Study area

Kolkata is the capital of the Indian state of West Bengal which is located on the east bank of River Hooghly. As a typical riverine city, Kolkata in earlier days was surrounded by marshes, tidal creeks, mangroves, swamps, and wetlands. But now all these have changed with time. Kolkata Municipal corporation (KMC) is the largest urban agglomeration of West Bengal which is located in UTM Zone 45°N with geographical extension of 22°28ʹ 00ʹʹN–22°37ʹ30ʹʹN and 88°14ʹ30ʹʹE–88°25ʹ30ʹʹE (Fig. 1). The KMC has an area of 205.07 Km2 which is divided into 16 administrative Boroughs and, respectively, 144 wards. The mean elevation of the city is 1.5 m–16 m above MSL. Many parts of the city were originally wetlands which were reclaimed over the time to house the bourgeoning population. The annual mean temperature over Kolkata Municipal Corporation is 26.8 °C or 80.24 °F. Monthly mean temperature ranges between 19 and 32 °C. Summers are hot and humid with temperature 34 °C, during May and June the temperatures often exceed 40 °C. Winter or mild winter lasts for highly 2 and half month with lowest temperature between 9 and 11 °C in December and January. The annual average rainfall is near about 1600 mm and highest rainfall occurs during the month of August. According to latest population data (DSH 2014-15) the city has a population of 4,496,694 and 1,024,928 households. The population density of KMC is about 24,306 persons per km2 where, household density ranges between 755 and 23,237 households per km2 by making KMC as India’s third largest metropolitan city as well as the world’s eighth largest urban agglomeration.

Fig. 1
figure 1

Location of the study area

Materials and method

Data sources

Different types of satellite data were collected looking towards requirement of the study. Landsat 8 OLI and TIRS data were used for deriving Land Surface Temperature (LST) which is an important factor for growing disease-carrying mosquitoes. The Landsat OLI and TIRS, 30 m resolution data with UTM projection by default (Path: Row 138/44) were collected from United States Geological Survey (USGS) earth explorer portal and thermal bands were obtained looking towards purpose of use. The Sentinel-L2A data were utilised for preparing Land Use Land Cover (LULC), Normalized Difference Vegetation Index (NDVI) and Normalized Difference Moisture Index (NDMI). Sentinel-L2A data (45QXE & 45QXF) with 10 m and 20 m spatial resolution along with WGS 1984 projection by default were collected from Sentinel Hub EO browser (Table 1).

Table 1 Sources of data collection and their use in mapping of vector-borne diseases

Excluding satellite data, GPS field location, information from Google Earth, demographic data and epidemic data were also used in this study. The demographic data including population, household and literacy were collected from District Statistical Handbook, Kolkata 2015. The densities were calculated by dividing with respective area. To consider suitable breeding sites for vector-borne transmitting agents, GPS locations were collected which were further processed and accuracy measured in Google Earth. These locations include waste bins, open vats of waste, compactor stations and water bodies. These locations were used for proximate analysis. Furthermore, spatial epidemic data were collected from KMC and interpolation was run to determine area under high and low risk. Finally, all these input layers were processed in GIS environment with equal projections and cell size for further processing and analysis.

Study plan

Multi-criteria decision making (MCDM) is considered in present study to analyse susceptibility of vector-borne diseases. The multi-criteria decision making or multi-criteria decision analysis is a decision-making method that helps in structuring, measuring, and judging a decision problem. As a part of MCDA, it is always required to select multi-criteria and prefer more weightage criteria through different techniques. Analytical hierarchy process (AHP) is a widely used MCDA method. Hence, AHP was used in present study. It is a multi-step plan including selection of relevant criteria/factors, data inputs, data processing, analysis, and making accurate decision. Kolkata Municipal Corporation was chosen to carry out such type of study, because the prevalence of vector-borne diseases like dengue and malaria is seen every year from early August to late October and claims many lives year after year. Thus, proper susceptibility zonation of VBDs can enhance better surveillance to reduce the impacts.

The methodology to carry out the study is presented in the following figure (Fig. 2). The important factors (considered as decision criteria) that associated to VBDs occurrence were chosen looking towards the aim of the study. First of all, 13 associated factors were chosen and included them into three categories, i.e. environmental factors, demographic, and epidemic factors and factors related to suitable breeding site. To cheek the decision accuracy, Consistency Ratio (CR) of each selected decision factor was calculated using Saaty’s method. For spatial analysis, the spatial database of each factor was created in GIS Environment. GIS integration was run using weighted linear combination (WLC) to prepare spatial susceptibility map of VBDs.

Fig. 2
figure 2

Chart showing the methodology used for analysing VBD-susceptible zonation in KMC

Determination of decision criteria: data selection and preparation

Environmental factors

Land use land cover (LULC)

Different type of land cover has significant role in providing sites for suitable breeding environment. The incidence of vector-borne disease like chikungunya and malaria is related with water bodies and land under low area (Sheela et al. 2017). Alterations of land-use type including impoundments, dams, irrigation and draining systems that create shelters for the carrying agents have potential effect in transmitting vector-borne diseases like malaria, dengue, and filariasis (Norris 2004). Thus, preparation of LULC map is crucial to classify major area under different land cover and their uses to understand those areas susceptible to mosquitoes’ breeding site.

The LULC map was prepared using Sentinel-2 dataset. Primarily band combination was performed with Near Infrared (Band 8), Red (Band 4), and Green (Band 3) to make false color composite (FCC) infrared. Secondly, object-based signature file was created to run supervised classification and finally, the entire area was classified into five major land-use types and susceptible risk was considered based on classified category (Fig. 3a).

Fig. 3
figure 3

Maps of four selected environmental factors which were taken as decision criteria in susceptibility analysis of VBDs; a land use land cover, b normalized difference moisture index, c normalized difference vegetation index and d land surface temperature

Normalized difference moisture index (NDMI)

The Moisture Index was calculated using NIR and SWIR Bands of Sentinel-2. Many variables are interrelated and correlated with each other. Like the wet areas have high moisture content and concomitantly high vegetation. NIR is useful for classifying the vegetation and SWIR is good for measuring the moisture content of soil and vegetation, i.e. the reflectance in the NIR band is influenced by the leaf’s internal structure and the SWIR reflects the changes in vegetation water content. In SWIR, a darker area highlights higher water content. The following equation was used to estimate moisture content;

$${\text{NDMI}}=\left( {{\text{NIR}} - {\text{SWIR}}} \right)/\left( {{\text{NIR}}+{\text{SWIR}}} \right),$$

where NIR is for the Near infrared (Band 8A) and SWIR is the Short-Wave Infrared (Band 11). For Sentinel-2 data, these bands are with 20 m spatial resolution. NDMI have a values ranging from − 1 to + 1, where − 1 indicates very bad moisture level and + 1 indicates very high moisture level (Fig. 3b).

Normalized difference vegetation index (NDVI)

Mosquito-borne diseases were found in highest proportion in areas with low land and lowest forest cover (Sheela et al. 2017). Many researchers emphasized that NDVI is a significant factor for analysing mosquito-borne diseases. Moreover, in present day deforestation seems to be associated with the growth of mosquito and rise in mosquito-borne disease transmission (Norris 2004). Vegetation indices, specially based on the NDVI, the habitat suitability for different species of mosquitoes can be described (Kleinschmidt et al. 2000; Brown et al. 2008; Lourenço et al. 2011). The mosquito population is positively correlated with Normalized Difference Vegetation Index (Gemperli et al. 2006). The amount of precipitation can also be linked with vegetation index. The green vegetation is related with higher amounts of precipitation and is considered as suitable environment for mosquito habitats (Brown et al. 2008).

To know the vegetation index (VI) of KMC, NDVI was calculated (Fig. 3c). It was used for quantifying the green vegetation. NDVI normalizes the green leaf scattering in the near-infrared wavelength and chlorophyll absorption in the red wavelength. Hence, near-infrared and Red bands are required. Band 8 and Band 4 of Sentinel-2 dataset were used for this purpose and NDVI was calculated using the following equation;

$${\text{NDVI}}=\left( {{\text{NIR}} - {\text{RED}}} \right)/\left( {{\text{NIR}}+{\text{RED}}} \right),$$

where NIR is Band 8 and RED is the Band 4. As per Sentinel-2 EO products guideline, the value of NDVI ranges from − 1 to 1. Negative values (values approaching − 1) represent water bodies. Values close to zero (i.e. − 0.1 to + 0.1) mostly represent barren areas of rock, sand, or snow. Low but positive values represent shrub and grassland (mostly it ranges from 0.2 to 0.4) and high positive values indicate dense forest.

Land surface temperature (LST)

Land surface and air temperature affect the feeding behaviour of the mosquito and play an important role in the growth of mosquito larva (Sandru 2014). The survival and the life cycle of vectors are also effected by land surface temperature. Sandru (2014) found that mosquito survival is low at extreme temperatures, i.e. very high or very low temperature is harmful for mosquito’s growth. The optimum temperature in which mosquito can grow and survive is found between 28 °C and 32 °C and temperatures < 5 °C and or > 40 °C are deadly for the mosquitoes.

The land surface temperature of KMC was derived from TIRS 1 (Band 10) of Landsat 8 dataset (Fig. 3d). To calculate LST using thermal infrared sensor data, first, the conversion of digital number to TOA (Top of Atmosphere) radiance is required. OLI and TIRS datasets can be converted to TOA spectral radiance using the radiance rescaling factor which is provided in metadata file of Landsat dataset. Thus following equation was used:

$${L_\lambda }={M_L} \times {\text{ }}{Q_{{\text{cal}}}}+{\text{ }}{A_L}$$

where Lλ = TOA spectral radiance (Watts/(m2*srad*µm)), ML = band-specific multiplicative rescaling factor taken from metadata (RADIANCE_MULT_BAND_x, x is the thermal band), Qcal = quantized and calibrated standard product pixel value and AL = band-specific additive rescaling factor taken from metadata (RADIANCE_ADD_BAND_x, x is the thermal band).

Now, the top of atmospheric brightness temperature has to calculate from the reflectance value using the equation:

$${\text{BT}}=\frac{{K2}}{{{\text{In}}~\left( {\frac{{K1}}{{L \lambda}}+1} \right)}} - {\text{273}}.{\text{15,}}$$

where BT = atmospheric brightness temperature in Kelvin, which is further subtracted by 273.15 to calculate the degree Celsius, K2 = band-specific thermal conversion constant (K2_CONSTANT_BAND_x, x is the band number), K1 = band-specific thermal conversion constant (K1_CONSTANT_BAND_x, x is the band number), Lλ = TOA spectral radiance.

Finally, brightness temperature was converted to land surface temperature using the following equation:

$${\text{LST}}=\frac{{{\text{BT}}}}{{(1~+~\left( {\lambda ~ \times ~BT/c2} \right)~}} \times {\text{Ln}}\left( e \right)$$

where BT = brightness temperature, λ = wavelength of emitted radiance, c2 = h × c/s = 1.4388 × 102 m K = 14,388 µm (h, c, and s are constant), e = emissivity which was calculated from near-infrared and Red band of the same dataset.

Demographic and epidemic factors

A densely populated area creates a higher chance of experiencing an epidemiological outbreak even if the vector house index is low in that area. The respective vector does not have to travel far to search for its victims (Sandru 2014). High human population density and poor water supply are regarded as major contributors to dengue epidemics (Gubler 2004; Barreto and Teixeira 2008). An unexpected range of human population densities between 3000 and 7000 person/km2 in rural villages is prone to high dengue outbreaks (Schmidt et al. 2011). The risk of contracting West Nile disease is increased by living in low-density housing in the presence of vegetation (Sandru 2014). Household density, housing types and surrounding built-up environment have offered vital insights into the epidemiology of dengue fever and potential risk (Braga et al. 2010).

Literacy is highly related with human awareness about diseases transmission. High literate population may be well aware about the precautions and consequences of transmissible diseases and hence have low risk in contrast with low literate population. Research showed that people well conscious about the cause of malaria and found that awareness increased with the increasing literacy status and knowledge about mosquito-borne diseases were also significantly better among literate than illiterate population (Rasania et al. 2002).

Based on literature survey, the above three demographic factors (population density, household density and literacy rate) were chosen in susceptibility analysis of VBDs. Data on population and literacy were collected from Bureau of Applied Economics & Statistics, Department of Statistics & Programme Implementation, Govt. of West Bengal. First of all, area of each respective wards of KMC was calculated using spatial statistical tool in GIS environment. The total number of population and household of each ward were then divided with respective area and density of population and household was calculated. The following formula was used to calculate population and household density;

$$Dx=~\frac{{Nx}}{{Ax}}$$

where Dx is the density of population or Household, Nx is the total residing population or households and Ax is the area of respective ward in Km2. Choropleth is technically a thematic areal map in which areas are shaded in color or patterned in proportion. Choropleth technique was used to prepare the literacy, population and household density map in five categories from very low to very high using natural break in GIS environment (Fig. 4a–c).

Fig. 4
figure 4figure 4figure 4

Selected factors related with demographic and epidemic factors; a population density, b household density, c literate population, d reported dengue cases and e reported malaria cases

Epidemiology is the analysis of the distribution of disease conditions and public health. Dengue and malaria are considered as an endemic disease instead of pandemic as these are confined in certain places of the earth, i.e. tropical region due to availability of temperature and moisture which offers best breeding sites for the disease-carrying host. For the epidemic parameters, cases of dengue and malaria for the year of 2012–2015 were collected for spatial analysis. Dengue and malaria are main vector-borne diseases in KMC, other VBDs including West Nile Virus, Lyme disease, Japanese Encephalitis, Chikungunya etc. are not seen here. Only 2 cases of Japanese Encephalitis (JE) were found during 2014. Thus, only malaria and dengue cases were considered for present work. For spatial distribution of dengue and malaria cases, spatial interpolation was used based on given inputs on recorded dengue and malaria cases (Fig. 4d, e).

Factors related to suitable breeding sites

Places with garbage and waste accumulation, stagnant sources of water, drains, stagnant canal with garbage accumulation, in and around community waste bins and open waste dumping vats are considered as the most appropriate breeding sites for mosquitoes. Improper handling and management of waste and their unscientific disposal cause adverse impact on all components of the environment and human health. Unscientific disposal of wastes here and there and constructing built-up areas can be the reason for creating suitable sites for breeding. Disposal of garbage and dumping these into drains can also be causes of breeding grounds for mosquitoes. Hence much attention is to be paid to provide appropriate treatment and disposal of waste generated in the built-up area. In an evolving urban policy, priority is to be given to the installation of safe treatment and disposal facility of waste especially, sewage, sullage, and solid waste (Sheela et al. 2017).

Wastes tend to accumulate near settlements, offers breeding sites to vectors like rodents, insects and animals (Cointreau 2006). Dumping of wastes and poorly managed landfills affect the health, quality of life and impact on local environment and livelihood. The storage of waste inside the house, kitchen, room and nearby waste bins and delay of refuse trucks to collect such waste can create disease-carrying pathogens such as vector, rodent and vermin having high risk of disease spread like malaria, dengue and other viral diseases (Abdellah and Balla 2013). The vector-borne disease transmission is strongly connected with location of breeding site (Carter et al. 2000). Transmissible diseases like malaria and dengue are known to be highly prevalent around specific mosquito breeding sites and they can normally be transmitted only within certain distances from the breeding sites. The range of dispersal is typically between a few hundred meters and a kilometre, but rarely exceeds 2–3 km (Kobbe et al. 2006). Land disposal facilities add to global warming and subsequently vector-borne disease abundance and pathogen survival. Proximity with landfills blurs the lines between occupational impacts on scavengers and environmental health impacts on surrounding communities (Sabesan et al. 2001; Cointreau 2006).

Keeping in mind these circumstances, four factors were chosen which are associated for providing suitable breeding sites. These factors including distance from waste bins, distance from open vats, distance from compactor stations and distance from water bodies were selected for proximity analysis. This operation was performed in two steps, i.e. GPS-based record of such locations throughout the study area was gathered during a 45 days field survey and these points were plotted in Google earth to export and create point shape files. Finally, multiple ring buffers of waste bins, distance from open vats, distance from compactor stations and distance from water bodies were created with a common interval of either 200 m or 300 m based on their extension for proximity analysis (Fig. 5a–d).

Fig. 5
figure 5

Selected factors relating with suitable breeding sites responsible for vectors’ growth and diseases’ transmission; a proximity to waste bins, b proximity to open vats, c proximity to compactor stations and d proximity to water bodies

Multi-criteria decision approach

Decision-making approaches are gaining much importance as a potential tool for assessment of multifaceted real problems due to their innate ability to judge different options. Decisions about susceptibility analysis of VBDs typically involved the assessment of multiple criteria according to study objectives (Carver 1991; Eldin and Sui 2003; Malczewski 2004). Analytic Hierarchy Process (AHP) is one of the more commonly applied MCDM approach (Lootsma and Schuijt 1997). AHP allows decision makers to structure their decision criteria into a hierarchy of sub-criteria or alternatives which can then be analysed individually. Thus, in first step, the aim was defined to identify susceptible areas of vector-borne diseases. In second step, the hierarchy was constructed by the main criteria to reach the defined aim. In the present study, 13 criteria were selected looking towards their importance in VBDs susceptibility. Then in third step, selected criteria were divided into alternatives for further analysis. Then, pair-comparison matrix (PCM) was established and weights were calculated subsequently in next steps.

AHP is essentially based on pair-wise comparisons matrix of the defined criteria which are used to establish the weight to calculate the importance or performance scores for selected criteria and alternatives (Syamsuddin and Hwang 2009). PCM is a relative ranking-based matrix table used to calculate the weight value of all sub-criteria under a selected criterion in decision making process. As all the selected criteria will not have the same significance for a particular instance, hence, the significance rank of all alternatives is complimentary and divides the same alternatives in the matrix (Saaty 1990; Saaty and Vargees 2001). The AHP employs an underlying scale with values from 1 to 9 to rank the relative importance for two criteria or alternatives (Table 2). Here, extreme significance is indicated by nine and equal significance by one between criteria of the matrix (Saaty 1990; Malczewski 1999). The pair-wise comparison matrix mainly has the criteria of reciprocity which is arithmetically denoted as 1/RC, R rank of C criteria in pairwise comparison matrix (Saaty 2012). PCM gives the weights of each criterion with comparison to all others. Once the comparison rank is fitted, the weightage has to be calculated for each respective alternative to judge the consistency for considering into decision. After constructing the pairwise matrix, relative weights/eigenvectors are calculated using following equation;

Table 2 Ranking scale of analytic hierarchy process adopted from Saaty to establish PCM
$$Ax{\text{ }}={\text{ }}{\lambda _{\text{max} }}X$$

where A is the comparison matrix of n criteria (i.e. priority matrix), X is the Eigenvector of n criteria (i.e. priority vector) and λmax is the Eigenvalue.

Calculated through,

$$\left( {{w_1} \times C{i_1}} \right){\text{ }}+{\text{ }}\left( {{w_2} \times C{i_2}} \right){\text{ }}+{\text{ }} \ldots {\text{ }}\left( {{w_n} \times C{i_n}} \right)$$
$${\lambda _{\text{max} }}={\text{ }}Ax1/Wx1,{\text{ }}Ax2/Wx2 \ldots {\text{ }}Axn/Wxn$$

where w1wn are the weights of alternatives, Ci1Cin are the ranks of alternatives and λmax is the Eigenvalue.

Once the weights are calculated it is essential to check the consistency in result. As the numeric ranks are derived from the subjective preferences of individuals, it is highly possible to have some personal bias in making the final matrix of judgments. For this purpose, AHP always offers a measure of the consistency of PCM by calculating the consistency ratio (CR). Saaty (1980) provides the calculated RI value for matrices of different sizes of alternatives as given here (Table 3). The ratio is designed in such a way that if the value of the ratio exceeds 0.10, it will be considered as inconsistent for judgments and value 0 as perfectly consistent, while values 0 or close to 0 (i.e. 0.01 or 0.03) are highly acceptable. Consistency ratio (CR) was calculated through the help of CI and RI. Mathematically, CR is expressed as;

Table 3 Standard value of random index (RI) given by Saaty to measure consistency in judgement
$${\text{CR}}={\text{CI}}/{\text{RI}}$$

CI was calculated by putting the value from above estimation by applying the following simple equation;

$${\text{CI}}=\frac{{{\lambda _{\text{max} }} - ~n}}{{n - 1}}$$

CI = consistency index, λmax = average of ΣW1Wn. RI depends on the number of elements being compared (i.e. number of alternatives in PCM) and if the value ranges from 0 to 0.09, the matrix will be considered as reasonably consistent and may continue the process of decision making using AHP.

Many researchers successfully used AHP as a multi-criteria decision making approach in different fields of study. Analytic hierarchy process was utilised to assess the various non-monetary criteria (Velasquez and Hester 2013). The combination of multi-criteria decision analysis and cost-benefit analysis was used for risk analysis in transport infrastructure appraisals (Ambrasaite et al. 2011). The Fuzzy multi-criteria decision-making approach and GIS were used for siting landfill in south Texas (Chang et al. 2008). The MCDM approach was used in combination with GIS to define risk zones of malaria epidemic in the central highlands of Madagascar (Rakotomanana et al. 2007). MCDM based analytic hierarchy process was applied for analysing criteria weight sensitivity to the study of spatial sensitivity in suitability evaluation (Chen et al. 2010). Several more studies applied analytic hierarchy process for analysing susceptibility, susceptibility, sensitivity and suitability (Syamsuddin and Hwang 2009; Tavares et al. 2011; Gorsevski et al. 2012; Rikalovic et al. 2014; Tu et al. 2014; Qayum et al. 2015; Eskandari et al. 2016; Nazri et al. 2016; Guler and Omralıoglu 2017; Ali and Ahmad 2018, 2019).

Susceptibility analysis of VBDs using weighted overlay analysis

Susceptibility analysis for vector-borne diseases was estimated using weighted overlay analysis (WOA). It is considered as an effective technique to resolve spatial complexity for susceptibility analysis (Kuria et al. 2011). AHP was applied to decide the more significant factors in the selected hierarchy of different inputs (Parimala and Lopez 2012). Thus, the selected thematic layers were integrated in GIS environment using linear weighted combination (Girvan et al. 2003). Susceptibility analysis of VBDs was determined based on weight calculated through AHP. All selected causative raster layers were defined with same cell size and put respective weight to combine into a single susceptibility index. The following equation was considered;

$${\text{SA}}=\mathop \sum \limits_{{j=1}}^{n} \left( {WjXj} \right)$$

where Wj is the weight value of decision factors j, Xj is the selected raster input and n is the number of selected decision criteria.

Results and discussion

The prevalence of vector-borne diseases in Kolkata Municipal Corporation is seen year after year, although the frequencies of occurrence and numbers of death have reduced. Thus, effort was made to carry out a spatial analysis on susceptibility zonation of such vectors and vector-borne diseases. In this regard, a complex decision-making process was carried out through multi-criteria selecting, rating, weighting, and analysing. A total 13 factors were selected which were pre-categorised into three decision parameters. These factors are environmental parameter, demographic and epidemic parameter and parameter related to suitable breeding site. Thus, the hierarchy was constructed with three decision parameters, 13 factors and 65 alternatives or sub-factors. Primarily, after calculating the weight value of each factors and sub-factors by AHP, single layer of susceptibility map was created for each of three parameters and finally VBDs susceptible map of KMC was created using overlay of each of this single layer in GIS environment.

VBDs susceptible zones based on environmental factor

Environmental factor is related with surrounding environmental and physical conditions. Four indicators were selected as environmental factor, including land use land cover (LULC), normalized difference moisture index (NDMI), normalized difference vegetation index (NDVI) and land surface temperature (LST). Thus, a total 20 alternatives were given risk value in a scale from 1 to 5, (5 defines very high susceptible zones and 1 defines very low susceptible zones toward VBDs) based on the scale of importance adopted after Saaty which ranges from 1 to 9. Table 4 shows 20 alternatives which were attributed by different importance from 1 (equal important towards VBDs outbreak) to 9 (Extremely important towards VBDs outbreak) and subsequently risk values were arranged to define which alternatives has greater susceptibility to VBDs (Table 4).

Table 4 Intensity of importance and respective risk value of selected environmental factors

The main function of AHP is pairwise comparison based on their relative importance which helps in measuring quantitative judgment in new fields (Saaty 1980). Thus, choice-based intensity of importance was given to particular alternatives looking for their association towards vector-borne diseases and pairwise comparison matrix were formed to obtain the weightage value, to check the result and error of decision making and empirical bias while assigning ranks in respective field, the Consistency Index (CI) and Consistency Ratio (CR) were considered. The weights of all 20 alternatives along with CI and CR are summarized here (Table 5). Saaty suggested that if the ratio of consistency and index for the resultant random matrix is found > 0.1, the decision-making process and selection of rank will considered as inconsistent (Saaty and Vargas 2012). The result shows that the CR of all selected factors was always calculated < 0.1 (0.0503 for LULC, 0.0313 for NDMI, 0.0152 for NDVI and 0.0375 for LST). Thus, it is may considered that the selection of rank was always in acceptable choice when pairwise comparison matrix was established. Now, to derive final output layer in GIS environment, the comparison matrix of selected four factors was established (Table 6). Finally, the VBDs’ susceptible zones were extracted through weight-based overlay of the chosen environmental factors using the GIS tool (Fig. 6). The result shows that the proportion of very high and very low susceptible zones-to-VBDs is limited and maximum area comes under moderate susceptible zones.

Table 5 Pairwise comparison matrix along with consistency ratio of four environmental factors
Table 6 Calculation of weight value of environmental factors for overlay analysis
Fig. 6
figure 6

Susceptibility analysis of VBDs based on selected environmental factors

VBDs’ susceptible zones based on demographic and epidemic factor

Demographic factor is related with population, household, level of education, etc. for vector-borne diseases assessment and susceptible zonation. Demographic factors are important because humans have high potential for transmission of such kind of diseases. In this regards, three causative indicators were taken as demographic parameters based on availability of data, including population density (PD), household density (HD) and number of literate population (NLP). Along with demographic parameter, previous epidemic records were also considered. The frequency of malaria outbreak was much greater than dengue cases during last year. Here, two more indicators were taken, i.e. reported malaria cases (RMCs) and reported dengue cases (RDCs). Thus, a total 25 alternatives were given risk value in a scale from 1 to 5, based on the scale of importance of intensity which ranges from 1 to 9. Table 7 shows 25 alternatives which were rated by different importance from 1 to 9 and subsequently risk values were put to define which sub-factor has more susceptibility to VBDs (Table 7).

Table 7 Intensity of importance and respective risk value of demographic and epidemic factors

The choice-based intensity of importance was given to particular alternative looking at their connection towards VBDs. The pairwise comparison matrix was constructed to obtain the weight value and to check the result and error of decision making and empirical bias while assigning ranks in respective field. The weights of all 25 alternatives along with their CI and CR are summarized (Table 8). The result reveals that consistency ratio of all selected indicators was found between 0.01 and 0.03 (0.0241, 0.0221, 0.0104, 0.0275 and 0.0320 for PD, HP, NLP, RDCs and RMCs, respectively). Each alternative was arranged with risk scale on the basis of weight calculated from PCM. Now, to derive final output layer, the comparison matrix of selected five factors was prepared (Table 9). GIS integration was created to design susceptible zones of VBDs based on demographic and epidemic parameter (Fig. 7).

Table 8 Pairwise comparison matrix along with consistency ratio of five demographic and epidemic factors
Table 9 Calculation of weight value of demographic and epidemic parameters for overlay analysis
Fig. 7
figure 7

Susceptibility analysis of VBDs based on demographic and epidemic parameter

VBDs' susceptible zones based on suitable breeding sites

The growth of vectors is associated with garbage damping. Wet and watery places near waste bins and open vats are highly suitable places for breeding sites (Fig. 8). Thus, proximity to spatial location of waste bins and open vats was taken and proximity to nearer locations was considered as very high to high susceptible to VBDs and vice versa. Along with these factors, proximity to compactor station was also taken into consideration because it was found during field visit that the liquid excreta is gathered into the drain after the waste compaction process which was left uncovered and this may offer suitable breeding sites for mosquitoes. One more factor, i.e. proximity to water bodies was also considered based on field visit. Thus, total of four indicators were taken as factors related to suitable breeding sites, including proximity to waste bins (PWBs), proximity to open vats (POVs), proximity to compactor stations (PCSs) and proximity to water bodies (PWTBs). Thus, a total 20 alternatives were given risk value based on same scale as used with other factors (Table 10).

Fig. 8
figure 8

Some considerable areas of suitable sites for growing vector-borne disease transmitting parasites which were identified during field visit (a) and b unclean drain in Diamond harbar Road, c stagnant water in drain near Nabapally, d drain in residential areas at Garden Reach filled with waste water and other solid waste from households, e dumped areas in Talata near Sealdah, f stagnant water body dumped with mixed waste in Tollygunge (the plastic carry bags, bottle and other such type container provides suitable places for growing diseases carrying parasites) and f author during discussion with a local respondent in Makaltala village near Dhapa dump site

Table 10 Intensity of importance and respective risk value of selected factors related to suitable breeding sites

After assigning choice-based intensity of importance to particular alternatives towards VBDs, PCM was constructed to obtain the weight value for checking the result. The weights of all 4 indicators along with their CI and CR are summarized below (Table 11). The result reveals that the CR values of all selected factors, i.e. PWBs, POVs, PCSs and PWTBs were calculated as 0.0511, 0.0371, 0.0508, and 0.0311, respectively. It was found that the CR was always < 0.1 for all the selected factors. Finally, same as the other factors, the VBDs’ susceptible zone was designed through constructing comparison matrix of selected four indicators (Table 12). The weight-based overlay of the chosen factors related to suitable breeding sites was derived using the spatial tool (Fig. 9). The result shows that areas circulating nearest to waste bins, open vats, compactor stations and water bodies come under more susceptible zones and areas farer to these locations are come under least-risk zones.

Table 11 Pairwise comparison matrix along with consistency ratio of four parameters related to suitable breeding sites
Table 12 Calculation of weight value of parameters related to suitable breeding sites for overlay analysis
Fig. 9
figure 9

Susceptibility analysis of VBDs based on parameters related to suitable breeding sites

Spatial susceptibility analysis of VBDs

After preparing individual layers based on selected factors, the final VBDs hotspot zone was created using weighted linear combination (WLC) of these individual layers in GIS environment. The weight of each parameter was assigned using AHP for integrated mapping. For creating spatial susceptibility to VBDs, the following equation was used (Qayum et al. 2015):

$${\text{VBDsH}}={H_{{\text{envp}}}} \times {H_{{\text{demp}}}} \times {H_{{\text{sbs}}}}$$

where VBDsH = vector-borne diseases hotspot; Henvp = hotspot identified based on environmental factors; Hdemp = hotspot identified based on demographic and epidemic factors; Hsbs = hotspot identified based on factors related to suitable breeding sites.

The final VBDs raster layer was created and reclassified according to the susceptible risk zone into 5 class, i.e. Very high (VH), high (H), moderate (M), low (L) and very low (VL) areas (Fig. 10). The areal proportion of susceptible zones to VBDs was estimated that 0.75%, 18.107%, 48.21%, 32.36% and 0.55% area were found under VH, H, M, L and VL susceptible zones to VBDs, respectively. The result of the study shows that about 81% area comes under moderate to low susceptible zones, whereas about 19% area comes under high to very high susceptible zones.

Fig. 10
figure 10

Mapping of vector-borne diseases hotspot

For the out breaking of diseases like water-borne, vector-borne or mosquito-borne are not due to any single factor. Many related factors are responsible for such. The present study tried to use such associated multi-criteria for spatial susceptibility analysis of VBDs. The result revealed that based on environmental factor, areas with water bodies, moisture content index in air > 0.33, vegetation index > 0.43 and surface temperature ranges between 30 and 32 °C has greater susceptibility to VBDs. Based on demographic and epidemic factor, areas with population density > 81,686, household density > 16,070, literate population < 5888, high reported cases of dengue and malaria has more prone to VBDs. Among suitable breeding sites, areas ≤ 200 m from waste bins, ≤ 300 m from open vats, ≤ 300 m from compactor stations and ≤ 200 m from water bodies has high susceptibility to VBDs. Out of total 13 selected criteria in this study, it was found that moisture index, land surface temperature, water bodies and nearness to open waste dumping vats have greater role in outbreak of vector-borne diseases.

Accuracy measurement

It is a very crucial task in spatial analysis, to evaluate the result with actual ground. As per KMC report in last year from January to October more than 600 dengue and more than 2000 malaria cases were reported in KMC out of which 6 were death in dengue. So, a field survey was carried out during June and July of 2018 in different wards of each 16 Borough of KMC. Looking towards surrounding settings and respondent’s response on structured questions, different GPS locations were recorded. These locations were further verified in Google Earth open software and export as point locations. After sampled categorization, result found that in many areas of southern, south-eastern and western part has very low prevalence of MBDs, some areas found with no cases of such diseases, while some areas of northern and central parts has high to moderate prevalence of MBDs. Thus, some areas with very high prevalence of such diseases were plotted to merge with susceptibility analysis of MBDs (Fig. 11). The result shows that these reported areas are much uniform with final layer and constantly merged with very high to high zones as derived from spatial susceptibility analysis using geospatial technique and MCDM approach.

Fig. 11
figure 11

Map showing assessment of accuracy (a) places with reported VBDs during last year (b) multi-criteria-based susceptible map of VBDs using geospatial technique

Conclusion

Disease-susceptibility analysis and assessment are required for better surveillance and strategy making to prevent such issues from public health. Susceptibility zonation and mapping are gaining great significant for this requirement. Materials and methods are considered as the base for logical and accepted result for this type of study. Thus looking towards research problem and main objective, geospatial technique and multi-criteria decision-making approach were used for analysing spatial susceptibility of vector-borne diseases in Kolkata Municipal Corporation. Analytic hierarchy process, as a decision-making tool of MCDM approach was selected for ranking and weighting the spatial information. AHP in combination with geographic information system offers useful result for the identification of vulnerable zones. The great advantage of AHP is the pairwise comparison matrix which helps in developing rank-based decision to judge decision criteria. Hence, wrong selection of rank or judgement of any chosen factor will offer consistency ratio of > 0.1, which is not acceptable and make decision process inconsistent. The consistency measurement of present study showed that it was always between the range of 0.01–0.07, it signifies acceptance for decision making based on intensity of importance. So, final overlay was carried out using derived weight of selected factors. The result revealed that maximum portion of study area belongs to moderate susceptible zones followed by low and high, i.e. covering about 48.21% (99.10 km2) area under moderate, 32.36% (66.3 km2) under low and 18.10% (37.6 km2) under high susceptible zones to vector-borne diseases. While very little portion was found under very high and very low with areal extension of 0.75% (1.50 km2) and 0.55% (1.13 km2), respectively. The study result also revealed that out of all selected criteria in this study, moisture index, land surface temperature, water bodies and nearness to open waste dumping vats have greater importance in outbreak of vector-borne diseases.

Finally, the study result was evaluated through locations of previously vector-borne disease-affected areas. The assessment found that highest malaria and dengue recorded areas of previous year are closely associating with high and very high susceptible zones derived from spatial susceptibility analysis. To seek more accurate and precise result, the present research also recommends to highlight on more ground-level information, long and extensive GPS-based field study and comprehensive studies of breeding sites of mosquitoes and vectors. The present study also suggests to use very high resolution satellite data including IKONOS (1 m), QuickBird (60 cm), GeoEye-1 (50 cm), WorldView-1 (50 cm) and WorldView-4 (30 cm) which will provide more accurate result in identification of susceptible areas.