Introduction

In the developing technology of science, the area of remote sensing (RS) provides the advantage of acquiring information about the surface of the earth in the past, present, and also for future. The RS is the knowledge of finding information about the physical features of an area on the earth’s surface through the measurement of reflected radiations from a particular distance by using a satellite or aircraft. The geospatial satellite RS technology plays a dynamic role for RS researchers in many aspects like geo-structural mapping, monitoring of the soil moisture, vegetation, forest fires, road monitoring, floods and droughts, deforestation, LU/LC, or urban planning (Yuan et al. 2020, Weiss et al. 2020, Wellmann et al. 2020, Bauer 2020, Cheng et al. 2020). The LU/LC prediction has been considered as needed and important research in the field of RS. Land cover refers to the earth’s surface cover areas like urban infrastructure, water bodies, bare soil, vegetation, agricultural lands, mountain regions, and forest-covered area. Land use refers to the utilization of the land by people on earth for different socio-economic activities. The information about the LU/LC over several decades around the world has been captured and stored in the database of different satellites and the information has been sensed by scientists for performing research on LU/LC prediction. The importance of LU/LC prediction research is to provide the LU/LC information for the particular area to the government officials, forest department, urban planners, and land resource management to take necessary actions for the protection of the LU/LC environment (Mohan et al. 2021, Vivekananda et al. 2021; Talukdar et al. 2020). From the observations of many researchers, the research on the LU/LC prediction is processed with the following steps that include data acquisition, pre-processing, LU/LC classification, accuracy assessment for validating the classified map, LU/LC change detection, modeling of environmental variables that affecting the LU/LC environment, LU/LC prediction, and the validation of the LU/LC prediction. Different algorithms and techniques have been used by researchers for predicting the LU/LC map for the specific area on the surface of the earth (Navin and Agilandeeswari 2020a, Ridding et al. 2020, Hasan et al. 2020, Li et al. 2020, Alharthi et al. 2020).

In the scientific field of RS, the data was collected through satellite systems or airborne sensors with different spectral, spatial, temporal, and radiometric resolutions. The use of multispectral data has an advantage in providing higher accuracy in the research of LU/LC prediction. A few of the high-resolution multispectral geospatial data used by researchers around the world include Sentinel, Landsat Series, IKONOS, SPOT, REIS (Rapid Eye Earth Imaging System), Linear Imaging Self-Scanning sensors (LISS-III and LISS-IV), Sentinel, Moderate Resolution Imaging Spectroradiometer (MODIS), ASTER Global DEM (Digital Elevation Model), and Cartosat DEM. The other data used for performing the research on LU/LC prediction include aerial photographs, ground survey data, Google Earth Engine (GEE) images, open street maps, and government-dependent data (Kodheli et al. 2020; Feng et al. 2020; Li et al. 2021). The noise, geometric, atmospheric, and other sensor errors present in the raw geospatial satellite images are corrected through the process of pre-processing. The most used techniques by researchers for the rectifying of geospatial satellite images are image de-striping, Quick Atmospheric Correction (QUAC) module, Dark Object Subtraction (DOS) module, Apparent Reflectance Model (ARM), ASCII Coordinate Conversion, georeferencing, FLAASH (Fast Line-of-sight Atmospheric Analysis of Hypercubes) module, orthorectification, wavelet dimensionality reduction, rescaling, histogram equalization resampling, Linear Discriminant Analysis (LDA), F mask method, and Minimum Noise Fraction (MNF) method (Tamiminia et al. 2020; Navin and Agilandeeswari 2020b; Kuchkorov et al. 2020). The features in the geospatial satellite images have been extracted using different unsupervised algorithms and the pixels in each feature help the researchers in performing further processing in obtaining information about the LU/LC classes. The unsupervised optimization technique has been widely used in the process of digital image processing for extracting features from geospatial satellite data. For extracting the features from the geospatial satellite image, optimization algorithms have been introduced for the clustering process. The researchers used the clustering objective as the minimum sum of the squared error for the optimization process. Based on the clustering objective, some of the optimization algorithms used for extracting the features from the geospatial satellite images are Genetic Algorithms (GA), Cuckoo Search Optimization (CSO), Ant Colony Optimization (ACO), Artificial Bee Colony (ABC), Shuffled Frog Leaping Optimization (SFLO), HHO, and Particle Swarm Optimization (PSO) (Sawant and Prabukumar 2020, Sheoran et al. 2020, Thyagharajan and Vignesh 2019, Sheoran et al. 2021, Rodríguez-Esparza et al. 2020, Singh et al. 2020).

The unsupervised classification algorithms are used for classifying the unknown features in an image into clusters or groups without any training process. Some of the most used unsupervised classification algorithms used by researchers in the field of RS are the Gaussian Mixture model, K-means, Fuzzy C-means (FCM), and ISODATA (Iterative Self-Organizing Data Analysis) clustering. The supervised classification algorithms are used for classifying the image by training the known features of an image with the reference image for obtaining good classification accuracy. Some of the most used supervised classification algorithms used by researchers in the field of RS are support vector machine (SVM), Mahalanobis distance, random forest classification (RFC), multivariate adaptive regression spline (MARS), parallelepiped classification, maximum likelihood classification (MLC), spectral angle mapper (SAM), k-nearest neighbor (kNN), and minimum distance to mean classification (Reddy and Kumar 2022, Singh et al. 2021, Alshari and Bharti 2021, Kulkarni et al. 2020, Yashin et al. 2020, Balha et al. 2021). The ground truth data is compared with the LU/LC classified data for calculating accuracy. Based on the accuracy results of the classification algorithms, the LU/LC classification map has been processed for further processing. The LU/LC change detection has been calculated for the classified LU/LC map between different periods (Chughtai et al. 2021; Mishra and Jabin 2020).

Some of the independent variables that provide the impact on LU/LC change include slope, aspect, elevation, precipitation data, hill shade, Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), Normalized Difference Soil Index (NDSI), census data, and distance variables (distance from edge of the forest, road, urban area, agricultural land, and water bodies) (Loganathan et al. 2022, Zeferino et al. 2020, Shafizadeh-Moghadam et al. 2020, Kayet et al. 2021). Based on the inputs of dependent and independent variables of the specific location, the LU/LC prediction was performed. The LU/LC change classification map for the specific study area was considered the dependent variable, and the factors that affect the LU/LC map of the study area were considered the independent variables. In the developing field of science and technology, the RS researchers used different prediction algorithms for finding future LU/LC changes around the world. Some of the prediction methods used by the researchers are Artificial Neural Network (ANN), Markovian and Cellular Automata (MCA), Logistic Regression (LR), Autoregressive Integrated Moving Average Model (ARIMA), Long-Short-Term Memory Neural Network (LSTM), CNN (Convolutional Neural Network), and Back Propagation Neural Network (BPNN) (Mohanrajan and Loganathan 2022, Floreano and Moraes 2021, Said et al. 2021, Seydi et al. 2021, Gavrilovskaya et al. 2021, Sankarrao et al. 2021). The scope of the research is based on the performance of the different algorithms used for different application areas. For providing better predictive results, the ensemble system or multi-classifier system learning-based approach was used by the researchers. The ensemble learning system is the process of combining multiple classification models for improving the model’s accuracy. The use of ensemble learning is to provide a less error rate on regression and high classification accuracy for statistical intelligence problems. The algorithms used in the ensemble learning system have been fused and the results are attained through the voting and averaging-based methods (Dou et al. 2021, Benbriqa et al. 2021; Phan et al. 2021; Kutlug and Colkesen 2021; Hu et al. 2021).

The environmental changes that have been observed in the past and present have been a concern for the future because of the worsening of nature and its impact on human health. The findings of the LU/LC changes for the past, present, and future were used for the appropriate planning and utilization of the LU/LC environment and its resources. So, the area of RS has been a significant tool for understanding the changes that happened in the past and present with the available satellite datasets. In the scientific field of the RS environment, the importance of the LU/LC prediction problem has been analyzed for the time series data by the community of scientists around the world. From the detailed observations, the selection of the satellite data, and the suitable algorithms for extracting the LU/LC classes for different periods is considered the first challenge in performing LU/LC change prediction research. The precise selection of spatial drivers that brings the impact of LU/LC change has been a challenge during the LU/LC prediction analysis. Providing the impact of multi-satellite information of LU/LC features through surface temperature maps by using LU/LC prediction models remains the challenge. The sustainable growth of the LU/LC environment for the time series data requires an accurate classification and prediction map, which was considered the strong motivation for this research. The main objective of our work is to provide the ensemble model of feature extraction-based LU/LC classification and prediction model with less misclassification rate. In our research work, we have proposed the ensemble-based LU/LC prediction model with less misclassification rate.

The main contribution of our research provides advanced research on the LU/LC prediction analysis through an ensemble model of unsupervised and supervised algorithms. The results will assist the government officials of land resource management in taking needed actions in protecting the LU/LC environment and its nature. In our research work, we have used the impact of multi-satellite information through the LISS-III and Landsat satellite images for the area of Javadi Hills, India. Our proposed flow of LU/LC prediction analysis is shown in Fig. 1. The main contributions of our work are as follows:

  1. i.

    An efficient LU/LC classification system has been developed by using the ensemble model of fuzzy HHO and supervised machine learning classifiers through a PCA-based fusion analysis.

  2. ii.

    The AVBaI has been calculated by using the LU/LC indices of the NDVI and Normalized Difference Bareness Index (NDBaI).

  3. iii.

    The E-LU/LC vegetation and bare soil map have been obtained by combining the features of the AVBaI with the results of the LU/LC classification system.

  4. iv.

    The LST map that affects the vegetation and bare soil features of Javadi Hills has been calculated and used as the spatial variable map during the process of LU/LC prediction.

  5. v.

    With the inputs of the E-LU/LC and LST map of Javadi Hills, the EP-LU/LC map has been obtained by using the MC-MLP neural network model.

Fig. 1
figure 1

Proposed ensemble-based LU/LC prediction flow chart

The rest of the paper is organized as follows: the “Area of the study and data acquisition” section presents the area of the study. The methods used for performing the LU/LC prediction are described briefly in the “Materials and methods” section. The experiments and the results of the proposed ensemble models are given in the “Results” section. The detailed discussions of this research work are described in the “Discussion” section. The “Conclusion” section presents the conclusions of this work.

Area of the study and data acquisition

Our research aims to perform the LU/LC prediction for the forest and non-forest-covered region of Javadi Hills, Tamil Nadu, India. The Eastern Ghats extension of Javadi Hills separates the range of the Vellore and Tiruvannamalai districts. It lies between the geographic coordinates of 78.75E12.5N–79.0E12.75N. The GCS-UTM (Geographic Coordinate System-Universal Transverse Mercator) of the 44N projection system was used for extracting the satellite image of Javadi Hills.

The research in Javadi Hills provides information about how much the impact of humans and nature had influenced the LU/LC area for different periods IN the forest and non-forest-covered regions. The thematic view of Javadi Hills was prepared using the shapefiles of India, Tamil Nadu, and Javadi Hills map which has been extracted using DIVA-GIS system software (http://www.diva-gis.org/) and Google Earth Engine platform (https://www.google.com/earth/). The Javadi Hills location map is shown in Fig. 2. Table 1 shows the specification of the geospatial time-series satellite images. The multispectral LISS-III time-series satellite images of Javadi Hills, India, for the years 2012 and 2015 were collected from the Bhuvan RS data of the NRSC, the Indian Geo-platform of ISRO (www.bhuvan.com). The LISS-III time-series satellite images were used for extracting the LU/LC features and the process for generating the LU/LC classification and prediction map of Javadi Hills. The geospatial Landsat (8 and 7) time-series satellite images of Javadi Hills, India, for the years 2012 to 2021 were collected from USGS, USA (https://earthexplorer.usgs.gov). The TIRS Landsat bands of Javadi Hills were used for estimating the LST. The TIRS bands were not present in the LISS-III satellite image. The Landsat NIR, SWIR, TIRS, and RED bands of Javadi Hills were used for estimating the vegetation and bare soil values. The Google Earth Engine data (https://www.google.com/earth/) for the years from 2012 to 2021 were used as the reference data during the process of accuracy assessment.

Fig. 2
figure 2

Javadi Hills — location map

Table 1 Satellite image specification

Materials and methods

In this section, we provided the detailed concepts of geospatial technology followed by pre-processing, feature extraction, supervised classification, fusion, prediction, and evaluation metrics of the proposed model. The geometric, atmospheric, and radiometric corrections were used in our research for correcting the extracted geospatial satellite image. The fuzzy HHO-based unsupervised algorithm was used for extracting the needed features in the pre-processed time-series geospatial satellite image. The supervised machine learning classifiers (SMC) used in our work includes SVM, RF, and ML algorithms. The PCA was used for fusing the results of SMC. The AVBaI has been calculated by using the LU/LC indices of the NDVI and Normalized Difference Bareness Index (NDBaI). The E-LU/LC vegetation and bare soil map have been obtained by combining the features of the AVBaI with the results of the LU/LC classification system. The LST map that affects the vegetation and bare soil features of Javadi Hills has been calculated and used as the spatial variable map during the process of LU/LC prediction. With the inputs of the E-LU/LC and LST map of Javadi Hills, the EP-LU/LC map has been obtained by using the MC-MLP neural network model.

Geospatial technology

In the developing field of geospatial technology, the RS, Geographic Information System (GIS), and Global Positioning System (GPS) play an important part in monitoring, identifying, locating, measuring, and modeling geospatial data. The initial stage of modeling the geospatial satellite data is to collect information from the satellites. The RS is the science of acquiring the needed information about the surface of the earth through aerial photographs or satellites. Based on different applications in the field of RS satellite, image processing is the latest technology used for collecting high-resolution images. There are different satellites operated around the world by different countries with different spectral, spatial, and temporal resolutions. Some of the satellites are Earth Observing-1 (EO-1), Resourcesat-1, and Resourcesat-2, Landsat series, Sentinel 2 missions, Moderate Resolution Imaging Spectroradiometer (MODIS), Rapid Eye Earth Imaging System (REIS), Quick Bird, and Digital Elevation Model (DEM) (Lechner et al. 2020, Radočaj et al. 2020).

The LU/LC prediction has been found in emerging research for more than 50 decades, and currently, it had been considered an important research problem for scientists around the world. The information on the LU/LC prediction helps to insist the concerned government officials, urban planners, and forest departments take action in protecting the LU/LC environment (MohanRajan et al. 2020). In our research work, we have used the multispectral LISS-III images from Resourcesat-1, and Resourcesat-2, Landsat Images from Landsat 8 (Operational Land Imager (OLI) and the Thermal Infrared (TI) Sensor), and Landsat 7 (Enhanced Thematic Mapper Plus (ETM +)). The geospatial satellite data is extracted in the raster format, where each pixel in the data is associated with an exact geographical location, and the value represents the features of the region. For better understanding, we presented the raster format of LISS-III satellite data of Javadi Hills in Fig. 3. The histograms of our raster data provide information about the pixel value distribution as shown in Fig. 4.

Fig. 3
figure 3

Raster format of the subset LISS-III satellite image of Javadi Hills

Fig. 4
figure 4

Histogram representation of LISS-III satellite data of Javadi Hills

Satellite image pre-processing

Pre-processing is considered to be the necessary progression in the field of RS. The geospatial image restoration or rectification is done for correcting the radiometric, geometric, atmospheric, and topographic distortions in the satellite data. The importance of pre-processing is to provide good visibility to the satellite images to attain better accuracy during the further process of LU/LC prediction (Kumar and Jain 2020, Ma et al. 2020). In our research work, we have used pre-processed methods like layer stacking, band rendering, haze reduction, georeferencing, and line drop-out error correction to provide good visibility to the satellite images.

Layer stacking and band rendering

The process of combining the multiple bands of a geospatial satellite image with the same rows and columns into a single targeted multispectral image is said as layer stacking. The images that needed to be stacked should have the same set of coordinates, whereas the spectral and temporal resolutions should be the same (Dez et al. 2021, Rahman et al. 2020). For better representation of the raster data, band rendering of contrast enhancement has been processed based on the presence of the multiband band, single band, and topographic bands of the satellite data (Gonzalez and Yamamoto 2020, Nádudvari et al. 2020).

$$\left({X}_{M},{Y}_{M}\right)=\frac{\sum_{i=1}^{N}\left({x}_{i},{y}_{i}\right)}{N}$$
(1)

Equation (1) represents the calculation of the layer stacking of the images. For each band \(i,\) the \(1\le i \le N\) represents the bands with the same set of pixel coordinate values \(x, y\). When all the inputs have the same coordinates, the values in the images are added and the average value is calculated for representing the multispectral image \(M\) with the same pixel coordinates \(X\) and \(Y\). All the bands (GREEN, RED, NIR, and SWIR) in the LISS-III sensors were stacked and presented as a multispectral satellite image. For better understanding, we presented the layer-stacked and band-rendered multispectral LISS-III satellite data of Javadi Hills in Fig. 5.

Fig. 5
figure 5

Layer stacking and band rendering of LISS-III satellite data of Javadi Hills

Haze reduction

The presence of atmospheric aerosols like cloud shadows, fog, and other darkened substances in the geospatial satellite data is considered to be noisy data. The haze reduction process of atmospheric correction increases the accuracy of the geospatial satellite data visually and statistically (Nazeer et al. 2021, Ilori et al. 2019).

Equation (2) represents the calculation of the haze-removing function. For better understanding, we showed the haze-reduced LISS-III satellite data of Javadi Hills in Fig. 6.

$$I\left(X\right)=J\left(X\right)t\left(X\right)+AL(1-t(X))$$
(2)

where \(I\) denotes the input multispectral image, \(X\) represents the pixel coordinates (x, y) of the input multispectral image, J denotes the scene radiance of the multispectral image, \(AL\) indicates the atmospheric light, and \(t\) denotes the multispectral image transparency.

Fig. 6
figure 6

Haze reduced LISS-III satellite data of Javadi Hills

Georeferencing

Georeferencing is the process of correcting the geometrical coordinates (X, Y) of the geospatial satellite image in the field of RS. The main use of georeferencing is to extract the region of interest (RoI) and to correct the position through each pixel that represents the coordinate or location values in the geospatial satellite data. The shifting, scaling, and rotation transformation function of the first-order derivative helps in resampling the input geospatial data by providing the exact Ground Control Points (GCP) and Digital Number (DN) value of each pixel to the output geospatial satellite data. The Bilinear Interpolation Sampling (BIS) method uses the weighted average of the input DN values to assign the highest weighted value to the output pixel of the geospatial satellite data (Van Ha et al. 2018, Pham et al. 2020).

Equations (3) and (4) show the input and output geometric coordinate system of the geospatial satellite image. Equations (5) and (6) show the equation for the first-order polynomial transformation. Equation (7) shows the equation for the BIS method. For better understanding, we showed the georeferenced RoI LISS-III satellite data of Javadi Hills in Fig. 7. The georeferencing has been performed for extracting the RoI (region of interest) in the geospatial data of Javadi Hills and hence the RoI for our research work falls between 78.80E12.56N and 78.85E12.60N. The size of each pre-processed satellite image of Javadi Hills is 256*200 pixels.

$$x={TF}_{1}\left(X,Y\right)$$
(3)
$$y={TF}_{2}\left(X,Y\right)$$
(4)

where \(x, y\) denote the geometric coordinates of the input geospatial satellite image, \(X,Y\) represent the coordinates of the RoI geospatial image, and \({TF}_{1}, {TF}_{2}\) denote the transformation functions.

$$x={A}_{0}X+{B}_{0}Y+{C}_{0}$$
(5)
$$y={A}_{1}X+{B}_{1}Y+{C}_{1}$$
(6)

where \({A}_{0}\) denotes the pixel width of the geospatial satellite image, \({B}_{0}\) denotes the rotation term, \({C}_{0}\) denotes center most value \(x\) of the upper-left pixel of the geospatial satellite image, \({A}_{1}\) denotes the rotation term, \({B}_{1}\) denotes the negative pixel height of the geospatial satellite image, and \({C}_{1}\) denotes the centermost value \(y\) of the upper left pixel.

$${DN}_{wt}= \frac{\sum_{n=1}^{4}\frac{{R}_{n}}{{{D}_{n}}^{2}}}{\sum_{n=1}^{4}\frac{1}{{{D}_{n}}^{2}}}$$
(7)

where \({DN}_{wt}\) denotes the weighted values of DN, \({R}_{n}\) denotes the neighboring DN values, and \({{D}_{n}}^{2}\) represents the squared distance of the input location in the geospatial satellite image.

Fig. 7
figure 7

Region of interest (RoI) map of Javadi Hills

Line drop-out error correction

Line drop-out error is the sensor error present in the geospatial satellite image. Radiometric correction is the process used for correcting the line drop-out error of the geospatial satellite image. The zero value information in the bands will lead to the line dropout error. The inverse distance weighting (IDW) interpolation method was used method for filling the raster region with no data values. The average of the above and below the line of the no-data values was calculated using the IDW method. By using the gap mask data or by using the nearby known weighted location values, the average values were calculated for the missing value locations (Srivastava et al. 2019; Charrua et al. 2021).

The IDW expression for fixing the line drop-out error in the geospatial satellite image is shown in Eq. (8). For better understanding, we have shown the process of line drop-out error correction in Fig. 8 by using the gap-filled image of Landsat 7 data through the process of IDW.

$${Z}_{j}= \frac{{\sum }_{i}{~}^{ \mathrm {Z}_{i}}\!\left/ \!{~}_{\mathrm {d^{n}}_{ij}}\right.}{{\sum }_{i}{~}^{\mathrm 1}\!\left/ \!{~}_{\mathrm {d^{n}}_{ij}}\right.}$$
(8)

where \({Z}_{i}\) denotes the known data points in the geospatial satellite image, \({Z}_{j}\) represents the unknown data points in the geospatial satellite image, \({d}_{ij}\) mentions the distance measure to the known points, and \(n\) denotes the user-weighted parameter. The gaps that cause the scan line error in the RED, NIR, SWIR, and TIRS bands of the Landsat satellite were corrected by using radiometric correction. Figure 9 represents the overall pre-processed multispectral LISS-III Time Series Satellite Images of Javadi Hills for the years 2012 and 2015. The results derived from the pre-processed Landsat bands are shown in the “Extracting the dependent features through the ensemble LU/LC map of Javadi Hills” section.

Fig. 8
figure 8

Line drop-out error corrected Landsat TIRS image of Javadi Hills

Fig. 9
figure 9

Pre-processed multispectral LISS-III bands of Javadi Hills. a 2012. b 2015

Fuzzy HHO-based supervised machine learning classifiers for extracting the LU/LC features

The unsupervised fuzzy HHO algorithm was used for finding the unknown features from the pre-processed satellite image. The feature-extracted map was used as an input for finding the LU/LC classes by using the supervised machine learning classifiers (SVM, RFC, and MLC). The PCA was used for fusing the results of the supervised machine learning classifiers.

Fuzzy Harris hawks optimization algorithm

Feature extraction is an important process that helps in increasing the accuracy of the classification models by extracting the needed features from the geospatial satellite data. The features are part of the information from the content of the geospatial satellite data. The new set of extracted feature values will be different from the original values. In the field of RS, object-based clustering and optimization methods were used consistently for extracting the features from the geospatial satellite data. The features are unknown and the process of an unsupervised-based optimization algorithm helps in extracting the unknown features in an image into clusters or groups without any training process (Matci and Avdan 2020; Al-Qaisi et al. 2021; Sihwail et al. 2020). In our research work, we have proposed the unsupervised fuzzy HHO technique for the optimal selection of cluster centers for extracting the needed features in the pre-processed LISS-III satellite images.

The swarm intelligence-based optimization algorithm helps the population members by the group information and through each other’s position. Harris hawks are one of the smartest birds in the world. The uniformity and cooperation among these birds were recognized as the best solution to targeting the rabbits. The different views on the attacking and escaping behavior of Harris hawks were observed and mathematically modeled for handling optimization problems. In HHO, we have a group of Hawks as the candidate solutions, and the best-obtained solution is called the rabbit. The exploration and exploitation phase has been employed for performing the HHO (Heidari et al. 2019; Abualigah et al. 2021; Iswisi et al. 2021). In the exploration phase, Harris hawks search for the rabbit (Prey). The gripping eyes of the hawks help them to spot and track the rabbit. In a few cases, it finds difficult to see the rabbit. In this case, the hawks will wait and monitor the place more visibly for tracking the rabbit. For each iteration, all the hawks are considered as the candidate solutions, and the fitness value has been calculated for each hawk based on the planned rabbit. Later, the hawks may wait in some positions for detecting the rabbit through Eq. (9). HHO uses a direct method for calculating the average position of the hawks using the following Eq. (10).

$$X \left(t+1\right)= \left\{\begin{array}{c}{X}_{\mathrm{rand}}\left(t\right)- {r}_{1}\left|{X}_{\mathrm{rand}}\left(t\right)- {2r}_{2}. X\left(t\right)\right| rand\ge 0.5\\ \left({X}_{\mathrm{rabbit}}\left(t\right)- {X}_{M}\left(t\right)\right)- {r}_{3}\left(LB+ {r}_{4}\left(UB-LB\right)\right) rand<0.5\end{array}\right.$$
(9)
$${X}_{M}\left(t\right)= \frac{1}{N} \sum_{i=1}^{N}{X}_{i}\left(t\right)$$
(10)
$$E= {2E}_{0}(1- \frac{t}{T})$$
(11)

In the HHO, the energy coefficient changes the behavior of the hawks from searching to hunting. HHO assumes that the rabbit’s energy is reducing while escaping from the hawks. This assumption has been calculated using Eq. (11). When the escaping energy of the rabbit is more than or equal to 1, the HHO sends the hawks to travel to different locations in search of the rabbit. When the rabbit’s energy got reduced, then the hawks will search the neighborhood for the solution during the exploitation phase. The exploitation stage focuses on the vicinity of the acquired solutions through the hard and soft besieges. In this phase of the exploitation stage, the hawks target the rabbit through its previous position. The hawks monitor and follow the chasing strategy as soon as the rabbit challenges to escape. The two variables, \(r\) and \(\left|E\right|\), have been used for the indication of events that occurred during the hawk’s optimization. The variable \(\left|E\right|\) denotes the escaping energy of the rabbit, and \(r\) denotes the probability of escaping. When \(r<0.5\), the rabbit has a higher chance to escape from the hawk, and \(r\ge 0.5\), there is less chance for the rabbit to escape from the hawk.

$$X \left(t+1\right)= \Delta X\left(t\right)-E\left|J.{X}_{\mathrm{rabbit}}\left(t\right)-X(t)\right|$$
(12)
$$\Delta X\left(t\right)= {X}_{\mathrm{rabbit}}\left(t\right)-X(t)$$
(13)
$$J=2 ( 1- {r}_{5} )$$
(14)

The HHO has been performed based on the four possible tactics, and they are soft besieges, hard besieges, soft besieges with progressive rapid dives, and hard besieges with progressive rapid dives. In soft besieges, the rabbit has some energy to escape from the hawk where \(r\ge 0.5\) and \(\left|E\right|\ge 0.5\). Hence, the hawks make the rabbit lose energy by surrounding it before its plans for the surprise jump. The soft besieges are mathematically formulated in Eqs. (12), (13), and (14). In hard besieges, the rabbit is tired with a low chance of escaping from the hawk where \(r\ge 0.5\) and \(\left|E\right|<0.5\). Hence, the hawks hardly surround the rabbit before the surprise jump. The hard besieges update the next location of the hawk through Eq. (15).

$$X \left(t+1\right)= {X}_{\mathrm{rabbit}}\left(t\right)- E \left|\Delta X\left(t\right)\right|$$
(15)

In soft besiege with progressive rapid dives, the rabbit still has the energy to escape from hawk attack where \(r< 0.5\) but \(\left|E\right| \ge 0.5\). Hence, the hawk makes a clever zigzag movement with several dives around the rabbit before the surprise jump. In this stage of soft besiege, the hawks will update the position in two steps. In the first attempt, the hawks will target the rabbit by calculating the next move. The first step is shown in Eq. (16). Through the comparison of the last dive and the possible results, the decision has been made to either dive or not. If it is not, the hawks will start performing irregular dives while attacking the rabbit, based on the \(LF\) concept. Hence, the quick dive mechanism is formulated and shown in Eq. (17). The levy flight function is formulated in Eq. (18). The position of the Harris hawks in the soft besiege phase is updated and the mathematical expression is shown in Eq. (20).

$$Y= {X}_{\mathrm{rabbit}}\left(t\right)- E \left|{JX}_{\mathrm{rabbit}}\left(t\right)- X(t)\right|$$
(16)
$$Z=Y+S\times LF (\mathrm{D})$$
(17)
$$LF \left(x\right)=0.01 \times \frac{\mu \times \sigma }{{\left|\upsilon \right|}^{\frac{1}{\beta }}}$$
(18)
$$\sigma = \left(\frac{\Gamma \left(1+\upbeta \right)\times \mathrm{sin}(\frac{\pi \beta }{2})}{\Gamma \left(1+\frac{\beta }{2}\right) \times \beta \times {2}^{\frac{\beta -1}{2}}}\right)$$
(19)
$$X \left(t+1\right)= \left\{\begin{array}{c}Y if F\left(Y\right)<F(X\left(t\right))\\ Z if F\left(Z\right)<F(X\left(t\right))\end{array}\right.$$
(20)

In hard besiege with progressive rapid dives, the rabbit does not have sufficient energy to escape where \(r< 0.5\) but \(\left|E\right|< 0.5\). The Harris hawks perform some rapid dives to target and kill the rabbit. The movement of the Harris hawks in the phase of hard besiege is formulated through some levy flight functions if they are failed. The position of the hawks in iteration \(t+1\) has been controlled by Eq. (20). Hence, the position of \(X\) and \(Y\) is updated and shown in Eqs. (21) and (22). The symbolic representation of HHO is shown in Table 2.

Table 2 Meaning of Symbols in HHO
$$Y= {X}_{\mathrm{rabbit}}\left(t\right)- E \left|{JX}_{\mathrm{rabbit}}\left(t\right)- {X}_{M}\left(t\right)\right|$$
(21)
$$Z=Y+S\times LF (\mathrm{D})$$
(22)

The unsupervised FCM or fuzzy membership clustering algorithm is widely used in the field of digital image processing, for extracting information from larger image datasets. Fuzzy membership clustering is considered as a grouping of the data in different clusters where the location of the data in each cluster depends on the degree of fuzzy membership of the data. The membership degree value is in the range of 0 and 1. The similarity between the data in the clusters is identified through the higher or lower membership value. Fuzzy membership clustering is considered an iterative optimization method that minimizes the cost function to reduce the dissimilarities between the clusters (Loganathan and Kaliyaperumal 2016, Valdez et al. 2021, Arora and Meena 2020, Li and Endo 2020). Equation (23) represents the minimization of the cost function in fuzzy membership clustering.

$${J}_{\mathrm{FCM}}= \sum_{j=1}^{N}\sum_{i=1}^{c}{({u}_{ij})}^{m}{\Vert {x}_{j}-{v}_{i}\Vert }^{2}$$
(23)

where \({u}_{ij}\) represents the fuzzy membership of pixel \({x}_{j}\) in the ith cluster, \({v}_{i}\) represents the ith cluster center, \(\Vert .\Vert\) represents the inner product normalization metrics, and m represents the constant value. When the pixels are found to be nearer to the centroid of their clusters, the cost function is reduced and the clusters are allocated with a higher membership value. The lower membership values are allocated to the pixels that are far from the centroid of their clusters. The membership function determines the probability of the pixels fitting into the specific cluster. In this clustering method, the probability merely relies on the distance between the pixel and all the cluster centers in the feature domain. In the theory of fuzzy membership clustering, each cluster center is initialized with random values and then converges to the ideal solution for the \({v}_{i}\) that represents the local minimum point of the cost function. The termination of the fuzzy membership clustering will be processed when there is no change in the cluster center during the iterative process. By using the outputs of the pre-processed image of Javadi Hills, the features in the LISS-III satellite images were extracted by using the proposed fuzzy HHO algorithm. During feature extraction, the unsupervised metaheuristic HHO algorithm was executed for selecting the optimum values from the membership matrix. The main aim of the feature extraction technique using fuzzy HHO is to use the optimum membership matrix for finding the cluster centers and to detect the needed features from the pre-processed satellite image. Each iteration has a set of membership matrices where each of them is considered a member of the HHO population. Each membership matrix represents the clusters evaluated by using the cost function of the fuzzy membership algorithm for determining the cluster centers. From the matrices, the optimal clusters were selected as the optimum membership matrix. The feature extraction-based fuzzy HHO algorithm consists of the following steps:

  1. i.

    Cluster centers were randomly created from the pre-processed LISS-III image, and the membership matrix of each cluster is generated.

  2. ii.

    A membership matrix is considered for each hawk in the HHO population.

  3. iii.

    The Harris hawk and its movement functions are used to update the membership matrix, and they are employed for creating the new cluster centers in the following iterations.

  4. iv.

    The new cluster centers are calculated by using the fuzzy evaluation function. The matrix with the minimum possible error is observed to be the optimal membership matrix.

  5. v.

    The optimal matrix is the position of the rabbit in the HHO algorithm and it is more responsible for pointing the other membership matrices. The optimal matrix is used in the last iteration for finding the optimal cluster centers.

  6. vi.

    The optimal cluster centers can be used for clustering the needed features from the pre-processed LISS-III satellite image.

While executing the fuzzy HHO algorithm, each pre-processed LISS-III image is used separately as an input. The parameters like population size, number of clusters, and the total number of iterations are initialized. A membership matrix has been used as the member of the Harris hawk population, and it is shown in Eq. (24).

$${\mathrm{HHO}}_{i}= {\left[\begin{array}{ccc}{u}_{\mathrm{1,1}}^{i}& \dots & {u}_{1,c}^{i}\\ \dots & \dots & \dots \\ {u}_{n,1}^{i}& \dots & {u}_{n,c}^{i}\end{array}\right]}_{n \times c}$$
(24)

Every membership matrix represented in the above equation has n rows which indicate the total number of satellite image pixels, and c columns which indicate the fuzzy degree of an image pixel to cluster centers. The \({\mathrm{HHO}}_{i}\) represents the member of the HHO algorithm and finds the optimum solution through the membership matrix. According to Eq. (25), every hawk represents a membership matrix of fuzzy clustering for feature extraction. Each component of a fuzzy membership matrix is between 0 and 1, and it is represented in Eq. (26).

$${\mathrm{HHO}}_{i}= {\left[{U}_{n.c}\right]}^{i}$$
(25)
$$0\le {u}_{ij} \le 1$$
(26)

where \(i\) and \(j\) represent the pixel number in a pre-processed LISS-III image and a cluster center in the pre-processed LISS-III image, respectively. Every solution found by a hawk is a membership matrix. According to Eq. (27), the sum of the rows is 1, which signifies the sum of the fuzzy degrees of an image pixel to all the cluster centers is equal to 1.

$$\sum_{j=1}^{c}{u}_{ij}= 1$$
(27)

where \(i\) is the number of pixels in a pre-processed LISS-III image, and \(j\) represents the number of cluster centers, which is denoted as c here. The fuzzy HHO algorithm generates an initial population of cluster centers where the membership matrices were defined to satisfy the conditions of Eq. (28).

$$\mathrm{Pop}= \left\{{\left[{U}_{n.c}\right]}^{1}, {\left[{U}_{n.c}\right]}^{2}, \dots \dots , {\left[{U}_{n.c}\right]}^{n}\right\}$$
(28)

where \(\mathrm{Pop}\) represents the initial population of the membership matrices, which determines the cluster centers and \(n\) represents the number of members in the initialized population. The membership matrices are used for updating the cluster centers in the fuzzy membership clustering technique according to Eq. (29):

$${c}_{i}= \frac{\sum_{k=1}^{n}{u}_{ik}^{m}{I}_{k}}{\sum_{k=1}^{n}{u}_{ik}^{m}}$$
(29)

In the membership matrix, each value of the satellite image pixel at any coordinates is shown by \({I}_{k}\), and the membership matrix or the members of the HHO algorithm has been updated through cluster centers and the position of the membership matrixes by using Eq. (30).

$$S(i)= \sum_{k=1}^{n}\sum_{i=1}^{c}{({u}_{ik})}^{m}\mathrm{sim}\left({I}_{k}, {c}_{i}\right)$$
(30)

where \(\mathrm{sim}\left({I}_{k}, {c}_{i}\right)\) represents the similarity of the satellite image pixel \({I}_{k}\) to a cluster center \({c}_{i}\). The fuzzy HHO algorithm minimizes the target function by selecting the membership matrix optimally, where the cluster centers are optimized. In the HHO algorithm, the relevant equations are used in updating the membership matrix. For instance, Eqs. (31) and (32) show the behavior of the soft besieges and hard besieges implemented on the membership matrices to update them respectively.

$$\mathrm{HHO}\left(t+1\right)= {\mathrm{HHO}}^{* }-\mathrm{HHO}\left(t\right)-E\left|J.{\mathrm{HHO}}^{*}-\mathrm{HHO}(t)\right|$$
(31)
$$\mathrm{HHO}\left(t+1\right)= {\mathrm{HHO}}^{* }-E\left|{\mathrm{HHO}}^{*}-\mathrm{HHO}(t)\right|$$
(32)

where \(\mathrm{HHO}\left(t\right)\) represents the membership matrix at each iteration \(t\), and \(\mathrm{HHO}\left(t+1\right)\) denotes the updated membership matrix. The \({\mathrm{HHO}}^{*}\) represents the optimal membership matrix, where the cluster centers are created for feature extraction. The feature extracted result of the unsupervised fuzzy HHO technique for the years 2012 and 2015 is shown in Fig. 10. The output obtained from the fuzzy HHO technique was used as input for finding the LU/LC classes.

Fig. 10
figure 10

Feature extracted map by using the fuzzy HHO technique. a 2012. b 2015

Supervised machine learning classifiers

Supervised classification is the most often used technique in the field of digital image processing. Supervised learning algorithms are widely used for the classification of satellite data in the area of remote sensing. The supervised classification works on the known knowledge about the image where the user can select the sampled pixels from an image and train each of the pixels with the reference pixels of the training image to attain good classification accuracy (Manoharan et al. 2019; da Silva et al. 2020; Wang et al. 2020). In our research work, we have used RFC, SVM, and MLC-supervised classification algorithms for classifying the optimized geospatial satellite data. The RFC algorithm has been used for both regression and classification problems in the field of digital image processing. In the field of remote sensing, the RF classification method was mostly used in classifying the LU/LC classes from the geospatial satellite image. The RF builds multiple decision trees on the subset of the training data and then obtains the classified results from each decision tree and finally chooses the best solutions using averaging or voting the results. The disadvantage of overfitting is reduced by averaging the results of the decision trees. The selections of features in the training datasets were processed by calculating the Gini index, and this metric has been widely used during the decision tree classification process (Piao et al. 2021; Amoakoh et al. 2021). The Gini index calculates the sum of the square of the probability of each class and the expression is shown in Eq. (33).

$$\mathrm{Gini\;index}=1- \sum\limits_{i=1}^{C}{\left({P}_{i}\right)}^{2}$$
(33)

where \({P}_{i}\) represents the probability of the classified features in the data, and \(C\) represents the total number of classes to be classified.

The SVM-supervised classification has been used for LU/LC classification problems in the field of remote sensing. The ideology of the SVMclassification algorithm is to search the hyperplane and find it in n-dimensional space for classifying the dissimilar data points. The hyperplane helps in the extrication of the decision boundary between the data points. The features in the n-dimensional space depend on the dimensions of the hyperplane. The data points that are nearer to the hyperplane were considered as the support vectors and hence help in maximizing the classifier’s margin (Karimi et al. 2020; Hawash et al. 2021). Equations (34) to (41) show the expressions for describing the process of the SVMclassifier.

$$H: {w}^{t} x+b=0$$
(34)
$${H}_{1}: {w}^{t} x+b=-1$$
(35)
$${H}_{2}: {w}^{t} x+b=1$$
(36)

where \(x\) represents the input column vector, \(w\) represents the weight of the column vector, \(b\) refers to the bias function by which the decision boundary of the hyperplane that separates the data points was represented as \(H\), and the hyperplane \({H}_{1}\) and \({H}_{2}\) moves along the support vectors were represented as \(-1\) and \(+1\) respectively.

$${D}_{1}= \frac{(-1-b)}{\left|w\right|}$$
(37)
$${D}_{2}= \frac{(1-b)}{\left|w\right|}$$
(38)
$$M= \frac{{D}_{1}}{{D}_{2}}$$
(39)
$$M=\left(\frac{\frac{(-1-b)}{\left|w\right|}}{\frac{(1-b)}{\left|w\right|}}\right)$$
(40)
$$M= \frac{2}{\left|w\right|}$$
(41)

where the distance between the hyperplane \({H}_{1}\) and \({H}_{2}\) to its origin was represented as \({D}_{1}\) and \({D}_{2}\), and the formulation of \(M\) shows twice the margin. The maximization of the margin (\(\mathrm{max}\frac{1}{\left|w\right|}\)) through the adjusting parameters (\(w\) and \(b\)) was considered the key objective of the SVM classifier.

The MLC algorithm has been considered the well-known LU/LC classification system in the area of remote sensing. The ML classification system assigns the unknown pixels to the known classes in each band using the training data. The statistics of ML in classifying the LU/LC classes of each band in geospatial data are normally distributed by calculating the probability of LU/LC classes belonging to each pixel (Kumar et al. 2021, Bosquilia and Muller-Karger 2021). The class conditional density calculated during LU/LC classification is shown in Eq. (42). The standard Gaussian distribution function has been used in this research work, where \(\omega\) represents the set of parameters from each class \(j\), \(x\) represents the image data of n bands (feature vector), and \(\sigma\) and \(\mu\) represent the standard deviation and mean of each class \(j\) respectively.

$$p \left(x|{\omega }_{j}\right)= \frac{1}{{\sigma }_{j }\sqrt{2\pi }} {e}^{-\frac{1}{2} {\left(\frac{x- {\mu }_{j}}{{\sigma }_{j}}\right)}^{2}}$$
(42)

Fusion using principal component analysis

Image fusion in the field of satellite image processing combines all spectral and spatial information of multiple satellite images of the same source with similar dimensions into a single image that preferably contains all the needed features from each of the original satellite images. PCA is an efficient way of fusing geospatial satellite images. The PCA method transforms the variables from correlated to uncorrelated for calculating the necessary weights to fuse the multiple images (Manoharan et al. 2018, Javan et al. 2021, Kaur et al. 2021, Batur and Derya 2018). The steps involved in fusing the image using PCA are as follows:

  1. i.

    The multiple images to fuse into a single image are stored in a matrix system by transforming each image into column vectors.

  2. ii.

    Images that are stored in the column matrix are subtracted by their mean and the covariance matrix was calculated.

  3. iii.

    The Eigenvectors and Eigenvalues of the covariance matrix were calculated and normalized in descending order.

  4. iv.

    The Eigenvector that corresponds to the highest Eigenvalues has been used for calculating the weighted value for fusing the multiple images according to Eq. (43).

  5. v.

    The PCA-based fused image will be calculated through the summation of the scaled matrices according to Eq. (44).

    $${Pc}_{1}= \frac{V(1)}{\sum V} , {Pc}_{2}= \frac{V(2)}{\sum V} , {Pc}_{n}= \frac{V(n)}{\sum V}$$
    (43)
    $$Img\;Fuse= {{Pc}_{1}I}_{1}+ {{Pc}_{2}I}_{2}+\dots + {{Pc}_{n}I}_{n}$$
    (44)

where \(I\) represents the input images to be fused, and \(V(n)\) represents the Eigenvectors that correspond to the principal component \({Pc}_{n}\). The principal component analysis has been used for fusing the results of SMC. The SMC used in our work includes SVM, RF, and ML algorithms. The LU/LC classes (high vegetation, scarce vegetation, and bare soil) of Javadi Hills for the years 2012 and 2015 were obtained by using the ensemble model of the fuzzy HHO technique with SMC through PCA.

The fused result for the years 2012 and 2015 is shown in Fig. 11. The algorithm of our ensemble model of fuzzy HHO-based SMC through PCA for extracting the LU/LC classes is presented in Algorithm 1. The overall flow chart of the ensemble model of fuzzy HHO-based SMC through PCA for extracting the LU/LC classes is shown in Fig. 12. We have used the front end of MATLAB 2021a for executing the fuzzy HHO-based feature extraction technique and Python software for executing the ensemble model of SMC algorithms.

Fig. 11
figure 11

LU/LC features of ensemble model of fuzzy HHO-based with SMC through PCA. a 2012. b 2015

Fig. 12
figure 12

Flow chart of the ensemble model of fuzzy HHO-based SMC through PCA

Algorithm 1
figure a

Algorithm to construct the ensemble model of fuzzy Harris hawks optimization-based supervised machine learning classification algorithms through principal component analysis

Adjusted vegetation and bareness LU/LC indices

In the field of RS, the vegetation features of the geospatial data are measured using the vegetation Index, and the bare soil features are measured using the bare soil index. The NDVI has been used for extracting the vegetation features through NIR and RED bands of satellite data. The NIR band reflects the LU/LC features of vegetation strongly and the RED band absorbs the LU/LC features of vegetation for the particular area. The Normalized Difference Bareness Index (NDBaI) has been used for extracting the features of bare soil through the Short Wave Infrared (SWIR) and TIRS bands of satellite data (Useya et al. 2019; Mzid et al. 2021). Equation (45) shows the expression of NDBaI for Landsat 8 data, and Eq. (46) shows the expression of NDBaI for Landsat 7 data. Equation (47) shows the expression of the NDVI.

$${\mathrm{NDBaI}}_{\mathrm{Landsat}\;8}= \frac{\mathrm{SWIR}-\mathrm{TIRS}}{\mathrm{SWIR}+\mathrm{TIRS}}$$
(45)
$${\mathrm{NDBaI}}_{\mathrm{Landsat}\;7}=(\mathrm{RED}+\mathrm{SWIR}-\mathrm{NIR})$$
(46)
$$\mathrm{NDVI}=\frac{\left(\mathrm{NIR}-\mathrm{RED}\right)}{(\mathrm{NIR}+\mathrm{RED})}$$
(47)

where \(\mathrm{SWIR}\), \(\mathrm{RED}\), \(\mathrm{NIR}\), and \(\mathrm{TIRS}\) represent the reflectance values of the SWIR, RED, NIR, and SWIR bands of the Landsat satellite images. By using the outputs of pre-processed Landsat bands, the LU/LC indices have been calculated. The NDVI and NDBaI show the features of LU/LC vegetation and bare soil. In our research work, we have combined the NDVI and NDBaI indices for providing the effective features of LU/LC vegetation and a bare soil map. The equation used for extracting the features of vegetation and bare soil map for the bands of Landsat 8 and 7 are given in Eqs. 48 and 49.

$${\mathrm{AVBaI}}_{\mathrm{Landsat}\;8}=\left(\mathrm{NDVI}+{\mathrm{NDBaL}}_{\mathrm{Landsat}\;8}\right)$$
(48)
$${\mathrm{AVBaI}}_{\mathrm{Landsat}\;7}=\left(\mathrm{NDVI}+{\mathrm{NDBaL}}_{\mathrm{Landsat}\;7}\right)$$
(49)

where \(\mathrm{AVBaI}\) represents the Vegetation and Bareness Index, \(\mathrm{NDVI}\) represents the Normalized Difference Vegetation Index, and \(\mathrm{NDBaI}\) represents the Normalized Difference Bareness Index.

We have used the map algebra module of the python package in the back-end tool of ArcGIS geospatial software for evaluating the LU/LC indices of Javadi Hills. The AVBaI map for the years 2012 and 2015 is shown in Fig. 13. The combined map provides the clarity of vegetation and bare soil regions of satellite data for the area of Javadi Hills. The Vegetation and Bareness Index for the years 2018 and 2021 is also calculated and shown in Fig. 14. The AVBaI map for the years 2018 and 2021 has been reclassified using the Python package through ArcGIS software. The reclassified reference map for 2018 and 2021 is shown in Fig. 21. In our proposed research work, we have used the outputs of the AVBaI map of Javadi Hills for the years 2018 and 2021 as the reference map to validate the predicted LU/LC map of 2018 and 2021.

Fig. 13
figure 13

Adjusted Vegetation and Bareness Index map. a 2012. b 2015

Fig. 14
figure 14

Adjusted Vegetation and Bareness Index map. a 2018. b 2021

Extracting the dependent features through the ensemble LU/LC map of Javadi Hills

The dependent features of the E-LU/LC map define the changes that happened between the LU/LC features of Javadi Hills for the years 2012 and 2015. In our research work, we have analyzed the vegetation and bare soil features of Javadi Hills by using an efficient ensemble model. The dependent E-LU/LC map has been extracted by combining the outputs of the PCA-based fused LU/LC map (fuzzy HHO-based SMC) with the output of the AVBaI maps for the years 2012 and 2015. The ensemble features produce the combination of the pixel values with the associated attribute tables. Each coordinate of the ensemble LU/LC raster map contains the dependent vegetation and bare soil feature values. We have shown the E-LU/LC map in Fig. 15. The E-LU/LC-dependent map for the years 2012 and 2015 has been used as an input for the prediction model for analyzing the future LU/LC changes in the area of Javadi Hills. We have used the front end of python and the back end of ArcGIS geospatial software for combining the ensemble features of LU/LC vegetation and bare soil map.

Fig. 15
figure 15

Ensemble LU/LC Map of Javadi Hills. a 2012. b 2015

Extracting the independent LST features of Javadi Hills

In the field of RS, the skin temperature of the geospatial data was measured using the LST. The hot and cold on the surface of the earth were measured using the LST through the reflection of the radiant energy within the earth’s surface. The TIRS data captured from the satellite were used in measuring the mixture of the hot and cold temperatures of the earth (Mukherjee and Deepika 2020, Sekertekin and Bonafoni 2020). In our research work, the LST has been used as one of the spatial data for analyzing the LU/LC prediction map. The TIRS bands of Landsat 7 and 8 satellite data have been used in our work. Equations (50), (51), and (52) show the calculation of LST for the TIRS bands of the Landsat 7 satellite. Equations (53) to (58) show the calculation of LST for the TIRS bands of the Landsat 8 satellite. Equations (50) and (51) show the conversion of the DN values of each pixel to the brightness temperature through the radiance of the TIRS band. Equation (52) shows the conversion of the values from Kelvin \((K)\) to Celsius \((C)\).

$${L}_{\lambda }= \left(\frac{{LMAX}_{\uplambda }- {LMIN}_{\uplambda }}{QCALMAX-QCALMIN}\right)* \left(QCAL-QCALMIN\right)+{LMIN}_{\uplambda }$$
(50)

where \({L}_{\lambda }\) denotes the spectral radiance in \(\mathrm{Watts}/{(\mathrm{m}}^{2}*{\mathrm{sr}}^{2}*\mathrm{ \mu m})\), \(QCAL\) denotes the quantized calibrated pixel value, and \(QCALMAX\) and \(QCALMIN\) denote the maximum and minimum quantized calibrated pixel value corresponding to the spectral radiance scale \({LMAX}_{\uplambda }\) and \({LMIN}_{\uplambda }\) respectively.

$${T}_{K}= \frac{K2}{\mathrm{ln}\left(\frac{K1}{{L}_{\uplambda }}+1\right)}$$
(51)
$$C={T}_{K}-273.15$$
(52)

where \({T}_{K}\) signifies the satellite temperature effectiveness in Kelvin, and the calibration constants \(K1, K2\) are represented in \(\mathrm{Watts}/{(\mathrm{m}}^{2}*{\mathrm{sr}}^{2}*\mathrm{ \mu m})\). The calibration constant \(K1, K2\) value in Landsat 7 satellite TIRS band is 666.09 and 1282.71.

$${TL}_{\uplambda }=ML* QCAL+AL- {O}_{i}$$
(53)

Equation (53) shows the conversion of TOA spectral radiance (\({TL}_{\uplambda })\) of the TIRS band by using the radiance rescaling factor (\(ML\)) in \(\mathrm{Watts}/{(\mathrm{m}}^{2}*{\mathrm{sr}}^{2}*\mathrm{ \mu m})\), the quantized calibrated pixel value is represented by \(QCAL\), the radiance additive band rescaling factor \(AL,\) and the correction value \({O}_{i}\) of TIRS band in Landsat 8 has been used respectively.

$${BT}_{P}= \frac{K2}{\mathrm{ln}\left(\frac{K1}{{TL}_{\uplambda }}+1\right)}-273.15$$
(54)

Equation (54) shows the conversion of the TOA brightness temperature (\({BT}_{P}\)) in Celsius (\(C\)) from spectral radiance data and the calibration constants \(K1, K2\) are represented in \(\mathrm{Watts}/{(\mathrm{m}}^{2}*{\mathrm{sr}}^{2}*\mathrm{ \mu m})\). The calibration constant K1 and K2 value in Landsat 8 satellite TIRS band is 774.8853 and 1321.0789. Equation (47) shows the expression for \(NDVI\) of the Landsat satellite images.

$$\mathrm{PV}={((\mathrm{NDVI}- {\mathrm{NDVI}}_{\mathrm{min}})/ ({\mathrm{NDVI}}_{\mathrm{max}} - {\mathrm{NDVI}}_{\mathrm{min}} ))}^{2}$$
(55)
$$E=0.004*\mathrm{PV}+0.986$$
(56)

The land surface emissivity (\(E\)) is derived from the results of the \(\mathrm{NDVI}\) values, where \(\mathrm{PV}\) denotes the proportion of vegetation, and \({\mathrm{NDVI}}_{\mathrm{max}}\) and \({\mathrm{NDVI}}_{\mathrm{min}}\) represent the maximum and minimum reflectance values of the \(\mathrm{NDVI}\) image of Landsat 8. The expressions for \(\mathrm{PV}\) and \(E\) are shown in Eqs. (55) and (56). By using the results of Eqs. (54) to (56), the LST is calculated and expressed in Eq. (57).

$$\mathrm{LST}= \frac{{\mathrm{BT}}_{P}}{\left(1+ \left({~}^{\Delta \mathrm P}\!\left/ \!{~}_{\mathrm L}\right. \right)* \mathrm{ln}(E)\right)}$$
(57)
$$c2={~}^{\Delta \mathrm P}\!\left/ \!{~}_{\mathrm L}\right.$$
(58)

where the wavelength of the emitted radiance is represented as \(\lambda\), the Planck’s constant (\(pk)\) value is \(6.626*{10}^{-34}\mathrm{ J s}\), the velocity of light (\(vl\)) value is \(2.998 * 108\mathrm{ m}/\mathrm{s}\), and the Boltzmann constant (\(bc\)) value is \(1.38 * {10}^{-34}\mathrm{ JK}\). The use of independent features is to display the impact of dependent LU/LC changes in the Javadi Hills during the process of LU/LC prediction. The LST map of 2012 and 2015 has been extracted for analyzing the effect of LU/LC features of Javadi Hills. The LST provides the skin temperature of the earth, where the high temperature on LU/LC shows less vegetation and the less temperature on the LU/LC shows high vegetation. The key idea of our research is to provide the impact of LST on the E-LU/LC features of vegetation and bare soil. The LST independent map for the years 2012 and 2015 has been used as an input along with the dependent E-LU/LC features for implementing the prediction model to analyze the future LU/LC changes in the area of Javadi Hills. We have shown the LST-independent map of 2012 and 2015 in Fig. 16. The LST map for the years 2018 and 2021 is also calculated and used as the independent feature map during the LU/LC prediction for the years 2024 and 2027. The LST map for the years 2018 and 2021 is shown in Fig. 17.

Fig. 16
figure 16

LST map of Javadi Hills. a 2012. b 2015

Fig. 17
figure 17

LST map of Javadi Hills. a 2018. b 2021

Ensemble-based LU/LC prediction analysis

The LU/LC prediction has been considered an important study in the file of remote sensing. The importance of LU/LC prediction lies in providing future LU/LC information about the specific region for different periods using the prediction algorithms. Based on the accurate results of the prediction methods, the LU/LC data will be shared with the governmental officials or land resource planners for taking action in protecting the LU/LC environment from great loss and save nature (Anand and Bakimchandra 2020, Bakr et al. 2022). In our research work, we have used MC-MLP neural network model for predicting the LU/LC classes for Javadi Hills, India.

Markovian-Cellular Automata model

In the field of RS environment, the MC analysis is the most used model during the process of LU/LC prediction. The Markovian model was used for calculating the transition probability matrix (TPM) for the LU/LC classes between different periods. The Markovian model is discrete with both spatial and temporal data. The Cellular Automata model represents the nonlinearity and spatially distributed structure for providing the LU/LC patterns from a geospatial satellite image. The Cellular Automata is discrete with space, time, and state and it helps in simulating the spatio-temporal computations (MohanRajan and Loganathan 2021, Huang et al. 2020, Munthali et al. 2020, Armin et al. 2020).

$$Tm={P}_{ij }= \left[\begin{array}{ccc}{P}_{11}& \dots & {P}_{1n}\\ \dots & \dots & \dots \\ {P}_{n1}& \dots & {P}_{nn}\end{array}\right]$$
(59)
$$\left(0\le {P}_{ij }<1 \mathrm{and} \sum_{j=1}^{n}{P}_{ij}=1\right)$$
(60)
$$S\left(t+1\right)={P}_{ij }*S(t)$$
(61)

In Eqs. (59), (60), and (61), the \(Tm\) indicates the TPM, \({P}_{ij }\) indicates the probability of the LU/LC class \(i\) to class \(j\), \(n\) represents the total number of LU/LC classes, \(i,j\) shows the LU/LC class of 1st and 2nd period, \(S\) represents the LU/LC information at time \(t\), and \(S\left(t+1\right)\) refers to the time variants.

Multilayer Perceptron Neural Network

The MLP neural network is used as a prediction model for providing future LU/LC information for a different study area in the area of RS. The MLP follows the idea of the ANN. The idea of MLP investigates the simple representations of the human brain to solve difficult computational problems. The MLP neural network model maps the nonlinearity between the input and the corresponding output vector. In MLP, every neuron node transmits the needed information among them. The input and output processing units of the MLP neural network are interconnected with nodes. As the human brain reads up the rules and provides the results, the MLP also comprises learning rules for providing the results. The initial stage of the MLP neural network model recognizes the data patterns visually or textually. The MLP works with the supervised learning process by training and analyzing the data through the backpropagation method, where the data and its calculations have been performed in a single direction from the input data to the output data. The MLP model has been defined in three different layers and they are input, hidden, and output layers. The user-defined input (Neurons) has been passed in each layer and the inputs are represented as the dependent and independent variables. The computations of the neurons were performed by updating the weights in the hidden layer. Through the selection of the activation functions, the output layers are produced with the minimum error (Kafy et al. 2020, Shooshtari et al. 2020, Bose and Chowdhury 2020). Equations (62), (63), and (64) show the expressions for the MLP neural network.

$$Y=\varphi \left(\sum_{j}^{n}{w}_{j}{x}_{j}+b\right)$$
(62)
$$Y=\varphi \left({w}^{T}x+b\right)$$
(63)
$$\varphi \left(z\right)= \frac{1}{1+ {e}^{-x}}$$
(64)

where the weights adjusted between the input and the hidden layer are represented as \({w}_{j}\), the neurons or user-defined inputs are represented as \({x}_{j}\), the output layer is represented as \(Y\), \(\varphi ,\) and \(b\) representing the activation function and bias value respectively. The activation or sigmoid function \(\varphi\) provides the resulting value (either 0 or 1) of the MLP neural network.

The dependent LU/LC map and independent LST map have been used as input for performing the LU/LC prediction analysis. The LU/LC prediction has been performed using the MC-MLP model. The EP-LU/LC map has been obtained by using the outputs of E-LU/LC and the LST map. The algorithm to implement the ensemble-based LU/LC prediction algorithm is presented in Algorithm 2. The overall flow chart of the ensemble-based LU/LC prediction algorithm is shown in Fig. 20. We have used the QGIS simulation software for our research to show the predicted LU/LC map of Javadi Hills for the years 2018, 2021, 2024, and 2027. The MOLUSCE simulation tool provides the MC-MLP model for forecasting the LU/LC prediction results. The simulation analysis consists of different components like the input module, correlation analysis, change detection, prediction, and validation module. We have used the dependent and independent inputs of Javadi Hills for the years 2012 and 2015 for analyzing the LU/LC prediction for the years 2018, 2021, 2024, and 2027. The result of the EP-LU/LC map using the MC-MLP neural network model for the years 2018, 2021, 2024, and 2027 is shown in Figs. 18 and 19.

Fig. 18
figure 18

Ensemble LU/LC predicted map of Javadi Hills. a 2018. b 2021

Fig. 19
figure 19

Ensemble LU/LC predicted map of Javadi Hills. a 2024. b 2027

Algorithm 2
figure b

Algorithm to implement the ensemble-based LU/LC prediction algorithm

Validation of our proposed ensemble model

The LU/LC predicted results of Javadi Hills were obtained through the ensemble-based prediction model (Fig. 20). The inputs of dependent LU/LC and independent LST map of Javadi Hills for the years 2012 and 2015 have been used for predicting the LU/LC map of Javadi Hills for the years 2018. For acquiring better prediction accuracy, we have used the reclassified AVBaI map of 2018 as the reference map during the process of validation. The same procedure has been executed for the year 2021. The reference map for 2018 and 2021 is shown in Fig. 21. With inputs from predicted 2018 and 2021 maps, we have predicted the LU/LC map for the years 2024 and 2027. We have acquired good validation accuracy for our proposed ensemble model.

Fig. 20
figure 20

Flow chart of the ensemble-based LU/LC prediction model

Fig. 21
figure 21

Reference map of Javadi Hills

Results

In this section, we explained the experimental setup of our research work and also we delivered the results of our proposed model. The experimentation analysis of the ensemble-based fuzzy HHO-based SMC models was presented briefly along with the growth patterns of the LU/LC analysis of the time series data for the Javadi Hills, India.

Experimental setup

In the field of RS, challenging research on LU/LC prediction has been performed in our research work. The geospatial satellite image of LISS-III and Landsat 7 and 8 has been extracted and used for the area of Javadi Hills, India. All our experiments were processed by using MATLAB R2021a, Python, QGIS, and ArcGIS geospatial software on the Intel Xeon processor 2.90 GHz CPU along with 128 GB RAM in Windows 10 (64 bit) environment.

Training data and parameter settings

The parameter values for the HHO were determined after several tests. The objective function for the HHO was defined according to the fuzzy membership clustering method, where the cluster centers within the input data are measured using distance metrics. The parameters defined for each HHO are shown in Table 3. The experiments were conducted on pre-processed LISS-III satellite images of Javadi Hills for the years 2012 and 2015. The size of the tested pre-processed image is 256 × 200 pixels.

Table 3 Parameter setting of each HHO technique

The SMC (support vector machine, random forest, and maximum likelihood) were used for training the results of the fuzzy HHO-based feature extraction technique. The results of each SMC have been used separately and the results were obtained. The PCA has been used for fusing the results of all three SMC algorithms. The training parameters used for the SMC are shown in Tables 4, 5, and 6. The training parameters used for the MLP neural network model are shown in Table 7. The training data has been prepared by using the GEE time series data of Javadi Hills. We have randomly created the training samples with 300 random points (location) through the latitude and longitudinal coordinates of Javadi Hills. The training data has been prepared in the CSV (Comma Separated Values) file format and then converted into the vector shapefile. For a better understanding, the attribute table of a few training data is shown in Table 8. For better understanding, the validation of the Google Earth images associated with the training attribute values of the LU/LC map is shown in Fig. 22. The results of the ensemble model of the fuzzy HHO-based SMC algorithm through PCA have been combined with the results of the AVBaI map. The E-LU/LC map of 2012 and 2015 has been obtained and the accuracy assessment has been performed. The accuracy assessment has been performed for the E-LU/LC vegetation and bare soil feature map by using the test or reference data of GEE time-series images.

Table 4 Parameter setting for the SVM model
Table 5 Parameter setting for the random forest classification model
Table 6 Parameter setting for the maximum likelihood classification model
Table 7 Parameter setting for the Multilayer Perceptron Neural Network
Table 8 Attribute table of the training data values
Fig. 22
figure 22

Validation of Google Earth Map of Javadi Hills

Based on the good accuracy results, we have used the E-LU/LC vegetation and bare soil map of 2012 and 2015 as the input for finding the EP-LU/LC map. The LST-independent spatial variable map provides the impact on the growth of LU/LC vegetation and bare soil in the area of Javadi Hills. We have used the LST map of 2012 and 2015 as the independent map during the process of LU/LC prediction. The less temperature represents the region with high vegetation, and the high temperature represents the region with less vegetation. The AVBaI values range from − 1 to + 0.5. The values obtained were separated into high vegetation, scarce vegetation, and bare soil. All the vegetation and bare soil values have been associated with the LST values. The LST value has been calculated in Celsius. For better understanding, we have shown the few attribute values associated with the dependent and independent features of Javadi Hills in Table 9. The input maps of dependent and independent features with the same geometric coordinates have been used during the process of LU/LC prediction. The MC model has been used for calculating the TPM for the years 2012 to 2021. The transition between each class has been computed and the LU/LC change map has been generated for further analysis of LU/LC prediction. The MLP neural network has been used for modeling the relationship between the dependent and independent maps through the adjustment of different parameters. Once the MLP neural network has been trained, the CA simulation has been performed for identifying the changes that happened during the next state (time-period). The validation of the results has been performed using the reclassified AVBaI map.

Table 9 Values associated with dependent and independent features of Javadi Hills

Accuracy assessment and computational complexity

The accuracy assessment provides information about the strength of the classification algorithms. The accuracy assessment is performed by comparing the reference or ground truth data with the classified results by building the confusion matrix. The confidence level of computing the LU/LC classification and prediction results generally depends on the statistical measure of the accuracy assessment (Li et al. 2020; Alharthi et al. 2020). In our research work, we have used the overall accuracy metrics for analyzing the correctness of the classification and prediction algorithms. The overall accuracy is considered the complete optimal function for the proposed ensemble model. The ultimate goal of our proposed ensemble model is to obtain better classification accuracy for LU/LC classification problems. Concerning the Google Earth Engine time series data, the accuracy assessment has been performed for the results of E-LU/LC vegetation and bare soil map. The random sampling points (pixel location) from the LU/LC images for the periods 2012 and 2015 were validated with the random sampling points of the Google Earth Engine through the confusion matrix. The results of the accuracy assessment are shown in Table 10.

Table 10 LU/LC accuracy assessment for the feature extraction-based LU/LC map of Javadi Hills

The fused results were combined with the Adjusted Vegetation and Bareness Index (AVBaI) map and the final ensemble LU/LC (E-LU/LC) map was achieved for the years 2012 and 2015 with good classification accuracy, with an average of 95.275%. The results of the ensemble model of fuzzy HHO-based SMC through PCA and AVBaI techniques were used for predicting the LU/LC map of Javadi Hills. The E-LU/LC vegetation and bare soil map of 2012 and 2015 have been used as the dependent map, and the LST map has been used as an independent map during the process of LU/LC prediction. The LU/LC prediction has been performed using the ensemble-based MC-MLP neural network. The LU/LC map of 2012 and 2015 has been used as input along with the LST map for determining the LU/LC predicted map for 2018. The EP-LU/LC map of 2018 has been validated with the reference data of reclassified AVBaI map of 2018 to determine the correctness of our proposed model. The same procedure has been followed for predicting the 2021 map. With the available data, we have prepared the reference map for 2018 and 2021 through AVBaI. With inputs of EP-LU/LC map for 2018 and 2021, we have predicted the LU/LC map for the years 2024 and 2027. The validation has not been performed for 2024 and 2027. With the high percentage of correctness of 2018 and 2021, we have projected the results for 2024 and 2027. The validated results of the predicted LU/LC map for the years 2018 and 2021 using the MC-MLP model are presented in Table 11. The result for the ensemble-based MC-MLP model provides good accuracy with 95.616% for 2018 and 95.910% for 2021. The result of the EP-LU/LC map provides an average prediction accuracy of 95.763%.

Table 11 Validation for the EP-LU/LC map

The total running time taken for executing the algorithm has been defined through computational complexity. Each algorithm varies with computational complexity in terms of input data, training data, and tuning parameters. The computational complexity of our proposed algorithm is estimated and shown in Table 12. We have shown the elapsed time for each process of our proposed algorithm. The execution time taken for running our proposed ensemble model is 1110.27 s. The results have been calculated in MATLAB and python by using the function tic-toc. The back-end tool of ArcGIS and QGIS also provides the elapsed time through the inbuilt python package. We have also compared the execution time of our fuzzy HHO algorithm with other optimization algorithms that include CSO, ACO, ABC, and PSO. The overall execution time taken by HHO shows good results when compared with other optimization algorithms, and the overall results along with the accuracies are shown in Table 13.

Table 12 Computational complexity of our proposed ensemble model
Table 13 Comparative analysis of the accuracy and execution time for the fuzzy HHO with other optimization algorithm-based ensemble prediction model

Correlation analysis

The correlation analysis is performed between the spatial variables that affect the LU/LC of the particular area. Pearson correlation analysis provides the statistical relationship between the spatial variables of the LU/LC analysis. The linear association varies from + 1 to − 1, which shows the positive and the negative correlation between the spatial variables. The correlation values help for expressing the strength of the association between the spatial variables (Riva and Nielsen 2021; Khamchiangta and Dhakal 2020). In our research work, we have used the Pearson correlation analysis for displaying the relationship between the LST and AVBaI maps of Javadi Hills, India. Equation (65) provides the expression for calculating the association between the spatial variables.

$$r= \frac{\sum \left({x}_{i}- \overline{x }\right)\left({y}_{i}-\overline{y }\right)}{\sqrt{\sum {\left({x}_{i}- \overline{x }\right)}^{2} \sum {\left({y}_{i}- \overline{y }\right)}^{2}}}$$
(65)

where \(r\) represents the correlation coefficient, \({x}_{i}\) represents the values of the \(x\) variable, \({y}_{i}\) represents the values of the \(y\) variable, \(\overline{x }\) represents the mean values of the \(x\) variable, and \(\overline{y }\) represents the mean values of the \(y\) variable. Pearson correlation analysis was constructed between the spatial LST maps that affect the impact of the LU/LC environment.

For signifying the impact of the LU//LC environment of Javadi Hills, we have constructed the correlation analysis between the LST (2012, 2015, 2018, and 2021), and the AVBaI (2012, 2015, 2018, and 2021). From the different analyses for our study area, we found that the correlation that occurred between the LSTs of 2012 and 2021 provides a strong positive maximum value of 0.929343. The main significance of our research work provides no negative correlation values. Table 14 represents the Pearson correlation analysis of the spatial variables.

Table 14 Pearson correlation analysis of the spatial variables for Javadi Hills, India

Growth pattern of LU/LC analysis

The LU/LC change information has been computed between the two different periods and also the information about the LU/LC change is provided to land resource managers or government officials to take appropriate actions for protecting the LU/LC environment (Chamling and Biswajit 2020, Chemura et al. 2020).

We have used the LU/LC change formula for calculating the area statistics of our study area. Equation (66) provides the expression for calculating the percentage of the LU/LC change (\({P}_{\mathrm{change}}\)) between the LU/LC map of two different periods (\({T}_{1}\) and \({T}_{2}\)).

$${P}_{\mathrm{change}} =\left(\frac{{T}_{2}-{T}_{1}}{{T}_{1}}\right)*100$$
(66)

Our research is based on ensemble-based prediction models for analyzing the LU/LC patterns of Javadi Hills from the period 2012 to 2027. From the deep analysis and validation, we found good classification accuracy for the E-LU/LC vegetation and bare soil map of Javadi Hills for 2012 and 2015. The E-LU/LC vegetation and bare soil map have been used as the dependent feature map during the process of LU/LC prediction analysis using MC-MLP Neural Network model. The LST map has been used as the independent spatial variable map during the LU/LC prediction analysis. Our prediction model has shown a higher percentage of correctness. The TPM has been calculated using the MC model for finding the impact of LU/LC changes on the dependent LU/LC map for the years 2012 to 2021 during the process of LU/LC prediction. With the results of the LU/LC change analysis from the MC model, the MLP neural network has been performed for the input maps, and the predicted results had been analyzed for the area of Javadi Hills. The TPM results for the dependent LU/LC map from 2012 to 2021 are shown in Table 15.

Table 15 Transition probability matrix for LU/LC maps of Javadi Hills during the years 2012–2021

In our research, we have shown the growth patterns for the area of Javadi Hills from the period 2012 to 2027. The area statistics of the LU/LC map is shown in Table 16. The percentage of change that happened in the LU/LC classes is analyzed and shown in Table 17. At the end of our results, we analyzed that for every 3 years (2012 to 2027), the LU/LC change had happened. The continuous change of vegetation from high to low in the area of Javadi Hills from the period of 2012 to 2027 indicates that the necessary actions need to be taken by the land resource planners to take actions in protecting nature from huge loss. We had shown the graph in Fig. 23, which displays the LU/LC changes in the area of Javadi Hills from the years 2012 to 2027.

Table 16 Area statistics of the classified and predicted LU/LC map of Javadi Hills
Table 17 LU/LC growth pattern of Javadi Hills from 2012 to 2027
Fig. 23
figure 23

LU/LC growth pattern of Javadi Hills from the years 2012 to 2027

Discussion

The LU/LC change prediction analysis has been performed for many decades and the importance of this research is to assist the concerned land resource planners or government officials to take necessary actions in the protection of nature from huge loss. From the brief observation by the different researchers in the field of remote sensing, LU/LC change has been analyzed through different types of datasets that include satellite images, aerial images, and the images taken when carrying out the ground truth or field survey. The advantage of using satellite images is to find the LU/LC information for the larger areas, and also for the areas which are inaccessible through airborne or ground surveys. The multispectral images provide the advantage in performing the LU/LC change prediction analysis since it provides a less number of bands than hyperspectral. The execution time and choosing of bands for performing the LU/LC change prediction analysis look easier and more effective when choosing the multispectral bands. The interesting tasks when performing the LU/LC change prediction analysis are as follows:

  1. i.

    The selection of precise satellite systems for a particular location through geographical coordinates is an interesting task.

  2. ii.

    The selection of the classification and prediction algorithms and their implementation through different training data, and the tuning of parameters is the most challenging and interesting part of LU/LC prediction research.

  3. iii.

    Finding the reference data of each pixel in the satellite image to calculate the classification and prediction accuracy is an important part of the LU/LC prediction research.

  4. iv.

    The use of more than one satellite information for one particular location in predicting the LU/LC change remains the challenge.

All the above interesting tasks were performed effectively in this research work. Instead of using one satellite information, this research work provides the information from two satellites that includes LISS-III and Landsat. The satellite images for the area of Javadi Hills have been collected from both satellite systems. LISS-III provides only 4 bands (Green, RED, NIR, and SWIR), and the information from LISS-III has been used for finding the LU/LC classes by using the ensemble model fuzzy HHO-based SMC through PCA analysis. The AVBaI map has been extracted by using the Landsat bands (NIR, SWIR, TIRS, and RED). The TIRS bands are not available in LISS-III and hence the multi-satellite system provides an advantage in using the TIRS bands of the Landsat satellite system. Also, the LST is the only independent spatial variable used for providing the impact of LU/LC changes in Javadi Hills by using the dependent E-LU/LC map. This work also provides the advantage of implementing both unsupervised metaheuristic optimization and supervised machine learning algorithms. We proposed the ensemble model of unsupervised fuzzy HHO-based SMC through PCA and AVBaI for avoiding misclassification errors and to provide good accuracy for the E-LU/LC classification model. Each optimization technique varies with accuracy and execution time. The importance of using the HHO algorithm over other optimization techniques HHO is an effective optimization method with many active and time-varying stages of exploration and exploitation. The searching strategies of hawks provide a flexible structure and high-quality results. The main idea behind the HHO technique is based on the support and chasing style of hawks with their surprise jump. The escaping energy parameters of the HHO have an active randomized time-varying nature for improving and harmonizing the exploratory and exploitive patterns. The importance of HHO displays a sequence of searching tactics and then, it goes for the best movement for catching the rabbits which are said to be the optimum solution. The importance of HHO over other methods is that the Harris hawks can chase the detected rabbits to exhaustion, which helps them to increase their vulnerability. We have also compared the execution time of our fuzzy HHO algorithm with other optimization algorithm-based ensemble prediction model and the results are shown in Table 13. The fuzzy HHO-based ensemble prediction model provides good accuracy results with less execution time. This work also provides an advantage in fusing the results of three effective LU/LC classifiers (SVM, RFC, and MLC) through PCA for providing all the information regarding the LU/LC classes without any loss. The use of PCA provides the advantage of fusing the results without any information loss, and also it helps in reducing the noise level. The AVBaI has been calculated from the vegetation and bareness index map. The results of AVBaI were combined with the fused LU/LC map and it hence shows the importance of using multiple satellite data for one study area.

The use of more than one spatial variable like slope, aspect, hill shade, and distance variables that include distance from the road, forest edge, water bodies, and built-up areas has been overcome by using an LST map in predicting the vegetation and bare-soil map for any study area around the world. The LST map has been used as the only spatial variable since it provides the advantage of finding the impact of LU/LC changes happened in the vegetation and bare soil regions. The LST map has been extracted from Landsat TIRS bands and it displays the mixture of the hot and cold temperatures of the LU/LC of Javadi Hills. The results were measured using Celsius where the high temperature (hot) is considered as the bare soil regions and the less temperature (cold) regions are considered to be the vegetation regions. For better understanding, we have shown the few attribute values associated with the dependent and independent features of Javadi Hills in Table 9. The impact of the known past and present LST on the E-LU/LC map provides the advantage in finding the unknown LU/LC predicted map. The proposed ensemble model will help in finding the LU/LC prediction map through the processes of the MC-MLP neural network model. For indicating the impact of the LU//LC environment of Javadi Hills, we have calculated the Pearson correlation analysis between the LST (2012, 2015, 2018, and 2021) and the AVBaI (2012, 2015, 2018, and 2021) map of Javadi Hills, India. The main importance of our research work provides no negative correlation values. Table 14 represents the Pearson correlation analysis of the spatial variables. The use of the Markovian-Cellular Automata model provides an advantage in finding the transition probabilities between the LU/LC classes of time series data and analyzing the future LU/LC patterns of Javadi Hills, India. The MLP neural network model is used in training the dependent (E-LU/LC map) and independent (LST map) during the process of LU/LC prediction. In our proposed work, we have validated the predicted maps with the reclassified AVBaI maps of 2018 and 2021. The validation results provide good accuracy for the years 2018 and 2021. Instead of validating the single LU/LC predicted map, we have checked and validated the two LU/LC predicted maps to manifest our model effectively. The Google Earth Engine images provide the advantage of finding reference maps during the process of accuracy assessment. Also, the information about the changes that happened in the vegetation and bare soil region of Javadi Hills will assist the concerned land resource planners or forest departments to take necessary actions in protecting the vegetation from a huge loss, especially in the forest-covered regions of the hill stations.

Conclusion

This research provides the growth patterns of LU/LC analysis of Javadi Hills, India. We have used the multispectral LISS-III geospatial satellite images for the periods of 2012 and 2015 as input for analyzing the LU/LC prediction for the years 2018, 2021, 2024, and 2027 for the area of Javadi Hills. The impact of LU/LC change has been analyzed through the LST for the years 2012, 2015, 2018, and 2021 of the Javadi Hills. The preprocessing of the geospatial satellite images has been performed for rectifying the radiometric, atmospheric, geometric, and topographic errors in the geospatial satellite images. The proposed ensemble model of fuzzy HHO-based SMC through PCA has been used for extracting the LU/LC classes from the geospatial LISS-III satellite images of Javadi Hills for the years 2012 and 2015. We have calculated the AVBaI map through the Landsat bands. The AVBaI map obtained from the Landsat satellite images was combined with the results of the fused LU/LC map of the ensemble model. Our proposed ensemble model provides good accuracy for the input E-LU/LC maps of 2012 and 2015 with an average of 95.27%. The time series Google Earth Engine platform has been used as reference data during the processes of accuracy assessment. The results of the dependent E-LU/LC maps (2012 and 2015) and the independent LST maps of Javadi Hills for the years 2012, 2015, 2018, and 2021 have been used during the predictive analysis of the MC-MLP neural network model. We had validated the EP-LU/LC map of 2018 and 2021 with the reclassified AVBaI reference data map. The result provides a good percentage of correctness with an average of 95.763%. The LU/LC change analysis has been calculated for each LU/LC map from 2012 to 2027, and we found that the high vegetation area of Javadi Hills has decreased and the less vegetated areas have increased simultaneously. The LU/LC information about the vegetation changes in our study area will insist the government administrators of land resource management take action for the protection of the LU/LC environment. The future scope of our research is to provide the LU/LC information about the water bodies and urbanized areas by including additional spatial variables like Normalized Difference Water Index and Normalized Difference Built-up Index.