Introduction

Remote sensing is one of the most important methods for analysing the multitemporal changes over a certain period. The land surface observation is important for efficient management of natural resources like water, soil, and vegetation and the climate impact on these resources. However, the degradation of land is considered an important factor affecting sustainable agriculture and continuous monitoring globally is essential for food security (Alshari et al., 2023). The primary causes of this deterioration might be severe natural disasters like floods and landslides, as well as other man-made causes. Thus, timely and accurate information on changes in land cover can assist a variety of undesirable surface changes that might otherwise result in a shortage of natural resources (Jozdani et al., 2019; Twisa & Buchroithner, 2019). Traditionally, land cover change monitoring has been done using on-the-ground observations which is time-consuming (Eugene et al., 2022; Mountrakis & Heydari, 2023). On the other hand, remote sensing is a powerful tool that plays an important role in monitoring, depicting, and managing regular changes with the help of information captured through satellite imagery (Gusain et al., 2019; Taloor et al. 2020). It is useful in many fields, including agriculture, forest monitoring, and weather forecasting. Two methods, namely categorization and change detection, are widely employed to extract vital and precise information regarding changes in the earth’s surface (Mahmoud et al., 2023; Pande, 2022).

Classification simplifies the data for users to analyse spatial patterns and correlations by classifying the pixels into several categories such as water, agriculture, and urban. During the past decade, various classifiers based on machine and deep learning have been developed for land cover changes (Teluguntla et al., 2018; C. Zhang et al., 2019). In past studies, classification algorithms such as random forest classifier (RF) (Alshari et al., 2023; Zerrouki et al., 2019), artificial neural network classifier (ANN) (e Silva et al., 2020; Zerrouki et al., 2019), the k-nearest neighbour classifier (KNN) (Zerrouki et al., 2019), and support vector machine classifier (SVM) (Shao & Lunetta, 2012) have been tested for different applications using multispectral/optical remote sensing data (Singh et al., 2021d; Singh e al., 2022c). Some researchers also developed and improved deep learning-based classifiers such as deep neural network classifier (DNN) (Al-Dousari et al., 2023; Bali & Singla, 2021; Liang et al., 2022), recurrent neural network classifier (RNN) (Chen et al., 2019; Henry et al., 2019), and convolutional neural network classifier (CNN) (S. Kaur et al., 2022; Yao et al., 2020) to improve accuracy, processing speed and applicability.

Different change detection strategies have been used by numerous researchers to report their work on land use and land-cover (LULC) (Lu et al., 20042014). Some change detection methods, including image differencing with a multispectral dataset, are suggested and contrasted with the decision tree algorithm using the PCC methodology (Dahiya et al., 2023). Goswami et al. (2022) represented the machine learning-based change detection method for the estimation of LULC variations. Similarly, Y. Zhang et al. (2022) suggested a continuous change detection approach to track changes in the forest across China by employing an index method and a multispectral dataset. Three categories in total—forest, water, and others—are employed, and the modified map’s accuracy of 86.4% was attained. As a result, the effectiveness of various classifiers depends on the category of classification dataset that is being used. Additionally, the results of the classification process will directly influence the accuracy of subsequent processing like change detection (Boonpook et al., 2023; Zamani et al., 2022). Deep learning is one of the effective approach to resolve the various challenging issues in the processing of the datasets but on the other hand offers the automated training process, less dependent on human interface, improved performance in intricate scenarios, and the absence of stringent requirements for verifying the quality of training data (Felegari et al., 2023; L. Zhang et al., 2016). Figure 1 briefly summarizes the processing of various deep learning models.

Fig. 1
figure 1

Brief overview of various deep learning models

Change detection is a process of measuring the multitemporal changes with respect to the object or particular region by observing it at distinct times (Basheer et al., 2022; A. Singh, 1989). Deep learning has proven an effective approach in the change detection technique due to its numerous benefits, therefore deep learning-based changed detection (DCD) concept has been explored in this study. It is capable of providing seasonal variations accurately with distinct Landsat series dataset and promise to use higher resolution dataset with accurate results. Practically, it helps in recognizing the differences or variations between multitemporal satellite images. Change detection technique serves various application areas such as forest fire, tracking cyclones, deforestation, and urban evolution (Sing et al., 2022a). In previous literature, various change detection methods have emerged which can be further broadly classified into per-pixel and sub-pixel (Esmaeili et al., 2023). Per-pixel means a single pixel belongs to one category of class whereas, sub-pixel means that a single pixel in an image can belong to multiple memberships of multiple class categories. There are various per-pixel-based methods namely image rationing (IR), post-classification comparison (PCC), change vector analysis (CVA), and image differencing (ID) (Farmonov et al., 2023). The choice of change detection technique depends upon the type of earth features under observation. The identification of various land surface changes may be accomplished through change detection techniques. While implementing the change detection, the four criteria must be fulfilled i.e., (a) explicit multi-temporal dataset registration; (b) accurate radiometric and atmospheric evaluation; (c) comparable phenological states between different datasets; and (d) the selection of datasets with the same spatial and spectral resolution (Dibs et al., 2023; Kumar et al., 2022;). Table 1 also represents the various machine learning and deep learning techniques for agricultural land as shown below.

Table 1 Agricultural land with machine learning and deep learning techniques

To effectively monitor land use and record land change resulting from a variety of natural and human-caused changes, including urbanization, drought and climate change, Landsat satellites are equipped with the best ground resolution and spectral bands for agriculture purpose. R. K. Singh, Sethi, and Singh (2021a) uses machine learning algorithm for LULC detection using Landsat series dataset and achieved accuracy below 86%. Similarly, Y. Zhang et al. (2022) suggested a continuous change detection strategy to monitor the forest utilizing a multispectral dataset using an index method over the country of China with three categories and achieved accuracy of 86.4%. To improve the results and to fetch detailed information about LULC, deep learning-based approach is used. It promises to employ better resolution datasets with reliable findings and is capable of presenting seasonal changes using various Landsat series datasets.

The present article aims to evaluate the performance of deep learning-based ENVINet-5 architecture for agriculture using Landsat time series data, i.e., Landsat-7, Landsat-8, and Landsat-9. Therefore, the objectives are framed as follow: (a) implementation of the deep learning-based ENVINet-5 architecture using the Landsat time series dataset, (b) development of the DCD model, and (c) accuracy assessment of the DCD model using the Landsat time series dataset. The outcome of this study allows the efficient estimation of agricultural changes with a state-of-the-art ENVINet-5-based deep learning change detection model. This introduction section is followed by the study area and satellite dataset. Afterwards, the methodology and experimental results have been discussed. At last, the conclusion has been drawn based on the results and discussion.

Study area and satellite dataset

Study area

The District Fatehgarh Sahib, situated in the south-eastern region of Punjab (India) is selected for the study purpose as shown in Fig. 2. This region has geographic coordinates between 30°38′44″ N to 30°38′37″ N in latitude and 76°22′57″ E to 76°22′95″ E in longitude. It is one of the developed districts of Punjab having a geographic area of 1147 km2. It is an agrarian region and most of the area lies in fertile alluvial soil which is good for agricultural cultivation. The Fatehgarh Sahib District’s residents primarily depend on agriculture for their livelihood. It contains a net irrigated area of about 101,000 hectares, whereas the gross irrigated area is 191,000 hectares. The land use and land cover-based classes that are selected for this region include wheat, ripe wheat, fallow land, buildup area, and water. Therefore, deep analysis is required to understand the special effects of human activity on the ecosystem and create plans for sustainable land use and management. As of now, the cost-effective method of monitoring and mapping land cover changes and other agricultural areas is only provided by remote sensing.

Fig. 2
figure 2

Representation of study area (a) Indian map highlights the Punjab state, (b) Punjab map highlights the district Fatehgarh Sahib, and (c) the district Fatehgarh Sahib dataset Landsat (RGB: NIR-Red-Green) bands based natural color images.

Dataset

The multispectral cloud-free datasets namely Landsat-7, Landsat-8, and Landsat-9 have been used for the present research work as shown in Fig. 3. The Landsat-7 dataset was acquired from Nov-2012 to April-2013. The dataset acquisition dates are: (a) 17-Nov-2012, (b) 19-Dec-2012, (c) 04-Jan-2013, (d) 12-Feb-2013, (e) 09-Mar-2013, and (f) 17-Apr-2013. Landsat 8 dataset was acquired from Nov-2017 to April-2018 and the dataset acquisition dates are: (a) 23-Nov-2017, (b) 25-Dec 2017, (c) 10-Jan-2018, (d) 27-Feb-2018, (e) 31-Mar-2018, and (f) 16-Apr-2018. Landsat-9 dataset was acquired from Nov-2022 to April-2023 and the dataset acquisition dates are (a) 13-Nov-2022, (b) 15-Dec-2022, (c) 16-Jan-2023, (d) 01-Feb-2023, (e) 05-Mar-2023, and (f) 06-Apr-2023. As per the availability and to notice the seasonal variations at different time intervals, data from three different sources are used, but the same bands are used for all the datasets.

Fig. 3
figure 3

Landsat-7 datasets (NIR-Red-Green) acquired on November 2012 to April 2013 (a–f), Landsat-8 datasets (NIR-Red-Green) acquired on November 2017 to April 2018 (g-l), and Landsat-9 datasets (NIR-Red-Green) acquired on November 2022 to April 2023 (m–r)

The Landsat-7 dataset has dimensions of 4.04 m × 2.74 m and weights 1973 kg. There are 378 Gbits (around 100 pictures) of solid-state memory on board Landsat-7. The primary equipment on board Landsat-7 is the image sensor known as the Enhanced Thematic Mapper Plus (ETM+), which is a whisk broom scanner. On February 11, 2013, Landsat 8 was launched on an Atlas-V 401 rocket from Vandenberg Air Force Base in California, using an extended payload fairing (EPF) provided by United Launch Alliance, LLC. The Thermal Infrared Sensor (TIRS) and the Operational Land Imager (OLI) are the two scientific instruments that make up the Landsat 8 satellite payload. Seasonal coverage of the whole landmass is offered by these two sensors at three different spatial resolutions: 30 m for visible, NIR, and SWIR; 100 m for thermal; and 15 m for panchromatic.

On September 27, 2021, Landsat-9 was launched from Vandenberg Air Force Base in California using an Atlas V 401 rocket from United Launch Alliance. The Thermal Infrared Sensor 2 (TIRS-2) and Operational Land Imager 2 (OLI-2) are on board Landsat-9, which was constructed by the NASA Goddard Space Flight Center in Greenbelt, Maryland, and Ball Aerospace & Technologies Corporation in Boulder, Colorado, respectively. The two instruments were integrated and the spacecraft was conceived and built by Northrop Grumman. Landsat-9 can distinguish 16,384 distinct wavelength color. In contrast, 8-bit radiometric resolution of Landsat-7 allows it to distinguish just 256, whereas Landsat 8’s 12-bit data allows it to detect 4096 shades. The Landsat series dataset (i.e., Landsat-7, Landsat-8, and Landsat-9) was downloaded from the online portal (https://earthexplorer.usgs.gov/). The Landsat dataset series products offer a temporal resolution of 16 days with a swath width of 185 km. The Landsat 7 has an 8-bit radiometric resolution, whereas, Landsat 8 and 9, it is 12-bit.

During the training and validation phases, Pléiades constellation fine-resolution data, i.e., Pléiades 1A and 1B were applied. Center National d'Etudes Spatiales (CNES), a division of Airbus Defense and Space, oversees this spacecraft. It provides sharp resolution images, which might give more specific information about the area. ERDAS Imagine Earth Resources Data Analysis System (2015) contains an in-built google earth (GE) viewer where a picture may be grounded about the GE. It provides the capacity to link to GE, navigate to a designated place, match images to GE and vice versa and carry out a great deal of recognition and identification finding the remote sensing equipment. The Pléiades system gives data that may be verified using very high-resolution remote sensing. The distance between tiny things, such as buildings and country roads, may be measured using it. It is also used to provide information to detect and identify crop diseases, differentiate between agricultural and plain land.

Methodology

The framework of the proposed methodology is shown in Fig. 4 which includes (a) the pre-processing of the input image (Landsat series dataset), (b) the implementation of the classification model (ENVINet-5), (c) the implementation of change detection technique (DCD), and (d) accuracy assessment.

Fig. 4
figure 4

The framework of ENVINet-5 architecture-based methodology

Preprocessing of an input dataset

The satellite dataset must be pre-processed before applying classification and change detection techniques. It is helpful in the successful elimination of the countless inaccuracies brought by several factors, such as the sun’s position in relation to the landscape, the air’s fluctuating conditions, and errors created by satellite sensors (Jalayer et al., 2023; Nejad et al., 2023). The outcome might change if the errors are not fixed on time. The radiometric correction, which the sensor uses to store the intensity of electromagnetic radiation (EMR) using digital numbers (DN), is eliminated as part of the pre-processing (Kumar & Jain, 2020; Panahi, Khosravi, et al., 2022; Sharifi & Felegari, 2022).

It is necessary to convert these DN into practical units such as reflectance, radiance, etc. The estimation of reflectance includes the conversion of DN data into radiance values. The brightness value of the DN imagery may be calculated by taking into account the maximum and minimum radiance values of each band (Gusain et al., 2019). The value of Xi as radian is computed as shown in Eq. (1).

$${\mathrm{x}}_{\mathrm{i}\uplambda}=\left[\frac{\left(\mathrm{D}{\mathrm{M}}_{\mathrm{i}}\uplambda \right)}{\left(\mathrm{Mgray}\right)}\kern0.5em \times \left(\mathrm{B}{\max}_{\uplambda}-\mathrm{B}{\min}_{\uplambda}\right)\right]+\mathrm{B}{\min}_{\uplambda}$$
(1)

where Bmax is represented as the maximum radiance indicated by the metadata. Bmin is shown as the minimal radiance value, and DMi highlights the digital number of band pixels. The maximum DN value for a given band is represented by MGray. It is important to calculate all the angles, including the solar zenith angle, azimuth angle, and elevation angle. The solar zenith angle, which is the distance between the Sun and its overhead position, is expressed in degrees. Every sensor’s readings contain some amount of noise or inaccuracy. While some of this noise can be eliminated, it is not always viable to do so using radiometric corrections. To eliminate or reduce these effects, radiometric adjustments use atmospheric correction algorithms, which enable precise surface property interpretation. In the present study, spectral reflectance is obtained by using Eq. (2) (Mishra et al., 2010) given below.

$$\mathrm{R}={\mathrm{D}}^2\left({\mathrm{L}}_{\mathrm{r}}-{\mathrm{L}}_{\mathrm{p}}\right)\ \uppi /{\mathrm{t}}_2\left({\mathrm{E}}_0\cos {\uptheta}_{\mathrm{z}}{\mathrm{t}}_{\mathrm{z}}+{\mathrm{E}}_{\mathrm{t}}\right)$$
(2)

where Lr and Lp represent the radiance of a sensor, zenith angle is denoted by θz , D is used to represent the distance among earth and the sun. E0 and Et show the bandpass spectral irradiance. After necessary rectification, the image can be used for additional analysis, such as classification or change detection. The accuracy evaluation process is used to validate the findings produced after implementation. Firstly, layer stacking of the Landsat series dataset is done, which means that all desired bands are stacked together to form a single image. After that, the radiometric corrections are performed on the dataset to remove any form of distortion. However, the Landsat-7 dataset suffers from data gap issues due to the scan line corrector (SLC) failing. These gaps are filled with the help of the direct sampling technique (Ienco et al., 2017; Panahi, Yariyan, et al., 2022). After this, corrective measures are applied to the Landsat 7 dataset.

ENVINet-5 architecture-based classification model

Classification allows the identification or analysis of objects to categorize them into specific sets of classes with the help of a training dataset. Numerous categorization techniques have been developed for land cover mapping in earlier literature. However, past literature assured that deep learning classifiers performed better than machine learning classifiers due to numerous benefits such as being capable of dealing with complex data, rapid model development, and minimal user involvement. In this study, we have proposed a deep learning-based ENVINet-5 architecture that effectively extracts features from the dataset while being suitable for the classification of semi-arid regions. The three-dimensional input shape files such as patch size were fed to the ENVINet-5 model (Kumar et al., 2022; Rasheed & Mahmood, 2023; K. Zhang et al., 2023). Three-dimensional convolutional layers make up the proposed architecture. The first, second and third convolution layers involved the 16, 32, and 64 filters, respectively with the kernel size such as (2,2), and the stride such as (1,1).

Furthermore, two max-pooling layers with kernel size (2,2) have been utilized to decrease the spatial dimension and select the highest pixel value possible from the training dataset. The first dense layer had 64 neurons and the second dense layer had 128 neurons. A certified feature map is produced by the ReLu after it completes each element operation with negative pixels equal to zero. The Softmax activation function was then applied to one output layer. The ENVINet-5 architecture is shown in Fig. 6. The number of epochs, loss function, learning rate, activation function, batch size, optimizer, and other parameters need to be set when using the ENVINet-5 architecture (Mohammadi et al., 2021; Singh et al., 2022b). The values of these parameters can significantly affect the precision of the classification. The parameters and their values are considered as (a) dropout=0.2, (b) learning rate=0.0002, (c) epochs=300, (d) batch size=128, and (e) activation function. To evaluate the performance of the ENVINet-5 architecture, an inbuilt toolbox is required which is provided by Environment for Visualizing Images (ENVI) software version 5.6 (Singh et al., 2021c). The hardware specifications required for the present work include Windows 10 or 11,8-16 GB RAM, NVIDIA Quadro P620 2 GB (4) MDP GFX. A total of 1000 samples are used in the model’s training, out of which 80% are drawn from the training dataset and the rest 20% from the testing dataset. The training-based region of interest (ROI) is selected including a variety of classes such as wheat, water, buildup, fallow land, etc. After the model training, implementation is executed which generates classified maps of ENVINet-5 architecture. Classified maps are further processed to identify the changes in land use and land cover areas with the help of a suitable change detection technique (DCD).

Change detection

The change detection technique is used for identifying differences in an image or object taken on different dates via repeated observation (Dahiya et al., 2022). Numerous researchers have used various change detection approaches in the prior literature, including image differencing, image rationing, PCD, etc. Each method of change detection has benefits and disadvantages, and it is not suitable in all circumstances. However, the PCD technique has proven best among all techniques for the present work. There are numerous benefits of this method such as it can minimize the error caused by sensors and suitable for atmospheric differences. This approach also enables the pixel-by-pixel comparison of images and the classified map quality has a significant impact on accuracy. Numerous researchers have recommended this technique for LULC monitoring, agriculture, forest monitoring (Du et al., 2020; Mohammadi et al., 2021), and snow detection (Singh et al., 2022b).

Accuracy assessment

Accuracy evaluation is a vital stage in the processing of remote sensing data. Its primary goal is to examine how pixels were sampled into the proper land cover groups. By comparing the classifications created for each pixel to the specific land cover conditions found related to the ground truth data, the accuracy of the classified image is assessed. There are several parameters to measure the accuracy such as (a) producer accuracy (PA) which is the probability that reference pixels are correctly classified. It is represented as the compliment of omission error and is calculated as Producer’s Accuracy = 100%-Omission Error. Omission errors are reference locations that were either omitted or left out of the right class on the classed map. (b) User accuracy (UA) refers to the pixels belonging to a specific class category. It is represented as the compliment of commission error and is calculated as User’s Accuracy= 100%-Commission Error. It relates to classified sites that serve as references for locations that were incorrectly included in the appropriate class on the classified map. By looking for inaccurate classifications on the categorized sites, commission mistakes are computed. (c) Overall accuracy (OA) is utilized to determine the overall accuracy map for each class, (d) kappa coefficient (Kc) as demonstrated in Eq. (3) (Mohammadi & Sharifi, 2021; Sood et al., 2020) is defined as the difference between two agreements, namely, actual and expected. The range of the Kappa Coefficient is −1 to 1. A classification that is equal to a random classification is denoted by a value of 0. When the categorization is much poorer than random, a negative value is indicated. When the categorization is substantially better than random, the value is near to 1.

$${\mathrm{K}}_{\mathrm{c}}=\frac{\mathrm{N}\sum_{\mathrm{i}=1}^{\mathrm{r}}{\mathrm{m}}_{\mathrm{i}\mathrm{i}}-\sum_{\mathrm{i}=1}^{\mathrm{r}}\left({\mathrm{Y}}_{\mathrm{i}}{\mathrm{L}}_{\mathrm{i}}\right)}{{\mathrm{N}}^{2\kern0.5em }-\sum_{\mathrm{i}=1}^{\mathrm{r}}\left({\mathrm{Y}}_{\mathrm{i}}{\mathrm{L}}_{\mathrm{i}}\right)}$$
(3)

The term Kc represents the value of the kappa coefficient; N shows the pixel count; r represents the rows and column count of the error’s matrix; mii portrays the observation associated with class I; Yi reflects the real values of class i whereas Li displays the anticipated values of class i. These metrics are useful in assessing a model’s ability to classify or forecast unknown data with accuracy.

Result analysis

In the present work, the Landsat series datasets namely Landsat-7, Landsat-8, and Landsat-9 are used as input data. The Landsat-7 dataset was acquired from November 2012 to April 2013, the Landsat-8 dataset was acquired from November 2017 to April 2018, and the Landsat-9 dataset was acquired from November 2022 to April 2023 as shown in Fig. 4. Afterwards, ENVINet-5 architecture implementation is performed over the Landsat series dataset as shown in Fig. 5. The classification is done which generates classified maps of numerous categories represented as wheat (with green color), water (with blue color), ripe wheat (with yellow color), buildup (with red color), and fallow land (with grey color) as shown in Fig. 6 and classification results have been shown in Fig. 7.

Fig. 5
figure 5

The working flow diagram of ENVINet-5 architecture-based implementation

Fig. 6
figure 6

Landsat-7 datasets classified maps on November 2012 to April 2013 (a–f), Landsat-8 datasets classified maps on November 2017 to April 2018 (g–l), and Landsat-9 datasets classified maps on November 2022 to April 2023 (m–r)

Fig. 7
figure 7

Landsat-7 datasets to Landsat-8 datasets “change/no-change” change maps on November 2017 to April 2018 (a–f), and Landsat-8 datasets to Landsat-9 datasets “change/no-change” change maps on November 2022 to April 2023 (g–l)

Various parameters, including error matrices, producer’s accuracy, user’s accuracy, and kappa value, are used to evaluate the accuracy assessment of the proposed algorithm. The accuracy assessment is done to evaluate the effectiveness of the ENVINet-5 architecture. Fine-resolution data from the Pléiades constellation—Pleiades -1A and Pléiades 1B—were used in the training and validation procedures. The Center National d'Etudes Spatiales (CNES)/Airbus Défense and Space oversees this satellite. It provides high-resolution photos that can give more specific information about the area. The integrated Google Earth (GE) viewer in the ERDAS Imagine Earth Resources Data Analysis System (2015) allows a picture to be anchored around the GE. It provides the capacity to establish a connection with GE, go to a designated place, match images with GE and vice versa, export prints, and carry out synchronizing.

Table 2 shows the accuracy assessment of the Landsat-7 dataset (89.4–90.2%) from Nov 2017 to Dec 2017 and (88–91.2%) from Jan 2018 to April 2018. Similarly, Table 3 reflects the accuracy assessment of the Landsat-8 dataset (86.8–91.4%) from Nov 2022 to Dec 2022 and (88.6–91.2%) from Jan 2023 to April 2023 and Table 4 shows Landsat-9 datasets accuracy of (90–92.8%) from Nov 2012 to Dec 2012 and accuracy of 91–93.2% from Jan 2013 to April 2013. The results show that the ENVINet-5 classifier has achieved an accuracy of above 91% in all the cases and found that the suggested method produces more accurate results for Landsat data.

Table 2 Accuracy assessment of ENVINet-5 architecture of Landsat-7 dataset
Table 3 Accuracy Assessment of ENVINet-5 architecture of Landsat-8 Dataset
Table 4 Accuracy assessment of ENVINet-5 architecture of Landsat-9 dataset

Afterwards, the classified map generated from the Landsat series dataset was used to generate the change maps. Figures 8 and 9 represent the ‘from to change’ map using the Landsat series dataset. The accuracy assessment of the changed map of ENVINet-5 architecture using Landsat-7 and Landsat-8 datasets has been shown in Tables 5 and 6. Similarly, the evaluation of the changed map of ENVINet-5 architecture for Landsat-8 and Landsat-9 datasets has been shown in Tables 7 and 8. The result depicts that ENVINet-5 architecture-based PCD is capable of producing accuracy above 89% (approx.) for changed maps.

Fig. 8
figure 8

Landsat-7 datasets to Landsat-8 datasets “from-to” change maps on November 2017 to April 2018 (a–f), and Landsat-8 datasets to Landsat-9 datasets “from-to” change maps on November 2022 to April 2023 (g–l)

Fig. 9
figure 9

Comparison of accuracy assessment of classified maps computed from (a) Landsat-7 (2012–2013), (c) Landsat-8 (2017–2018), and Landsat-9 (2022–2023).

Table 5 Accuracy assessment (PA, UA, KC) of ENVINet-5 architecture -PCD change map using Lansat-7 and Landsat-8
Table 6 Accuracy assessment (RT, CT, NC) of ENVINet-5 architecture -PCD change map using Landsat-7 and Landsat-8
Table 7 Accuracy assessment (PA, UA, KC) of ENVINet-5 architecture -PCD change map using Lansat-8 and Landsat-9
Table 8 Accuracy assessment (RT, CT, NC) of ENVINet-5 architecture -PCD change map using Landsat-8 and Landsat-9

The representation of the changed area in percentage (%) is computed from the ENVINet-5 architecture using Landsat-7, Landsat-8, and Landsat-9 datasets which are shown in Tables 9 and 10. The results portray that agricultural land has been decreased and converted into urban areas (buildup) which is an alarming situation. ENVINet-5 architecture-based PCD has proven efficient in detecting changes in agricultural land. This work has shown that an ENVINet-5 architecture-based PCD improved the effectiveness of expanding the use of the Landsat series dataset in the detection of land surface changes with a straightforward framework.

Table 9 Representation of change area (in percentage) computed from the ENVINet-5 architecture using Landsat-7 and Landsat-8
Table 10 Representation of change area (in percentage) computed from the ENVINet-5 architecture using Landsat-8 and Landsat-9

In this work, the potential of the ENVINET-5 has been explored on Landsat series datasets, i.e., Landsat-7, Landsat-8, and Landsat-9. The visual interpretation and statistical analysis confirm the effectiveness of the ENVINet-5 in the effective production of the classification maps as shown in Fig. 5. However, it has been noted that the accuracy of the Landsat-9 is only slightly greater than that of the Landsat-7 and Landsat-8. The primary cause of such results may involve the better radiometric resolution (14-bit) offered by the Landsat-9 as compared to the Landsat-8 (12-bit) and Landsat-7 (8-bit) as shown in Figs. 9, 10 and 11. This is one of the important parameters of the sensor which allows the better detection of smaller differences in the energy reflected from the object. It has also been observed that the errors in the results are still high which needs to be improved with the help of the improvement in the preprocessing steps, better feature selection during the classification process, and advanced models for the detection of minor changes or least impacted with the classification outcomes. The Landsat series is one of the important platforms for the detection of long-term changes. Future studies may involve the integration of artificial intelligence (AI) driven models for the accurate monitoring of land cover changes.

Fig. 10
figure 10

Comparison of accuracy assessment of change maps computed from (a) Landsat-7 to Landsat-8 (2012/2013-2017–2018), and (b) Landsat-8 to Landsat-9 (2017/2018–2022/2023)

Fig. 11
figure 11

Graphical representation of “change and no-change” area computed from (a) Landsat-7 to Landsat-8 (2012/2013-2017–2018), and (b) Landsat-8 to Landsat-9 (2017/2018–2022/2023).

Discussion

To evaluate the utility of DCD algorithm over agriculture area, validation has been done using Landsat series dataset. In past literature, the U-Net algorithm has been used over glacier lakes by using Landsat dataset and achieved accuracy of 85.3% (He et al., 2021; Marzvan et al., 2021). Similarly, U-Net classifier is used for irrigation mapping over Montana state from 2000 to 2019 using Landsat dataset (Colligan et al., 2022; Mohammadi et al., 2021). Singh et al. use machine learning algorithm for LULC detection using Landsat series dataset and achieved accuracy below 86%. Similarly, Zhang et al. (2022) suggested a continuous change detection strategy to monitor the forest utilizing a multispectral dataset using an index method over the country of China with three categories and achieved accuracy of 86.4%. As comparative to previous studies, our model performs better than other approaches used. The primary goal of this research is to use proposed algorithm to create an effective and realistic change detection method. Specifically, this algorithm is designed to highlight the agriculture classes and their variations using Landsat series datasets. Deep learning algorithm has been applied to attain state-of-the-art performance on a variety of tasks and is also capable of handling big and complicated data. But, to train, it needs a lot of data and processing power, which makes it computationally costly. In the current study, a deep learning-based change detection (DCD) algorithm has been proposed to detect long-term agricultural changes using the Landsat series datasets (i.e., Landsat-7, Landsat-8, and Landsat-9) during the period 2012 to 2023. The proposed algorithm extracts feature from satellite data according to their spectral and geographic characteristics and identifies seasonal variability.

The visual interpretation and statistical analysis confirm the effectiveness of the ENVINet-5 in the effective production of the classification maps as shown in Tables 1, 2, 3 and Fig. 5. However, it has been noted that the accuracy of the Landsat-9 is only slightly greater than that of the Landsat-7 and Landsat-8. The primary cause of such results may involve the better radiometric resolution (14-bit) offered by the Landsat-9 as compared to the Landsat-8 (12-bit) and Landsat-7 (8-bit) as shown in Figs. 8, 9 and 10. According to the experimental results, vegetation has declined between 2012 and 2023, but buildup land has been expanded to 88.22% between 2012 and 2023 for the Landsat-7 and Landsat-8 datasets. Conversely, fallow land (1–0.59%) and water (3.20–0.05%) make up the degraded area. This research’s primary objective is to categorize agricultural land using a deep learning-based classifier, such as the ENVINet-5 architecture, and use the DCD approach to monitor changes over time. Additionally, it is determined that by utilizing the Landsat series information, the design has the potential to increase accuracy.

Conclusion and future scope

This work evaluates the potential of the DCD architecture over an agricultural area, i.e. Fatehgarh Sahib, Punjab using the Landsat series dataset. The results make it clear that the proposed model can increase the accuracy of LULC categorization and agriculture monitoring. The data is categorized as wheat, ripe wheat, buildup, water, and fallow land. The experimental result concludes that vegetation has decreased from 2012 to 2023, while buildup land has increased up to 88.22% (2012–2023) for Landsat-7 and Landsat-8 datasets. On the other side, degraded area includes water (3.20–0.05%) and fallow land (1–0.59%). The main goal of this research is to implement a deep learning-based classifier, i.e., ENVINet-5 architecture to classify the agricultural land and measure the changes using the DCD technique. It is also concluded that the architecture also has the potential to improve the accuracy using the Landsat series dataset. Although significant progress has been achieved in change detection, there are still many obstacles because of factors including lack of training data and previous knowledge. Moreover, a lot of fundamental problems with remote sensing datasets that haven’t been addressed yet, such as heterogeneous data, multiresolution pictures, and global information of high-resolution and large-scale images. Thus, it is highly recommended to do more research with a greater emphasis on these issues. This study will be helpful in the identification of crop growth, crop yield prediction, precision farming, and crop mapping. Additionally, it is also expected that this study will provide direction and practical guidance to researchers and scientists.