Introduction

Agriculture land is the backbone of the Indian economy and the major source of national income via agriculture and allied activities. Agriculture acts as a supply chain of food products and raw materials for industrial development, commercial activities, and international trade [77]. It has also been observed that in India, since the past few decades, agriculture activities have been continuously decreasing due to urbanization or the growth of other sectors [44]. But it is still high as compared to other countries. It is more important to perform the comprehensive assessment of agriculture with respect to crop production which is essential to meet the demands of the food supply chain [78]. To assess the agricultural land, field observation methods are generally followed which is a time-consuming, expensive and tedious task [3]. Moreover, there is a very rare possibility of continuous monitoring on a daily or weekly basis. Crop mapping and classification are some of the most difficult tasks among agricultural land problems [32]. In agricultural land, the most common approach used for crop monitoring is the digital cameras or field observation for evaluation of the crop yield which may be costly or limited to the small area [12]. Therefore, automatic, consistent and a fast, system are necessary to deliver the precise crop mapping and monitoring over all large scale [4].

Remote sensing via optical or microwave imaging offers a cost-effective way to monitor the land cover changes at a very large scale [29]. The continuous monitoring and assessment of agricultural cropland provide valuable insights into the different agronomic parameters [61, 69]. The use of advanced geospatial technologies is necessary to acquire information related to variations in soil, climate, and other physic-chemical changes [49]. The monitoring of agricultural manufacturing systems follows strong seasonal patterns about the biological life cycle of yields. Every part of these factors is highly variable in time and space dimensions [71].

To monitor the agricultural land, various techniques are available such as change detection, classification, and fusion or pan-sharpening [44, 83]. Change detection procedures are generally used to monitor multi-temporal changes and detailed information can be found in different studies [44, 83]. In change detection procedures, classification is an important element to categorize the different land types based on their similarity score and allow the user to extract the meaningful information in the form of a thematic map [45]. The classification techniques can be categorized as (a) supervised/ unsupervised; (b) parametric/non-parametric, (c) hard/soft, (d) per-pixel/sub-pixel, and (e) object-based classification [44]. Due to the limitations of various satellite sensors, it is not possible to acquire the earth imagery at a very high temporal and spatial resolution simultaneously and there is a requirement to perform the pan-sharpening or fusion of high-resolution and low-resolution datasets. In many cases, pan-sharpening is found to be more effective to improve the classification and change detection results [81].

However, the main focus of the present study is to make a comparative study on various emerging land-use and land-cover classifiers. Generally, supervised classification requires having adequate knowledge about prior information or training data to generate classified images. Whereas, in unsupervised classifiers, there is no requirement of prior information or training data because it classifies the input dataset based on similarity or in the form of clusters [86]. Moreover, semi-supervised classifiers are more preferable due to their less dependency on user’s skills and handling more complex problems [81]. Nowadays, some machine learning or deep learning-based classifiers have become more popular due to their effectiveness in extracting critical information from remotely sensed data. Table 1 summarizes some of the basic and advanced classifiers, especially for agricultural land.

Table 1 Agricultural land with machine learning and deep learning

In the present paper, we address the major developments made into the field of classifiers based on satellite datasets, epically for agricultural land. The aim of the present analysis is on: (a) the recent advancements made in classification methods; and (b) comparative analysis of different strategies to monitor agricultural land. This paper also highlights the different types of satellite sensors available or previously used in agriculture applications and various steps involved in satellite dataset pre-processing as explained in the second section. Afterward, a detailed review of conventional as well as advanced classification models for agriculture applications. At last, we have made the summary and future recommendations for optimal ways to use satellite datasets of agricultural land in sections.

Classification Models

The supervised classification requires the training data to classify the input dataset [34] such as decision tree, minimum distance [39], nearest neighbor (NN) [102], and maximum likelihood classifier (MLC) [14]. Whereas, the unsupervised classifiers divide the spectral information into specified class categories based on statistical information acquired from an image itself [15, 16]. Such as ISO [89] and K-mean [15, 16]. The parameters classifiers like mean-vector and covariance matrix are often generated from training samples [63]. In the case of complicated landscapes, parametric classifiers generate undesirable results such as linear discriminant analysis [63] and MLC [15, 16]. Non-parametric classifiers are generally based on the exclusion of statistical parameters and free to learn with the help of training dataset such as support vector machine (SVM) [74], NN [21], decision tree [21].

Conventional classifiers are generally based on the signatures generated from the training dataset (Table 2). These classifiers generally ignore the mixed pixel information and provide the result based on maximum likelihood [33]. Such as MLC [24], NN, decision tree [14, 36].

Table 2 General convention in agricultural land

Whereas, subpixel classifier offers the combination of partial membership of multiple class categories within a specific pixel [86] such as Fuzzy-set [80, 84], spectral mixture analysis [63] and linear mixture model [87]. The OBC classifiers involved the categorization of pixels based on the spatial relationship with the surrounding pixels [39]. In this paper, we have reviewed different approaches (neural networks, machine learning, and deep learning) with highlighting various features like classification techniques, classifier, sensor category, crop/parameters, and performance accuracy. NN (Neural Networks) are smart tools to derive thematic maps from satellite datasets (Table 2).

Machine Learning-Based Classifiers

The machine learning approach is used to solve large nonlinear problems using datasets from various sources. It enables improved decision-making and knowledgeable procedures in a real-world scenario with minimum dependency on the user’s skill. It provides a flexible and powerful structure for the integration of expert information into the system. The machine learning approaches are broadly used for the accurate measuring of biotic stress for weed detection as well as plant disease in the crop (Table 3). Cai et al. [7] described the utilization of Landsat series spectral data to solve the problem of clouds while implementing the machine learning model and more accurate analysis of the classification process. Coopersmith et al. [10] reported the landowner to hesitant to place sensors due to financial cost, difficulty, and sometimes infeasibility physical visit to the remote location which may be limited by modeling the wetting or drying process through machine learning algorithms. Duro et al. [14] selected a subset of a large amount of drainage basin select for a long-term study land-use and land-cover monitoring.

Table 3 Agricultural land with machine learning (ML)

Deep Learning-Based Classifiers

The deep learning further extends machine learning applications into more depth as well as transforms the dataset using the different function that hierarchically allows data representation, through several levels’ abstractions. A strong benefit of deep learning is feature-based learning that includes the automatics extraction of different features from input dataset Table 4, represents the various deep learning approaches including CaffeNet and convolution neural network (CNN). Kussul et al. [33] utilized the multilevel deep learning architecture for the classification of different land use and land cover types from remotely sensed datasets. In this section, we briefly review relevant deep learning-based models that were originally proposed for visual dataset processing and that are widely used for state-of-the-art research into deep learning in Remote Sensing Dataset. In addition, we mention the latest deep learning developments, which are not yet widely applied to remote sensing but may help create the next generation of its algorithms. Figure 1 gives an overview of the deep learning models we discuss in this section.

Table 4 Agricultural land with Deep Learning
Fig. 1
figure 1

An overview of various deep learning classifiers: a deep convolution neural network (DCNN), b autoencoders, c recurrent neural network (RNN), and d convolution neural network (CNN)

Further, [76] provided a better understanding of the capability of Sentinel-1 dataset radar dataset or images for agricultural land mapping. Ndikumana et al. [53] developed the deep learning model efficiently and perfectly classify cloud, shadow, and land cover in different high-resolution satellite datasets. Moreover, Zhou et al. [102] investigated the suitability and potential of DCNN in the supervised classification of POLSAR (Polarimetric Synthetic Aperture Radar) dataset. Spatial information was naturally employed to terrain classification due to the properties of convolutional networks (Table 5).

Table 5 Deep learning models and its application

Sowing and Harvest of Summer and Winter Crops

Figure 2 represents the Sowing and harvesting of different crops during the summer season (May–October) and winter (October–April) [41]. The phonological stages for each crop’s and Botanical names such as wheat (Triticum aestivum), Barley (Hordeum vulgare), Mustard (Brassica nigra), Berseem (Trifolium alexandrinum), Paddy (Oryza sativa), Corn (Zea mays), Millet (Pennisetum typhodium), Sorghum (Sorghum bicolor) and Sugarcane (Saccharum officinarum) have been acquired from growth guides provided by Punjab State’s Department of Forestry, Agriculture, and established by interviews with neighborhood farmers [33].

Fig. 2
figure 2

An overview of various agricultural production phases

Traditionally, mapping the vegetation of an entire area is a matter of time and requires a demanding field survey. Remotely sensed datasets, especially such as sentinel-2, Landsat-8, and MODIS dataset the classification and monitoring of vegetation can be accomplished more cost-effectively with more detail in less period with the help of machine learning and deep learning approaches (Table 6). Three stages play an important role for vegetation monitoring or mapping of Punjab state’s region such as plantation, growth, and harvest time of crop cycle. In the past, classifiers have proved useful for finding different crop classes such as SVM [77] and KNN [34] for wheat; RF, SVM [77] and NN [101] for barley; RF, KNN and DCNN [34] for mustard; DT, RF and SVM [26] for Berseem and paddy; KNN [7] for corn; RF [4] for millet and sorghum and MLP [33] RF and DCNN [34] for sugarcane.

Table 6 Different machine learning and deep learning approaches related to agriculture informatics

Summary and Conclusion

The main focus of the present analysis is on the recent advancements made in classification methods and comparative studies on different strategies to monitor agricultural land [28]. Agriculture monitoring via remote sensing offers a cost-effective and rapid way. Nowadays, a significant contribution has been in the field of agriculture monitoring via satellite images due to the free data access policy offered by most space organizations [36]. With continuous development in space technology such as high spectral, spatial, and temporal resolutions, more or unexplored information can be warranted in the future [46]. Advanced geospatial classification techniques such as machine learning and deep learning can be more significant to extract important information from agricultural land [58].

From the previous literature, it is apparent that pixel-based methods have certain limitations such as not considering the variations within a pixel which can be effectively solved with the help of sub-pixel-based approaches up to a great extent. There is further existence of variation within a pixel [9]. Most studies on satellite datasets highlighted the performance of object-based classification approaches for different regions such as agriculture areas, urban areas, forests, and wetlands [47]. In the past various years, different studies have been carried out using different emerging classifiers in remote sensing-based agriculture applications [91]. Worked on NN and concluded that NN spontaneously selects the training samples on the contextual information extracted from the target area [34]. Moreover, the spatial distributions of the objects have also been improved and strengthened as it uses multi-scale contextual information [34, 76]. The accuracy in class-category and boundary information has also been improved in NN classified maps [18].

Moreover, the machine learning classifiers such as DT [26], SVM [20], RF [1], MLP [15, 16] and KNN [7] has the potential to improve the classification results in agriculture regions as compared to conventional classifiers [37]. Moreover, machine learning techniques directly study information from small data samples through their features and successively construct a difficult statistical model to make predictions on larger ones [15, 16]. These features come from variables that are involved in classification, namely predicting variables [7]. Such data-driven approaches can enhance the possibilities to adaptively improve the performance of a model by avoiding the problem of over-fitting or under-fitting [1].

On the other hand, the deep learning classifier such as CNN [76], RNN [5] and DCNNs [34] or object-based classification techniques improve the extraction of the agricultural land classes [76]. Within the deep learning approach, convolutional and pooling layers are connected alternatively to simplify the features towards deep and intellectual representations. Typically, the convolutional layer improves the learning procedure through a set of samples or image patches across the dataset [91]. Those weights are shared by different feature maps, in which multiple features are learned with a reduced number of parameters, and an activation function, e.g., rectified linear unit is followed to strengthen the non-linearity of the convolutional operations [62]. The pooling layer involves max-pooling or average-pooling, where the summary statistics of local regions are derived to further enhance the generalization capability.

The advanced methodologies (ML and DL) have the potential to become very important to the monitoring of agricultural land using satellite datasets. To apply these technologies for plant diseases, weed detection, real-time field operations, and soil analysis may become routine operations in close to future agriculture [17, 53]. Moreover, the development and integration of advanced algorithms in classification or change detection procedures may be beneficial to acquire information regarding the different vegetation types over agricultural land. Further, the machine and deep learning-based techniques can also be tested for vegetation monitoring over rugged terrain where remote sensing is highly affected with differential illumination effects in the form of shadow [79].

In this paper, we have systematically reviewed the state-of-art machine learning and deep learning techniques in remote sensing data analysis [67]. The deep learning techniques were originally rooted in machine learning fields for classification and recognition tasks, and they have only recently appeared in the remote sensing and geoscience community [30]. From the five perspectives of (a) supervised/ unsupervised; (b) Parametric/non-parametric, (c) hard/soft, (d) per-pixel/sub-pixel, and (e) object-based classification, we have found that deep learning techniques have had significant successes in the areas of target recognition and scene understanding, i.e., areas that have widely accepted as challenges in recent decades in the remote sensing community because such applications require us to abstract the high-level semantic information from the bottom level features, while the traditional remote sensing methods of feature describing feature extraction classification are shallow models, with which it is extremely difficult or impossible to uncover the high-level representation [8].

In agricultural land, which is an SVM-based technique, the testing on the automatic extraction of human-made objects is not made, and the segmentation accuracy limitation is not resolved [50]. The developed SVM classifier is not suitable for the applications, such as change identification and monitoring of the environment [100]. The classification accuracy is not achieved to the expected limit in the developed multi-spectral dataset by utilization of the SVM and RF classifiers [53]. The classification result is not improved by the DT classifier as the training dataset, and the testing area is limited. The research challenges in the DNN-based classification are, DCNN is not advisable for the classification of multi-sensor and multi-resolution satellite datasets (Singh, Sethi, and Singh, 2021). The developed ANN classifier [79, 82] cannot achieve the expected accuracy in massive distinct region databases and suffer from high computational complexity. However, the research in deep learning is still young and many queries remain unsolved. They are some potentially interesting topics in machine learning and deep learning for remote sensing data analysis such as (a) the total number of training samples [99]; (b) the complexity of remote sensing images [11]; (c) transfer between data sets [96]; (d) depth of deep learning model [104].