Background and Introduction

Land-use maps have become necessary for monitoring changes and managing land resources sustainably. However, such monitoring requires extensive effort, especially in large hardly accessible areas. Therefore, satellite images represent a valuable source of spatial data that can be used to provide helpful information and also investigate various spatial phenomena (Du et al. 2020; Ghasemloo et al. 2022) efficiently and rapidly (especially in the case of large areas) and cost-effectively (Huyck et al. 2014). Furthermore, the development of satellites was accompanied by the evolution of many algorithms used for satellite image classification, retrieving a set of spatial information or using them as input to predict a specific parameter or phenomenon (Lary et al. 2018). The classification of satellite images to obtain land-use land-cover (LULC) maps is one of the most important applications that combine satellite images and algorithms (Talukdar et al. 2020). The demand for such maps has become significant due to their importance in natural resource management (Bouslihim et al. 2021a; Kala and Kumar 2022; Montalván-Burbano et al. 2021). Also, it helps to understand the impact of human activities on the ecosystem (Berihun et al. 2021) or to be used to understand the distribution of another parameter or phenomenon (Aouichaty et al. 2022; John et al. 2022). Therefore, extracting this type of information requires either the application of unsupervised classification methods without relying on samples or reference areas (Alshari and Gawali 2021; Georganos et al. 2018) or the supervised classification methods. The latter is based mainly on representative samples from all groups of interest (e.g., land-use type) in order to classify the values of individual image components (pixel), including spectral reflectance or grayscale values (Nagne et al. 2019; Shaharum et al. 2018).

Htitiou et al. (2019) evaluated the feasibility of phenological data extracted from Sentinel-2 and Landsat-8 using the Random Forest (RF) model. They reported a high accuracy for Sentinel-2 for crop classification in irrigated areas in Morocco. Van Leeuwen et al. (2020) used three different algorithms (RF, Support Vector Machine (SVM), and Artificial Neural Network (ANN)) to produce a LULC map from Sentinel-2 on temporarily inundated areas (Hungary). Results show a high accuracy with slight differences between the three models. Ghayour et al. (2021) compared the performances of five models to generate LULC in western Iran. They report that the SVM model has the highest overall accuracy (OA) for satellite data and more accurate performance for Sentinel-2 than Landsat-8. Results also show that Maximum Likelihood Classification (MLC) and Minimum Distance (MD) models provided the lowest accuracy. In another study from Vietnam, Thanh Noi and Kappas (2017) used Sentinel-2 with three models (RF, SVM, and K-Nearest Neighbor (KNN)) and 14 different sizes of training samples. Results show a high accuracy for all classifications with a slight superiority of SVM. Karan and Samadder (2018) evaluated the performance of six classification techniques to extract land-use maps from the WorldView-2 satellite in India. Results show that only two classifiers (SVM and ANN) produced good results. Other classifiers, such as MLC, MD, and Spectral Angle Mapper, had not produced satisfactory results. All these studies use different satellites and a variety of classification algorithms. Although the findings of these studies are very diverse, overall, they all concur that the SVM model has high accuracy.

This new mission, Landsat-9, is a continuation of efforts to study and understand our planet since the launch of the first Landsat image in 1972. This Landsat-9 satellite has joined its sibling, the Landsat-8 satellite, in orbit, permitting the acquisition of images every eight days. Landsat 9 transmits 14-bit data from OLI-2, which provides a better view of darker areas, such as forests and coastal waters, than the 12-bit data from Landsat 8 OLI. This enhancement in radiometric resolution can also be supported by an additional improvement in spatial resolution based on pan-sharpening methods, which might contribute significantly in object detection and classification.

In the present study, we aim to assess the ability of the recently launched Landsat-9 satellite compared to Sentinel-2 for generating land-use maps based on two of the most popular classification algorithms (RF and SVM). The Landsat-9 images were pan-sharpened to a resolution comparable to that of Sentinel-2. This work was performed in a first area characterized by an urban trend surrounded by agricultural land and a rural area characterized by agricultural land and the Oum Rabii river. This study will help explore the new Landsat-9 product, especially the pan-sharpening effect on land-use map production.

Materials and Methods

Site Description

Two study areas of different characteristics were chosen, each occupying 227 km2 (Fig. 1). The first area (zone 1) represents Settat town in the Casablanca-Settat region, Morocco. This zone has an urban character with diverse land use; for example, we can find some forest areas in the North of the town, surrounded by agricultural and pasture lands. The second area (zone 2) represents a part of Bni Chegdale commune, Fquih Ben Salah province. This area has a rural character with randomly distributed habitats and crop circles; these parcels are irrigated from the Oum Rabii river (located south of the area). In the present study, considering the specificity of each site and for classification purposes, the two areas were combined and considered as a single area to ensure the presence of a variety of land-use types.

Fig. 1
figure 1

Geographical location of selected study areas

Image Processing

Spatial remote sensing is increasingly helpful for LULC mapping in large or hard-to-access areas, using spectral bands from different electromagnetic spectrum wavelengths. However, LULC classification requires high-resolution images of the Earth’s surface, which are not always available. This work compares the performance and evaluates the feasibility of using high-resolution multispectral images for land-use classification. For this purpose, two free-access satellite products with 0% cloud cover were used (Table 1).

Table 1 Landsat-9 and Sentinel-2’s main characteristics

Landsat-9 is a sun-synchronous satellite launched on September 27, 2021, and the images were made available for download in February 2022, resulting in a collaboration between NASA and the United States Geological Survey (USGS). It has an OLI-2 (Operational Land Imager) sensor to capture multispectral images in the visible and near-medium infrared wavelengths and a TIRS-2 (Thermal Infrared Sensor) spectrometer to study the ground temperature. Data from the new Landsat-9 sensor have important technological characteristics regarding spectral resolution, signal quantification level, signal-to-noise ratio, and radiometric resolution (14 bits), compared to the 12 bits for Landsat-8 and the 8-bit resolution for Landsat-7. Therefore, Landsat-9 provides better radiometric signal measurement sensitivity than previous Landsat missions. Furthermore, it improves change recognition, especially in darker areas such as water or dense forest (U.S. Geological Survey 2019).

Two satellite images of the same scene were downloaded from the USGS website (https://earthexplorer.usgs.gov/), captured in the daytime to enable maximum solar irradiation of the Earth’s surface. Landsat-9 data are delivered at L2SP and L1TP correction levels from the T1 collection 2 with a UTM 29 N projection. The spectral bands from visible (blue, green, and red), near infrared (NIR), and short-wave infrared (SWIR1 and SWIR2) of the atmospherically corrected L2SP level with a spatial resolution of 30 m have been layer stacked. These data were spatially enhanced by the panchromatic band of the L1TP level with a resolution of 15 m to increase the Landsat-9 sensor data resolution.

The BROVEY pan-sharpening algorithm (Eqs. 1 and 2) was used for the Landsat-9 bands (Gillespie et al. 1987; Johnson et al. 2012). This algorithm allows the fusing of data from different sensors. It is based on the chromatic transformation of spectral bands that relies on the normalization of the bands from the low-resolution multispectral (MS) image and are subsequently multiplied by the panchromatic (PAN) image to obtain a high spatial resolution multispectral (MSpan) image (Wang et al. 2005; Liu and Moore 1998; Zhang 1999):

$${\mathrm{MS}}_{\mathrm{pan}}=\frac{\mathrm{MS}*\mathrm{PAN}}{I}$$
(1)
$$I=\frac{(0.42*\mathrm{Blue}+0.98 \mathrm{Green}+0.6*\mathrm{Red})}{2}$$
(2)

where I is intensity which is a function of multispectral bands.

The BROVEY pan-sharpening algorithm can produce spectral distortions in the bands used in the classification. We have chosen this algorithm after testing several others based on previous works proving that it effectively improves spatial resolution. In the present work, no spectral distortions were observed in the bands after pan-sharpening.

Sentinel-2 is also a sun-synchronous satellite, placed in orbit since 2015 by the European Space Agency and developed under the Copernicus program. Sentinel-2 products are characterized by systematic observations on a global scale of multispectral images of high spatial and temporal resolution (Table 1). They are widely used in mapping and monitoring detail phenomena, such as detecting spatio-temporal variations in land cover, geochemical/physical variation, vegetation cover, and natural disaster management (Kollert et al. 2021; Ibrahim et al. 2018).

Two images were downloaded from the ESA website (https://scihub.copernicus.eu/) and captured on a very close date to Landsat-9. These data are distributed at the L1C level (top of atmosphere reflectance) with the same global geodetic system as the other data used. The Sentinel-2 images used in this study were acquired under clear sky conditions. The nearest neighbor resampling algorithm was used to unify the spatial resolution of band-2 to 8A, band-11, and band-12. This downscaling procedure improves classification accuracy (Atkinson 2013). The bands used for each image have been marked in bold in Table 1.

Presentation of Classifiers and Training/Testing Dataset

Two well-known machine learning algorithms were tested to evaluate their capability for land-use mapping based on Sentinel-2 and pan-sharpened Landsat-9 images. Random Forest (RF) (Breiman 2001) and Support Vector Machine (SVM) (Meyer and Wien 2015) algorithms have been used extensively to solve classification and regression problems. These models have been applied to several fields, such as discrimination of species composition types (Crabbe et al. 2020), digital soil mapping (Bouslihim et al. 2021b), and crop classification (Fu et al. 2021).

Classification model development requires data representing various land-use classes for training and evaluating classification results. In the present study, we have focused on six classes: agriculture, pasture, forest, water, bare soil, and urban. Sample collection was done by field surveys using GPS (Global Positioning System). In addition, a second validation was done based on Google Earth orthorectified images to verify the quality and position of the samples. Furthermore, samples were collected as points for spatially limited classes (water and forest) or surfaces (polygons) for large surface classes. Figure 2 represents the geographical location of samples used for model training and testing (validation). However, collected samples were not equally sized due to the limited presence of particular classes such as water and forest. To avoid this inequality, we homogenized training samples before model tuning using the minimal value class. Then, 260 samples for each land-use class (1560 samples in total) were used for the RF and SVM model tuning and development.

Fig. 2
figure 2

Distribution of training and testing samples

Classification Accuracy

Classification performances can be checked by a visual control to assess the consistency of the obtained classification with the distribution of the different classes. Also, a statistical verification can be performed based on a confusion matrix and other statistical indices such as precision, recall, and f1 score for model performances in each land-use class. The macro-averaged f1 score, the overall accuracy (OA), and the Kappa coefficient were also used to assess the general model performance. Therefore, 100 independent samples for each land-use class (600 in total) were used to evaluate the classification performance for both models.

Results

Spectral Reflectance Analysis

Spectral reflectance of different classes, including forest, agriculture, pasture, urban, bare soil, and water, was extracted to examine and compare Landsat 9 and Sentinel-2 sensors (Figs. 3 and 4).

Fig. 3
figure 3

Spectral reflectance distribution for training data of different land-use classes across pan-sharpened Landsat-9 (top) and Sentinel-2 (below)

Fig. 4
figure 4

Spectral reflectance curves for training data of different land-use classes across pan-sharpened Landsat-9 (top) and Sentinel-2 (below)

Figure 3 shows the spectral reflectance distribution of training pixels for the pan-sharpened Landsat-9 and Sentinel-2 bands. In all bands, several outliers for the urban area are detected for the first one, followed by bare soil (except for SWIR2) and a few outliers for water at NIR, SWIR1, and SWIR2. Overall, agriculture, forest, and water reflectance are significantly distinguishable from the other classes, especially at NIR, SWIR1, and SWIR2. For the Sentinel-2 image, we observe the presence of outliers for most classes in all available bands. As in the case of Landsat-9, the reflectance of agriculture, bare soil, and water is different from the other classes, especially in the NIR, B6, B7, B8A, and SWIR.B11 bands.

In Fig. 4a (pan-sharpened Landsat-9), the different classes show approximately the same spectral variation for the selected bands. A generally decreasing spectral response in the visible bands can be observed. In addition, a significant increase in reflectance was detected in the NIR range, with the highest value for agriculture records. For the SWIR range, the forest and agriculture show a decrease in reflectance, whereas the other classes (urban, pasture, and bare soil) keep a high spectral response. The water class is characterized by the lowest reflectance in the NIR and SWIR range of the Landsat-9 sensor. Figure 4b (Sentinel-2) shows a large spectral difference, with high spectral separability represented by non-identical reflectance curves from the RED-EDGE to SWIR bands. Agriculture and water always show high and low values in the RED-EDGE.B6 to B8A and NIR, respectively. The reflectance curves are different because of the spectral characteristics of both sensors. The number and spectral range of the bands used, which vary from visible to SWIR, are different. As shown in Table 1, the Sentinel sensor has ten bands, while Landsat has only six bands. The wavelengths also vary; for example, the blue band varies between 0.450 and 0.515 μm for Landsat-9 and between 0.457 and 0.522 μm for Sentinel 2, thus resulting in different reflectance values for each class. Regarding the similarity of the spectral signatures of water and vegetation, we have reported this in the manuscript. Some sampling points were located in areas affected by an algal bloom, causing a green coloration of water, which leads to a slight spectral confusion between vegetation and water.

The obtained results show that the Landsat-9 and Sentinel-2 sensors generally have the same spectral changes but with more details for Sentinel-2 characterized by its high resolution, which could be a crucial factor in the reliability and accuracy of classification results. It will be difficult for the Landsat-9 sensor to distinguish between classes because of their almost identical spectral signatures.

Classifiers Model’s Development

Optimal parameters for model development must be carefully chosen to achieve the best results for the machine learning algorithms. Therefore, two essential parameters must be tuned for the RF model: the mtry and ntree, whereas cost, gamma, and kernel function type are the parameters that usually influence the results of the SVM classifier. Model tuning was performed in the present study to facilitate the selection of the optimal parameters for each model. Furthermore, for RF classifier, a ntree of 500 with a mtry of 2 and 6 was found to be the optimal value for Landsat-9 and Sentinel-2, respectively. While the best SVM model was obtained from a radial model, gamma of 0.5, with a cost of 256 and 128 for Landsat-9 and Sentinel-2, respectively.

Land-Use Classification Results

The best models for RF and SVM were used to generate a distribution map of the different land-use classes. The two areas (zone 1 and zone 2) were combined for a more manageable representation of the results. Table 2 represents the surface and the percentage of the six classes obtained by pan-sharpened Landsat-9 and Sentinel-2 classification using RF and SVM classifiers. Generally, bare soil occupies over half of the total area; pasture takes the second position, followed by the urban area, agriculture, forest, and water, which occupies a small surface.

Table 2 Comparison of the occupied area of different classes

A few important observations were made by comparing classification results. The surface of the water body identified by pan-sharpened Landsat-9 + SVM was significantly larger (3.5 km2), about three times the water estimated by the other classifications. Similarly, the urban area for the pan-sharpened Landsat-9 + SVM combination is the largest with 30.8 km2, and the Sentinel-2 + RF captured a smaller area than the others (24.8 km2). Regarding pasture areas, the classifications obtained by RF (for both images) gave lower values (144.5–147.2 km2) compared to SVM (151.4–153.1 km2). The forest area captured by Sentinel-2 + SVM is larger than the three other classifications, with 7.8 km2. For agricultural areas, the Sentinel-2 + SVM captured the smallest area (22.7 km2), while the pan-sharpened Landsat-9 + RF captured an area of 27.6 km2. For the last class, the bare soil captured by RF is larger than SVM, with 247.4 and 249.7 km2 for pan-sharpened Landsat-9 and Sentinel-2, respectively. The SVM classifier captured smaller areas, with 237.5 km2 for pan-sharpened Landsat-9 and 243 km2 for Sentinel-2.

Figures 5 and 6 show the land-use classification obtained from the pan-sharpened Landsat-9 and Sentinel-2, respectively. The overall visual control shows a similarity between all classification outputs for both models. For zone 1, we can observe the excellent recognition of numerous roads, especially the highway that crosses the whole area and other secondary roads distributed across the entire zone. Also, the small water surface in the North of zone 1 (basins of the water treatment station of Settat town) was well identified in the four maps along with the surface occupied by the forest in the North-East.

Fig. 5
figure 5

Land-use maps obtained from the pan-sharpened Landsat-9 image

Fig. 6
figure 6

Land-use maps obtained from the Sentinel-2 image

Regarding zone 2, we can notice that the main river was clearly detected, with a few small water bodies distributed in zone 2 (irrigation water storage basins). However, some agricultural areas were misclassified as forests for Sentinel-2 + SVM.

The visual control made at a higher scale allows better analysis and comparison of classification results. Figure 7 presents the geographical location of the six selected classified areas. In Fig. 8A, we focused on water basins of the wastewater treatment station of Settat town. It can be noticed that the combination Sentinel-2 + RF was the only classifier capable of identifying the basin on the bottom right as a water area. In contrast, the different other combinations classified it as an urban area. Also, using both classifiers, a group of small basins located below the wastewater treatment station was only identified in the Sentinel-2 image. Besides, a small area of agriculture in the northeast corner was misclassified as forest (by the Sentinel-2 + SVM). The previous results are valid for the second case (Fig. 8B). The forest was identified in a similar pattern. However, misclassification of the road in the northwest corner was observed in the pan-sharpened Landsat-9 + SVM, where some pixels were classified as water. In the case of Fig. 8C, two observations can be made: (i) pixels classified as forest are very frequent in images classified by SVM, and (ii) the small lake located in the center of the image was well captured in the case of Sentinel-2 + RF.

Fig. 7
figure 7

Distribution of different selected areas

Fig. 8
figure 8figure 8

A series of frames showing the comparison of different classification results

Moreover, the two roads on the left were well classified in the case of the Sentinel-2. The classification results for the example shown in Fig. 8D are generally similar. The river path was well captured in the different combinations. The only apparent difference was that more pixels were classified as urban in Landsat-9 than in Sentinel-2. In Fig. 8E, water collection basins were well captured along with most agricultural areas. However, two important observations should be noted; agricultural lands were classified as forests by Landsat-9 + RF, Landsat-9 + SVM, and Sentinel-2 + SVM. And other agricultural or pasture lands were classified as urban. The only correct classification was observed in the Landsat-9 + RF combination.

Regarding Fig. 8F, a misclassification of some bare soil areas classified as forest areas was observed in the Sentinel-2 + SVM combination. Many different areas were classified as urban in the same classification. In addition, we can note that in the Landsat-9 + SVM case, another area was classified as water, even though it does not exist.

Accuracy Assessment of Classification

Model performance was evaluated for each land-use class based on the f1 score, recall, precision (Table 3), and the confusion matrix results (Tables 4, 5, 6, and 7). Comparing classification results (Tables 4 and 5) of the pan-sharpened Landsat-9 image showed that the SVM classifier (OA = 0.86, Kappa = 0.832 and macro-averaged f1 score = 0.86) performed slightly better than RF (OA = 0.855, Kappa = 0.826, and macro-averaged f1 score = 0.85). RF (Table 4) classified agriculture better than SVM (Table 5), with 100 well-classified samples for RF against 97 for SVM and an f1 score slightly higher for SVM (0.92 versus 0.89 for RF). Both algorithms classified bare soil with the same level of accuracy, with an f1 score between 0.92 and 0.93. SVM performed better than RF for the other classes, including forest (f1 score of 0.82 for SVM, versus 0.79 for RF), pasture (f1 score of 0.87 for SVM, versus 0.84 for RF), and water (f1 score of 0.82 for SVM, versus 0.80 for RF). Generally, we reported confusion between forest and agriculture/pasture and between urban and bare soil/pasture. In addition, a high misclassification was observed for water, with many samples classified as urban, forest, pasture, or agriculture.

Table 3 The resulting precision, recall, and f1 score values for different classification combinations
Table 4 Confusion matrix for the pan-sharpened Landsat-9 with the RF predictions (columns) and reference samples (rows)
Table 5 Confusion matrix for the pan-sharpened Landsat-9 with the SVM predictions (columns) and reference samples (rows)
Table 6 Confusion matrix for the Sentinel-2 with the RF predictions (columns) and reference samples (rows)
Table 7 Confusion matrix for the Sentinel-2 with the SVM predictions (columns) and reference samples (rows)

For the Sentinel-2 (Tables 3, 6, and 7), once again, the SVM (OA = 0.912, Kappa = 0.894, and macro-averaged f1 score = 0.90) performed better than RF (OA = 0.898, Kappa = 0.878, and macro-averaged f1 score = 0.91). Generally, SVM (Table 7) classified forest better than RF (Table 6), with 84 well-classified samples against 76 for RF and an f1 score of 0.90 for SVM against 0.85 for RF. Forest class was confused with agriculture (16 samples) and pasture (7 samples) for RF, whereas the confusion was more pronounced with pasture (14 samples misclassified) for SVM. It should also be noted that some pasture samples were misclassified as urban or bare soil, although the f1 score of the two classifiers is identical with values between 0.87 and 0.88. Similarly, it was observed that water was confused with urban or forest with an f1 score equal to 0.94 for both classifiers.

Based on previous results (OA, Kappa, macro-averaged f1 score) and model performance for each class (f1 score), it can be concluded that Sentinel-2 showed better results than the pan-sharpened Landsat-9 for both classifiers. The lowest result was obtained with pan-sharpened Landsat-9 + RF, while Sentinel-2 + SVM had the best performance.

Discussion

The current study is aimed at comparing the ability of the new product (Landsat-9) and Sentinel-2 to provide land-use maps using RF and SVM classifiers. The findings of this study will help choose the right combination (image/algorithm) for a better land-use classification. The Landsat-9 image was recently made available, which might prevent discussing the obtained results with other similar studies. However, since Landsat-9 and Landsat-8 have approximately the same characteristics, studies that have evaluated Landsat-8 data were considered for discussion purposes.

The visual and statistical comparisons of the classification results are different. Based on visual comparison, the RF model seemed to perform better than the SVM, given that several features were well identified. In contrast to the RF results, the general observation of SVM classification shows confusion between several land-use classes, like the confusion between agriculture/forest and pasture/bare soil or urban. The combination of the RF model and the Sentinel-2 gives the best result, and misclassification of several elements observed in the other combinations was not observed in this case. The Sentinel-2 + RF could identify the water with high efficiency, considering the other combinations classified the water basin as urban. This could be attributed to the fact that the basin (1.5 m deep) was not completely filled with water, thus, it being confused with the urban class. It should also be noted that all combinations could classify the water well despite the central basins’ green color, which is generally caused by a symbiosis between aerobic bacteria and algae. These results align with Toosi et al. (2019), who used different Landsat time-series and machine learning methods (e.g., least squares SVM radial, SVM linear, and regularized discriminant analysis) to detect mangrove cover in southern Iran. They conclude that the RF model performed better based on visual comparisons. In another study, Adugna et al. (2022) compared the SVM and RF classifiers for extensive area land cover mapping of Africa. They found that the RF model performed 3% higher than the SVM model.

Statistical validation of the classification results based on testing samples showed that both images successfully classified agriculture and bare soil with high performance. Water was the exception, and both classifiers found problems classifying water for the pan-sharpened Landsat-9 image. In contrast, the water classification results of the Sentinel-2 image are significantly better than that of Landsat-9, which could be explained by the spatial resolution difference between the two images. The statistical evaluation showed that the SVM model performed slightly better than the RF. Nevertheless, this finding is consistent with several other studies. Rahman et al. (2020) used RF and SVM algorithms and their combination to classify LULC in two regions, Bhola (rural) and Dhaka (urban), based on Landsat-8, Sentinel-2, and planet satellite images. Their results revealed that Sentinel-2 performed better compared to both other images. They also found that SVM is significantly better than RF and ensemble model techniques. In addition, Thanh Noi and Kappas (2017) examined the performances of three algorithms (RF, KNN, and SVM) for classifying Sentinel 2 data in the North of the Red River Delta, Vietnam. Results showed that SVM has the highest overall accuracy and produces the best LULC maps, followed by the RF and KNN algorithms. In another study, Rana and Suryanarayana (2020) tested the combination of principal component analysis (PCA) and three algorithms (MLC, RF, and SVM) to produce LULCs from a Sentinel-2 image. They noticed that combining PCA with the SVM model can give the best result than the other classifiers. In addition, Sonobe et al. (2018) used the Sentinel-2 to extract 82 different remote sensing indices for crop-type classification. The results showed that the SVM classifier performed well compared to RF (89.3 to 92.0% of OA), and a combination of SVM and RF led to a significant improvement with an OA of 90.2 to 92.2%.

Furthermore, several studies reported that sample number and location might explain the difference between the virtual and statistical comparison results. For example, Thanh Noi and Kappas (2017) showed the impact of sample size and design on the performance of models to produce LULC maps. They found that when the training sample size was sufficiently large, i.e., greater than 750 pixels/class or representing an area of approximately 0.25% of the overall search area, all classifiers used (RF, KNN, and SVM) revealed a similar high AO (greater than 93.85%). In addition, according to Belgiu et al. (2014), some variations may be related to the choice of classification features and the assignment of feature clusters to discrete hierarchical levels. The land cover type of a high-resolution pixel is more accurately identified than a low-resolution pixel since the latter generalizes the heterogeneous surroundings (Zhao et al. 2014).

Overall, the difference between the two classifiers (RF and SVM) is negligible. Despite the Landsat-9 resolution improvement using the pan-sharpening technique, the advantage of the Sentinel-2 image was still apparent. Also, the presence of many outliers in the Sentinel-2 data used for classification did not affect the algorithm’s performance. In general, several previous studies have reported the superiority of Sentinel-2 images over Landsat-8 images. Along the same line, Nyamekye et al. (2021) examined the performance of RGB bands from 3 sources (unmanned aerial vehicle, Landsat-8 and Sentinel-2) for artisanal and small-scale mining mapping in Ghana. The results showed that Landsat-8 and UAV showed poor performance, while Sentinel-2 gave the best results (OA = 0.949 and Kappa = 0.94). In another study in China, Wang et al. (2021) tried to identify cotton areas using two multispectral images (Landsat-8 and Sentinel-2) and a hyperspectral image (Orbita). This comparison showed that Orbita and Sentinel-2 performed better than Landsat-8. Also, both images (Landsat-8 and Sentinel-2) were used by Htitiou et al. (2019) for crop classification in irrigated areas in Morocco. Results showed a higher performance of Sentinel-2 (OA of 0.93 and Kappa of 0.91) than Landsat-8 (OA and Kappa of 0.9 and 0.88, respectively).

Finally, the comparative analysis of this work and other similar studies had shown the superiority of Sentinel-2, despite attempts to improve the accuracy of Landsat-9 images. This difference can be attributed mainly to the higher spatial and spectral resolutions. Also, the three red-edge bands of Sentinel-2 have been shown in several studies to be very useful for improving the accuracy of vegetation maps and for also classification of a wide range of crops (Forkuor et al. 2018; Immitzer et al. 2016; Ramoelo et al. 2015).

Conclusion and Future Work

The present study evaluates two machine learning algorithms (RF and SVM) for land-use mapping, using two of the most popular and commonly used satellite image sources, the recently launched Landsat-9 and Sentinel-2. The potential to improve the performance of Landsat-9 has also been tested by pan-sharpening the different bands for high-resolution data (15 m). For optimal performance of the two classifiers, a model tuning approach was applied by trying different combinations of the critical parameters for each model. The results showed that the SVM performed slightly better than RF. Furthermore, the Sentinel-2 demonstrated significant capabilities for multiple-variety classification compared to Landsat-9, despite the improved resolution of the latter.

Overall, this study represents a first attempt to evaluate the capabilities of Landsat-9 to generate accurate land-use maps. This study opens the path for future works, such as crop-type classification using phenological information derived from multispectral or hyperspectral images; testing other machine learning or deep learning classifiers to optimize performance and processing time. In addition, other methods of pan-sharpening algorithms or alternative data sources can be tested to improve the Landsat-9 resolution.