Introduction

Draining basins are territorial units, which correspond to extensions of a river and its tributaries, whose delimitation is a geological structure called watershed (Grannel-Pérez 2004). Large variations occur in land use and land cover (LULC) of watershed (Song et al. 2018), since several activities are developed around water resources, which impact geographic and physical environment and local societies (Salazar et al. 2015), such as urban expansion processes (Shao et al. 2020), supply, energy generation and food production. The LULC dynamic is associated with the rearrangement activities and enterprises depending on the coverage type (FAO 2000). On the report of Brazilian Institute if Geography and Statistics (IBGE 2013), the LULC classification indicates a typology of use according to the geographic distribution, identified trough homogeneous patterns of land cover.

Significant changes in surface cover derived from anthropogenic activities (Dibaba et al. 2020) can maximize negative effects on environmental resources, such as water quality and availability, being harmful to biodiversity and ecosystems (Guzha et al. 2018). Thus, the implementation of management policies and adaptation of priority uses of water (Tucci 1997) in conjunction with analyzes of LULC space–time dynamics are relevant for the management of water and land resources to achieve sustainable regional development (Feng et al. 2017). Activities whose have large-scale impact on the environment, in terms of surface coverage, are the constructions of dams for electricity generate (Lees et al. 2016), since river flows and geomorphology of flooded areas are changed (Wu et al. 2018), including changes in landscape structure (Gage and Cooper et al. 2015), loss of ecosystems and biodiversity (Palmeirim et al. 2014) and expropriation of local people (Jafari and Hasheminasab 2017).

In Brazil, on the Tietê River lower course, the Três Irmãos dam was built to generate electricity, which has been in operation since 1993 (Eletrobrás Furnas 2022). Despite hydroelectric power generation being a renewable energy source, the construction dam caused environmental and social damage (Campos et al. 2018). The implementation of Três Irmãos reservoir in the Pereira Barreto caused population displacement and flooded part of the city, decharacterizing the local landscape. Besides many natural vegetation hectares and areas destined for agriculture were flooded, interfering with several ecosystems and the economy of the region (Campos et al. 2018). Recent works were not found which analyzed the LULC around the Três Irmãos watershed reservoir. However, acquiring information related to LULC, around flooded areas or reservoirs, due the dam build, are important to investigate the main influences of the built reservoir and to take decisions in environmental management policies (Yulianto 2016).

Remote sensing is an important approach to study the space–time dynamics of the LULC (Mamnun and Hossen 2020). The mapping and classification of LULC can be performed through photointerpretation of satellites images, whose objective is to identify and distinguish surface objects (Crósta 2002). Another widely used method is digital classification, where a target analysis is performed by objective interpretation, distinct from the subjectivity of a visual analysis. Furthermore, digital classification is able to collect data, seeking and extracting information quickly with satisfactory results (Venturieri and Santos 1998).

Recent works have been conduced to detect changes and map LULC using spatial and temporal resolution sensors such as satellites from Landsat (Hishe et al. 2021) and Sentinel (Silva 2020) constellations. The objective is to classify forests (Younis et al. 2022; Hossain et al. 2022), different soil types (Ma et al. 2023), urban areas (Ackom et al. 2020), agricultural surfaces (Sicre et al. 2020) and floodplains (Yulianto et al. 2016). In addition, different classification algorithms such as Maximun Likelihood (Araújo and Mamede 2021), Random Forest and Support Vector Machine (Ren et al. 2021) are tested. LULC studies aim to test whether sensors and classifications algorithms are suitable for detecting and classifying the earth’s surface (Jia et al. 2014; Schulz 2021), as well as mapping space–time changes (Hussain et al. 2022; Hossain et al. 2023) to support policy decision-making in environmental management (Sarif and Gupta 2022), based on different classification systems (Gong et al. 2013).

The aim of this study was to investigate the evolution in LULC in the watershed of Três Irmãos hydroelectric plant, in order to find relationships between the LULC changes and the formation of reservoir. The Três Irmãos dam was the last to be built along the cascade of Tietê River reservoirs, in middle of 1990. Therefore, it was possible to find satellites images before and after the flood. A change detection assessment was applied in images based on the post-classification technique (Zhu 2017) to analyze the influence on LULC.

Materials and methods

Study area

The Três Irmãos reservoir watershed is located in Tietê River low course, in São Paulo State, Brazil (Fig. 1) and were developed for power generation. Worth mentioning the flooding reservoir processing has begun in August 1990, reaching its maximum volume in November 1990 (Eletrobrás Furnas 2022). The flooded area reaches 757 km2 (Tijoá Energia 2020), average depth of 17.2 m (Barbosa et al. 1999) and useful volume of 3.45 × 103 km3 (CESP 2013). Três Irmãos reservoir is interconnect to the Ilha Solteira reservoir, located in the Paraná River, through Pereira Barreto channel, which is 9 km long and 61 m deep for navigation purpose between the Tietê and Paraná Rivers (Tijoá Energia 2020). According Campos et al. (2018), Três Irmãos reservoir brought economics benefits including an increase power generation capacity (807.5 MW) and energy security for the country, development to the region, population growth, expansion of urban industry, opening road, work, ecotourism and other activities related to the multiple uses of water. In accordance with the report by the Integrated Water Resources Management System of São Paulo State, the main activities developed in watershed are sugarcane crop and livestock and industries dependents on agriculture and animals’ production.

Fig. 1
figure 1

Study area: Três Irmãos hydroelectric reservoir, located in the Tietê River, São Paulo State, Brazil

Acquisition and pre-processing of satellite images

Multispectral images were acquired from Landsat satellites between 1990 and 2018, comprising periods before and after the flooding of the Três Irmãos reservoir. The images are open source from United States Geological Survey (USGS) on the Earth Explorer webpage (http://earthexplorer.usgs.gov/) and were obtained from the sensors Thematic Mapper (TM–Landsat 5) and Operational Land Imager (OLI–Landsat 8), which have equivalent spatial and temporal resolutions for the visible bands of the electromagnetic spectrum. For both sensors, acquired products were Level-2, available in surface reflectance, whose atmospheric correction relied on Landsat 8 Surface Reflectance (L8SR) code, processed with Moderate Resolution Imaging Spectroradiometer–Climate Modeling Grid Aerosol (MODIS CMA) data (Vermote et al. 2016). The data were captured on 27 February 1990, 18 September 2000, 29 August 2010 and 18 July 2018.

To prevent the cloud effect in the image classification process, the satellite data were obtained in the seasonal period winter (July–September), except for the 1990 image that needed to match the period before the flood (February). In order to classify just the impact area, the images were crop using de boundaries of Três Irmãos reservoir watershed, delimited from digital elevation model (DEM), Suttle Radar Topography Mission (SRTM). The procedure of delimitation relies on the topography obtained from DEM/SRTM 1 Arc-Second Global data, also acquired on the Earth Explorer webpage.

LULC classification and change detection

In this study, the maximum likelihood classification algorithm was used to generate discriminated functions for each LULC class (Table 1), based on statistical parameters to determine the probability of a pixel belonging to the class of interest (Richards 2013). For the maximum likelihood classifier, training samples were created considering the spectral signature to specify the land cover class, this classification technique was consolidated and widely applied to map LULC (Gong et al. 2013; Jia et al. 2014). The LULC classification and post-classification processes were performed in freely available QGIS software, version 2.18. By the maximum likelihood algorithm, the supervised classification was carried out using the Semi-Automatic Classification Plugin (SCP) (Congedo et al. 2016) and the post-classification method (Zhu 2017) was performed using Land Cover Change tool, which makes comparative analyzes following classification pairs (time frames: 1990 and 2000; 2000 and 2010; 2010 and 2018; 1990 and 2018). Those analyzes quantify pixels changes for each class within the four time frames and they called pixel-by-pixel cross-tabulation (Jensen et al. 1987).

Table 1 Land cover class name and description adapted of FAO (2000)

Five thematic classes were adopted according to the level-3 classes recommended by Land Cover Classification System (LCCS) of the Food and Agriculture Organization of United Nations (FAO), for land cover classification, listed hereafter as Artificial and Natural Bodies Water (ANW), Natural and Seminatural Vegetation (NSNV), Pasture (PA), Cultivated and Managed Terrestrial Areas (CMTA) and Bare Soil Areas (BSA). To avoid an inconsistency classification, it was not considered a class for urban areas, since Landsat data have medium spatial resolution and cities in the impact area had small extensions.

How historical images were used, there is no possibility of carrying out field surveys to collect validation samples. Therefore, validation samples were explored from a high spatial resolution image projected in the Google Earth Pro tool, following the methodology mentioned by Olofsson et al. (2014). This methodology was applied only to the 2010 and 2018 data, since 1990 and 2000 Google Earth images presented spatial resolution equivalent to the images used in this work. The 1990 and 2000 classifications assessment were based on the original images. Thirty validation samples were selected from each class, which determined the “field truth”, to provide the confusion matrices and concordance coefficients.

Classification accuracy

The classification was assessed using confusion matrices for each year investigated, two global metrics (i.e., overall accuracy and kappa coefficient) and two class-based metrics (i.e., user and producer accuracies). The overall accuracy was obtained from a set of test samples and was calculated by the ratio of correctly classified pixels and the total amount of pixels, being optimal when greater than 85% (Jensen 1986). On the other hand, kappa coefficient is a statistic measure of classification concordance, whose values range from zero (minimum concordance) to one (maximum concordance) (Congalton 1991). According to Foody (2020), kappa coefficient does not evaluate precision and its magnitude is difficult to interpret, since it is an index of agreement. Thus, to complement the approaches for calculating classification accuracy levels, user accuracy is related to commission errors, i.e., assigns pixels to classes to which they do not belong, while producer accuracy is related to omission errors, i.e., represents the probability of a pixel being classified according to its attributes (Congalton 1991).

Results and discussion

LULC changes detection in the Tietê River watershed (1990 – 2018)

The LULC classifications for the years 1990, 2000, 2010 and 2018 of the study area are shown in the maps in Fig. 2, whose area statistics (in hectares and percentage) for respective years are summarized in Table 2. Comparing the maps in Fig. 2 through a first visual analysis, the CMTA land cover class remained predominant category for all the assessment periods. According to Trevisan et al. (2021), after the modernization of agricultural, livestock and industrial activities in 1970, it has been occurred an intense agricultural expansion in the Tietê river watershed, driven by the production of ethanol from sugarcane and others such as orange juice, vegetables oils, meat and milk.

Fig. 2
figure 2

LULC classification maps for a 1990, b 2000, c 2010 and d 2018

Table 2 Area of each LULC (hectare e percentage) for 1990, 2000, 2010 and 2018

In 1990, the ANW class accounted only 1% (5,172 ha) of the basin area, while the NSNV class covered 16% (139,463 ha). By that year, the others LULC categories followed the percentages: BSA 5% (42,308 ha), PA 21% (175,334) and CMTA 57% (488,670). This period corresponds to the moment before dam built and the flooded place for the generation of electric energy. By 2000, the flood caused due the dam construction expanded the ANW category for 58,758 ha, providing a 6% increase in the representativeness of the class in the study area. Meanwhile, the NSNV class showed a reduction of 93,456 ha, becoming the least representative category of the study area, with 5%. The flood covered an extensive region of Permanent Preservation Areas, with emphasis on riparian forests that were around the river. The PA class also suffered a significant reduction, about 12% compared to 1990. Despite CMTA class, in 2000, also has been partially flooded by the reservoir, a significant increase of 14% was observed, occupying 600,977 ha. The increase in water supply due to floods favors the water recirculation system from the rainy season to the dry season, providing advantages for large-scale agricultural production (Yoshida et al. 2020).

Although CMTA has exhibited a reduction in 2010, being the smallest area recorded for the class comparing the four dates, it remained the predominant watershed area. PA areas were increased, as a reflection of the reduction of CMTA areas. On the other hand, NSNV class showed an increase of 34,699 ha, since measures were implemented to recover riparian forests and policies to create legal reserves, such as Provisional Measure No. 2166–67, of 2001, in Brazil. In turn, BSA class kept practically unchanged, with an area of 66,879 ha. Based on analysis of 28-year Landsat data from the watershed impact area, the CMTA class remained dominant throughout the assessed period and its variation was inversely proportional to the PA class, since with the increase in the CMTA class, the PA class reduced. Ronquim and Fonseca (2018) suggesting the expansion of sugarcane cultivation areas has been replacing, preferably, pasture areas. Moreover, the NSNV showed an increase 49,298 ha since 2000, reaching 11% of the impact area.

The land cover change in percentage for each class is graphically presented in Fig. 3. The graphs have bars of different colors, the blue bars represent the unchanged areas, while the red bars represent areas that have been replaced (lost) to other classes, in turn the green bars represent the areas that increased (gained). The change in land cover over the 28 years was dynamic and did not follow a set pattern. The NSNV and PA classes experienced negative changes, meanwhile CMTA remained a considerable percentage throughout the period. The fact of the flood have been occurred, due the hydroelectric plant built, conditions were crated to adapt the impact area for implantation of agricultura crops, since the reservoir have been suplied several activities.

Fig. 3
figure 3

Graphical visualization of LULC change detection in percentage between the years a 1990 and 2000, b 2000 and 2010, c 2010 and 2018, d 1990 and 2018

Of these changes, the most impactful changes variations were observed in the first frame (1990–2000), being 91.2% gained of water, 69.8% lost of vegetation, 66.2% lost of pasture. Considering all period (1990–2018) BSA have been gained 65.8%. By the time, rates of NSNV were recovered into 2018, however, not reaching the amount of vegetation at the beginning of the time series. The period from 2010 and 2018 was tagged by the Brazilian Forest Code (Federal Law 4,771/1965, amended by Federal Law 12,651/2012) (Brazil 2012), which provides general rules for the protection of vegetation, including permanent preservation areas and legal reserves. Thereby, until 2018, the vegetation areas were environmentally compensated around the current watercourse, as determined by Forest Code.

Summarizing the changes in land cover from 28 years, classified images (1990 – 2018) were compared, resulting in a map (Fig. 4) to be observed the changes detection over the analyzed period. In the legend of the map, the class changes were clarified, presented by colors that show the categories changed from 1990 to 2018. From the changes, it was noted all classes were maintained in certain regions and other class changes did not occur (ANW to NSNV, ANW to PA, ANW to CMTA, ANW to BSA, BSA to ANW, BSA to NSNV). Those pairs of classes that did not show changes did not prevent the dynamicity of the impact area. Considering that part of the urban area of Pereira Barreto (class BSA) was flooded, no change detection was observed in the comparison map, since the amount of changed area was not significant for the spatial resolution of the 30 m image.

Fig. 4
figure 4

LULC change detection over 28 years

The cross-tabulation of pixel-by-pixel approach, detailed in pixel percentage, assists in discerning LULC analyzes (Chowdhury et al. 2020). Cross-tabulation were applied for four different frames (1990–2000, 2000–2010, 2010–2018 and all period 1990–2018), presented in Tables 3, 4, 5, 6. During these periods the PA and BSA classes were mostly changed to the CMTA class, likewise the NSNV class in the first frame. Camara and Caldarelli (2016) explained that the expansion of sugarcane cultivation, due to the high demand for ethanol production, from 1990 onwards in the São Paulo State, mainly replaced other crops and pasture areas. Nevertheless, total recoverable sugar indicator (ATR) data, showed a growth rate of the product value was almost null from 1990/2000 and 2013/2014, with a big drop in sugar price in 2006/2007 and 2009/2010, which explains the replacement of agricultural crops by pasture, between 2000 and 2010, with 15.05%. Tables 4, 5, 6 presents that the NSNV class have been maintained a gradual increase in its areas (3.91, 4.31 and 7.17%, respectively), which was due to the application of vegetation maintenance policies.

Table 3 Cross-tabulation of LULC changes in pixel percentage (1990–2000)
Table 4 Cross-tabulation of LULC changes in pixel percentage (2000–2010)
Table 5 Cross-tabulation of LULC changes in pixel percentage (2010–2018)
Table 6 Cross-tabulation of LULC changes in pixel percentage (1990–2018)

Image classification accuracy

Evaluation accuracy is an important step to validate the maps resulting from the classification and corroborate the efficiency of the methodology applied in the study area (Congedo 2016). The confusion matrices are shown in Tables 7, 8, 9, 10, as regard to the overall accuracy and the kappa coefficient are presented in Table 11, while the user accuracy and the producer accuracy are expressed in Table 12. The confusions regarding the classifications were not deep. In 1990, considering validation dataset, 2.64% of the CMTA pixels were classified as NSNV, 0.44% as PA, 0.41% as BSA and 0.65% of the NSNV pixels were classified as CMTA. Showed a satisfactory result, only 1.56% of the PA pixels were classified as CMTA and 0.13% of the CMTA pixels were classified as PA. As in previous reviews, the year 2010 presented low pixels confusions, 1.52% of pixels that were CMTA areas were classified as PA areas, while 0.03% of the CMTA pixels were classified as NSNV areas. The image classification of 2018 exhibited the poorly results compared to the classifications from other years, 2.28% of the ANW pixels were classified as CMTA. Despite all the analyses, it is not possible to confirm the true surface target, since there were no field visits.

Table 7 Confusion matrix in pixel percentage of the year 1990
Table 8 Confusion matrix in pixel percentage of the year 2000
Table 9 Confusion matrix in pixel percentage of the year 2010
Table 10 Confusion matrix in pixel percentage of the year 2018
Table 11 Overall accuracy and kappa coefficient obtained for image classifications of different years
Table 12 User accuracy and producer accuracy obtained after classification

The methodology performed excellently, with overall accuracy greater than 94% and a kappa coefficient greater than 0.92, being the 2000 classification presenting the highest values, 99.82% and 0.994, respectively. About Table 12, despite most values of user accuracy were satisfactory, over 86%, PA class had an accuracy of 68.05% and 60.67% for 2010 and 2018, repectively. The confusion between PA and CMTA classes was responsible for lowest user accuracy, since both classes may present similar spectral behaviors. Nonertheless, producer accuracy presented satisfactory values within the range 80–100%. The high accuracies found in this study are due to the adopted methodology. Several studies prove high accuracy with the use of the maximum likelihood for LULC classification (Araújo and Mamede 2021; Polat and Kaia 2021).

Conclusion

This paper assessed the LULC changes in the watershed of the reservoir flooded by the Três Irmãos dam built, on the lower course of the Tietê River, for generate electricity, during 28 years (1990–2018), using multispectral images. From the findings, the area of interest was predominantly rural, where sugarcane production has been intensified for the ethanol generation. In the first decade of the assessment (1990–2010) 69.8% of vegetation areas were lost, as much of the vegetation in the impact area was flooded due to the construction of the reservoir. Despite this, from 2010 to 2018, the NSNV class was partially recovered through the application of public policies and federal laws for the maintenance and conservation of vegetation. In addition to the areas of natural vegetation, a great dynamic was noted between agricultural areas and the pasture class, which showed gradual reductions and increases by years. Understanding the dynamic of the LULC over the years becomes an allowance for managers to elaborate and implement policies of environmental planning, land use planning, natural resource management and ecosystem services. We recommend improved monitoring for forested areas and policies for restoring natural vegetation. Furthermore, care is needed so that areas of natural vegetation are not replaced by agricultural areas, due to their importance.