Introduction

Soil degradation is one of the most important degradation processes all over the world; annually tons of soils are loss by water erosion, land use change, and inadequate management practices, causing an irreversible damage in agricultural and mountain areas (Prieto-Amparán et al. 2019). A good soil and water management is essential to prevent soil degradation, and several countries are developing river basin management plans to help solve the problem by using watershed prioritization’s methodologies (Javed et al. 2009). For a good soil and water management in a given area, it is crucial to characterize a watershed or basin and its drainage system because it helps to estimate how much water runs downstream, how much soil gets lost, and what measures can be applied (Eze and Efiong 2010), information useful for sustained soil and water development, conservation, and restoration.

In this paper, we define a watershed as an area that drains the entire rainfall into a particular stream outlet (Chopra et al. 2005; Pal et al. 2012). In this area, different hydrological, geological, and structural processes interact with each other, and those processes are relevant for water erosion, soil evolution and properties, and biodiversity management. Those processes have a close relationship with the morphometry of the watershed (Hajam et al. 2013) and are directly associated with surface runoff and its hydrological response (Price 2011; Méndez 2016).

Characterizing and describing a watershed and its drainage system through a morphometric analysis and quantitative description is relevant because these parameters show the evolution of the drainage system and the relationship between processes associated with soil degradation (Strahler 1964). The morphometric parameters and land cover are a prerequisite for watershed prioritization when trying to implement soil and water conservation measurements or for developing watershed management plans.

The morphometric parameters are the most important characteristics used to analyze a watershed because they can provide quantitative information (Ramu and Mahalingam 2012). Several studies (Horton 1945; Miller 1953; Nag 1998; Migiros et al. 2011) have shown that the morphometry of a watershed is affected by the interrelation of the architecture of the drainage network and factors such as climate, geology, relief, and structural forms of a basin. The use of basin’s morphometry as a methodology is not new, several authors (Horton 1932, 1945; Smith 1950; Miller 1953; Strahler 1964) have pioneered research on basin’s morphometry, but other methods that are more precise continue to be explored. Most recently, similar works to morphometric analysis have been carried out using geographic information systems (GIS) and remote sensing (RS) to assess morphological features and analyze basins’ properties in a more reliable and precise way (Nag and Chakraborty 2003; Rudraiah et al. 2008; Magesh and Chandrasekar 2012; Sharma et al. 2015; Resmi et al. 2019). In addition, statistical analysis and multicriteria analysis have been included to analyze some of the morphometric parameters by using different methodologies like principal component analysis (PCA) (Khanchoul and Saaidia, 2017; Meshram and Sharma, 2018), fuzzy analytical hierarchy process (Ahmed et al. 2018), and analytic network process (Chowdary et al. 2013; Gopinath et al. 2016), offering useful results for decision-makers. Another important factor to characterize a basin is its land use degradation because of the crucial role of vegetation cover on the response of a watershed to different rainfall events. The spatial distribution of vegetation cover—based on NDVI variation—can be used as indicator for basin degradation degree, and it helps to determine critical areas for soil conservation and vegetation cover restoration.

In understanding the performance of a watershed and the prioritization of the morphometric parameters and the NDVI, the analysis and process can be improved by using other techniques like principal component analysis (PCA) (Ahmed et al. 2018), because they reduce the number of variables to be used, i.e., from using all the variables to only use the most significant ones to the degradation processes. The easy access to data sources to get morphometric parameters and Normalized Difference Vegetation Index (NDVI) makes them good methods for watershed prioritization coupled with PCA reducing time while also increasing quality in the results.

Particularly in Mexico, researchers have come up with alternative methodologies for watershed prioritization. However, most of these methodologies have been developed for arid and semi-arid conditions. For example, in the northern part of Mexico, some researches have been using methodologies like overlay maps (Martínez-Ramírez et al. 2017), weighted sum (Olguín-López and Pineda-López 2010), and multicriteria analysis (Zarco-Arista et al. 2010), but conditions in those areas are mostly arid (the rainiest month could be 60 mm), and therefore, variables are different and do not apply to cases with high precipitations and extreme events.

In this paper, specifically, we present an alternative methodology for watershed prioritization using morphometric analysis and NDVI coupled with PCA and GIS in Mexico mainly in tropical and sub-tropical lands by using variables with high impact (selected by PCA) on soil degradation. With our work, we present an alternative methodology for watershed prioritization for areas with high precipitation and extreme rainy events aiming to fill this methodological gap.

We use the case of the Huehuetan river sub-basin to develop our methodological proposal and test it. The Huehuetan river sub-basin is located in the southeast part of Mexico, in the state of Chiapas, where high precipitation occurs even extreme events like hurricanes (the rainiest month could be 550 mm) (Baumann and Arellano 2003). These particular conditions of high precipitation and extreme events require specific methodologies to prioritize watersheds based on statistical methods like PCA where variables can be discretized and considering only those that have greater influence on degradation processes.

Materials and methods

Study area

The Huehuetan river basin is located at the southeast region of the Mexican state of Chiapas, near to the border with Guatemala (Fig 1), covering up to 752.42 km2 and emptying into the Pacific Ocean. In this study, we analyzed the upper part of the basin where a rain gauge station called Huehuetan is located (−92° 24′ 02.43″ W, 15° 00′ 05.38″ N), covering an area of 319.27 km2. We divided the Huehuetan river sub-basin into eight watersheds assessed through the SWAT model (Soil and Water Assessment Tool) using a digital elevation model (DEM) with a resolution of 15 m per pixel. The watersheds were classified using the Pfafstetter methodology (Verdin and Verdin 1999), and the drainage network was categorized by Strahler’s order methodology (Strahler 1957).

Fig. 1
figure 1

Location of the Huehuetan river sub-basin and delineated watersheds

The Huehuetan river sub-basin is part of the Sierra Madre de Chiapas and has a steep relief in its upper part. The 83.15% of the basin had slopes greater than 10%, and elevations vary between 30 and 2690 m above mean sea level (Fig 2); this abrupt topography and intense rainfall episodes favor extreme hydro-meteorological events with large volumes of runoff, soil erosion on hillslopes, sediment transport on channels, and mass movements due to soil saturation. The annual precipitation varies from 1200 mm in the lower part to more than 3500 mm in the middle and upper areas of the sub-basin from May to October mainly and being September the rainiest month with 550 mm on average. Lithosols soils are the largest soil unit in the area, followed by Andisols, Cambisols, and Luvisols. When frequent rainfall occurs, these soils generate large volumes of surface runoff, affect the water infiltration capacity, and cause soil pore saturation and landslides on steeper slopes.

Fig. 2
figure 2

Spatial distribution of the slope, drainage network, and elevations in the Huehuetan river sub-basin

Data source

In this study, we used a digital elevation model with a resolution of 15 m per pixel provided by the continuous Map of Mexican Elevations (CEM 3.0), downloaded in the official website of the National Institute of Geography and Statistics (https://www.inegi.org.mx/app/geo2/elevacionesmex/). The NDVI was calculated by using a satellite image of the LANDSAT 8 Satellite, downloaded at the official website of the US Geological Survey (https://earthexplorer.usgs.gov/) and considering bands 4 (Red) and 5 (Infrared) in the following equation (1):

$$ \mathrm{NDVI}=\frac{B5-B4}{B5+B4} $$
(1)

Determination of morphometric parameters using geographical information systems

We determined the morphometric parameters for each watershed using the formulas described in Table 1. Some parameters were generated with GIS tools. The hypsometric integral (HI) and the hypsometric curve (HC) were calculated with the CalHypso tool (Pérez-Peña et al. 2009) using equations adjustment (2 and 3) proposed by Strahler (1952) and Harlin (1978). The hypsometric curve was classified according to Strahler’s methodology (Strahler 1952).

$$ f\left(\mathrm{x}\right)=\mathrm{a}+\mathrm{bx}+\mathrm{c}{\mathrm{x}}^2+\mathrm{d}{\mathrm{x}}^3 $$
(2)
$$ {\left(\frac{h}{H}\right)}_{\mathrm{i}}={a}_i+{b}_i\left(\frac{a}{A}\right)+{c}_i{\left(\frac{a}{A}\right)}^2+{d}_i{\left(\frac{a}{A}\right)}^3 $$
(3)
Table 1 Formulas used to determine morphometric parameters

NDVI values for the Huehuetan watersheds

The NDVI values were calculated from the LANDSAT 8 OLI satellite image retrieved on December 25, 2018. The resulting values were grouped according to the classification shown in Table 2 and which was done by Gómez-Almonte (2005) and Merg et al. (2011).

Table 2 Classification of NDVI values

Principal component analysis

Multivariate analysis is a technique that identifies common patterns in the distribution of data and reduces the size of the initial matrix to facilitate its interpretation. The PCA was used to know the contribution of morphometric parameters to the soil degradation process and which of those parameters have greatest influence in soil degradation. The correlations generated by PCA allow to look closely at the information contained in multiple variables, and they help to reduce variables redundancy (Sharma et al. 2015; Meshram and Sharma 2018). Since the PCA technique depends on the total variance from the original variables, it is most suitable when all variables are using the same measurement units. Hence, it is necessary to express the variables in standard form, i.e., selecting the unit of measurement for each variable so the sample variance is 100%. Singh et al. (2009) summarize the PCA analysis in three steps:

  1. 1.

    Calculate the correlation matrix, R.

  2. 2.

    Calculate the principal component loading matrix by PCA.

  3. 3.

    In the principal component (PC) loading matrix, Eigenvalues greater than one indicate significant PC loading.

While using a correlation matrix allows to numerically affirm the degree of association between pairs of variables, it does not allow to link these variables directly to a particular parameter. On the contrary, a PC loading matrix determines a matrix of loadings and how they associated with a particular parameter (Farhan et al. 2017). To obtain the statistical values, principal components, and Eigenvalues, the RStudio software and the FactoMineR package were used (Lê et al. 2008).

Watershed prioritization

The values of the morphometric parameters for each watershed were determined and classified considering the following assertions (Javed et al. 2009; Londhe et al. 2010; Kanth and Hassan 2012; Tolessa and Rao 2013). The linear parameters that have direct relationship with watershed erodibility, drainage system amplitude, and dissection are watershed area (A), drainage density (D), stream frequency (Fs), drainage texture (Dt), form factor (Ff), mean bifurcation ratio (Rbm), ruggedness number (Rn), relief ratio (Rh), length of overland flow (Lg), constant of channel maintenance (C), basin relief (Bh), hypsometric integral (HI), and mean watershed slope (Pmc). The highest value of linear parameters was assigned to rank one; the second highest value was assigned to rank two and so on. The least value of the linear parameters was given to the last one in the rank. Shape parameters such as elongation ratio (Re), circularity ratio (Rc), compactness coefficient (Cc), and shape index (Sw) have an inverse relationship with erodibility (Biswas et al. 1999; Ratnam et al. 2005; Javed et al. 2009), i.e., the lower the value of the variable is, the greater its influence on degradation degree on watersheds is. Hence, watersheds degradation ranking was determined by assigning priority ranks.

The compound parameter (CP), the priority rank (NP), and the priority degree (GP) of each watershed were also calculated (Patel et al. 2012; Chauhan et al. 2016; Prabhakar et al. 2019). CP was calculated by averaging all parameters from watersheds. From the group of these watersheds, the highest priority rank (NP) was assigned to the watershed having the lowest compound parameter and so on. The watersheds have been categorized into three classes (GP) using the following equations (4, 5, and 6).

$$ {GP}_{\mathrm{High}}=\left[{CP}_{\mathrm{MIN}}\kern0.5em ,\kern0.5em {CP}_{\mathrm{MIN}}+\left(\frac{CP_{\mathrm{MAX}}-{CP}_{\mathrm{MIN}}}{3}\right)\right] $$
(4)
$$ {GP}_{\mathrm{Medium}}=\left[{CP}_{\mathrm{MIN}}+\left(\frac{CP_{\mathrm{MAX}}-{CP}_{\mathrm{MIN}}}{3}\right)\kern0.5em ,\kern0.5em {CP}_{\mathrm{MIN}}+2\times \left(\frac{CP_{\mathrm{MAX}}-{CP}_{\mathrm{MIN}}}{3}\right)\right] $$
(5)
$$ {GP}_{\mathrm{Low}}=\left[{CP}_{\mathrm{MIN}}+2\times \left(\frac{CP_{\mathrm{MAX}}-{CP}_{\mathrm{MIN}}}{3}\right)\kern0.5em ,\kern0.5em {CP}_{\mathrm{MAX}}\right] $$
(6)

In the same way that we calculated the values of CP, NP, and GP for the component of morphometry, first we calculated the NDVI values, and they were classified following the proposal done by Gómez-Almonte (2005) and Merg et al. (2011). After that, we calculate the CP, NP, and GP as suggested by Patel et al. (2012), Chauhan et al. (2016), and Prabhakar et al. (2019). The GP values were calculated applying Equations (4), (5), and (6), and classified as a function of high, medium, and low ranks.

Results and discussion

Main sub-basin characteristics

The parameters of shape, relief, and those relative to drainage network were calculated and presented in Table 3. The results indicate that in the upper part of the sub-basin, the water flow runs at high velocities generating a torrential flow. In the lower part, there are peak flows caused by surface runoff concentration. The sub-basin time of concentration is 3.37 h along 41.59 km of the main channel and reaches flows with highly erosive velocities. The sub-basin is elongated, classified as order seven. The sub-basin presents high drainage density and outstanding stream frequency. The sub-basins’ drainage system is dendritic and presents a mean bifurcation ratio around four streams before discharging into a stream of higher order.

Table 3 Morphometric parameters determined for Huehuetan river sub-basin

The sub-basin’s mean terrain slope is 38.79%, and the main channel’s mean slope is 6.4%; gradients greater than 100% are found in its upper area, and they influence directly erosive processes. Based on the longitudinal profile of the main channel, the sub-basin’s upper sector is approximately 2700 m above mean sea level, and the slope of the first 10 km downstream of the main channel is greater than 100%. Following the course downstream, between 10 and 20 km approximately along the main channel, the slope decreases. In the last part of the main channel (from 28 to 42 km), i.e., downstream, the slope is less than 10%. Due to these variations, the main channel’s mean slope is 6.4%; yet we must consider the slope variations along the main channel linked to the changes in length and elevation (Fig 3). The sub-basin’s mean terrain slope and the main channel’s mean slope values are important since they define the potential runoff and have a strong correlation with soil erosion, particles detachment on streams, sediments transport capacity, and volume of sediment deposited in the lower parts of the sub-basin. The sediments deposited in the lower parts of the sub-basin cause that the main channels lose part of their hydraulic section making agricultural and urban areas more prone to flooding.

Fig. 3
figure 3

Longitudinal profile of the main channel for the Huehuetan river sub-basin

The sub-basin’s hypsometric-adjusted curve corresponds to a third-degree polynomial model, and its integration produces an area of 30.47% below the curve and which according to Strahler (1952, 1957) corresponds to a sub-basin at an old stage (monadnock phase). According to the sub-basin’s morphometry and geoforms, relative areas at the beginning of the hypsometric curve present strong erosive processes caused by high runoff velocities and which are attenuated downward creating accumulation zones from transported materials (Fig 4).

Fig. 4
figure 4

The hypsometric curve obtained and adjusted for the Huehuetan river sub-basin

Morphometric parameters of watersheds

Drainage analysis coupled with hydrological observations provides useful information about the geological framework of the basin. The main factors controlling the evolution of the basin are rainfall availability as the source of water, drainage characteristics and its influence on runoff and infiltration capacity, rock type for which the lithology character governs the movement of the water, and slope because it controls the water energy flow. These factors can define variations between morphometric parameters and can be measure it by GIS (Singh et al. 2014).

On each delineated watershed, the morphometric parameters, geoforms, and soil physical properties vary according to their topography, for example, the slope is higher upstream. In relation to the area, the largest corresponds to watershed 01, followed by watershed 08, located at the lower and upper sectors of the sub-basin, respectively (Table 4). The mean slopes of each watershed increase according to its mean elevations, being the watersheds 07 and 08 (located in the sub-basin upper part) the steepest ones.

Table 4 Watersheds morphometric parameters for the Huehuetan river sub-basin

The Pmc values, in relation to Plc, are close to or greater than 20% in all watersheds, being the smallest difference on watershed 02, with 19.58%, and the largest on watershed 07, with 51.83%. These differences in slope, between terrain and channel, are related to topographic relief, drainage network density, and shape of channel cross-sections.

Stream order (U)

The stream order (U) is a morphometric parameter that determines the stream hierarchy of tributaries, and the higher the stream order is, the larger discharges and erosive velocities are (Mahala 2020). In the Huehuetan river sub-basin, the stream orders are six (watersheds 01, 04, 06, and 08), five (watersheds 03 and 07), and four (watersheds 02 and 05). In our case, the stream number decreases as the order number increases, and this can be explained by Horton’s law of streams (Horton 1945). In Figure 5, we present a graph that shows our data following a linear trend when graphing the stream order relative to the logarithms of the stream number. Any variation in this figure is related to changes in geological structures, active tectonics (Kamberis et al. 2012), lithology (Singh et al. 2014), and consequently, in relief and slope terrain and channels.

Fig. 5
figure 5

Relationship between stream order and stream number, by watershed, in the Huehuetan river sub-basin

Stream length (L u)

The stream length (Lu) is a hydrological feature affected by the slope. The higher slope is, the higher erosive velocities are, generating a dense stream network on upper lands but shorter on length (first order). However, when all streams are accumulated, it produces a larger total stream length (Lu). In contrast, smaller slopes will influence the stream order with less dense by increasing channel length and the total length to be shorter (Gopinath et al. 2016). In general, the sum of stream length decreases as the order number increases (Ramaiah et al. 2012; Magesh et al. 2013) and which is consistent with Horton’s law of streams (Horton 1945). Our results indicate that eight watersheds (Fig 6) fit Horton’s laws although some of them showed an inverse relationship mainly on the 3rd and 4th order (watersheds 01, 02, 07), due to their physical conditions that increase the drainage network to higher orders. Besides topography, lithology (Mishra and Nagarajan 2010), and tectonic processes, there are physical conditions that affect expected line trends in the drainage area (Migiros et al. 2011; Kamberis et al. 2012; Hajam et al. 2013).

Fig. 6
figure 6

Relationship between stream order and stream length, by watershed, in the Huehuetan river sub-basin

On the other hand, the mean stream length (Lsm) is directly proportional to the stream order (U) and follows a geometric progression (Table 5) as suggested by Horton’s law of streams (Horton 1945). The stream length ratio (Rl) also follows this behavior, although in some watersheds the values decrease, as observed from the relationship between 4th and 5th orders. This unexpected behavior is associated with tectonic processes (Kamberis et al. 2012) and its lithological responses that have occurred in the drainage areas and have generated changes on relief and slopes in some sectors of the sub-basin (Strahler 1957; Phani 2014; Gopinath et al. 2016).

Table 5 Stream length and stream length ratio, by watershed and stream order, on the Huehuetan sub-basin

Bifurcation ratio (R b)

The bifurcation ratio (Rb) expresses the relationship between the stream number from a given order and the stream number on the next lower order (Schumm 1956). The variation in this parameter in the same watershed is related to local geological conditions that affect lithological characteristics, drainage density, angle of connection between channels, and watershed shape and size (Singh and Dubey 2012). In our study, the Huehuetan river sub-basin presents high values of Rb between 7.0 and 2.0 while the mean ratios by watershed between 5.46 and 3.71. These high Rb values are related to early hydrograph peaks, potential for flash floods (Howard 1990; Rakesh et al. 2000), soil erosion susceptibility, and gully formation, processes affected by geological and lithological evolution such as deformation patterns, tectonism, and mass movements (Kamberis et al. 2012).

Drainage density

The drainage density (D) is a parameter related to the total stream length and watershed area (Horton 1945); it varies with the rainfall intensity, rock permeability, soil infiltration, and watershed slope. The magnitude of the drainage density controls landforms, soil erosion, and vegetation cover (Horton 1932; Strahler 1964) and provides a numerical measurement for landscape dissection, drainage efficiency, and runoff potential (Magesh et al. 2013). For the Huehuetan river sub-basin, the extreme D values vary from 3.92 to 5.78 km/km2 for watersheds 02 and 06, respectively (Table 6). In our study, while high values are associated with high topographic relief, impermeable rocks, superficial soils, and scarce vegetation cover, low values are linked to watersheds with permeable subsurface material, good vegetation cover, and low relief that increase the infiltration capacity. Singh and Singh (2011) and Singh et al. (2014) found similar results for other watersheds.

Table 6 Morphometric parameters for each watershed

Stream frequency

According to Horton (1945), stream frequency (Fs) is the ratio between the total number of reach segments for all stream orders and the watershed area. Hypothetically, it is possible to have watersheds with the same drainage density but differs in stream frequency and vice versa (Ramaiah et al. 2012). The stream frequency is associated with the evolution of the landscape (Magesh et al. 2013); it mainly depends on the watershed lithology and it influences the drainage texture (Hajam et al. 2013). Low Fs values are related to high permeable lithology (Joji et al. 2013; Abboud and Nofal 2017) and low flood occurrence (Maurya et al. 2016). However, areas of permeable soils exposed to intense rainfall events and high frequency can produce runoff due to soil saturation or insufficient infiltration capacity which, at the same time, cause the creation of new streams and trigger landslides. Our results indicate that higher Fs values are in the upper part of the sub-basin, mainly in watersheds 07, 06, and 08, indicating soils susceptible to erosion and gully formation. In addition, Fs is directly proportional to drainage density.

Drainage texture

The drainage texture (Dt) is a parameter corresponds to the relationship between the total number of streams for all stream orders and the perimeter of the watershed. According to Horton (1945), Dt represents the relative spacing among drainage lines and depends mainly on infiltration capacity, although it is also affected in a lower degree by rainfall, vegetation cover, and rocks permeability (Dash et al. 2013). Smith (1950) classified the drainage texture in five classes: very coarse (<2), coarse (2–4), moderate (4–6), fine (6–8), and very fine (>8). Based on this classification, the sub-basin textures of our study (Table 6) fall into the categories of coarse texture (watershed 02), fine (watersheds 06, 05, and 04), and very fine (watersheds 01 and 08). The variation of Dt values for our study indicates a high drainage network dissection and potential for runoff.

Form factor

According to Horton (1932), the form factor (Ff) parameter correlates the watershed area and the square of the watershed’s channel length. The Ff parameter is also an indicator of watershed topographic relief, geological structures, and erosivity of the channel’s bed material (Abboud and Nofal 2017). Watersheds with high form factors have a circular shape and big peak flows but short time of concentration, whereas elongated watersheds with low form factors have low peak flows and large time of concentration (Altaf et al. 2013; Kumar and Chaudhary 2016; Kusre 2016). In this study, all watersheds have a low form factor (Ff<0.20) and fitting the elongated shape of the watersheds and capable to manage floods better than those watersheds with circular forms.

Shape index

The shape index (Sw) is reciprocal to the form factor (Ff) and relates the stream length and watershed area. This index is associated with topographic relief, sediment transport, and flow concentration time (Vincy et al. 2012; Altaf et al. 2013). The values of the shape index for the studied watersheds varied from 5.53 in watershed 04 to 12.95 in watershed 01 (refer to Table 6). Those Sw values suggest that watershed 04 has the shortest concentration time, while the watershed 01 will have the longest concentration time.

Elongation ratio

The elongation ratio (Re) relates the diameter of a circle, with an area similar to the watershed, and the maximum stream length (Schumm 1956). Re compares the watershed shape to a circle, and its magnitude depends on the climatic and geologic past conditions of the sub-basin (Rai et al. 2017). The elongation ratio, proposed by Schumm (1956) and further interpreted by Strahler (1964), is used to evaluate the shape of the watersheds, i.e., circular (0.9–1.0), oval (0.8–0.9), less elongated (0.7–0.8), elongated (0.5–0.7), and more elongated (< 0.5) (Gupta et al. 2019). Elongated watersheds indicate a young stage of evolution caused by intense neotectonic activities, while a watershed with a shape close to a circle suggests an early stage of maturity (Lykoudi and Angelaki 2004). A circular watershed is more efficient to discharge runoff than an elongated one (Singh and Singh 1997). The Re values for the watersheds of our study varied between 0.31–0.48, and they can be classified as more elongated. The Re values of the watersheds help to explain the neotectonic influence on basin’s evolution, and high speed flows on channels (Strahler 1952).

Circularity ratio

The circularity ratio (Rc) is a quantitative parameter that allows visualizing the shape of the watershed. The Rc relates the area of a watershed to the area of a circle with a circumference that is equal to the perimeter of the watershed, i.e., both the watershed and the circle must have the same perimeter, but the area can vary (Miller 1953; Strahler 1964). The Rc parameter is affected by geological structures, vegetation cover, climate, relief, and slope (Vincy et al. 2012; Aher et al. 2014) and is used to correlate stream frequency, watershed shape, and runoff (Al-Saady et al. 2016). Magesh et al. (2011) reported that the Rc is related to the age of the watershed and it can be classified as low, medium, and high indicating that a watershed can be at youth, mature, and old stages of the life cycle, respectively.

Length of overland flow

The length of the overland flow (Lg) comprises the water that flows over the ground surface to stream channels. The overland flow occurs due to the inability of water to infiltrate the ground surface, either because of the high intensity of rainfall or because of soil poor infiltration capacity (Horton 1945; Schumm 1956). Horton (1945) defines Lg as the half of the reciprocal value of the drainage’s density (D) and it depends on channel slope and current vegetation cover conditions (Aher et al. 2014). Lg holds an inverse relationship with channel mean slope, and it is similar to the laminar flow length (Magesh et al. 2011); hence, the slope is important for the hydrological response of watersheds (Aher et al. 2014). The Lg values in our work varied from 0.09 to 0.13 km, indicating that the overland flow moves over short distances before runoff concentrates into tributary channels; this situation is mainly due to steep slope conditions, scarce vegetation, and soil susceptibility to generate gullies; causing the high values of drainage density and stream frequency.

Constant of channel maintenance

Schumm (1956) used the reciprocal of drainage density to represent the area needed to maintain 1 km of channel and called constant of channel maintenance (C). This parameter depends on rock type, permeability, vegetation cover, and topographic relief (Kouli et al. 2007; Dash et al. 2013), and it is an indicator of watershed erodibility (Singh and Singh 2011). The results in Table 6 show low values because the watersheds in our study have scarce vegetation, soils with low resistance to runoff, and mountainous relief (Altaf et al. 2013).

Compactness coefficient

The compactness coefficient (Cc), also known as Gravelius Index (GI), divides the watershed perimeter by a circumference with a similar area to this one. During rainfall events, Cc influences the amount of runoff and the shape of the hydrograph (Graveluis 1914; Maurya et al. 2016). From the hydrological standpoint, circular watersheds produce the most hazardous conditions because they produce, for similar surfaces, the shortest time of concentration and the highest peak flow (Ratnam et al. 2005; Javed et al. 2009; Altaf et al. 2013; Chandniha and Kansal 2017). When watershed tends to be circular, the relative distance from any point at the perimeter of the basin to the main channel is practically the same, and the concentration time becomes lower causing greater accumulation of surface runoff, bigger erosive velocities, and larger floods along the main channel. A Cc value equal to one occurs when the watershed shape is a perfect circle; a factor of 1.128 corresponds to a square-shaped watershed, and a coefficient around 3.00 is related to very elongated watersheds (Al-Saady et al. 2016). Based on our results (Table 6), the watersheds show square-shaped and very elongated shapes.

Ruggedness number

The ruggedness number (Rn) parameter is the relation between the watershed topographic relief and the drainage density (Strahler 1957); it combines the slope-length characteristics in one expression. Areas with low watershed topographic relief but high drainage density are ruggedly textured, and areas of higher topographic relief have lower dissection. Rn indicates that if the drainage density increases while the relief remains constant, the average horizontal distance from drainage divide to the contiguous channel will reduced. On the other hand, if the drainage density remains constant and the watershed topographic relief increases, the elevation difference will increase (Aher et al. 2014). The Rn values calculated for our study (Table 6) show that the Huehuetan watersheds exhibit steep slopes, high relief, and high drainage density.

Relief ratio

The relief ratio (Rh) is the relationship between the total relief of a watershed and the longest horizontal distance measured along its main channel (Schumm 1956). Rh measures the overall watershed steepness, and it is directly related to the length of overland flow and time for peak flood (Singh et al. 2014). Thus, Rh is an important indicator for general erosion and intensity (Samal et al. 2015). High values of Rh indicate a steep slope and high relief, while a lower slope values may be controlled by basement rocks in form of small ridges and mounds (Vittala et al. 2004). The Rh values (Table 6) obtained in our study are directly related to watershed topographic relief conditions, in the sub-basin, upper part are steeper slopes and higher Rh values (i.e., watersheds 05 and 06), while the lowest value (0.04) was found in the watershed 02. Our results align with the findings of Ansari et al. (2012), Joji et al. (2013), Aher et al. (2014), Babu et al. (2016), Kumar and Chaudhary (2016), and Abboud and Nofal (2017); they report similar behaviors of the Rh values for other watersheds.

Hypsometric integral

The hypsometric integral (HI) of a watershed represents the relative area under a given altitude (Strahler 1952). The hypsometric curve (HC) presents on the ordinate axis; the watershed elevation in relation to its maximum altitude, and on the abscissae axis, presents the relative areas with respect to the watershed surface. The relative values (percentages) presented in the curve allow comparisons between curves from different watersheds (Racca 2007). The hypsometric curves of a watershed are associated with flood responses, watershed erosion, and drainage network development and allow to identify active and inactive tectonic sectors (Al-Saady et al. 2016).

The value of the hypsometric integral, expressed in percentage, compares a given volume at time “t” in the watershed and its initial value, allowing to estimate the erosion volume occurred on the watershed along its geological span time (Bishop et al. 2002). Strahler (1952) found that HI has an inverse relationship with watershed relief, slope length, drainage density, and channel slope.

The hypsometric curve and the hypsometric integral can be associated with the watershed dissection degree and the relative age of its topographic relief. The convex curves in the upper part with high values of the hypsometric integral (HI>60%) are linked to early youth watersheds with little degraded landscapes but intense erosion processes (youth or inequilibrium stage). The S-shaped curves indicate that watersheds are close to full maturity (35%<HI<60%) and landscapes are close to equilibrium stage and lower degradation processes. The concave curves, with low HI values (<35%), are typical on dissected landscapes and old watersheds where degradation processes are minimal (monadnock stage) (Strahler 1952).

Correlation matrix between morphometric parameters

The correlation between morphometric parameters using the Pearson correlation coefficient (r) are summarized in Table 7. The results indicate that 20 values were highly positive correlations (with greater than 0.70). These correlations were watershed area (A) with drainage texture (Dt), drainage density (D) with stream frequency (Fs), ruggedness number (Rn), relief ratio (Rh), basin relief (Bh), and mean watershed slope (Pmc); stream frequency (Fs) with ruggedness number (Rn), basin relief (Bh), and mean watershed slope (Pmc); form factor (Ff) with elongation ratio (Re), circularity ratio (Rc), and hypsometric integral (HI); elongation ratio (Re) with circularity ratio (Rc), and hypsometric integral (HI); mean bifurcation ratio (Rbm) with constant of channel maintenance (C); ruggedness number (Rn) with relief ratio (Rh) and basin relief (Bh); length of overland flow (Lg) with constant of channel maintenance (C); and compactness coefficient (Cc) with shape index (Sw).

Table 7 Correlation coefficients of the morphometric parameters

On the other hand, we found 22 negative correlations with high level of significance, i.e., the mean bifurcation ratio (Rbm) with stream frequency (Fs); ruggedness number (Rn) with mean bifurcation ratio (Rbm); both length of overland flow (Lg) and constant of channel maintenance (C) with drainage density (D), stream frequency (Fs), ruggedness number (Rn), and relief ratio (Rh); compactness coefficient (Cc) and shape index (Sw) with form factor (Ff), elongation ratio (Re), and circularity ratio (Rc); basin relief (Bh) with mean bifurcation ratio (Rbm), length of overland flow (Lg), and constant of channel maintenance (C); and mean watershed slope (Pmc) with length of overland flow (Lg), and constant of channel maintenance (C). We also found 16 comparisons did not correlate (values between −0.10< r < 0.10), i.e., watershed area (A) with length of overland flow (Lg) and drainage density (D) with hypsometric integral (HI).

The correlation analysis shows a high level of correspondence (p<0.01) among 17 variables compared. Drainage density (D), length of overland flow (Lg), and constant of channel maintenance (C) were the most associated variables with five correlations, followed by compactness coefficient (Cc) and shape index (Sw) with four correlations each one, while stream frequency (Fs), form factor (Ff), ruggedness number (Rn), and basin relief (Bh) shown three significant correlations each one. On the other hand, watershed area (A) and relief ratio (Rh) were associated (p<0.05) only with drainage texture (Dt) and mean watershed slope (Pmc), respectively. We identified that the main correlations are related to five linear morphometric parameters and the correlation with four of these parameters are associated with shape parameters, and correlations with three parameters correspond to relief. Despite these correlations allow identifying high and significative relationship among parameters, they do not allow yet to group specific parameters into components and try to attach any physical significance (Sharma et al. 2015). Thus, the resulting large number data from the correlation analysis justifies the need for a procedure to discriminate parameters and find those parameters that have greater influence on basin’s evolution and its degradation without losing quality of the explanation of each selected variable.

Due to a large number of correlations among morphometric parameters, we used the principal component analysis to determine which morphometric parameters have greater influence on watersheds behavior. Multivariate analysis is a useful technique for identifying commons patterns in data distribution, leading to a reduction of the initial dimension of datasets and facilitating their interpretation (Khanchoul and Saaidia 2017).

Principal component analysis

We applied the principal component analysis (PCA) technique to all morphometric parameters that have influence on soil degradation process in the eight watersheds of the Huehuetan river sub-basin (Table 8). PCA is based on a “component loading” matrix and indicates numerically the level of relationship between each component and the original morphometric parameters (Farhan et al. 2017). The weights of the original parameters in each component are called “loadings,” and each component is associated with a particular parameter. Besides interpreting the processes that generate the observed relationships between the chosen variables, PCA also provides a simplified data matrix known as the “component score” (or weightings) matrix.

Table 8 Eigenvalues explained variability and principal component loadings

Eigenvalues greater than one indicate that the respective principal component presents more variance than the original standardized variables; thus, the higher the eigenvalue, the greater the variability (Khanchoul and Saaidia 2017). Based on the results of our analysis, the first three principal components (PC01, PC02, and PC03) showed a total variance of 92.34%.

The Table 9 shows that PC01 is highly correlated with Fs, C, D, and Lg (PC01>0.90). PC01 also shows satisfactorily correlated value of 0.75<PC01<0.90 with Pmc, Cc, and Rn and moderately (0.60<PC01<0.75) with Rh, Rc, Sw, Bh, Rf, and Re. In general, the PC01 is associated to linear parameters. The PC02 correlated satisfactorily with Re while moderately with Rf, HI, Sw, Bh, A, and Rn indicating that it is associated with the degradation of the watershed over time. The PC03 has a satisfactory correlation with Dt and moderately with A and Rh, and these two variables are associated with the shape of the watershed. On the other hand, Rbm was not correlated significantly with any principal component.

Table 9 Correlations between variables and factors (factor loadings)

The Fig 7 shows the circular correlation between principal components by pairwise comparison and the influence of each variable. This figure illustrates that the PC01 and PC02 represent almost 80.46% of inertia (Fig 7a). According to the small angle between vectors and its closeness to the circumference, drainage density (D), stream frequency (Fs), and mean watershed slope (Pmc) are positively well represented by PC01. On the other hand, the length of overland flow (Lg), constant of channel maintenance (C), and shape index (Sw) show a high negative correlation with PC01. Based on Fig 7b, most of the variables were correlated with PC01, except the watershed area (A) and the drainage texture (Dt); those variables were better represented by PC03.

Fig. 7
figure 7

Circular correlation between variables analyzed and the first three principal components

Comparing PC02 and PC03 (Fig 7c), most of the analyzed variables are not correlated with both principal components, but watershed area (A) and drainage texture (Dt) showed a high positive correlation with PC03. The hypsometric integral (HI) and elongation ratio (Re) were the most positively correlated with PC02.

Principal components analysis was used for all morphometric parameters to reduce the information contained in all variables, identify patterns, analyze the relationship between parameters, and therefore to reduce the data of the morphometric parameters influencing the sub-basin evolutions the most and that are related to erosion and degradation processes (Khanchoul and Saaidia 2017). In our study, PCA was useful to regroup parameters with physical significance and discriminate morphometric parameters with lower influence.

The results of this study reveal that PC01 is related to linear parameters, PC02 to watershed shape, and PC03 to relief. The first component values for our watersheds suggests that runoff yield and soil loss can be associated with linear parameters like drainage density (D) and stream frequency (Fs). Our results align with what Kamberis et al. (2012) found; D and Fs are parameters highly correlated with components like soil texture, land cover, and geology.

Watershed prioritization

Based on the maximum values among three principal components, variables used to determine watershed prioritization (Table 10) were stream frequency (Fs), drainage density (D), elongation ratio (Re), and drainage texture (Dt).

Table 10 Watershed prioritization based on both components and final priority

We calculated the priority rank (NP) according to the watershed degradation degree evaluated for each selected variable and considering the compound parameter (CP) as described in the methodology section and classified by priority degree (GP).

To review the importance of the morphometric parameters on watershed degradation and its impact on vegetation cover, the NDVI was used to compare both results and obtain more precise values. Table 10 shows the variations in priority as a function of vegetation cover, where watersheds 01, 03, and 06 decrease their priority rank due to high values of vegetation cover, specifically in the high vegetation class. On the other hand, watersheds 02 and 05 increase their priority rank due to the presence of the urban areas where vegetation cover is scarce and there are large paved areas (Fig 8).

Fig. 8
figure 8

Spatial variation of NDVI classes in the Huehuetan river sub-basin

The results of combining the morphometric parameters and vegetation cover estimated with NDVI are presented in Table 10. The morphometry and NDVI describe the watershed degradation caused by water concentrated in the channels and the overland flow on hillslopes, respectively. The use of morphometric parameters and NDVI together, generates a spatial distribution of the general degradation in the watershed and which is useful to define the best management practices for soil and water conservation on streams and hillslopes. Table 10 shows the priority rank for the watersheds as a function of morphometric and NDVI variables; it also shows the final priority combining both procedures.

There is a clear definition among the priority ranks of the watersheds, where the highest priority for intervention is located in the upper areas of the sub-basin, specifically on watersheds 07, 08, and 06. The priority rank indicates the sub-basin degradation condition. In the upper areas, there are steeper lands and deeper channel profiles. As the channels descends along the watershed (Fig 9), the slope or steepness of lands, the depth of the channels, and degradation conditions reduce too. The watershed 01 is classified as high priority by the morphometry component and low priority by NDVI component, but when both components (morphometry and NDVI) are combined, it can be classified as of medium priority. The difference in classification obeys to larger vegetation cover and shows the importance of biomass to control degradation.

Fig. 9
figure 9

Final prioritization of watersheds based on the combination of morphometry and NDVI components

In our study, the first-three highly prioritized watershed was found in the upper part of the sub-basin reflecting the importance of linear and shape parameters. Linear and shape parameters combined with heavy rainfalls, steeper slopes, and soils susceptible to water erosion have been generating changes in the evolution of the landscape. In our results, the watersheds that are classified as medium priority (01, 03, and 05) have vegetation cover values greater than the 60% and are even higher than values for a watershed classified with high priority. The NDVI classes show that vegetation cover plays an important role on the effects against soil erosion and sub-basin degradation (Table 10). Watersheds with low priority in our research (02 and 04) are spatially distributed in the lower part of the sub-basin, mainly in plain areas with mean elevation values of these two watersheds of 246 and 529 m above mean sea level (Table 4). For the watersheds 02 and 04, the low relief has an important role on the response of the drainage network, and it affects the degradation of the sub-basin causing frequent flooding and affecting agricultural and urban areas.

Conclusions

In this study, seventeen morphometric parameters coupled with NDVI of eight watersheds located in Huehuetan river sub-basin of Chiapas, Mexico, were chosen for the watershed prioritization trough principal component analysis. The component loading matrix obtained from correlation matrix showed that the first three components (PC01, PC02, and PC03) whose Eigenvalues are greater than one together account for 92.34% of the total explained variance. Based on our results, the first component is strongly correlated with linear parameters, the second component is strongly correlated with watershed shape, and the last one (PC03) is associated with relief. Based on the values of the morphometric parameters, three principal components were selected and defined as linear, shape, and relief components. Moreover, we can infer that in hydrological modelling processes such as runoff and sediment yield from small watersheds, our methodology could be useful to discriminate insignificant parameters without losing quality in the results. In addition, the use of both morphometric parameters and NDVI components improves quality watershed prioritization results.

We conclude that it is feasible to analyze fewer variables representing greater variance, as we showed with our methodology and the case we presented in this paper. Otherwise, if the analysis is run with more traditional methods such as weighted sum or multicriteria analysis, a high number of combinations between variable pairs will be required. The present study reveals that parameters related to drainage network (Fs and D) and watershed shape (Re and Dt) have a great influence on erosive processes of the Huehuetan river sub-basin; all these variables are associated with runoff-erosion processes on channels and drainage network dissection degree. The morphometric analysis allows inferring from basin susceptibility to degradation, and it is an indicator of the potential growth of the drainage system. On the other hand, when NDVI is confronted with morphometric analysis, this index functions as an indicator of the degree of soil protection by vegetation cover. The combination of both components (morphometric parameters and NDVI) to generate a final priority allows identifying the activities needed to mitigate processes of watershed erosion at streams and hillslopes. In this way, areas with higher degradation potential (i.e., sites with greater slopes and higher reliefs) are located in the upper part of the sub-basin, e.g., watershed 07, 08, and 06, for our research. This methodology (morphometric parameters and NDVI coupled with PCA) helps to reduce analysis time, maintain parameters variability, reduce morphometric parameters on analysis, improve basin public policies, and help decision-makers to implement the best management practices on those areas with greater degradation susceptibility on soil and land use change. The use of PCA for watershed prioritization is a good tool for discriminating the insignificant parameters from the analysis and improves quality of results.