Introduction

Soil moisture is a critical variable in many physical processes related to agriculture, hydrology, and meteorology or climatology (Maria Jose Escorihuela and Quintana-Seguí 2016; Wang et al. 2016). Assessments of soil moisture are especially significant for drought monitoring by agricultural industries and for observations of the ecological environment. In that regard, a great deal of attention has been paid to how changes in the land surface temperature (LST) and normalised difference vegetation index (NDVI) affect soil moisture. However, relatively less attention has been paid to how the temperature vegetation dryness index (TVDI), which is based on an empirical parameterisation of the relationship between LST and NDVI variations, impacts soil moisture. For example, the TVDI has not been studied in the area of the Longmen Mountains (TZ).

There are many different techniques for in situ soil moisture measurement (Skierucha and Wilczek 2010; Skierucha et al. 2012), but they are cost-intensive and require major efforts to be put in place (Chen et al. 2015). Moreover, only a few networks are available for in situ soil moisture measurement. Thus, remote sensing (RS) and geographical information system (GIS) are valuable spatiotemporal data and spatial analysis tools for soil moisture investigations (Chen et al. 2015; George P. Petropoulos et al. 2015).

Researchers in China and globally have investigated the impact of TVDI changes on soil moisture, owing to the availability of RS data of an areas over different time periods El Hajj et al. (2016). Watson et al. (1971) were the first to carry out a soil moisture inversion, using a model that combined the LST and thermal inertia. Sen and Yuan (2007) examined the relationship between the summer season NDVI and local-level climate variables, including precipitation and temperature in Minnesota from 1990 to 1997, using geographically weighted regression. Gao et al. (2011) presented a new drought assessment method that spatially and temporally integrated TVDI with a regional water stress index, based on a synergistic approach. Chen et al. (2011) investigated the potential of the TVDI for assessing soil moisture in the Huang-Huai-Hai plain from moderate resolution imaging spectroradiometer (MODIS) data. Chen et al. (2015) used the TVDI to investigate the differences in soil moisture under four types of mono-species forests and two types of mixed forests in Nanjing. Rébecca Filion et al. (2016) investigated the potential of radar and LANDSAT data in generating reliable soil moisture maps to support water management and agricultural practices in Mediterranean regions, particularly during dry seasons. Eftychia Taktikou et al. (2016) evaluated the capability of a soil water content predicted from RS to indicate the soil water content at a short time and space scale, through comparisons with daily soil moisture data determined in situ using dielectric devices. Wang et al. (2016) investigated the spatial distribution and limiting factors of the vegetation growth and NDVI change trends in two major alpine grasslands (alpine meadow and alpine steppe) in relation to soil temperature and soil water conditions, using datasets of the plateau scale soil water content, frozen soil type, vegetation index, and LST. A land parameter retrieval model was used to derive enhanced resolution soil moisture and vegetation optical depth data from advanced microwave scanning radiometer brightness temperatures, as sharpened by a modulation technique based on high-frequency observations (A.I. Gevaert et al. 2016). Comparisons with soil moisture from in situ measurements or model simulations indicated show that the TVDI is feasible for monitoring soil moisture (Chen et al. 2015).

Previous analyses indicated that soil moisture can be measured to some degree and have investigated the potential for obtaining the soil moisture through the relationship between the LST and NDVI, based on RS data. Although the findings from these studies are important, little research has focused on investigating the soil moisture in the ecologically fragile TZ, in western China. This research is important, both for helping us understand how the soil moisture and TVDI are linked over time and also for ensuring the assessment of the soil moisture. Therefore, this study investigates the suitability of RS data for estimating the soil moisture in the TZ.

The scale effects and cloud influence on optical RS data are very important scientific problems in RS (Chen et al. 2015) and can increase the uncertainty in soil moisture retrieval. LANDSAT 8 OLI/TIRS images have a higher spatial resolution (30–120 m) than Terra/Aqua MODIS images (250–1000 m) or National Oceanic and Atmospheric Administration-Advanced Very High Resolution Radiometer (NOAA-AVHRR) images (1100 m) where cloud cover affects the LST calculation and results in errors in the calculation. Therefore, this study uses good-quality images.

The objectives of this study are to (1) identify the distribution of the NDVI; (2) present the retrieved LST data based on atmospheric correction algorithms; (3) determine the suitability of the TVDI for estimating the soil moisture in the TZ; (4) estimate the soil moisture through RS data and establishment of the results with field measured soil moisture data.

Materials and methods

Study area

The study area is located in the northwest region of the Sichuan Basin between 103° 57′–104° 20′ E and 31° 15′–31° 41′ N (Fig. 1), in the transition zone between the Chengdu Plain region and the Longmen Mountains (TZ), and has a surface area of 79.48 km2. The northwest section of the study area is a part of the TZ, whereas the southeast section of the study area is a part of the Chengdu Plain. Thus, the terrain is high in the northwest section and low in the southeast section, and there is a large difference in altitude between these sections. The study area lies in the subtropical humid climate zone of the Sichuan Basin and has a mild climate, abundant precipitation, and four distinct seasons, i.e. typical characteristics of the continental monsoon climate. The average annual temperature is 15.7 °C, and the average annual precipitation is 1053.2 mm. The distribution of vegetation in the region exhibits a distinct vertical gradient, owing to terrain and climate effects.

Fig. 1
figure 1

Location of study area

Data sources

The main types of research data in this study included RS data, field measured soil moisture data, and auxiliary data. The RS data included LANDSAT-8 OLI/TIRS RS images taken on 28 April 2016, with track number 129/038. The RS data was acquired from the ‘US Geological Survey Earth Resources Observation and Science’ data centre (USGS EROS Data Center), and a digital elevation model (DEM) was acquired from the Data Center for Resources and Environmental Sciences, Chinese Academy of Sciences (RESDC), the Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences. The auxiliary data included 1:50,000 topography maps and vector boundary data for the study area, which were all obtained from a key lab of land resources evaluation and monitoring in the Southwest, Ministry of Education, i.e. Sichuan Normal University, while Global Positioning System (GPS) sampling data were obtained via field surveys.

Field work and soil samples processing

The field measured data was collected within a week before and after the RS imaging time, by handheld GPS devices. The weather was stable during the field work, and it could be determined that the soil moisture remained relatively stable. One hundred and twenty-four soil observation points of in situ soil moisture measurements were selected to ensure that each land use types had at least ten observation points (Fig. 1). However, cloud cover seriously influences LST retrieval and often limits the applicability of the TVDI for estimating soil moisture on a regional scale. Ninety-five effective field measured data points were retained using data exploration and elimination of abnormal values, such as the large errors caused in soil sampling or sample processing, points seriously influenced by cloud cover as determined based on GIS and Statistical Package for the Social Sciences software.

Previous results have shown that in the root zone of the soil profile, the relationships between the TVDI and soil moisture at depths of 0–20 cm in the soil samples were closer than those at depths (Patel et al. 2009; Chen et al. 2011; Sun et al. 2012;). The sampling depth in this study was approximately 20 cm, and each soil sample was crumbled before being placed in an aluminium box with a known accurate mass. Each sampling point was geo-referenced using a handheld GPS system. Samples were placed inside an oven preheated to 105 °C ± 2 °C and were then baked for 12 h. After removal and cooling to room temperature, each fresh soil sample was then immediately weighed to an accuracy of 0.001 g. For each sampling point, three measurements were made for the purpose of parallel determination. If there were significant discrepancies between the measurements for the first parallel sample, another round of drying would be carried out before re-weighing.

The main task during the field study was to measure the soil moisture using a direct method. The dried soil samples were weighed using a Mettler balance, and the percent gravimetric soil moisture was calculated using Eq. (1):

$$ {M}_s=\left({W}_t-{D}_y\right)/{D}_y\times 100 $$
(1)

where the Ms, Wt, and Dy were soil moisture, soil wet weight, and soil dry weight, respectively.

Remote sensing processing

The RS data were projected using the Gauss–Krüger projection and were referenced to the World Geodetic System-84 coordinate system. The geometric corrections were performed on the 2016 RS images (using the 1:50,000 topographical maps) to create the reference image.

The ‘Environment for Visualizing Images’ (ENVI) software offers quick atmospheric correction (QUAC) and ‘Fast Line-Of-Sight Atmospheric Analysis of Hypercubes’ (FLAASH) atmospheric correction tools. QUAC is an atmospheric correction method for multispectral and hyperspectral imagery that works with the visible and near-infrared spectra in the shortwave infrared wavelength range. FLAASH is a first-principles atmospheric correction tool that corrects wavelengths in the visible spectrum through the near-infrared and shortwave infrared regions and can correct images collected in either vertical or slant-viewing geometries. The atmospheric correction of the LANDSAT 8 OLI/TIRS images in this study is accomplished using the ENVI 5.1 FLAASH model, which is based on the ‘MODTRAN4 RT’ model (French and Norman 2003).

The maximum likelihood method and GPS field data were used for the supervised classification of the 2016 images. The GPS field information was used to compare validation points to evaluate the accuracy of the determined land use types. The overall classification accuracy was above 90%, with a kappa coefficient of 0.856 or higher. Land use information was obtained on farmlands, forested lands, grasslands, water bodies, built-up land, and unused lands (Fig. 2).

Fig. 2
figure 2

Land use patterns in study area

The areas of each land cover types in the study area are presented in Table 1. According to Table 1, the land use was dominated by forested lands and farmlands, and these land types accounted for 83% or more of the study area. The data regarding the dependence of the land use landscape patterns on the elevation in the study area were obtained through spatial superposition analysis and statistical analysis of the land use maps and elevations. The results showed that cultivated lands, built-up land, and water areas were primarily distributed within the 535~854-m elevation range, and the proportion of the corresponding surface area in the study area was occupied more than 48%. The forested areas were mainly distributed in the 854~3299-m elevation range, and the proportion of the surface area in the study area was 38.47%. The unused lands were mainly distributed in the 854~2209-m elevation range.

Table 1 The areas of each land cover types

Calculation of NDVI

The NDVI is an appropriate indicator for vegetation growth status, biomass, and vegetation coverage (Liang et al. 2012). It ranges from − 1 to 1. Negative values indicate that the ground is covered by clouds, water, or snow, and thus, reflects visible light; 0 indicates the presence of rocks or bare soils, with the near-infrared and red bands being approximate or equal; and the positive values indicate the presence of vegetation cover, with the value increasing with greater coverage. The NDVI was calculated using the following Eq. (2):

$$ \mathrm{NDVI}=\left(\mathrm{NIR}-R\right)/\left(\mathrm{NIR}+R\right) $$
(2)

where the NIR and R are the near-infrared band and red band reflectance values, respectively.

After an atmospheric correction of the results, some pixel values would become negative. This means that outliers beyond [− 1,1] existed among the NDVI values. To facilitate calculations, the background for the NDVI values of these pixel values was set as 0 (Peng et al. 2016). The outliers were eliminated using Band Math in ENVI 5.1, yielding the processed NDVI data: (NDVI lt − 1) × 0 + (NDVI gt 1) × 0 + (NDVI ge − 1 and NDVI le 1) × NDVI

where the lt denotes ‘less than’, gt denotes ‘greater than’, denotes ‘less than or equal to’, and ge denotes ‘greater than or equal to’.

Retrieval of fractional vegetation covers

A pixel binary model was used to determine the fractional vegetation covers (FVC). The NDVI value of a mixed pixel is formed by two components: areas with vegetation cover (NDVIveg) and those with bare soil/no vegetation cover (NDVIsoil) (Zhao 2003). The FVC is calculated using the following Eq. (3):

$$ \mathrm{FVC}=\left(\mathrm{NDVI}-{\mathrm{NDVI}}_{\mathrm{soil}}\right)/\left({\mathrm{NDVI}}_{\mathrm{veg}}-{\mathrm{NDVI}}_{\mathrm{soil}}\right) $$
(3)

where the NDVIveg represents the NDVI values of the pure vegetation cover pixels, and NDVIsoil represents the NDVI value of the bare soil cover pixels.

The NDVI value of a pixel is affected by various factors, including the atmosphere, surface conditions, year, season, and regional conditions. These cause NDVIsoil and NDVIveg to vary over space and time. For the calculation of NDVI values in this study, the confidence interval was set to 5% and 95% of the cumulative probability distribution table for the region’s NDVI values. The NDVI values in the vicinity of 5% and 95%, specifically 0.003794 and 0.773399, were selected as the NDVIsoil and NDVIveg, values, respectively.

Retrieval of the land surface temperature

Currently, the algorithms used for the RS retrieval of LST include the atmospheric correction algorithms (Sobrino et al. 2004; Li et al. 2004; Mao et al. 2005), single-window algorithms (Qin et al. 2001; Jiménez-Muoz et al. 2014; Zhou et al. 2011), and split-window algorithms (Wan and Dozier 1996; Ri et al. 2013). The LANDSAT 8 OLI/TIRS contains two thermal infrared bands: the band10 (10.60~11.19 μm) and band11 (11.50~12.51 μm). The US Geological Survey (USGS) has pointed out that some calibration errors can be found with the band 11. It is recommended to use only the TIRS band 10 in quantitative research rather than using two channels (Gao et al. 2017). As LANDSAT 8 OLI/TIRS 10 data is located at a lower atmospheric absorption band than TIRS11, its atmospheric transmissivity is higher, making it more suitable for single-band inversions of the LST (Jiménez-Muoz et al. 2014; Yu et al. 2014). The main difference between the single-window and single-channel algorithms is that in terms of atmospheric parameters, the former requires both near-surface temperatures and atmospheric moisture content, whereas the latter only requires atmospheric moisture content. Hence, the LANDSAT 8 OLI/TIRS 10 data and atmospheric correction method were adopted in this study for the inversion of the LST.

The atmospheric correction algorithms for calculating the LST were as follows: first, the atmospheric effect on surface thermal radiation was estimated; second, this atmospheric impact was subtracted from the total thermal radiation observed by satellite sensors, and the surface thermal radiation intensity was obtained; finally, the thermal radiation intensity was converted to the corresponding LST.

Land surface emissivity

According to previous studies (Qin et al. 2004; Sobrino et al. 2004), the calculation process for land surface emissivity is as follows: first, the RS image is divided into town, water, and natural surface; second, the land surface emissivity values of the natural surface and towns are calculated based on the FVC. The emissivity value of the water is 0.995 because water continuity is strong and has fewer mixed pixels (Qin et al. 2004); the land surface emissivity values of the natural surface and towns are calculated based on the FVC as follows:

$$ {\varepsilon}_{sf}=0.9625+0.0614\mathrm{FVC}-0.0461{\mathrm{FVC}}^2 $$
(4)
$$ {\varepsilon}_{bd}=0.9589+0.086\mathrm{FVC}-0.0671{\mathrm{FVC}}^2 $$
(5)

where εsf, εbd are the emissivity values of the cities and towns, and natural surfaces, respectively, and FVC is the fractional vegetation cover of a pixel.

The ground radiation brightness

The thermal infrared radiation brightness is calculated as follows:

$$ B\left({T}_s\right)=\left[{L}_{\lambda }-{L}^{\uparrow }-\tau \left(1-\varepsilon \right)\times {L}_{\downarrow}\right]/\tau \times \varepsilon $$
(6)

where the B(TS) is the radiation brightness values in the thermal infrared band of a blackbody (Wm−2sr−1μm−1), Lλ is the thermal infrared radiation brightness value, Lis the upwelling atmospheric radiance (Wm−2sr−1μm−1), L is the down welling atmospheric irradiance divided by π (Wm−2sr−1μm−1), ε is the land surface emissivity, and τ is the total atmospheric transmittance along the path between the land surface and the satellite sensor. The imaging time and the centre latitude and longitude information were obtained from the official website of NASA (http://atmcorr.gsfc.nasa.gov/) and were used to determine the τ, L, and L values as 0.76, 1.79 and 2.98 Wm−2sr−1μm−1, respectively.

LST

The inverse function of the Plank formula and the thermal infrared radiation brightness values was used to calculate the LST (Sospedra et al., 1998). Thermal radiances were converted to corresponding luminance temperatures, according to Planck’s law. The temperature value was calculated in Kelvin (Ts) or Celsius from the above processes, as follows:

$$ {T}_s={k}_2/\left[\ln \frac{k_1}{B\left({T}_s\right)}+1\right] $$
(7)
$$ \mathrm{LST}={k}_2/\left[\ln \frac{k_1}{B\left({T}_s\right)}+1\right]-273.5 $$
(8)

where Ts is the effective at-satellite brightness temperature in Kelvin (K), the LST is the land surface temperature in Celsius (°C), and k1, k2 for LANDSAT 8 OLI/TIRS are constant values of 774.8853 W/(m2. Μm.sr) and 1321.0789 K, respectively.

Calculation of the temperature vegetation dryness index

The TVDI is a method for land surface soil moisture retrieval based on the optical and thermal infrared RS channel data (Sandholt et al. 2002). Therefore, the TVDI was used to estimate the soil moisture in the TZ. If an image contains a large space of soil moisture and green vegetation, the space presents a triangle. The triangle forms because the LST decreases as the vegetation cover increases. The scatter plot between the NDVI and LST is termed as the LST_NDVI space and is closely associated with surface evapotranspiration and moisture Amani et al. (2016).

Any random NDVI value corresponds to a set of dry and wet edges. The maximum LST decreases as the NDVI increases along the x-axis on the dry edge (Sandholt et al. 2002), and the stomatal resistance to evapotranspiration is a key factor, which is partly controlled by the limited moisture availability (Sandholt et al. 2002; Chen et al. 2015). The wet edge consists of a group of points forming a horizontal line. The LST is independent of the NDVI on the wet edge, and for the points closer to the wet edge, the evapotranspiration capacity and soil moisture become higher (Chen et al. 2015).

The computation formula was calculated as follows (Sandholt et al. 2002):

$$ \mathrm{TVDI}=\frac{\mathrm{LST}-{\mathrm{LST}}_{\mathrm{min}}}{{\mathrm{LST}}_{\mathrm{max}}-{\mathrm{LST}}_{\mathrm{min}}} $$
(9)

where the LSTmax and LSTmin represent the maximum and minimum LST values when the NDVI equals a particular value, whereas LST represents the surface temperature of any random pixel.

After the LST_NDVI eigenspace has been simplified to a triangle, LSTmax and LSTmin were concurrently subjected to linear regression:

$$ {\mathrm{LST}}_{\mathrm{min}}={a}_1+{b}_1\times \mathrm{NDVI} $$
(10)
$$ {\mathrm{LST}}_{\mathrm{max}}={a}_2+{b}_2\times \mathrm{NDVI} $$
(11)

where a1,a2 and b1,b2 are the intercepts and slopes of the dry and wet edges, respectively, and the coefficients of the a1,a2 and b1,b2 can be determined by a least squares fit to the actual data, respectively.

Results and discussion

NDVI patterns

The NDVI values for the study area range from 0.00244 to 0.80481, and the overall spatial pattern shows a reduced NDVI from the TZ to plain region. The main land use type in the northwestern TZ region is woodlands, and the vegetation coverage is high. The NDVI values of the forest land and grassland in the mountain area are high. Some of the areas are affected by human activities. The main land use types are construction land, bare land, and water, where vegetation coverage is low. The NDVI values in the eastern plains are low. Hosseini and Saradjian (2011) studied four different soil water content estimation models, NDVI_LST, enhanced vegetation index (EVI)_LST, NDVI_LST_NDWI, and EVI_LST_NDWI; statistically, it was proven that when the EVI was replaced with the NDVI in the model, the accuracy of the soil moisture estimation increased (Fig. 3).

Fig. 3
figure 3

Distribution of NDVI in study area

LST patterns

In summary, there was a significant decline in the LST from the plains to the mountainous areas (Fig. 4). The former is affected by the lower altitude and human activities, which caused higher temperatures. In that regard, a higher altitude may be an important factor for reducing LST, which, when combined with the reduced human activities in the mountainous areas and higher vegetation coverage, resulted in a relatively lower LST. There were abnormal inversion results owing to thick cloud covers.

Fig. 4
figure 4

Distribution of the average LST in study area

The north western region of the study area is located in the TZ, and parts of the areas in the north western region are covered by clouds. As a result, temperatures in these areas are below 5 °C. For areas not affected by cloud cover, a temperature range of 5~10 °C in the mountainous areas from April to May is considered normal. Temperatures in the central mountainous area, at 10~20 °C, are generally higher than those in the northwestern mountainous areas. The main land use type is woodlands, and the vegetation coverage is high. Some of the areas are affected by human activities or a slope direction/gradient, resulting in temperatures above 20 °C.

In the eastern plains, which are predominantly arable lands, the LST values are mostly near 20~30 °C. The central area of the plains is the main urban area, consisting mostly of lands for construction, the LST values there are 30~35 °C. According to the principle of the urban heat island effect, the LST values in cities are higher than those in surrounding areas. The LST even reached 35 °C and above in the middle of the urban area in study area. Field observations indicated the distribution of factories at these areas, and the higher LST values resulted from the industrial developments, emissions, and dense population distribution.

Analysis of the LST_NDVI eigen space

After setting the step size of the NDVI to 0.01, the EXCEL software and function calculations were used to extract the LSTmax and LSTmin values corresponding to the same NDVI value. The spatial scatter diagram of the LST_NDVI eigen space approximated a triangle (Fig. 5), which was similar to that in the description by Sandholt et al. (2002). The relationship between the NDVI and LST is near-linear. According to Fig. 5, the scatter tendency of the LSTmax and LSTmin can be divided into three segments:

Fig. 5
figure 5

LST_NDVI eigen space

When NDVI ≤ 0.15, the scatters for LSTmax did not have a corresponding maximum value, whereas those for LSTmin formed a straight line with a positive gradient;

When 0.15 < NDVI ≤0.6, the scatters for both LSTmax and LSTmin approximated horizontal straight lines with near-zero gradients; and,

When 0.6 < NDVI ≤0.8, the LSTmax and LSTmin lines were both straight, but with negative and positive gradients, respectively.

The plots of LSTmin and LSTmax as a function of NDVI are shown in Fig. 6. There was a strong correlation between the LST and NDVI. The y-axis represents the LST, and the x-axis shows NDVI (Fig. 5). The LST and NDVI relationship slope could be effectively determined during the growing season with a certain level of the NDVI. The derived LST/NDVI slope from the image windows was significantly correlated to the in situ soil moisture (Xin et al. 2006). With the increase in NDVI values, the LST values decreased, and vice versa.

Fig. 6
figure 6

Equations of LST_NDVI eigen space with dry edge and wet edge

The general trend of the dry and wet edges decreases and increases as with the NDVI value, respectively. Thus, LSTmax decreases, but LSTmin increases. To determine the parameters describing the dry and wet edges, the LSTmin and LSTmax values observed for small intervals of the NDVI are extracted in the LST_NDVI eigen space. The equations for the dry and wet edge for the three scenes are shown in Fig. 6.

Spatial variation of TVDI

The TVDIs of the TZ from the plain region to mountains were calculated, as shown in Fig. 7. According to Fig. 7, the spatial pattern variation of TVDI values has a significant reduction from the northwestern mountainous regions to southeastern plains area. The northwestern mountainous region has a high TVDI value, whereas the plain region has a low TVDI value owing to the different FVC values and human activities.

Fig. 7
figure 7

Spatial distribution of TDVI in study area

Comparison of TVDI with in situ measurements

The effectiveness of the TVDI as an index for assessing soil moisture was validated by systematically designed in situ soil moisture measurements for each grid. The obtained TVDI value was compared with the measured soil moisture. Fifty-eight points were randomly selected for in situ soil moisture measurement to ensure that each land cover type had at least five observation points. There is a significant negative correlation between the TVDI and the in situ measured soil moisture (R = 0.710). Linear relationships exist between the soil moisture and the TVDI, NDVI, and LST (Table 2).

Table 2 Regression model of soil moisture

According to Table 2, all P values in the regression model were much less than 0.05. Therefore, the TVDI can reflect the soil moisture status better than the NDVI and LST. Patel et al. (2009) also revealed that a significantly strong and negative relationship exists between the TVDI and in situ soil moisture, particularly when the vegetation cover is sparse.

To verify the regression model accuracy, 38 points of in situ soil moisture measurement that were not involved in the model construction were used. The results show that the TVDI can reflect the soil moisture status. The root-mean-square error (RMSE) was 0.04897, and the P values were far less than 0.01. This proved that the accuracy of the soil moisture inversion was relatively high. The TVDI was also found to be satisfactory in capturing the temporal variation in the soil surface moisture status.

Researchers in China studies have used the TVDI to estimate soil moisture from different RS data and have produced different R2 values (Chen et al. 2015; Bai et al. 2017). For example, in these different RS data, for NOAA-AVHRR images, R2 = 0.23–0.81 (Goward et al. 2002; Wang et al. 2004); for Terra/Aqua MODIS images, R2 = 0.12–0.83 Chen et al. (2012), and for LANDSAT TM images, calculated R2 values is R2 = 0.15–0.80 (Chen et al. 2015) etc. The R2 values between the TVDI and soil moisture are still low in some studies (Chen et al. 2011; Chen et al. 2012). This is owing to scaling effects in the TVDI values retrieved from RS images. In that regard, each pixel in the LANDSAT TM image represents a grid of 30 m × 30 m, but each observation point of the in situ soil moisture measurement represents only a point on the soil surface. Thus, the in situ measured soil moisture cannot ensure a perfect match with the corresponding pixel in the image (Chen et al. 2015).

Spatial distribution of soil moisture

The spatial distribution of the soil moisture content within the study area based on the inversion results is shown in Fig. 8.

Fig. 8
figure 8

Spatial distribution of soil moisture in study area

In terms of the overall distribution of the soil moisture content, the levels were relatively high in the northwestern and central mountainous areas but were relatively low in the southeastern plains. For areas within the TZ, a gradual decline in soil moisture levels was apparent from the plains towards the mountainous areas. This resulted from the general variation in soil moisture levels with altitudinal changes (535~3506 m), as well as the fact that the southeastern region of the study area sits on the plains of Chengdu and comprises predominantly arable lands and lands for construction, making it more affected by human activities. Second, although the northwestern and central regions of the study area are located in the mountainous areas and are subjected to less human impact, they are located on the Longmenshan seismic belt. Consequently, there are frequent mountain hazards, landslides, and debris flows. In addition, the 2008 earthquake had a major impact on the mountainous areas, making it difficult to restore the damaged vegetation cover. Nevertheless, gradual changes in the TZ’s soil moisture levels may still be seen. Finally, the soil moisture content in localised areas was also affected by other factors such as the slope direction and land use type.

Conclusions

In this study, we examined how much of the soil moisture variability was associated with TVDI variability within and across the study area regions. A triangulation method was used to estimate the surface soil moisture. This method combines visible, infrared, and thermal data. The LST and NDVI values were obtained to derive TVDI values for the assessment of surface soil moisture through remotely sensed data and for the establishment of the results with field-measured soil moisture data. The method interpreted the pixel distribution in the LST_NDVI space. A triangle formed because the LST decreased as the vegetation cover increased. The scatter plot between the NDVI and LST is termed as the LST_NDVI space and is closely associated with surface evapotranspiration and moisture.

Our study is valuable in providing a spatially detailed account of where and by how much soil moisture varies and how much of this was driven by TVDI variability in the TZ. To a certain extent, this study supplements existing soil moisture research in a region with unique topographies, i.e. ranging from 535 to 3506 m. The soil moisture inversion of the study area contributes to the observation of changes in moisture levels as the plains transition towards the mountainous areas, and as arable lands transition towards woodlands. The results are of great significance for the monitoring of moisture content in arable lands and in the overall ecological environment.

We found a significant relationship between the TVDI and soil moisture at the observation points. Correlation and regression analyses were conducted to relate the TVDI to the in situ calculated soil moisture. The spatial pattern of the TVDI generally shows a low moisture distribution over the study area. A significant (P < 0.05) negative correlation of r = 0.710 was found between the TVDI and in situ soil moisture. The larger the TVDI value, the drier the soil; conversely, the higher the soil moisture levels, the smaller the TVDI values.

We found that the TVDI was adequate for the temporal variation of surface soil moistures. The TVDI provides a consistent appraisal of the moisture situation. Consequently, it can be used to evaluate wet conditions. The accuracy of the soil water inversion in this study was high. For the study area, this proved the feasibility of using the TVDI for the soil moisture inversion. The soil moisture inversion was carried out at a depth of 20 cm over large areas throughout the study area. In the process, the calculation of the vegetation coverage was of vital importance. For the plains of Chengdu in particular, and the Sichuan Province in general, there have been very few studies with moisture inversions conducted based on the TVDI, nor have results been verified using measured data from field samples.

We thought that the effects of scaling might also have an impact on the accuracy of soil moisture and found that the appraisal of soil moisture using the TVDI was possible at medium spatial resolutions, as the relationship of soil moisture with LST and NDVI leads to an adequate number of representative pixels for developing a triangular scatter plot. The study area consisted predominantly of woodlands and arable lands, leading to high vegetation coverage. The LANDSAT 8 OLI/TIRS data selected for the study not only facilitated the calculation of the TVDI but also fully met the accuracy requirements for the soil moisture inversion. Our approach of using LANDSAT 8 OLI/TIRS data for temperature inversions based on the NDVI proved to be reliable and had a high resolution. However, cloud cover seriously influences the LST retrieval and often limits the applicability of the TVDI for estimating soil moisture on a regional scale.