1 Introduction

Climate change, which has resulted in high temperatures, precipitation, floods, and global sea-level rise, is the most serious issue that has emerged in recent decades (Khan et al. 2010; Swain et al. 2020). Severe weather conditions are prevalent over the world, causing flooding and significant infrastructure damage (Seyyedi et al. 2014; Awais et al. 2021). So, there is a need for a long-term and accurate precipitation dataset to examine the climate simulation model and other hydrological studies (Moazami et al. 2013; Tapiador et al. 2017; Lu and Yong 2018; Ricciardelli et al. 2018). Most essential phenomena, including ground-water recharge assessment, and flood and drought monitoring, need precipitation as input parameters, indicating its importance in water resources (Behrangi et al. 2011; Su et al. 2011; Jiang et al. 2012; Bahrawi et al. 2021). Precipitation estimate has traditionally been done using on-ground rain gauges, which give real-time details regarding rainfall occurrence but not geographically dispersed data (Artan et al. 2007; Alharthi et al. 2020). Long-term and continuous data gauge monitoring systems will also necessitate a significant financial and technological commitment. Rainfall quantification is difficult to estimate because of unequal distribution or geographical inhomogeneity, especially when data collecting by the ground station is scarce (Gu et al. 2010; Camberlin et al. 2019). Rainstorms also have a lot of spatial unevenness in dry and semi-arid environments, especially when the rainfall intensity is strong (Habib and Nasrollahi 2009; Farran et al. 2021). As a result, sensor-based precipitation estimates are becoming increasingly promising in recent years, with satellite sensors dominating because of their greater regional coverage and temporal precision (Maggioni and Massari 2018; Hussain et al. 2022; Bahrawi and Elhag 2020). The data available from various sources are currently being employed to understand precipitation patterns on a global and regional scale (Collischonn et al. 2008; Poinsot 2020). The advantages of satellite-based measurements include non-disturbed coverage in areas where ground-based gauges cannot install, specifically in the hilly and oceanic regions (De Coning 2013; Gado et al. 2017; Awais et al. 2022). Several satellites, including the Climate Prediction Center morphing technique (CMORPH), the Tropical Rainfall Measuring Mission (TRMM), the Multi-satellite Precipitation Analysis (TMPA), the Global Precipitation Measurement (GPM), and the Precipitation Estimation from Remotely Sensed Information Using Artificial Neural Networks (PERSIANN), have been launched to cover global climatic conditions (Kummerow et al. 1998; Joyce et al. 2004; Elhag et al. 2021b). Although the data obtained from these satellite products were good, they were flawed owing to random and systematic errors mostly connected to their retrieval technique and observation; therefore, they must constantly be compared to ground-based real-time data for adjustments (Segond et al. 2007; Habib et al. 2014; Fang et al. 2015; Vu et al. 2018). TRMM satellite was launched in 1997 to estimate moderate to heavy rainfall and has since gathered extensive measurements through its precipitation radar (P.R) system to provide an understanding of precipitation distribution. This space-borne satellite is capable of covering high-resolution vertical precipitation profiles from global to ground scale (Kozu et al. 2001; Gupta et al. 2021). Saudi Arabia has a land area of roughly 2,250,000 km2, necessitating a large number of rainfall measuring locations. However, current low-density rain gauges across the country are insufficient to get an accurate spatial distribution of rain (Almazroui 2011; Elhag et al. 2021a). Consistently altering land use and high rainfall intensities can swiftly generate flash floods during or after the rainstorm, causing significant infrastructure damage (Abera et al. 2016). Numerous researches were conducted to evaluate the performance of rainfall estimation by different satellite sensors. Choubin et al. (2019) constructed a BIAS correction framework for TRMM Satellite in which a correction factor has been established for gauged and ungauged pixels providing satisfactory results. Gauged pixels gain 57% BIAS correction while ungauged got the value of 25%, indicating significant improvement in remotely sensed precipitation estimation (Choubin et al. 2019). Dinku et al. (2010) investigated the validation of satellite rainfall for several locations in Colombia and found that the PERSIAN and GSMaP-MVK products, respectively, overestimated and underestimated their values (Dinku et al. 2010). Jiang et al. (2018) compared two high-resolution satellites, TRMM and CMORPH, for estimating precipitation over Shanghai. They found that CMORPH is more accurate than TRMM, but that it underestimates readings in particular intensity ranges (Jiang et al. 2018). Likewise, Almazroui (2011) presented a study on the calibration of TRMM data in contrast with rain gauges for the period of 10 years (1998–2009) on KSA which indicated that TRMM and rain gauge data had a higher level of significance. Precipitation measured by satellite is 92% accurate which showed an error of 8% in its measurement, so for correction of TRMM value correction factor of 0.93 has been defined in that article (Almazroui 2011). Haile et al. (2011) conducted a study and found that the CMORPH satellite has great capability for finding high spatio-temporal (8 × 8 km2 — daily) rainfall (Haile et al. 2015). Mahmoud et al. (2018) conducted a study on Integrated Multi-satellite Retrievals for Global Precipitation Measurement (GPM) mission (IMERG) and analysis precipitation data on the region of Saudi Arabia. He tested the product on early, late, and final rainfall runs and discovered that it provided correct results in certain locations but inaccurate results in others. Following these evaluations, we can see that some tweaks or corrections to remotely sensed data are required in order to get more accurate precipitation data for TRMM (Mahmoud et al. 2018).

The major emphasis of this study was on the evaluation and accuracy assessment of satellite-measured precipitation (TRMM 3B43) for both station and regional levels. The novelty of this work is that it uses in-depth several error measures, namely, relative BIAS (R.B.), root-mean-square error (RMSE), mean absolute error (MAE), correlation coefficient (CC), and coefficient of determination (R2) on regional and station scales to evaluate the TRMM performance throughout Saudi Arabia. Furthermore, the limitations that result in larger R.B. and low correlation values were also analyzed. This study will provide a platform for arid-region researchers to better comprehend the quality of remotely sensed data on a larger scale and identify the restrictions that lead to low satellite data accuracy and serve as a foundation for object-oriented activities such as flood monitoring, drought management, and other hydrological processes.

2 Material and methods

2.1 Study area

The research covered the whole KSA from 34°32′00″ to 55°40′00″ in latitude and from 16°22′00″ to 32°15′00″ in longitude as represented in Fig. 1. Based on administrative division, KSA is subdivided into thirteen regions: Madinah, Makkah, Riyadh, Qassim, Tabuk, Asir, Jazan, Najran, Al-Jouf, Hail, Al-Baha, and Eastern and Northern Region as shown in Table 1.

Fig. 1
figure 1

Location of the study area and the administrative regions of the Kingdom of Saudi Arabia

Table 1 Rain gauges considered in the study

The temperature is high throughout the year in the most remote portions of the Kingdom, with moderate winds, but meteorological features vary greatly from season to season. Heavy rainstorms occur just a few days a year in some sections of the state, making it the driest in the world. The majority of the land is made up of desert impermeable sections that have a limited ability to absorb precipitation and can result in flash floods even with ordinary rainfall (Abdullah and Al-Mazroui 1998; Bahrawi et al. 2020). Most of the rainfall events occur in summer, which mainly causes the northward advancement of the southwesterly monsoon and results in nearly 300 mm, which is 60% of the annual occurrence from October to March (ŞEN and Al-Suba’i 2002).

The majority of precipitation falls on the high topographical regions and mountainous slopes, whereas rainfall in the rest of the country is modest and uneven, with high intensities and short durations. Rainfall variability patterns locally or regionally based on the resulting impact of El Niño Southern Oscillation (ENSO), the North Atlantic Oscillation (NAO) patterns (Almazroui 2011; Hasanean and Almazroui 2015). The average annual rainfall in most arid regions is less than 100 mm, whereas mountainous places receive more than 300 mm. (Hag-elsafi and El-Tayib 2016).

2.2 Data acquisition

2.2.1 Rain gauge dataset

Rain gauges are considered the reference for precipitation measures because of their provision for real-time rainfall estimation. In this research, data from all thirteen regions have been collected from the Ministry of Environment, Water and Agriculture (MEWA) Saudi Arabia, for 5 years from January 1, 2008, to December 31, 2012 (website: https://www.mewa.gov.sa/en/Pages/default.aspx, accessed on March 2020). Twenty-eight regional stations have been selected in thirteen regions for data collection covering the whole KSA (see Table 1). The 28 stations were taken under consideration for data comparison and evaluation. The graphical representation of station data for each year was prepared by intensity distribution weightage (IDW) in GIS to understand the rainfall distribution pattern visually (see Fig. 2).

Fig. 2
figure 2

Sample representation of station rainfall distribution pattern of January, April, August, and December for the year 2011 over KSA by inverse distance weighting (IDW)

2.2.2 Satellite dataset

Tropical rainfall measuring mission (TRMM-3B43) satellites were used to acquire precipitation datasets for this study. The satellite recorded precipitation with an extent of 50 S to 50 N, with a spatial resolution of 0.25° (Liu et al. 2012). TRMM measured rainfall using a variety of sensors, including visual, infrared, and microwave imagers (see Table 2). The sensors worked together to offer a more accurate rainfall estimate (Kummerow et al. 2000; Elhag and Yilmaz 2021). TRMM gives a greater spatial extent of rainfall distribution for the kingdom than point data provided by ground stations (see Fig. 3). Satellite data was accessible in near real-time (3 h), with the daily and monthly temporal resolution, allowing accurate rainfall monitoring. Precipitation data were obtained from the online data source of NASA for the period of 5 years 2008–2012 (website: https://disc.gsfc.nasa.gov, accessed on March 2020).

Table 2 Typical information of TRMM instruments
Fig. 3
figure 3

Sample representation of TRMM rainfall distribution pattern of January, April, August, and December for the year 2011 over KSA

2.3 Data processing

Several pre-processing process steps were involved prior to the evaluation of the TRMM and ground datasets. The process begins with the acquisition of gauge and satellite data from their respective sources. Different procedures were adopted in Geographical Information System (GIS) ArcMap 10.5 to enable our satellite data for further interpretation (see Fig. 4). The data downloaded from the NASA website was in.nc4 file format, which needs to be converted in a more accessible processing form; therefore, data is then converted to Geotiff files by python designed tool (SMD) in ArcMap environment.

Fig. 4
figure 4

Evaluation methodology flowchart

Ground control points (GCPs) were selected in order to extract TRMM pixel values from raster images. Station coordinates were obtained in Universal Transverse Mercator (UTM) format from decimal degrees for working convenience using UTM converter (http://www.utmconverter.com). A coordinate matching process (CMP) was applied to the gauge points in order to ensure exact value retrieval. Following the abstraction of rainfall values, a correlation to TRMM monthly rainfall with ground data was constructed on a regional and individual station basis.

2.4 Data comparison and evaluation

The study’s comparison stage was built on the foundation of the station and regional assessments. In the second stage of the investigation, individual station data was reviewed to establish TRMM’s overall dependability. There were four main statistical parameters used to evaluate TRMM dataset from which the first three parameters, including root-mean-squared error (RMSE), mean absolute error (MAE), and relative BIAS (R.B.), were described as the error within the satellite data. In contrast, the second group containing correlation coefficient and regression provides information regarding the extent of agreement between both data (see Table 3).

Table 3 Statistical measures

3 Results and discussion

The current research is based on comparing TRMM satellite rainfall data to ground-based measurements. Most of the study area got much lower rainfall throughout the years, indicating it was an arid environment. The maximum rainfall observed by the gauges and satellite during the whole study period was 117 and 120 mm/month, respectively, and there were lots of months in each region when no precipitation occurred. As per ground observations, the southern region, particularly Abha, had the highest average rainfall, while the north-western area like Arar, Jawf, and Tabuk had the lowest throughout the years. The satellite spatio-temporal pattern was also found to be similar to the ground observation spatio-temporal pattern for almost all regions (Figs. 2 and 3).

Most rainfall events occurred in winter (December–January) and summer (April–May) with higher intensities. Figure 5 shows a sample representation of the time series of Abha and Riyadh stations to demonstrate the similarities and dissimilarities between the two data sets. Visually, the figure shows relatively reasonable similarities.

Fig. 5
figure 5

Sample representation of the time series of the rainfall of the satellite and gauge data for Abha and Riyadh stations

3.1 Regional evaluation

The study was conducted to evaluate the performance of TRMM satellite rainfall for 5 years (2008–2012) on a regional scale. Statistical parameters were applied to test the reliability of satellite datasets over the Kingdom.

Regression analysis was applied to each region in order to assess the degree of agreement between stations within each region (see Fig. 6), starting with the southern part, which includes Asir, Jazan, and Najran. The behavior is similar, with R2 values of 0.67 and 0.60 for Asir and Najran, respectively; however, it drops for Jazan to 0.40. Makkah and the Al-Baha area, both of which are located in the south-western part of the Kingdom, have regression coefficients (R2) of 0.61 and 0.51, respectively. Similarly, the eastern area comprises five stations (Hafar Al-Batin, Qaisumah, Dammam, Al-Ahsa, and Dhahran) and shows an average R2 of 0.58. The performance of the Madinah province is superior, as seen by the region’s R2 of 0.69. Moving toward the north-western part of the country, three areas can be found: Tabuk, Hail, and Al-jouf, which provide values of R2 equal to 0.80, 0.69, and 0.59, respectively.

Fig. 6
figure 6

Scatter plot and regression analysis of rainfall data of the stations within each administrative region of KSA

The value of R2 in the northern part of the Kingdom (Northern Border) was up to 0.55, suggesting fair performance. However, Qassim, which is close by, did not have a significant relationship and gained a value of 0.29 for the recorded years. The correlation was found better in the North (Tabuk, Hail, Madinah) and Asir regions. However, Riyadh and Qassim, showed a very low correlation (less than 0.50), indicating low assessment accuracy in these areas, while a moderate correlation was found in the eastern and north-eastern regions (see Fig. 7a).

Fig. 7
figure 7

Administrative region-based spatial distribution of the statistical performance measure between rainfall ground stations and TRMM: a CC, b RMSE, c MAE, and d BIAS

In Makkah, Jazan, and Qassim regions, the RMSE revealed a higher degree of error, suggesting a very low satellite accuracy, with errors of 13.16 mm, 14 mm, and 17.34 mm respectively. Whereas the Riyadh, Eastern, and Asir provinces did better than the previous regions, their inaccuracy ranged from 9.44 to 9.62 mm. Al-Baha and Najran did well on the south-western and southern sides, whereas Madinah performed modestly on the western side, with an inaccuracy of 6.79 to 7.69 mm, respectively. The northern and northwestern areas of the Kingdom had substantially lower degrees of RMSE, which did not surpass 5 mm. Satellite measurements did well in most portions of Saudi Arabia except in Qassim, Jazan, and Makkah (see Fig. 7b). Maximum MAE values were found in Makkah, Qasim, and Jazan, ranging from 6.24 to 6.72 mm. Meanwhile, moderate errors of 4.04 and 5.06 mm were observed in Riyadh and Asir, indicating poor satellite performance in these locations. The Northern, North-Western, and Eastern Regions, on the other hand, significantly improved MAE from 2.18 to 3.60 mm, except for western sections and Qassim, where MAE was found to be moderately low (see Fig. 7c). For each region, the R.B. was determined (see Fig. 7d). The findings revealed that the satellite overestimated rainfall up to 52.38% in Madinah, Tabuk, Hail, Al-jouf, and Qassim. Similarly, in several areas, such as the Eastern region, Jazan, Al-Baha, and Riyadh overstated values by more than 40%. The northern border and Najran, on the other hand, were likewise underestimated, although only by a factor of 10%

3.2 Station-based evaluation

Statistical evaluation was applied to each station separately to evaluate the satellite response at the station scale. Interpolation was done using the inverse distance weighting (IDW) approach in Geographic Information Systems (GIS) software to assess the variability of the statistical measures spatially. The functional relation between the satellite and the ground station was investigated using the statistical metrics such as the R-squared, correlation coefficient, RMSE, MAE, and R.B. (see Fig. 8a-d). According to station-based regression analysis, six stations across the Kingdom had values equal to and more than 0.80, in some regions like the eastern, southern, and south-western. Meanwhile, some regions were found to have moderate c ranges between 0.60 and 0.80, and most of them are situated on the southern, western, and northern-western sides (Elhag et al. 2020). Numerous stations in the research area had poor satellite performance (R2 < 0.60), with Wadi-Dawasir and Qaisumah stations having the least regression of 0.20 and 0.28, respectively, over the whole study period (see Table 4 for detailed values of R2 for each station with regression equations). The correlation coefficient (CC) was found to be high in the southern, eastern, and north-western stations from which five stations (Bisha, Turaif, Dhahran, Damam, and Al-Ahsa) with CC > 0.9. Similarly, nine stations, the majority of which are located in the Makkah, Madinah, and Asir areas, have a CC ranges between 0.80 and 0.90. Southwestern, north-western, and two stations from the northern border have CC ranges between 0.60 and 0.80 indicating moderate correlation. Sharorah, Wadi Dawasir, Qaisumah, Qurayyat, and Qassim stations provide a lower correlation (less than 0.6). However, overall, most of the observing sites showed an acceptable correlation between satellite and ground observation. In station-based evaluation, RMSE displayed a higher degree of error in Makkah, Qassim, Jazan, and Qaisumah stations, ranging between 13.57 and 18.35 mm. Meanwhile, nine stations in the Kingdom exhibited moderate error (12.78–8.33 mm) from which Abha, Jeddah, Wajh, and Taif were dominated with a higher degree of error (up to 10.3 mm), and the rest of the stations has a minor error (RMSE = 8.33–9.87 mm).

Fig. 8
figure 8

Station-based spatial distribution statistical performance measures between rainfall ground stations and TRMM: a CC, b RMSE, c MAE, and d BIAS

Table 4 Statistical indices for each station (R2, RMSE, MAE, %BIAS, and CC)

In general, the central (Qassim), southern (Jazan, Abha), and south-western (Makkah) stations of the country have higher values of RMSE. Mean absolute error (MAE) results were found in most of the stations (21) less than 5 mm except in the south-western part (Makkah), southern part (Jazan, Abha), and the central part (Qassim) region where values range between 5 and 8.7 mm. Likewise, the R.B. results depicted that the TRMM satellite provides under and over-estimation for the whole Kingdom. There were sixteen stations with underestimated values, of which 12 had R.B. values of more than 25%, and the majority of them were located in the north to the north-western part of the country; nevertheless, Bisha, Sharorah, and Jeddah had the same behavior but on the southern side. Taif, Makkah, Al-Ahsa, and Qurayyat all had lower R.B. values below 25%.

The south and south-western stations mainly overestimated the precipitation, and most of them range between 25 and 50% except Abha, which touches the values of 68%. The maximum fluctuation was noticed in the eastern region, where R.B. went to 94% in Hafar Al-Batin. There were some stations (8) that showed an underestimation of less than − 20%, from which Makkah, Wadi-Dawasir, Qurayyat, Rafha, and Turaif reached an overestimation of less than 10%. Overestimation and underestimation were dominating for the stations.

Remotely sensed data were examined throughout the research to determine the causes of stations’ and regions’ poor performance (Elhag et al. 2017). It was detected that the elevation had a relationship with the value of R2 during the studied period. Low elevations up to about 261 m and elevations between 786 and 2102 m have a relatively high value of R2 = 0.7 on average. However, from elevations 358 to 768 m, there is a relatively low R2 = 0.25 on average except at elevation = 639 m. The values of R2 versus the elevations are presented in Fig. 9.

Fig. 9
figure 9

The variability of R2 with the elevation of the rain gauges

The high R2 with elevated areas using the sensors of the TRMM satellite has been reported in the literature by Xiaojun et al. (2021). However, they claimed that low R2 is reported in plain areas which does not match our results (Elhag 2016). Overall, for the entire study period in the whole Kingdom, regression analysis provided R2 = 0.56 indicating an average approximation of TRMM satellite with ground-based data (see Fig. 10). Based on the statistical parameterization obtained from the TRMM rainfall data, it is recommended to use these results for hydrological processes like drought estimation using monthly rainfall depth because of its better spatial distribution and average performance over Saudi Arabia. For future research, it is recommended that this work could be extended with higher temporal resolution and detection likelihood for extreme events be taken into consideration and suggested to be extended with longer duration and dense observatories (Psilovikos and Elhag 2013).

Fig. 10
figure 10

Scatter plot and regression analysis of monthly rainfall from the study stations

4 Summary and conclusions

This research mainly focused on evaluating the Tropical Rainfall Measuring Mission (TRMM) satellite by taking rain gauge observation as a reference over the entire Kingdom of Saudi Arabia for 5 years from January 2008 to December 2012 on a monthly time scale. The evaluation was performed using statistical performance indicators, namely, R2, RMSE, MAE, R.B, and CC. There were two major components in the evaluation, namely, regional-based and station-based. The regional-based evaluation indicated the performance versus ground station data on a regional scale (13 regions in KSA), whereas the station-based examination provided information about each station (28 stations). Regression and correlation analysis provided a functional relationship between both datasets that have been developed and presented in Table 4. The highest R2 = 0.92 for the Dhahran station and the lowest R2 = 0.2 for Wadi Dawasir station. RMSE and MAE are the error indicators that showed high values for Makkah (RMSE = 18.4 mm, MAE = 8.03 mm), Jazan (RMSE = 14 mm, MAE = 8.72 mm), and Qassim (RMSE = 17.3 mm, MAE = 6.8 mm), while the minimum values were at Tabuk (RMSE = 4.3 mm, MAE = 1.76 mm) and Hail (RMSE = 4.5 mm, MAE = 2.23 mm). Likewise, R.B. indicated overestimation and underestimation between ground and satellite data. R.B. showed a higher percentage for most of the regions. The maximum overestimation occurred at Hafar Al-Batin (R.B. = 94.6%), while the minimum overestimation occurred at Wadi-Dawasir (R.B. = 7.5%). However, the maximum underestimation occurred at Jeddah (R.B. =  − 52.8%), while the minimum underestimation occurred at Qurayyat (R.B. =  − 5.9%).

The station-based evaluation was also gone through the same statistical analysis. Regression and correlation indices showed excellent behavior in the stations of Asir and the Eastern region, while moderate behavior was found in southern and southwestern observatories and lowest at mixed locations in most of the southern and northern sides. Similarly, higher R.B. was observed for most southern and eastern parts, while lower at north and south-western stations. Overall, the results indicated a moderate behavior of the TRMM satellite to observe rain over Saudi Arabia. However, more promising studies for the Kingdom with higher ground observations and longer periods are recommended to explore further satellite performance.