1 Introduction

The atmosphere of Earth significantly limits astronomical observations beyond the role in sustaining life. The Earth’s atmosphere contains a mixture of several gases, such as N\(_2\), O\(_2\), Ar, CO\(_2\), O\(_3\), etc. and these gases significantly influence the atmospheric absorption characteristics [1]. The water vapor in the atmosphere also contributes to opacity in the infrared spectrum, particularly at mid-infrared and far-infrared wavelengths [2]. Consequently, the monitoring of precipitable water vapor (PWV) along the atmospheric column over multiple years is crucial to identifying optimal sites to establish an observatory which will make observation in microwave, millimeter, submillimeter and infrared regions. The impact of water vapor is prominently observed in the near infrared spectral range, specifically at wavelengths below 3000 nanometers. Figure 1 shows a simulated Earth transmission spectrum generated using the NASA Planetary Spectrum Generator (PSG) tool [3].

Fig. 1
figure 1

Atmospheric transmittance due to gases absorption as a function of wavelength. The Earth’s transmission spectra (from 400nm to 3000nm) were generated using the NASA Planetary Spectrum Generator (PSG) tool. Atmosphere and surface composition as follows surface pressure: 5.8546e-01 bar; Molecular weight: 28.97 g/mol; Gases: H2O; Surface temperature: 275.45K; Albedo: 0.30; Emissivity:0.70

Some factors as astronomical seeing, cloud cover, humidity, wind and water vapor are the decision maker for the site selection for an observatory. Therefore, PWV is also a crucial atmospheric component for infrared (IR) observations [4] and affects the accuracy and reliability of these observations through its absorption and re-emission of infrared radiation. Moreover, the level of atmospheric water vapor is pivotal in shaping the accuracy and quality of infrared (IR) observations [5].

The PWV measured in millimeters or centimeters quantifies the total water vapor from the surface of the Earth to the upper atmosphere, representing the depth of water in a vertical column of unit cross-sectional area if liquefied [6]. The PWV introduces phase errors, excess noise, and coherence loss, which affect the precision of infrared observations [7]. It should be recognized that PWV is influenced both by altitude and geographical latitude [8]. However, water vapor is the main greenhouse gas responsible for the most significant absorption compared to other gases [9]. Additionally, the re-emission of absorbed energy contributes to the sky background, impacting the sensitivity of IR observations [10].

Evaluating atmospheric properties through on-site ground-based meteorological data and formulating extensive observational strategies is essential, but may not be sufficient. Numerous site evaluation studies have been carried out to identify suitable locations for astronomy worldwide [11]. Research on PWV monitoring for both potential and active observatory sites globally, including high altitude sites in Chile [12, 13], Kitt Peak National Observatory (KPNO) [14], Roque de los Muchachos Observatory (ORM) [15], and the Thirty Meter Telescope (TMT) [16], is being conducted.

Astronomical observatories are ideally located a top of high mountains to circumvent the adverse impacts of the Earth’s atmosphere, including atmospheric turbulence, attenuation, scattering, and spectral absorption lines. The DAG (with its new administrative structure and name Türkiye National Observatories) is located at an altitude of 3170 m, which is the highest altitude and the robust infrastructure observatory site in Türkiye, and is planned to serve as an observatory around the world [17]. The aim of this study is to examine the precipitable water vapor for the NIR observations at the DAG and to make higher quality NIR observations.

2 Study area and data

Today, it has become quite common to see the use of various techniques (satellite-based, ground-based, such as GPS meteorology or radiosonde balloons) for different purposes in many different disciplines [18,19,20,21]. The objective of this research is to create a database of PWV data gathered through various methods such as satellite imagery and a radiosonde balloon, with the purpose of comparing these data sets and conducting temporal analyzes.

The DAG site as Türkiye’s the largest observatory site was selected as the locations for this study. First of all, the TPW values from the satellite were compared with the PWV values from radiosonde stations distributed homogeneously throughout Türkiye. The analyses of PWV values from satellite for the DAG site were made using the NWC SAF SPhR product. The PWV data from radiosonde observations for the DAG were also supplied by the Türkiye State Meteorological Service (TSMS). Both limitations in temporal resolution (with observations conducted only at 00:00 UTC and 12:00 UTC, daily) and the atmospheric drift effect of the balloon data from Erzurum meteorological ground station (17095-ERZM), where far away about 14 km, were taken into consideration for the PWV analyses for the DAG site. After the radiosonde balloon launch, it may deviate easily from their initial launch point due to wind or turbulence. These deviations take place typically within a few kilometers in the lower troposphere and also increase with altitude. These drift variations are highly influenced by the seasonal weather conditions of the location [22].

As the preliminary information, it would be appropriate to mention some geographical and climatic characteristics of the DAG site and Türkiye. While Türkiye is located between 26 - 45 degrees east longitude and 36 - 42 degrees north latitudes, the DAG site is located at an altitude of 3170 m on the Karakaya - Erzurum hill, at 39.78 degree north latitude and 41.23 degree east longitude. Besides, Türkiye has an average altitude of 1130 meters above sea level and annual average temperature values range between \(3.6^{\underline{\circ }}\)C and \(20.1^{\underline{\circ }}\)C [23]. The average annual precipitation and duration of sunshine for Türkiye is approximately 648 mm and 6.9 hours per day, respectively. The average global solar radiation in Türkiye is also about 4.2 kWh\(m^{2}\) [24, 25]. The Mediterranean and Aegean coasts of Türkiye and the Marmara region have a Mediterranean climate, a continental climate in the interior, and a black sea climate in the black sea region [26, 27]. Continental climate characteristics are observed in Erzurum especially and winters are long and snowy, whereas summers are short and warm.

Fig. 2
figure 2

The locations of stations in the Türkiye’s radiosonde network and DAG site. As can be seen from the map, RS stations are located to cover the whole country

Türkiye has a radiosonde (RS) observation network with consisting of 11 meteorological stations. The meteorological station located in the city center of Erzurum has been conducting observations since 1929. Figure 2 shows the study areas along with the RS stations located in these regions. Based on observations spanning approximately 90 yearsFootnote 1, the lowest temperature observed was -37.2 \(^{\circ }\)C on December 28, 2002, while the highest temperature recorded was 36.5\(^{\underline{\circ }}\)C on August 11, 2006. According to TSMS records, the annual precipitation is approximately 432.2 mm. The least precipitation falls in the form of snow in winter, with approximately 50 days of snowfall. The snow cover period lasts approximately 114 days. The province receives the most precipitation in spring and summer [28]. The distance between the Türkiye RS stations was about 500 km on average. These RS stations were operated twice a day, 00:00 UTC and 12:00 UTC, as applied throughout the world. One of these stations (ERZM-17095) was located in the DAG near-region (Erzurum city center). The distance between the ERZM RS station and the DAG site was approximately 14 km.

In Table 1, the international station code (ID), city names, latitude, longitude, and elevation information of the ground stations of the TSMS used for validation are provided. These stations contain the distribution of all possible precipitable water vapor values based on various location characteristics (latitude, longitude, altitude) and different climatic zones.

Using python and python’s siphon [29] and metpy [30] modules, RS data were downloaded from the University of Wyoming websiteFootnote 2 and were calculated using the following (1).

$$\begin{aligned} PWV = - \frac{1}{{\rho }_l g} \int _{P_{bottom}}^{P_{top}} r \,dp \end{aligned}$$
(1)

where \(\rho _{l}\) is the density of water, g is the gravitational constant, r is the mixing ratio of water vapor and \(P_{bottom}\), \(P_{top}\) and p are the pressure of air [31].

Table 1 Information of radiosonde stations in Türkiye and Cyprus

An additional source of PWV data comes from meteorological satellites that employ remote sensing techniques. The indispensable role of geostationary satellites in delivering high temporal resolution is recognized. Consequently, PWV measurements sourced from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) on the Meteosat Second Generation (MSG) satellites have been incorporated into the DAG database. This data set is included in the Satellite Application Facility on Support to Nowcasting and Very Short-Range Forecasting (NWC SAF) atmospheric stability collection and has been transmitted through TSMS FTP from early 2019.

The NWC SAF v2013 SEVIRI Physical Retrieval (SPhR) product, now known as iSHAI in v2018, was utilized in the study. This product encompasses subdatasets for the total precipitable water (TPW), layer precipitable water (LPW) vapor content (BL: Precipitable Water in the Boundary Layer, ML: Precipitable water in the middle layer and HL: Precipitable Water in the Upper Layer), along with instability indices (KI: K-Index, LI: Lifted Index and SHW: Showalter Index). BL, ML, and HL represent precipitable water vapor levels at pressure levels of 1000-850 hPa, 850-500 hPa, and 500 hPa to the top of the atmosphere, respectively. An example of a TPW image showing the work area is provided in Fig. 3.

Fig. 3
figure 3

A representative exemplar of the NWC SAF TPW product (map) over Türkiye, captured on 2020-07-29 at 15:00 UTC

Although TPW data are well known and are frequently used in meteorology and atmospheric sciences, this study provides new information for other disciplines such as astronomy, hydrology, and agriculture. Another important point is that the reliability of the product has been demonstrated by a validation study of the TPW data at different points handled in Türkiye.

NWC SAF provides software used to produce a variety of atmospheric data for use in short-term weather forecasts. In Türkiye, TSMS produces atmospheric data for Türkiye and its surroundings using this software. SPhR data are generated by the TSMS and transmitted to DAG over the internet via FTP. The acquired data underwent a series of processing steps, including reprojection, georeferencing, and clipping, utilizing the capabilities of Python’s GDAL library. The Python libraries numpy, pandas, scipy, scikit-learn, cartopy, and matplotlib were used to perform tasks such as image processing, correlation analysis, and regression analysis, as well as to visualize the pixel values of satellite imagery.

The PWV values from the TSMS RS measurements and satellite imagery taken at 00:00 and 12:00 UTC were used to assess the accuracy of the NWC SAF TPW data. This product is produced by TSMS with a temporal resolution of 15 minutes and a spatial resolution of 3 km, covering Türkiye and its surrounding areas. PGE13 SPhR is based on the calculations of Jun for the GOES instruments [32, 33]. The development of PGE13 SPhR aims to achieve functional execution. The calculation focuses on observing the temperate climate and convective storms. The algorithm has only been applied to clear pixels or fields of interest (FOIs) with configurable width. The default number of FOI pixels was set to 3x3 [34].

3 Analysis method

In this study, correlation and regression analysis were used to evaluate the relationship between satellite imagery and radiosonde balloon retrievals. Regression analysis provides the mathematical expression of the relationship between two or more variables, while correlation analysis determines the direction (positive/negative) and strength of the relationship. Python programing language, “scipy.stats” module and “linregress” function were used for analysis in the study. The “linregress” function calculates a linear least squares regression for two sets of measurements and returns the slope of the regression line, the intercept of the regression line, Pearson correlation coefficient, the p-value and the standard errors (for the slope and intercept). The slope of a line indicates its steepness, while the intercept indicates the point at which it intersects an axis. The slope and intercept define the linear relationship between the two variables and can be used to estimate the average rate of change. The p-value is a hypothesis test whose null hypothesis is that the slope is zero. It employs the Wald test with a t distribution of the test statistic. The reader can refer to the link below for more detailsFootnote 3. Since PWV is a quantitative variable, it would be appropriate to use \(R^{2}\) (it is the coefficient of determination and the square of the Pearson correlation coefficient), MBE (mean bias error), MAE (mean absolute error), and RMSE (root mean square error) indexes to interpret the correlation and regression analysis.

Statistical parameters were used to determine the accuracy of the NWC SAF TPW product in the whole of Türkiye and to evaluate the results. Of these parameters, \(R^{2}\), MBE, MAE, and RMSE are calculated using the expressions given in (2345) to (6). In these expressions, n is the total number of data, and \(R_{i}\) and \(\overline{R}\) are the radiosonde PWV data and the average of these data, respectively. Si and S are the TPW data obtained from the satellite, and the average of these data. \(R^{2}\) represents the amount of variance explained by the independent variables in the model. The MBE is the average of the differences between the radiosonde PWV and the TPW values calculated from the satellite data. MBE provides information that the estimate made is higher or lower than the actual values. The MAE gives the average size of errors in a set of estimates without considering their direction. The RMSE is a quadratic error calculation rule that also measures the mean magnitude of the error. For high accuracy between the two quantities, the R2 value should be high (closer to one) and the MBE, MAE, and RMSE values should be low (closer to zero). For high accuracy between the two quantities, the R2 value should be high (closer to one) and the MBE, MAE, and RMSE values should be low (closer to zero). The minimum value of \(R^{2}\) is 0 and the maximum value is 1. In essence, the higher the accuracy of a model in making predictions, the closer its R2 will be to 1. Thus, when \(R^{2}\) is high and the MBE, MAE, and RMSE indices are low, it is decided that there is high agreement between the two quantities (satellite imagery and balloon data).

$$\begin{aligned} RS_{PWV}= & m \times SPhR_{TPW} + b \end{aligned}$$
(2)
$$\begin{aligned} R^{2}= & \frac{[\sum \limits _{i=1}^{n} (R_{i}-\overline{R})(S_{i}-\overline{S})]^{2}}{\sum \limits _{i=1}^{n} (R_{i}-\overline{R})^{2} \sum \limits _{i=1}^{n} (S_{i}-\overline{S})^{2}} \end{aligned}$$
(3)
$$\begin{aligned} MBE= & \frac{1}{n} \sum \limits _{i=1}^{n} (R_{i}-S_{i}) \end{aligned}$$
(4)
$$\begin{aligned} MAE= & \frac{1}{n} \sum \limits _{i=1}^{n} |R_{i}-S_{i}| \end{aligned}$$
(5)
$$\begin{aligned} RMSE= & \sqrt{ \frac{1}{n} \sum \limits _{i=1}^{n} (R_{i}-S_{i})^{2} } \end{aligned}$$
(6)

In this study, the pixel values of the NWC SAF images at the TSMS ground station locations were used for validation, while the pixel values at the DAG site were used for time series analysis. The NWC SAF TPW data has a temporal resolution of 15 minutes due to RS measurements being taken twice daily (at 00:00 and 12:00 UTC). To ensure consistency in measurement times, only the 00:00 and 12:00 UTC measurements of the NWC SAF TPW data were used for validation. This allowed for a spatial and temporal comparison of the two datasets.

4 Results and discussion

Prior to performing the correlation analysis, the statistical information of the dataset compiled from radiosonde and satellite data for the reference stations is presented in Table 2. Upon examining of the Table 2, it is evident that both the night and day time SPhR and RS data exhibit similar values. The analysis of the minimum values reveals that a slightly higher value of 2.19 mm is calculated in the RS day time measurement while the two data sources display similar values. The comparison of the maximum values shows that the SPhR values are 7 mm and 5 mm higher than the RS values for day time and night time, respectively. There is an approximate difference of 1.5 mm between the mean and median values, with the mean values higher than the median values. Although the SPhR mean values at night are lower than the RS mean values, the opposite is observed for day time. The examination of the standard deviation in the table indicates a high calculation for the SPhR daytime, suggesting that the SPhR TPW values are measured in a wider range and larger values during the daytime. This observation is consistent with results from other indices presented in the table.

Table 2 Statistics of all reference station data

The results of the correlation analysis are presented in Table 3, where m and b are slope and intercept of the regression line represent the coefficients between the SPhR and RS PWV values. As far as the examination of the \(R^2\) values in the table was concerned, the lowest coherence is observed at 17095-Erzurum 12:00 UTC (0.83) and 17196-Kayseri 12:00 UTC (0.84) while the highest coherence is found at 17030-Samsun (day and night; 0.95 and 0.94) and 17240-Isparta 00:00 UTC (0.95). The cities of Erzurum and Kayseri stations, which exhibit the lowest correlation, share common characteristics such as being far from sea influence, having high altitudes, and low PWV values. Considering that the MSG satellites (MSG-4 or Meteosat-11) carries the SEVIRI sensor, which produces the SPhR product, and is located at 0 degrees longitude, the farthest radiosonde station from the satellite sub-point is Erzurum. This distance may affect the correlation at the Erzurum station. Stations with high consistency, except Isparta, share common features such as proximity to the sea and high PWV values. \(R^2\) values ranging from 0.83 to 0.95 were calculated for the reference stations. These values indicate a strong statistical relationship, demonstrating the reliability and accuracy of the SPhR TPW data on Türkiye.

Table 3 Statistics of the correlation analysis between SPhR TPW and RS PWV

When examining the MBE values in Table 3, it is observed that they range from -2.07 mm to 2.01 mm. The negative MBE values indicate that the TPW data generally measured higher than the RS data while the positive MBE values indicate that the TPW data generally measured lower than the RS data. The TPW values were predominantly higher than RS PWV values at the 17351-Adana station, as well as at the 17064-İstanbul (day time) and 17196-Kayseri (day time) stations. The MAE and RMSE values in the Table 3 range from 1.17-2.61 mm and 1.57-3.44 mm, respectively. Similar values were observed at other stations. The lowest (best) MAE and RMSE values were recorded at 17240-Isparta at 12:00 UTC, while the highest (worst) correlation was observed at 12:00 UTC at the 17351-Adana station. Higher MAE values were calculated during the day time compared to night time at the İstanbul and Adana stations while higher MAE values were calculated at night time compared to day time at the Samsun, Isparta, and Erzurum stations. In terms of RMSE values, the night time values were higher than the day time values at the Samsun and Isparta stations while day time values were higher than night time values at the other stations. In terms of astronomical observations, especially for night time, reliable accuracy and high temporal resolution PWV data will determine the accuracy and precision of IR observations.

In Fig. 4 (night time) and Fig. 5 (day time), the distribution of SPhR TPW and RS PWV data from the reference stations is presented. All graphs clearly show a consistent distribution of both PWV values. High PWV values were observed at Samsun, Istanbul, and Adana while the relatively low PWV values were recorded at the Erzurum, Kayseri, and Isparta stations. The common characteristic of stations with low PWV values is their high altitude. The Erzurum meteorology station, located at an altitude of 1869 meters, records lower PWV values compared to other regions. The establishment of the DAG in Konaklı/Erzurum at an altitude of 3170 meters offers a notable benefit by reducing PWV values and minimizing negative atmospheric impacts on astronomical observations.

Figure 6 displays the aggregated distribution patterns of all stations throughout the day time and night time, revealing a comparable trend with marginally greater consistency (\(R^2\) = 0.94) observed at night time.

Fig. 4
figure 4

Scatter plots of six reference stations (00:00 UTC). RS and SPhR comparison was made for these stations. The \(R^2\) values range between the lowest 0.9 and the highest 0.95

Fig. 5
figure 5

Scatter plots of six reference stations (12:00 UTC). RS and SPhR comparison was made for these stations. The \(R^2\) values range between the lowest 0.83 and the highest 0.94

Fig. 6
figure 6

Scatter plots of all reference stations at the nighttime (00:00 UTC) and daytime (12:00 UTC). The \(R^2\) values are 0.94 for nighttime and 0.93 for daytime

Given these considerations, the SPhR TPW data, which were validated earlier, demonstrate a strong correlation with the radiosonde measurements. The utilization of TPW data for IR astronomy at the DAG site is expected to be highly beneficial due to their precision, temporal resolution, and accuracy. Consequently, the PWV measurements at the DAG site have been analyzed over time using SPhR TPW data.

In Fig. 7, the time series of SPhR TPW values for the DAG site are presented at 15 minutes intervals, hourly, daily, and monthly. The upper left panel of Fig. 7 displays the instantaneous SPhR TPW values obtained from the 15 minutes images of the DAG site. The hourly median TPW values derived from these 15 minutes values are shown in the upper right panel while the daily median TPW values are presented in the lower left panel, and the monthly median TPW values are shown in the lower right panel of Fig. 7. The 15 minutes TPW pattern for the DAG site is illustrated in Fig. 7. It is evident from the figure that data transmission was interrupted in early 2020, resulting in missing data. This lack of data will have a slight impact on the statistical analysis of the dataset.

As shown in Fig. 7, the mean values of total precipitable water vapor are higher in the summer season compared to the winter season. The dataset for this study only included one winter season, which may explain the relatively high TPW statistics. Future studies with a wider date range are expected to yield lower statistical values. The hourly median TPW data distribution is similar to the previous plot, with higher values observed in summer and lower values in winter. Daily TPW values were calculated taking the median of 15 minutes TPW values, with minimum and maximum values ranging from 2-14 mm. The monthly median TPW values were also calculated from the 15-minute TPW values, showing values varying from about 3-10 mm. The monthly TPW values for DAG generally show a decreasing trend from June to January, followed by an increase in the first seven months of 2020. TPW values decreased from July to the end of the year. In particular, on winter days with low TPW, favorable conditions in other atmospheric parameters such as cloud cover, temperature, and wind speed will enable high-quality astronomical observations.

Fig. 7
figure 7

15-minute (Upper Left), Hourly (Upper Right), Daily (Lower Left) and Monthly (Lower Right) SPhR TPW data over DAG site

The PWV indices relevant to infrared astronomical observations are systematically classified into four distinct categories [2]: (1) PWV < 3 mm, considered good or excellent; (2) 3 \(\le \) PWV < 6 mm, classified as fair or mediocre; (3) 6 \(\le \) PWV < 10 mm, classified as poor; and (4) PWV \(\ge \) 10 mm, considered extremely poor. Low PWV values are conducive to detailed observations, such as direct imaging of exoplanets and optimal planning of observation times during nights with favorable conditions.

The use of precise atmospheric data during the planning phase of astronomical observations is crucial for making accurate decisions. The PWV values in the tables are derived from the values of the closest pixel to the location of the DAG site in all 15 minutes NWC SAF images. During the study period, the mean and median PWV values at the DAG site were approximately 7 mm with the minimum and maximum values calculated as 0.59 mm and 24.12 mm, respectively.

In a separate study, the GNSS meteorology method was used to evaluate precipitable water vapor at the DAG site [35]. Statistical results from the collected data (October 2016 to June 2017) reported an average PWV of 3.2 mm and a median of 2.7 mm. A comparison with these results shows a slight increase in the PWV measurement values in this study. The slight difference in PWV measurements between the studies can be attributed to the different data collection periods; the previous study covered one winter and one summer season using the GNSS meteorology method, whereas this study included one winter and two seasons. Figure 7 shows that PWV measurements are generally higher during the summer months than in the winter months. It is imperative to emphasize that the selection of data collection intervals significantly impacts the research findings and, according to the criteria established by [36], the PWV values at the DAG site are considered suitable for infrared observations.

The statistics of the TPW datasets are as follows: The dataset has a total count of 27,260 images. The recorded minimum value is 0.59 mm, while the maximum value is 24.12 mm. The mean value is 7.10 mm and the median is 7.06 mm. In addition, the standard deviation is 3.14 mm.

Table 4 Statistics of layer precipitable water over the DAG site

This section of the research demonstrates the quantification of near-surface water vapor depletion at the DAG site. In this study, atmospheric data from three distinct layers were gathered using the SPhR product. During the early stages of the study, the reliability of the product was evaluated and shown to be very accurate.

The DAG site is characterized by an average pressure level of 700 hPa. Based on the atmospheric conditions of the DAG site, a significant amount of water vapor that has accumulated within a vertical column in the atmosphere would expected to undergo removal. Figures 7 and Table 4 provide information on the current situation. The distribution of all precipitable water vapor values was obtained from 15 minutes satellite images.

Fig. 8
figure 8

TPW values of different pixels over the DAG site. Distribution of precipitable water layer data on the DAG site across the entire dataset (June 2019 - Dec 2020)

During the study period, a value of 0.29 mm was observed in BL images in 16 instances out of a total of 50932 satellite images. This recurring value may be attributed to the measurement error and can be excluded from the analysis. From this perspective, it can be conclusively asserted that the DAG site has been shielded from the influence of the planetary boundary layer with respect to water vapor. The analysis of the red color distribution in the graph and the “HL” column data from the table indicates that the air mass above 500 hPa over the DAG telescope has an average precipitable water vapor of 0.48 mm.

The observation plans for astronomical observations should be made when low values of precipitable water vapor are measured at a site, as the PWV values above 10 mm are not optimal for astronomical observations. During the dry season, it is observed that both lower and upper level water vapor values are relatively low when the vertical distribution of water vapor is analyzed at the DAG site. The primary contribution to precipitable water vapor that could impact astronomical observations is found in the middle layer of the atmosphere, particularly between 700-500 hPa.

The DAG is established on the Karakaya Hills in an area with many hills and mountain range. Because the DAG telescope will observe at a zenith angle of 0-70 degrees, the air masses around the site remain in the field of view of the telescope. In terms of terrain structure, meteorological conditions can vary significantly within a few kilometers. In order to understand the severity of changes in these meteorological parameters, the site must be evaluated together with its environment.

The previous investigations at the DAG site included a vertical analysis of water vapor. In order to examine the PWV for all directions, we need that the water vapor concentrations how vary spatially within the DAG site. The results of these types of analysis are presented in Fig. 8 and Table 5. The graphs display the Total Precipitable Water (TPW) values of a single pixel (1x1) at the location of the DAG telescope as well as the values of the nine surrounding pixels (TPW [3x3 mean]) and their differences.

In the eastern regions of Türkiye, the SEVIRI sensor reaches approximately 5 km pixel resolution. Therefore, the area is covered by a single pixel of size approximately 25 \(km^{2}\). Furthermore, the total area covered by nine pixels approximates 225 \(km^{2}\) (15 km by 15 km).

At this stage of the study, the PWV values of the DAG site and its surroundings were compared. The plot graphs of this comparison are shown in Fig. 8. In the figure, the black line plot shows the value of a single pixel, the gray line plot shows the value of 3x3 pixels, and the red line plot shows their differences.

Table 5 Statistics of layer precipitable water vapor over Türkiye

The mean precipitable water vapor measurements indicate values of 7.05 mm for the 1x1 pixel and 7.61 mm for the 3x3 pixel region. A comparable disparity was also observed in the median measurements. Although the minimum values recorded for both areas were identical at 0.59 mm, a 0.35 mm discrepancy was observed in the maximum values. In the datasets depicted in the graphical representations and tabular format, it becomes apparent that the precision of site selection surpasses that of the adjacent regions on the hill where the observatory is located.

Figure 9, the precipitable water vapor values for the three different layers (boundary layer, mid layer, and total) are displayed as maps. The maps represent Türkiye’s nighttime PWV as only nighttime values were used in producing them, because astronomical observations were conducted at night.

Table 5 presents the statistics of the PWV values for both the sea level and the mountain regions. The HL values are not comparable to other layer values, so they are not displayed as a map. This inconsistency is similarly observable in the table. Upon jointly reviewing the table and maps, it becomes clear that there is a marked disparity in the distribution of water vapor between the sea level and the mountainous areas of Türkiye. In addition, attributed to the modulation of altitude in precipitable water vapor and the elevated terrain in eastern Türkiye, it is manifest that the PWV values exhibit a decreasing trend from the western to the eastern regions of the country.

Fig. 9
figure 9

Three different layers PWV maps over Türkiye

Upon the examination of the boundary layer (BL) map depicted in Fig. 9, it becomes evident that elevated concentrations of water vapor predominantly characterize the coastal areas of Türkiye at sea level. In contrast, the central and southeastern territories of Türkiye display diminished Precipitable Water Vapor values, with the PWV metrics registering as nil in the Mediterranean and Eastern Anatolia regions, attributable to the elevated altitudes. Further examination of the Mixing Layer (ML) map reveals that regions situated at higher altitudes manifest lower PWV values relative to other areas. In contrast, PWV values at sea level, particularly along the coastal zones of Türkiye, are comparatively low.

In the middle of Fig. 9, lakes and plains are identified as areas with high PWV values on the ML map. The TPW map, which shows the total PWV values in all layers, serves as the final PWV map of Türkiye during the study period. The map patterns for the other layers are also shown in this map. It is evident when all maps are taken into account that PWV values tend to accumulate at lower levels of the atmosphere.

In the DAG site, the lowest PWV value (equal to zero) is in the boundary layer (BL), with similar PWV values (ranging between 0.07 mm and 0.48 mm) in the upper layers. The eastern regions of Türkiye, particularly mountainous areas, typically have lower PWV values. This is because humid air releases moisture as it ascends, leading to decreased PWV values, which signify lower humidity and drier weather conditions. Analysis of the maps in Fig. 9 reveals that settlements situated at higher altitudes, such as DAG, have significantly lower PWV levels compared to those at sea level. Therefore, it is highly consistent with statistical data that the selection of observatory sites in these areas is preferred. As a result, it can be inferred that conducting highly sensitive activities such as astronomical infrared observations at higher altitudes would result in less atmospheric interference. It is evident that the mean PWV value observed at the DAG observatory is less than 10 mm in all layers.

5 Conclusion

In this research, we presented our initial findings on the PWV content at the DAG site to enhance our understanding of the site’s atmospheric characteristics, especially. DAG, which houses Türkiye’s largest telescope with a diameter of 4 meters and is the first near-infrared (NIR) telescope in the country, is planing to achieve first light in late summer in 2024.

This research aimed to explore the impact of water vapor on infrared astronomical observations using DAG telescopes. The effective monitoring of PWV would result in the generation of higher-quality astronomical observation products in the IR region and a reduction in the operating cost of the devices used at the site. In this regard, PWV values have been analyzed in three parts. The initial assessment focused on evaluating the accuracy of the satellite imagery data used. This step is crucial to ensure that subsequent analyses, such as temporal and spatial evaluations, are based on accurate and reliable data. Subsequently, a temporal analysis was performed on the DAG site using satellite-derived data. Lastly, the study conducted a spatial analysis of PWV data values across Türkiye, considering stratification into atmospheric layers (low, medium, and high). The main findings of the study are listed below.

  • The TPW dataset was initially validated at six radiosonde stations in Türkiye. The validation results showed that the SPhR TPW data with a 15 minutes temporal resolution had a strong relationship (\(R^2 > 80 \)) with radiosonde data at both day time and night time.

  • The correlation analysis yielded \(R^2\) values between 0.83-0.95 for day and night time for the reference stations considered. This is indicate a strong statistical relationship and the reliability of the SPhR TPW data across Türkiye.

  • The mean bias error values between the SPhR TPW and radiosonde data vary ranged from -2.07 mm to 2.01 mm. This means that the SPhR TPW data measured on average about 2 mm of small or large. This measurement error is a relatively acceptable amount. Also The SPhR TPW data generally showed lower values compared to the radiosonde measurements.

  • Temporal analysis of SPhR TPW data was performed for the DAG site from June 2019 to 2020. During the study period, the mean and median TPW values at the DAG site were approximately 7 mm and minimum and maximum values were calculated as 0.59 mm and 24.12 mm.

  • Additionally, higher TPW values were observed in the DAG area during the summer months, while lower TPW values were observed in the winter months.

  • Monthly median TPW values at the DAG site varied between approximately 3-10 mm, with a decreasing trend from June to January and an increase in the first seven months of 2020.

  • Spatial and vertical analysis of precipitable water vapor data was conducted over DAG site and Türkiye from SPhR data. The results show that the values of the DAG site remain low compared with those of its surroundings.

  • The lower and upper-level water vapor values are quite low and the main contribution comes from the middle layer of the atmosphere especially 700-500 hPa.