1 Introduction

In general, hydrological events with extreme occurrences mostly induced by climate fluctuations have social and economic effects on community and, thus has long been the continuing topic of research, considered in a wide spectrum of scientific communities ranging from climatology to public health (e.g., among others, Kalaycı and Kahya 1998; Şarlak et al. 2009; Kaya et al. 2012; Meral et al. 2014 and 2015). There is a need for accurate and consistent data to make climate assessments and make accurate predictions and modeling. It is noteworthy that inhomogeneous data sets do not give reliable results in statistical analyses; therefore, it is necessary to ensure the homogeneity of time series through some statistical methods, combined with metadata, which is pertinent to inhomogeneous data sets. Metadata is a historical information in a station including the state of observation, providing important asset in data quality to be used in homogeneity studies. The best way to maintain homogeneity is keeping gauging stations in a required environment and the measuring system under proper management (Demircan et al. 2015). Availability of long-term, homogeneous, and continuous precipitation series is always essential for climate and hydrologic studies. Nevertheless, precipitation data may suffer from inhomogeneity, owing to various non-climatic factors. The observational data network across Turkey is under continuous development; however, at the same time, it is prone to the effects of urbanization and changes in land-use conditions (Kömüşcü 2010). Homogeneity tests of time series can be examined under two headings as absolute method and relative method. In the former, the test is separately applied for each station as the reference stations are involved in testing in the latter (Wijngaard et al. 2003).

Hanssen-Bauer and Førland (1994) conducted a homogeneity analysis of rainfall data from 165 rainfall stations in Norway using the standard normal homogeneity test (SNHT). They found that 50 of the 165 stations were homogeneous in their study. Tarhule and Woo (1998) studied the alterations in precipitation characteristics in northern Nigeria using the Pettitt test. Mihajlovic (2006) analyzed the homogeneity of the monthly total rainfall series using the SNHT. Kang and Yusof (2012) analyzed the homogeneity of the precipitation data on the peninsula of Malaysia using SNHT, Buishand, Pettitt, and Von Neumann tests. Al-Lami et al. (2014) examined the homogeneity of 36 meteorological stations’ data in Iraq for the periods 1981–2010. They applied the Pettitt test, SNHT, Buishand range test, and Von Neumann ratio test to detect the inhomogeneity of precipitation series.

A number of techniques have been used for analyzing the homogeneity of hydrologic and meteorological variables. Türkeş (1996) analyzed homogeneity of annual precipitation data of 91 stations in Turkey during the periods 1930–1993 using the Kruskal-Wallis test prior to analyzing their spatial and temporal characteristics. Kahya and Kalaycı (2004) tested homogeneity of trends in monthly streamflow in Turkey using a method developed by Van Belle and Hughes (1984). Karabörk et al. (2007) checked through the homogeneity of rainfall data in Turkey by the SNHT and Pettitt test, and considered a station to be inhomogeneous if at least one of the two tests indicated so. As a result, their outcomes showed that 43 out of 212 stations were found to be inhomogeneous. Göktürk et al. (2007) conducted homogeneity assessment of Turkish rainfall series using the SNHT. They suggested that homogeneity tests should be applied correctly, and metadata should be checked to explain the homogeneity of the stations. Fırat et al. (2010) applied the Pettitt test, SNHT, and the Runs test (Swed-Eisenhart) for testing the homogeneity of precipitation data spanning from 1968 to 1998 for 267 meteorological stations throughout Turkey.

The aim of this study is to explore the homogeneity characteristics of monthly precipitation series at carefully selected 160 meteorological stations in Turkey using four common absolute homogeneity tests of the SNHT, Pettitt test, Buishand test, and Von Neumann ratio test. Motivation of this research was inspired from the fact that there recently have been increasing body of hydrology and climate investigations with various aims in Turkey, which mostly require reliable updated precipitation data. At the same time, this study carries a role in complementing previous studies in the same matter by employing a number of different homogeneity tests and having recourse to metadata if possible.

2 Data and study area

We used monthly precipitation data spanning from 1974 to 2014 compiled by Turkish State Meteorological Service (MGM). Our data network consists of specially selected 160 meteorological stations with no missing values distributed all over Turkey. The geographical location of these meteorological stations, which are almost uniformly distributed across Turkey, is illustrated in Fig. 1.

Fig. 1
figure 1

The geographical distribution of 160 precipitation stations over Turkey used in this study

The geographic location of Turkey lies over the coordinates with latitudes 36° and 42° N and longitudes 26° and 45° E. The country is a peninsula in the Eastern Mediterranean, with three sides surrounded by the sea. The climate of Turkey is changeable due to its geographic location. Three main climate types observed can be listed as Mediterranean climate, continental climate, and temperate oceanic climate. Mediterranean climate is defined by hot and moderately dry summers as well as cool and rainy winters. This climate type is observed for the Mediterranean and Aegean coastal areas of the country. Continental climate is characterized by hot summers and cold winters, which could be observed in the interior regions of Turkey. Precipitation occurrence is less and especially in the form of snow during wintertime. The Black Sea coastal region has a temperate oceanic climate. This climate type is characterized by warm and wet summers with cool and rainy winters. This region receives considerable amount of precipitation and its eastern part has frequent precipitation throughout the year. Sensoy et al. (2008) indicates that the central Anatolia is the most vulnerable to drought in the country due to the mountainous location. Taurus Mountains in the south and Caucasian Mountain in the north and Black Sea Mountains hold the rain clouds, resulting in continental climate. While the eastern part of the Black Sea coasts receives 2200 mm average rainfall, whereas the interior regions such as Konya and Iğdır receive 250–300 mm average rainfall annually. In addition, the Aegean and Mediterranean coasts have different precipitation patterns depending on location; thus, annual precipitation in this region varies from 580 to 1300 mm (Dundar 2015).

3 Methodology

3.1 Homogeneity tests

Hydrological and climatological studies require that the data be homogeneous. However, changes in routine observation methods, relocation of stations, or changes in measurement techniques can cause homogeneity to deteriorate in the data. In addition, changes in population due to population growth in the area being measured, urbanization, or changes in nature due to natural influences can also lead to a deterioration of the homogeneous structure of data records. As a result, studies on this data may lead us to the wrong conclusion. Therefore, homogeneity analysis is necessary.

In this study, we applied four broadly used homogeneity tests to the precipitation data, namely, the Buishand test, the Von Neumann ratio test, the SNHT, and the Pettitt test. Wijngaard et al. (2003) suggested that these tests were sufficient to determine the homogeneity of the data. The most important property of these tests is that they should complement each other. From these tests, the Buishand, SNHT, and Pettitt tests investigate whether there is a jump in the time series and determine the breaking point. The Von Neumann ratio test assumes that the series do not randomly disperse and do not give information about when homogeneity is impaired. While the SNHT is sensitive to breaks at the beginning and end of the time series, the Pettitt test and the Buishand test are more susceptible to detecting breaks in the middle of the time series.

3.1.1 Von Neumann ratio test

In this homogeneity test, according to the hypothesis H0, the data were not randomly distributed. According to the alternative hypothesis, the time series considered is randomly distributed. Von Neumann ratio test does not determine a specific location where homogeneity is impaired and does not provide information about when homogeneity is impaired (Wijngaard et al. 2003).

In this test, N describes the mean squared proportion of the variation of the variance in succession (Von Neumann 1941):

$$ N=\frac{\sum_{i=1}^{n-1}{\left({Y}_i-{Y}_{i+1}\right)}^2}{\sum_{i=1}^n{\left({Y}_i-\overline{Y}\right)}^2} $$
(1)

Critical values of this test are given in Wijngaard et al. (2003). If the calculated N value exceeds the critical N value, H0 hypothesis is considered homogeneous.

3.1.2 Buishand test

In Buishand test, which assumes that the data is normally distributed, the data is distributed independently and randomly according to the H0 hypothesis. This test is sensitive to breaks in the middle of the time series (Wijngaard et al. 2003).

$$ {S}_0^{\ast }=0;{S}_k^{\ast }={\sum}_{i=1}^k\left({X}_i-\overline{X}\right),\kern0.5em k=1,2,\dots, N $$
(2)

where:

\( \overline{X} \) :

: the mean of time series observations (X1, X2, …, XN)

k :

: the number of the observation at which a break point has occurred

Rescaled adjusted partial sums are obtained by dividing the \( {S}_k^{\ast } \) by the sample standard deviation (Buishand 1982):

$$ {S}_k^{\ast \ast }={S}_k^{\ast }/{D}_X,\kern0.5em k=1,2,\dots, N $$
(3)
$$ {D}_X=\sqrt{\frac{\sum_{i=1}^N{\left({x}_i-\overline{x}\right)}^2}{N}} $$
(4)

The statistic that can be used to analyze homogeneity is expressed as follows:

$$ Q={\max}_{0\le k\le N}\left|{S}_k^{\ast \ast}\right| $$
(5)

\( Q/\sqrt{N} \) value is compared with the critical value given by Buishand (1982). If a calculated value is less than the critical value, the null hypothesis is accepted; otherwise, the null hypothesis should be rejected (Alghazali and Alawadi 2014).

3.1.3 Standard normal homogeneity test

SNHT is a homogeneity test frequently used in climate studies. This method, developed by Alexandersson, has successfully been used in testing many climatic and hydrological scales (Alexandersson 1986). This method is flexible and simple to use. Alexandersson divides the reference point of the examined series by a “k” point and calculates T(k).

$$ T(k)=k{\overline{z}}_1^2+\left(n-k\right){\overline{z}}_2^2\kern0.5em k=1,\dots, n $$
(6)

where

$$ {\overline{z}}_1=\frac{1}{k}\frac{\sum_{i=1}^k\left({Y}_i-\overline{Y}\right)}{s}\kern1em \mathrm{and}\kern0.5em {\overline{z}}_2=\frac{1}{n-k}\frac{\sum_{i=k+1}^n\left({Y}_i-\overline{Y}\right)}{s} $$
(7, 8)

If the break is located at the point K, T(k) reaches its maximum value at k = K. The test statistic T0 is defined as:

$$ {T}_0=\max \left(T(k)\right)\kern0.5em \mathrm{for}\kern0.5em 1\le k<n $$
(9)

Jaruskova (1994) has developed the following equation based on this test statistic.

$$ {T}_0=\frac{n{\left(T(n)\right)}^2}{n-2+{\left(T(n)\right)}^2} $$
(10)

If T0 exceeds the test statistic value, the null hypothesis is rejected. Critical values of this test are given in Wijngaard et al. (2003).

3.1.4 Pettitt test

This nonparametric method, suggested by Pettitt (1979) to detect the point of change in a time series, finds the point of change on a monthly or annual scale. While the null hypothesis indicates independent and random distribution of the series, the alternative hypothesis indicates a sudden change. Pettitt test is a nonparametric rank test. To calculate the statistics, the ranks r1rn of the Y1Yn are used (Pettitt 1979):

$$ {X}_k=2\sum \limits_{i=1}^k{r}_i-k\left(n+1\right),\kern0.5em k=1\dots n $$
(11)

If there is a break in year K, the statistic is maximal or minimal near the year k = K:

$$ {X}_K=\underset{1\le k\le n}{\max}\left|{X}_k\right| $$
(12)

The statistical significance for a probable level α is given as:

$$ {X}_{K\alpha}={\left[-\ln \alpha \left({n}^3+{n}^2\right)/6\right]}^{1/2} $$
(13)

The critical values of this test are given in Pettitt (Pettitt 1979). If the XK values do not exceed the critical values, the H0 hypothesis is accepted; that is to say, it is homogeneous.

3.2 Double mass curve analysis

Double mass curve analysis (DMCA) is frequently used in the literature for homogenization by correcting errors in hydro-meteorological data. Rainfall data may have lost their homogeneity properties due to faults coming from various sources. These errors are classified into two types, randomly and systematically. Errors that are described as random and sampling error occur from misreading in observations. Systematic faults may occur due to changes in station location, misplacement of the measuring instrument, urbanization, and so on. These types of errors are incompatible and the observed values are always less or always greater than the actual values. Therefore, homogeneity of rainfall data can be obtained by correcting such mistakes. For this correction, DMCA is used. This technique is widely used for detecting and correcting inconsistent precipitation data. (Linsley et al. 1982).

The control of the values of a station by DMCA is made by comparing the annual or monthly total values of that station and the average annual or monthly values of a group of stations neighboring that station.

The correction is performed by adjusting the records prior to the break to reflect the new state (after the break). To accomplish this, the rainfall records prior to the break are multiplied by the ratio of slopes after and before the break (Ponce 2014).

$$ {M}_c=\frac{c}{a}{M}_a $$
(14)

where,

Mc : Adjusted precipitation

Ma : Observed precipitation

c : Slope of graph to which records are adjusted

a : Slope of graph at time Ma was observed

4 Results and discussion

Our precipitation data set having the most updated observations across Turkey was compiled from 160 meteorological stations for the periods 1974–2014. We analyzed the homogeneity of time series at each station using the SNHT, the Pettitt test, the Buishand test, and the Von Neumann ratio test at significance level of 5%. In this study, all of the 160 precipitation stations were particularly selected as having no missing value, among a total of nearly 250 stations. In this study, R statistical software was used for the homogeneity analysis. Our decision criterion for a station to be homogeneous was rigidly set as a full confirmation through all of the four tests used in this study. Previous similar studies in Turkey did not adopt such a rigid criteria for the confirmation of homogeneity of precipitation series as applied in this study.

Keeping this criterion in mind, all stations were subjected to the four homogeneity tests. As a result of overall evaluation, we determined 44 out of 160 stations to be inhomogeneous since either one or more tests did not confirm. Test-wisely speaking, the results of the SNHT and the Pettitt test showed that a set of 25 out of 160 stations were impacted from inhomogeneity. According to the Buishand test, 13 out of 160 stations were found to be inhomogeneous while the Von Neumann test revealed 22 inhomogeneous stations. Moreover five out of 160 stations possess inhomogeneity characteristic with the respect to the results of all four tests. Table 1 summarizes all these tests’ results. Since we set the level of significance at 5%; the critical values are 167, 8.10, 8.07, and 1.49 for the Pettitt, SNHT, Buishand, and Von Neumann ratio tests, respectively.

Table 1 Inhomogeneous precipitation stations and the homogeneity tests results

It is important to visually inspect the distribution of inhomogeneous stations in order to catch an underlying unexpected specific reason or to justify the required homogeneity correction. For this purpose, we mapped the distribution of inhomogeneous stations depending on applied test method in Fig. 2. The Pettitt test (Fig. 2a) revealed 25 inhomogeneous stations as twice as that of the Buishand test (Fig. 2b) (implying that the former exhibits more conservative outcomes than the latter) as both tests are capable of detecting a break point in the middle of a series. It is important to note that all 13 inhomogeneous stations detected by the Buishand test were also comprised in the set of inhomogeneous stations of the Pettitt test.

Fig. 2
figure 2

a Stations whose homogeneity was not approved by the Pettitt test; b The same as (a), except for the Buishand test; c The same as (a), except for the SNHT; d The same as (a), except for the Von Neumann ratio test; e Stations whose homogeneity was not approved by all four tests

The SNHT, known as a good tool for detecting a break point at the head or end part of a time series, brought out a fact that the reason behind inhomogeneity of 25 stations (Fig. 2c) was a break point appearing mostly in the second half of time series (in particular 1996 or later). It is also noticeable that the year of 2008 was the most frequent break point as happened at six stations. The Von Neumann ratio test, which focusing on all parts of the time series, pointed out to seven inhomogeneous stations that were not caught by the other three tests; namely, Cankiri, Tatvan, Hakkari, Keskin, Ilgin, Baskale, and Milas stations (Fig. 2d).

Among the 44 inhomogeneous stations, five stations (Nallihan, Sinop, Ardahan, Kars, and Mardin), which could not pass the four tests, are shown in Fig. 2e. Two stations (Nallihan and Odemis) still remain inhomogeneous after applying DMCA to the set of 44 non-homogeneous meteorological stations. It is noted that one of these two stations (Nallihan) takes place in Fig. 2e.

Subsequently, we adopted the DMCA to each of 44 inhomogeneous stations included in Table 1 to see if any station could possibly turn out to be homogeneous. We drew a double mass curve by setting the annual precipitation total of an inhomogeneous station under consideration at ordinate axis and corresponding an average of precipitation total values of neighboring homogeneous stations at horizontal axis. We used minimum four neighboring stations in this analysis. Our results showed that 42 out of inhomogeneous 44 stations passed all the four analysis tests after being applied to correction based on the DMCA. However, it was not possible for only two stations, namely Odemis and Nallihan, to make them homogeneous. At this point, we thought that it would be useful to exemplify what was done in each station subjected to DMCA. For this purpose, we randomly selected Akcakoca station, which is located in the northwestern Turkey, and presented its homogeneity test results prior to the correction in Table 2 with a plot in Fig. 3a. It is obvious that there is a jump in the mean around the break year of 1989. It is concluded that the new data set found after such analysis are homogenous (please see Table 2 and Fig. 3).

Table 2 Homogeneity test results for Akcakoca station prior to the correction
Fig. 3
figure 3

Testing of homogeneity by SNHT, Pettitt, and Buishand tests. a Homogeneity test results prior to the correction. b Homogeneity test results after the DMCA correction

Applying the DMCA to precipitation data at Akcakoca station, the detected anthropogenic influence in this station data was successfully corrected (Fig. 3b). Moreover, the corrected data of Akcakoca station has successfully passed from all our four homogeneity tests at the 5% significance level. In this particular example, we used a group of four neighboring stations (namely Sakarya, Bolu, Duzce, and Zonguldak) of Akcakoca station for the DMCA.

In Fig. 4, we showed the linear regression fit to precipitation series in Akcakoca station before and after the homogenization. A visual inspection and noticeable difference in the numerical value of slope coefficient strongly indicate that homogenization made important change in the variability of the station. Specifically speaking, an obvious upward trend-like behavior almost diminished after the homogenization procedure applied.

Fig. 4
figure 4

Plots of the trend for precipitation series in Akcakoca station. a Results prior to the correction. b Results after the correction

4.1 Inhomogeneity interpretations based on metadata

We had limited amount of metadata for our data network (only for 12 stations) (Table 3); therefore, we had to interpret inhomogeneity reasons for those with available information. Utilization of metadata in understanding the reason(s) behind the detected inhomogeneity at a station finds the best example in our data network at the two stations, namely Artvin and Uzunkopru, whose break point and moving years are very close, providing a direct cause explanation. Specifically speaking, the gauging station in Artvin was first moved to a location 300 m away and 31 m higher elevation in 1983, and then was subjected to other moving in 1989 and 2009. Our statistical tests showed two breaking points as 1984 and 1987 are both in the vicinity of moving years. The case in the gauging station in Uzunkopru indicated a smooth example having one break year (2013) and its metadata providing one moving year just a year later (2014) to a location at 2.4 km in northeast.

Table 3 Metadata of inhomogeneous precipitation stations

In the inhomogeneous 12-station set (Table 3), we counted six stations in which the break year took place before moving year, namely Bartin, Trabzon, Corlu, Simav, Cumra, and Uzunkopru stations. The length of time period between the break point and moving year has a span from 1 to 31 years. Among these, Uzunkopru and Corlu stations having such respective span as 1 and 6 years are the ones that were possibly affected by moving and allowing dispersion in the sensitivity of statistical test although the break year happened prior to the moving year. However, majority came out to be in favor of more than 19 years, implying that environmental factors certainly played a significant role in the inhomogeneity process as the case in the following stations: Bartin, Trabzon, Simav, and Cumra. Bartin station has a break point in 1994, experienced a negligible move within 5 m in 2013 at nearly the end of the record, indicating that there is relation between moving and break point. The second to fourth stations had similar pairs for the periods 1987–2006, 1981–2012, and 1977–1997, implying a possible reasoning for inhomogeneity, which might be logical to search in other causing physical parameters other than moving.

The remaining stations in the inhomogeneous 12-station set (Table 3), in which break year take places after moving year, are Samsun, Giresun, Ardahan, Balikesir, and Bafra stations. The length of time period between moving year and break point has a span from 5 to 30 years; therefore, we interpreted that moving was always not perceived as single direct effect to cause the detected break point in such stations. The Samsun station with a break point in 1986 was subjected to move twice, first in 1973 (to a location 16 km away) and second in 2015 (to a location in northeast 100 m away). A recorded move in Giresun station happened at the end of first quarter of the series (1984) but its break point appeared in 1999. As opposed to the other stations in the same category, Ardahan and Balikesir stations have relatively less span between the moving year and break point having such pairs as of 1994–2001 and 2003–2008, implying that the effect of moving station on the homogeneity condition was gradually sensed. Finally, Bafra station experienced a move in 1982 but its break point happened in 2012, implying that moving has no effect on inhomogeneity condition whatsoever.

5 Conclusions

In this research, the homogeneity of the most recent monthly precipitation data set comprised of 160 meteorological stations across Turkey for the periods 1974–2014 was analyzed for the confident future use. The data set was selected in such a way that there is no missing observation in each series. We applied the SNHT, Pettitt test, Buishand test, and Von Neumann ratio test to each precipitation series and individually evaluated the outputs of these tests at the 5% significance level to determine inhomogeneity with break years.

The results of the SNHT showed that 16% of the entire data set was impacted from inhomogeneity. According to the Buishand test, 8% of the entire data set was found to be inhomogeneous. The Von Neumann ratio test revealed 14% inhomogeneity portion of the data network. Finally, the Pettitt test showed that 16% of the entire data set was found to be inhomogeneous. In addition, five out of 160 stations have an inhomogeneity according to the results of all tests (please see Fig. 2). Therefore, 44 out of 160 stations are said to be inhomogeneous as they failed according to at least one of the four tests. The breaks captured by the Pettitt test and Buishand test usually tend to appear in the middle of the study period, whereas the results of SNHT frequently showed inhomogeneity at the beginning or end of the records. Our results showed that 42 out of 44 inhomogeneous stations responded affirmatively to all the four tests after correction based on the DMCA. In conclusion, the size of our precipitation data set was decreased to 158.

In this study, we used four different homogeneity tests to determine the inhomogeneous precipitation stations. If there is any heterogeneity in the data, the statistical parameters do not reflect the physics of the phenomenon, the practices and interpretations made are erroneous; in short, the healthy result cannot be obtained for the analysis to be performed. Available metadata was used to interpret the detected inhomogeneity. In this study, in order to determine the reasons underlying the homogeneity of station data, we made comments using the metadata obtained from the Turkish Meteorology General Directorate. Accordingly, it is estimated that the cause of homogeneity in the Artvin station is the change of station location. The simultaneous appearance of fracture years in data sets in multiple locations suggests that these breaks are natural breaks of the climate. This situation was observed at Artvin-Trabzon and Corlu-Balıkesir stations. Outside of these stations, there are environmental factors and an anthropogenic effect. This information is important because it is used in data quality, continuity, and homogeneity studies.

Our results showed that these homogeneity methods could be used in an accomplished way to test the homogeneity of rainfall series in Turkey. The results of this study, in general, came out to be consistent with earlier indications of studies, which were mentioned in the introduction section. This study carries a role in complementing previous studies in the same matter by employing a number of different homogeneity tests and having recourse to metadata. We hope that the updated outputs of this study would have recourse at the very first phase of future studies in hydrology and meteorology fields in Turkey.