1 Introduction

GNSS technology has widely been applied in a variety of geophysical studies, and thousands of permanent GNSS stations have been established around the world for this purpose. Analyzing the time series of GNSS stations in the International Terrestrial Reference Frame (ITRF) provides useful information for the study of the global plate motion, crustal deformation, and earthquakes. All these studies are based on the correct interpretation of the coordinate time series as well as on reliable and accurate station coordinates and velocities. The GNSS coordinate time series mainly consists of tectonic deformation (e.g., Dragert et al. 2001; Jiang et al. 2014), non-tectonic deformation (e.g., Vandam et al. 1994; Tiampo et al. 2004), and noise (Williams et al. 2004). Analyzing the non-tectonic deformation signals by geometric and physical models is crucial in studying the impacts of various geophysical phenomena on GNSS reference stations. Nikolaidis (2002) applied a weighted least-square algorithm to solve the geometric model according to linear trend, annual, and semi-annual variations, and other characteristics of GNSS coordinate time series. Dong et al. (2002) investigated the effects of atmosphere, non-tidal ocean, snow, soil moisture, and other environmental factors on GNSS station displacements, and explained the main causes of the annual and semi-annual variations of GNSS time series. However, these geometrical model and physical model cannot directly extract mass loading signals (annual and semi-annual periods) from the GNSS time series. Therefore, a method separating the seasonal non-tectonic deformation signals from the GNSS vertical time series is in really needed.

The Independent Component Analysis (ICA) method is a blind source separation method, which can separate statistically independent source signals from multi-dimensional mixed observation signals (Hyvärinen and Oja 2000). If the geophysical phenomena are independent; then, the independent component signals (ICs) decomposed by the ICA method will reflect their physical phenomena. The Phase Space Reconstruction-based Independent Component Analysis (PSR-ICA) method, a single-channel ICA method, can effectively separate the independent source signals, even though they are the same frequency (Dai et al. 2014). Therefore, in this study, we use the PSR-ICA method to analyze the seasonal variation of GNSS vertical coordinate time series. However, the vertical time series of a GNSS station is a non-stationary signal, and the PSR-ICA algorithm can only process stationary signal; therefore, we use an adaptive time–frequency decomposition method called Ensemble Empirical Mode Decomposition (EEMD) (Wu and Huang 2009), to process non-stationary GNSS vertical coordinate time series and extract the stationary seasonal signal.

2 EEMD and PSR-ICA Methods

The EEMD and PSR-ICA methods are jointly used to analyze the GNSS vertical time series based on their respective advantages. First, GNSS vertical time series is decomposed using the EEMD method, and then, the noise signal and seasonal signal are classified and reconstructed based on the Hurst parameter (Rodriguez et al. 2009). After that, the PSR-ICA method is used to decompose the reconstructed one-dimensional seasonal signal into several ICs that are then compared with non-tectonic crustal deformations caused by mass loadings. The data processing flow chart is shown in Fig. 1.

Fig. 1
figure 1

Flow chart of single-channel ICA data processing for the GNSS vertical time series analysis

2.1 EEMD Decomposition and Extraction of the Seasonal Non-Tectonic Signal

The EEMD, as a modified EMD algorithm, which can be used to decompose a nonlinear and non-stationary time series \(x(t)\) into several Intrinsic Mode Functions (IMFs) and the residual signal (Wu and Huang 2009; Huang et al. 1998). Then, the IMFs with different noise contents are classified and reconstructed based on the Hurst parameter. The general equation of the EEMD algorithm is given below:

$$x\left( t \right) = \mathop \sum \limits_{j = 1}^{D} \overline{{c_{j} }} \left( t \right) + r_{D} (t),t = 1,2, \ldots ,T$$
(1)

where \(\overline{{c_{j} }} \left( t \right)\) is the IMF component and \(r_{D} (t)\) is the residual signal.

The IMFs of the EEMD algorithm can be divided into stationary and non-stationary parts. The non-stationary IMFs can be reconstructed as a long-term trend signal (Qin et al. 2012), and the stationary IMFs need to be analyzed by the Hurst parameter. Hurst parameter has been used to analyze noises in many studies (Montillet et al. 2013; Schroeder and Wiesenfeld 1991). The relation between Hurst parameter (\(H\)) and noise signal is that the white noise is corresponding to \(H = 0.5\), the flick noise is corresponding to \(H = 1\) and the random walk is corresponding to \(H = 1.5\).

The noise in the GNSS time series can be analyzed usually after removing the linear trend and seasonal variation. However, the EEMD method can decompose the noise, without removing the linear trend and seasonal variation, into a series of IMFs ordered by frequency, and these IMFs can be classified using the Hurst parameter. For example, Montillet et al. (2013) used the EMD method and the Hurst parameter to extract the white noise (\(H \le 0.6\)) of the GNSS vertical time series. Furthermore, several studies suggest that the best noise model of most GNSS vertical time series is the white plus flicker noise model (WN + FN) (Li et al. 2012; Zhang et al. 1997; Mao et al. 1999; Amiri-Simkooei et al. 2007). Therefore, we define the IMFs whose \(H\) is within \(\left[ {0.6, \, 1.1} \right)\) as colored noise, which can be best characterized by the WN + FN model.

The procedure of the time–frequency analysis can be described as follows.

  1. 1.

    Decompose time series \(x\left( t \right)\) into IMFs and residual signal.

  2. 2.

    Extract the non-stationary IMFs and the residual signal, and reconstruct them to long-term trend signal.

  3. 3.

    Estimate the Hurst parameter of each stationary IMFs. Sum over the IMFs whose \(H\) are within \([0,1.1]\) as noise signal, and sum over the remaining IMFs as one-dimensional seasonal signal \(x'\left( t \right)\).

However, the EEMD method cannot decompose components in the same frequency; therefore, the one-dimensional seasonal signal should be analyzed by the single-channel ICA method.

2.2 Seasonal Non-Tectonic Signal Analysis by PSR-ICA Algorithm

PSR-ICA algorithm is a one-dimensional stationary signal processing method (Dai et al. 2014, Packard et al. 1980; Takens 1981), which does not rely on the time–frequency feature of the components. Therefore, it can separate ICs from the one-dimensional seasonal signals to reflect the effect of the seasonal mass loadings. The procedure of the PSR-ICA algorithm is shown in Fig. 2.

Fig. 2
figure 2

Flow chart of PSR-ICA processing for the seasonal signal analysis

In this flow chart, \(x'\) is the one-dimensional seasonal signal; \(X\) is the reconstruction phase space; \(Y = WX\) is to obtain the ICs from the reconstruction phase space; \(X^{K}\) is the \(K {\text{th}}\) independent subspace; \(x^{K}\) is the reconstructed one-dimensional seasonal signal.

2.2.1 Channel Extension

Phase Space Reconstruction (PSR) is a method of obtaining nonlinear dynamics features through delaying and embedding a one-dimensional signal into a high-dimensional phase space to reflect the characteristics of the signals (Kennel et al. 1992). The multi-dimensional phase space contains all signals of physical sources, affecting the seasonal non-tectonic deformation of GNSS stations.

$$X_{i} = \left\{ {x_{i} ',x_{i} '\left( {1 + \tau } \right), \cdots ,x_{i} '\left( {1 + \left( {m - 1} \right)\tau } \right)} \right\},i = 1,2, \cdots ,M$$
(2)

where \(M = T - (m - 1)\tau\), \(m\) is the embedding dimension, \(\tau\) is the delay time. \(m\), and \(\tau\) are the main parameters of the PSR algorithm, whose optimal solution can be obtained using the False Nearest Neighbors method and the Mutual Information method, respectively (Dai et al. 2014; Kennel et al. 1992; Fraser and Swinney 1986).

2.2.2 Independent Component Signal Separation

Assume a set of independent source signals \(S\left( {t'} \right):\,S\left( {t'} \right) = [s_{1} (t'),s_{2} (t'), \cdots ,s_{n} (t')]^{\text{T}}\) and their corresponding observation signals \(X\left( {t'} \right):\,X\left( {t'} \right) = [x_{1} (t'),x_{2} (t'), \cdots ,x_{\text{m}} (t')]^{\text{T}}\), the relationship between \(X\left( {t'} \right)\) and \(S\left( {t'} \right)\) can be described as follows (Hyvärinen and Oja 2000):

$$X\left( {t'} \right) = AS\left( {t'} \right),t' = 1,2, \ldots ,M$$
(3)

where \(A\) is \(m \times n \,\)-order unknown linear mixed matrix (\(n \le m\)).

ICA algorithm is to separate the ICs by solving the mixed matrix \(W\).

$$Y\left( {t'} \right) = A^{ - 1} X\left( {t'} \right) = WX\left( {t'} \right)$$
(4)

ICs \(Y\left( {t'} \right)\) can be used to estimate \(S\left( {t'} \right)\) effectively. FastICA algorithm, a widely used ICA algorithm, is used to separate ICs from \(WX\left( {t'} \right)\) in this study. FastICA algorithm uses the Negentropy, a measure standard of non-Gaussian distribution, to measure the non-Gaussian maximum of \(WX\left( {t'} \right)\), and it follows the fixed-point iteration theory, which makes the convergence faster and more robust (Hyvärinen 1999).

2.2.3 Independent Component Signals Reconstruction

These ICs \(Y\left( {t'} \right)\) can reflect the implicit information of the observation signals. However, sometimes, the number of ICs are more than the seasonal mass loadings. Therefore, the ICs should be artificially classified into several clusters. Then, the ICs of the same cluster, corresponding to the same mass loading, are multiplied by their corresponding column of the mixed matrix to construct independent subspaces \(X^{K}\) for reconstructing the source signal \(x^{K}\) (Cardoso 1998). Therefore, the original multi-dimensional phase space is equal to the sum of independent subspaces.

$$X = X^{1} + X^{2} + \cdots + X^{K}$$
(5)

where \(X^{K}\) is the \(K {\text{th}}\) independent subspace, and it can be described as follows:

$$X^{K} = \mathop \sum \limits_{j \in Q} a_{j} y_{j}$$
(6)

where \(Q\) is the number of ICs in a cluster, and \(a_{j}\) is the corresponding column of the mixed matrix to the \(y_{j}\), where \(y_{j}\) is the \(j\) th IC. The independent subspaces \(X^{K}\) can be reconstructed to a one-dimensional source signal \(x^{K}\) based on the inversion of the PSR method,

$$x_{i}^{K} = \left\{ {\begin{array}{*{20}l} {\frac{1}{A}\sum\limits_{h = 1}^{A} {X_{j,i - (j - 1)\tau }^{K} } } \hfill & {1 \le i \le 1 + (m - 1)\tau } \hfill \\ {\frac{1}{m}\sum\limits_{h = 1}^{m} {X_{j,i - (j - 1)\tau }^{K} } } \hfill & {1 + (m - 1)\tau \le i \le T - (m - 1)\tau } \hfill \\ {\frac{1}{B}\sum\limits_{h = m + 1 - B}^{m} {X_{j,i - (j - 1)\tau }^{K} } } \hfill & {T - (m - 1)\tau \le i \le T} \hfill \\ \end{array} } \right.$$
(7)

where \(A = ceil(i/\tau )\), \(B = ceil((N - i + 1)/\tau )\), and \(ceil( \cdot )\) denotes the rounding-up function.

3 GNSS Vertical Time Series Analysis

3.1 Analysis of Observation Data and Mass Loadings

The seasonal non-tectonic deformation of GNSS reference station mainly includes the mass loading of atmosphere, soil moisture, non-tidal ocean, snow, etc. However, in most parts of China, the non-tidal ocean mass loading and snow mass loading had a little or no impact on crustal deformation, and they can be neglected. Thus, the effect of atmosphere and soil moisture mass loading are the major factors in the analysis of the seasonal non-tectonic deformation (Wang et al. 2005).

To explore the single-channel ICA process and its effect on separating non-tectonic deformation, we use the daily vertical time series of GNSS reference stations collected by the CMONOC. These daily GNSS solutions are generated using the GAMIT/GLOBK software with the double-difference ionosphere-free code and phase observations. Meanwhile, the station receiver antenna phase center, satellite antenna phase center, ocean tides, solid earth pole tides, and solid earth tides are corrected with the absolute phase center model, IGS ANTEX model, FES2004 ocean tide loading model, and IERS2003 model, respectively. The tropospheric delay and satellite coordinates are estimated with the station coordinates during the daily data processing, and then, each daily solutions are transformed into IGS08 using seven-parameter transformations. The data, consisting of 13 GNSS reference stations in the area from 90°E 15°N to 120°E 42°N (see Fig. 3), cover a time span from January 2001 to December 2013.

Fig. 3
figure 3

Distribution of the selected CMONOC GNSS stations

The vertical time series of those 13 GNSS stations with strong noise is manifested as non-stationary annual variation. To explore the data sets, we randomly choose three GNSS reference stations (BJFS, JIXN, and LUZH) as proxies to demonstrate the process of our methods and its results in detail. Figure 4 shows the vertical displacement time series of BJFS, JIXN, and LUZH sites for a time interval of 13 years.

Fig. 4
figure 4

Vertical time series of GNSS reference stations (a) BJFS, (b) JIXN, and (c) LUZH

To illustrate the non-tectonic deformation in the selected area, we calculate the correction values of atmospheric and soil moisture mass loadings of the 13 sites, using the 6-hour sampling atmosphere surface pressure data and the daily sampling National Center for Environmental Prediction (NCEP) reanalysis II soil moisture data from 2001 to 2013. The corrections of mass loadings are calculated using the Quasi-Observation Combination Analysis software (QOCA) (available at http://qoca.jpl.nasa.gov). Results are shown in Fig. 5 for the three selected sites.

Fig. 5
figure 5

Correction values of atmospheric mass loading and soil moisture mass loading (a) BJFS, (b) JIXN, and (c) LUZH

After removing the linear trend of GNSS vertical time series using the linear fitting method, the power spectrum is used to analyze the changes of the seasonal variation of GNSS vertical time series before and after removing the correction values of atmospheric and soil moisture mass loading, as shown in Fig. 6. After the correction, the annual variation of residual time series is significantly weakened, indicating that the correction can effectively reduce the annual variation in the GNSS vertical time series. However, the mass loading corrections cannot completely eliminate the annual variation in the GNSS reference stations. Therefore, decomposing the GNSS vertical time series into several sub-signals (relating to the main mass loadings) is an important way to illustrate the annual variation in the GNSS vertical time series.

Fig. 6
figure 6

Power spectrum of GNSS vertical time series before (black line) and after (red line) removing the atmospheric loading and soil moisture loading a BJFS, b JIXN, and c LUZH

3.2 EEMD Decomposition and Reconstruction

The EEMD method is used to decompose the vertical time series into IMF components and residual signal. The \(H\) values of IMFs are calculated by the Detrended Fluctuation Analysis method (Grech and Mazur 2013). As the calculation error can be up to 0.1, the IMFs whose \(H\) values are within [0, 1.1] are reconstructed as noise signal. The noise, the artificial reconstructed seasonal signal, and the long-term trend of BJFS, JIXN, and LUZH sites are shown in Fig. 7.

Fig. 7
figure 7

The reconstructed noise, seasonal signal, and long-term trend signal a BJFS, b JIXN, c LUZH

The EEMD method is also used to de-noise the sum of atmospheric mass loading and soil moisture mass loading. The comparison between the reconstructed seasonal signal and the sum of the de-noised atmospheric and soil moisture mass loadings of the 13 GNSS sites are shown in Fig. 8.

Fig. 8
figure 8

Comparison between the reconstructed seasonal signals decomposed by EEMD and the sum of the de-noised atmosphere and soil moisture mass loading for the 13 GNSS sites (20 mm offset per station)

The correlation coefficients, RMS (Root Mean Square) between the seasonal signals, and the sum of the corrections of de-noised seasonal mass loadings (atmospheric mass loading and de-noised soil moisture mass loading) are calculated, and their values for the 13 GNSS stations are shown in Table 1.

Table 1 Correlation between seasonal signals and sum of the de-noised atmospheric and soil moisture mass loading

The correlation coefficients of the seasonal signals and the de-noised sum of atmospheric mass loading and soil moisture mass loading are 0.61–0.86, and ΔRMS are 18–48 %. These results are reasonable, including the low correlation of WUHN, XNIN, KMIN, and XIAM sites (in bold), because the mass loading correction cannot completely reflect the seasonal variation of GNSS vertical time series (see Fig. 6).

From the comparison in Fig. 8 and correlation analysis in Table 1, we can say that there is a strong correlation between the GNSS station annual variation and the two mass loadings in the selected area.

3.3 Simulation Experiment

We mixed the de-noised correction of the atmosphere and soil moisture mass loading (red line in Fig. 8) as the simulated non-tectonic deformation signals. First, the PSR method is applied to obtain the multi-dimensional embedding matrix, and the time delay is 8 and the embedding dimension is 2. Then, the FastICA method is used to analyze the multi-dimensional embedding matrix, and the two ICs are obtained. We rename them as Reconstructed signal A (RS-A) and Reconstructed signal B (RS-B) without artificial classification and reconstruction. The comparison of reconstructed signals and its corresponding simulated signals (de-noised atmospheric and soil moisture mass loading) are shown in Figs. 9 and 10, respectively.

Fig. 9
figure 9

Comparison between the reconstructed signal A separated by PSR-ICA and the simulated signal (de-noised atmospheric mass loading) (10-mm offset per station)

Fig. 10
figure 10

Comparison between the reconstructed signal B separated by PSR-ICA and the simulated signal (de-noised soil moisture mass loading) (10-mm offset per station)

The correlation coefficients and RMS between reconstructed signals and their corresponding simulation signals are calculated, respectively, and their values are shown in Table 2.

Table 2 Correlation coefficients of reconstructed signals (Reconstruction A and Reconstruction B) and their corresponding mass loading signals

The simulation experiment shows that the signals reconstructed by the PSR-ICA method can effectively reflect the atmospheric and the soil moisture mass loading. The correlation coefficients of the RS-A and the atmospheric mass loading are 0.81–0.96, and the ΔRMS are 11–71 %. In addition, the values of the RS-B and the soil moisture mass loading are 0.71–0.98 and 14–77 %, respectively. These facts suggest that the atmospheric and the soil moisture mass loadings can be separated by single-channel ICA.

3.4 GNSS Vertical Time Series Analysis

The seasonal variation of GNSS vertical time series (black line in Fig. 8) is also used to verify the effectiveness of the PSR-ICA method.

First, the time delay and the embedding dimension are calculated using the mutual information method and the FNN method, respectively. The determined time delay is 51, which is quite different from that of the simulation experiment. A possible explanation could be the difference between seasonal signal and mass loading corrections. The embedding dimension is 2 in most cases, and 3 for the KMIN site, which indicates that the ICs should be reconstructed into two main components.

Then, the FastICA method is used to separate the multi-dimensional embedding matrix. A number of ICs are obtained. Three ICs for the KMIN site are artificially reconstructed into two reconstructed signals, Reconstructed signal 1 (RS-1) and Reconstructed signal 2 (RS-2). The comparison of the two reconstructed signals for the 13 GNSS sites and their corresponding mass loadings (de-noised atmospheric and the soil moisture mass loadings) are shown in Figs. 11 and 12, respectively.

Fig. 11
figure 11

Comparison between the reconstructed signal 1 separated by PSR-ICA and the atmospheric mass loading (20-mm offset per station)

Fig. 12
figure 12

Comparison between the reconstructed signal 2 separated by PSR-ICA and the soil moisture mass loading (20-mm offset per station)

The correlation coefficients and RMS between the reconstructed signals (RS-1 and RS-2) and their corresponding mass loading signals (de-noised atmospheric mass loading and de-noised soil moisture mass loading) are calculated, respectively, the results are shown in Table 3.

Table 3 Correlation between the reconstructed signals (RS-1 and RS-2) and its corresponding mass loading signals

From the correlation analysis of the reconstructed signals (RS-1 and RS-2) and their corresponding mass loading signals, we can say that some physical sources affect the crustal deformation. (1) The correlation coefficients of the RS-1 and the atmospheric mass loading are between 0.50 and 0.86, and ΔRMS are within the range of 5–47 %, which suggest that the RS-1 can be considered as a seasonal non-tectonic deformation mainly affected by the atmospheric mass loading; and (2) The correlation coefficients of the RS-2 and the soil moisture mass loading, except for the WUHN site (in bold), are in the range of 0.50–0.83, and ΔRMS are between 8 and 44 %, which means that the RS-2 can be considered as a seasonal non-tectonic deformation mainly affected by soil moisture mass loading. The vertical time series of WUHN station may be affected by the change of multipath effect and the instability of the station monumentation. The seasonal variation is not obvious and covered by noise and irregular trend variation in some years, which may seriously affect the seasonal signal extraction and the independent signal separation.

According to the analysis of experimental results in Sects. 3.3 and 3.4, the major mass loadings (atmospheric mass loading and the soil moisture mass loading) can be separated effectively by the single-channel ICA.

4 Discussion and Conclusion

We have shown that the PSR-ICA method can separate the independent source signals from the mixed mass loading signal in the simulation experiment. However, the correlations are 4–19 % and 2–29 % lower, respectively, between the two separated signals using the PSR-ICA method and atmospheric mass loading and soil moisture mass loading in the simulation experiment. These facts may indicate that the surface mass loadings, such as atmospheric mass loading and soil moisture mass loading, have a weak joint effect on crustal deformation in mainland China.

The PSR-ICA decomposition of the seasonal variation of GNSS vertical time series shows lower correlations with atmospheric and moisture loadings with respect to the simulation experiment. The correlations are systematically 15–20 % lower, which are mainly caused by the difference between simulated signals (simulated by the sum of the de-noised correction of the atmosphere and soil moisture mass loadings) and GNSS time series. Their correlation coefficients are 0.61–0.86, and the PSR-ICA decomposition results of these two experiments are both compared with the de-noised correction of the atmosphere and soil moisture mass loading. However, the mass loading corrections also have errors, such as for the KMIN site, there is no snow in this area during 2001 and 2013, and the snow mass loading corrections have a seasonal variation (up to 1.5 mm). The spatial resolution of mass loading data is 2.5° × 2.5° that mass loading corrections cannot accurately reflect the effect of mass loadings at specified points. Therefore, with the rapid increase of GNSS reference stations worldwide, the separation of mass load signals from GNSS time series is helpful to study the effect of mass loadings on the crustal deformation and improve the accuracy of the correction model of mass loadings.

We use the EEMD method to decompose the GNSS vertical time series into noise signal, seasonal signal, and long-term trend signal. The PSR-ICA method is applied to analyze the seasonal signal. We successfully obtain the seasonal non-tectonic deformation signals. Through the case study of the selected GNSS reference stations, the following three conclusions can be drawn: (1) By obtaining noise signal, seasonal non-tectonic deformation signal, and long-term trend signal using the EEMD method, we found that the seasonal signal is strongly correlated with the sum of the major mass loading corrections; (2) In the simulation experiment, the high correlation between the separated signals and mass loadings (correlation coefficients are 0.71–0.98 and ΔRMS are 11–77 %) has proved that the PSR-ICA algorithm can accurately extract the physical source signals from the non-tectonic deformation signals; (3) The high correlation between the signals separated from seasonal signals and mass loadings (correlation coefficients are 0.50–0.86 and ΔRMS are 5–47 %) can effectively illustrate the seasonal variation (mainly annual variation) in the vertical time series of the GNSS reference station, and the separated signals can more accurately reflect the effects of various mass loadings on the crustal deformation.