Keywords

1 Introduction

The center of mass (CM) of the Earth is defined as the center of the mass of the entire Earth including solid Earth, ocean and atmosphere (Montag 1999; Pavlis 1999) (see Table 1 for a list of abbreviations used in the paper). Sea level change, glacier melting, atmospheric circulation, and mantle convection result in the CM movement relative to the center of the figure (CF) of the Earth surface, which reflects the global mass redistribution and the interaction between solid Earth and mass loading (Wu et al. 2012; Cheng et al. 2013a). The geocenter motion (GCM) is helpful to study the problem that the International Terrestrial Reference Frame (ITRF) implementation is not completely consistent with the International Terrestrial Reference System (ITRS) definition, which will further improve the precision of the reference frame origin (Blewitt 2003; Dong et al. 2003; Metivier et al. 2010; Altamimi et al. 2011). The GCM has an important impact on the reference frame transformation of Satellite Laser Ranging (SLR), Global Navigation Satellite System (GNSS) and Doppler Orbitography and Radio-positioning Integrated by Satellite (DORIS) system (Wei et al. 2016; Riddell et al. 2017). It is also an important topic for studying the Earth’s mass redistribution, such as ocean tide, glacial isostatic adjustment, atmospheric and ocean circulation, geodynamic process in the Earth’s interior (Trupin et al. 1992; Dong et al. 1997; Watkins and Eanes 1997).

Table 1. Abbreviations used in this paper

SLR data were processed to estimate effectively GCM (Bouille et al. 2000; Crétaux et al. 2002). The fewer SLR sites, the maldistribution of sites and the absence of SLR data on the ocean lead to the deterioration of the measurement precision. GNSS data were also used to determine GCM (Blewitt and Clarke 2003). Although there are more International GNSS Service (IGS) stations all over the world, GNSS satellite orbital altitude is so high that the precision of GNSS-derived GCM is lower than that of SLR-derived products. The DORIS-derived GCM in Z direction is very different from the other two directions and its precision is only up to centimeter-level. DORIS-derived GCM has the lowest precision among the series estimated by these three space geodetic techniques (Kuzin and Tatevian 2005; Kong et al. 2017). The time series of GCM in different time-span can be accurately estimated by using the SLR data of Lageos-1, Lageos-2 and other geodynamical satellites.

The wavelet transformation, least squares spectral analysis, singular spectrum analysis (SSA) can be used to discover the characteristics of GCM in X, Y and Z directions (Guo et al. 2009; Wei et al. 2016). However, when analyzing the GCM, these methods cannot take into account the correlation of all directions of GCM. Multi-channel singular spectrum analysis (MSSA), as the extended form of SSA, is one of the effective statistical data analysis methods in oceanography, geoscience, meteorology and other fields (Vautard and Ghil 1989; Wang et al. 2016; Zotov et al. 2016; Zhou et al. 2018). The MSSA is a method for analyzing nonlinear time series. It is also able to denoise data, extract periodic oscillation signals, identify trends from multidimensional time series and build prediction models (Shen et al. 2017, 2018). Compared with SSA, during the process of multidimensional time series, the correlations among different channels are taken into account, so we apply the MSSA method to the analysis of GCM and study the ability to extract periodic signals of GCM.

The monitoring and modeling of GCM is a key issue for constructing a millimeter-level, dynamic and real-time global reference frame. However, due to the complexity of obtaining multi-source observations and data processing, The GCM parameters cannot be obtained in real-time or quasi-real-time (Altamimi et al. 2011; Zhao et al. 2019). Therefore, the MSSA model is applied to predict GCM parameters and proposes a GCM prediction method that combines linear model (LM), MSSA and autoregressive moving average model (ARMA).

SLR-derived GCM series from Jan. 1993 to Feb. 2017 updated by the Center for Space Research (CSR), the Texas University at Austin, are used to study GCM variation in this paper. The trend and periodic variations of GCM are investigated by using MSSA. Finally, based on historical GCM data, the fusion method of LM, MSSA and ARMA models is used to predict GCM parameters.

2 Data Collection and Methodology

2.1 SLR-Derived GCM Products

The GCM products used in this paper are obtained from CSR at University of Texas website (http://ftp.csr.utexas.edu/pub/slr/geocenter/). The GCM products (GCN_L1_L2_30d_CF-CM) were solved with UTOPIA and LLISS from SLR data of geodynamical satellites (e.g. Lageos-1/2, Starlette, Ajisai and Stella) in SLRF2014 (Cheng et al. 2013a; Pearlman et al. 2019). CF-CM is intended to reflect the true degree-1 mass variations without being affected by the higher-degree site loading effects (particularly at the annual frequency) (Ries 2016). The GCM products are often used to study the local and global mass balance with GRACE and are currently the best geocenter coordinate result recognized internationally. Here we download the GCM products from Jan. 1993 to Feb. 2017.

2.2 Multi-channel Singular Spectrum Analysis

There is a time series \(x_{{li}} \left( {l = 1,...,L;i = 1,...,N} \right)\) in which l is channel number and i is time sequence number. The rank of \(x_{{li}}\) is arranged according to the time delay phase space M (\(1 \le M \le N/2\)) that is the window length and also called the step number of time lag. The integer multiple of the main cycle is generally chosen as one window length in MSSA (Oropeza and Sacchi 2011; Shen et al. 2018).

The trajectory matrix of the channel \(l\) is

$$ {\mathbf{X}}_{i} = \left[ {\begin{array}{*{20}c} {x_{{1i}} } & {x_{{2i}} } & \cdots & {x_{{Ki}} } \\ {x_{{2i}} } & {x_{{3i}} } & \cdots & {x_{{K + 1i}} } \\ \vdots & \vdots & \vdots & \vdots \\ {x_{{Mi}} } & {x_{{M + 1i}} } & \cdots & {x_{{Li}} } \\ \end{array} } \right]\,\left( {1 \le i \le N} \right) $$
(1)

where \(K = N - M + 1\). The multi-channel trajectory matrix can be indicated as

$$ {\mathbf{X = [X}}_{{\mathbf{1}}} {\mathbf{,X}}_{{\mathbf{2}}} {\mathbf{,}} \cdots {\mathbf{,X}}_{{\mathbf{N}}} {\mathbf{]}}^{{\mathbf{T}}} $$
(2)

Matrix X has \(L \times M\) rows and \(N - M + 1\) columns. Similar to SSA, the next step is to decompose the singular value of X. We define the matrix \({\mathbf{S}} = {\mathbf{XX}}^{T}\), where \({\mathbf{X}}^{T}\) is the transposed matrix of X. Suppose that \(\lambda _{1} , \cdots ,\lambda _{M}\) are the eigenvalues of matrix S, that is, the singular values. These eigenvalues are arranged in the descending order. The larger singular value generally represents the larger energy signal and the smaller one corresponds to the noise part. Matrix X can be expressed in the elementary matrix as

$$ {\mathbf{X = P}}_{{\mathbf{1}}} {\mathbf{ + P}}_{{\mathbf{2}}} {\mathbf{ + }} \cdots + {\mathbf{P}}_{{\mathbf{D}}} $$
(3)

where D represents the number of singular values, and \({\mathbf{P}}_{i} = {\mathbf{S}}_{i} {\mathbf{U}}_{i} {\mathbf{V}}_{i}^{T}\) in which \({\mathbf{U}}_{i}\) is the temporal empirical orthogonal function and \({\mathbf{V}}_{i}\) is the temporal principal components.

The GCM time series contain different signals, such as annual term and semi-annual term. It is necessary to use the w-correlation method (Hassani 2007) to merge elementary matrix \({\mathbf{P}}_{i}\) representing the same signal into a group. Suppose that the time series after reconstruction of each elementary matrix \({\mathbf{P}}_{i}\) is \({\mathbf{Y}}_{i}\), the correlation of any two reconstructed time series can be expressed by w-correlation as

$$ \rho _{{i,j}}^{w} = \frac{{(Y^{{(i)}} ,Y^{{(j)}} )}}{{\left\| {Y^{i} } \right\|_{w} \left\| {Y^{j} } \right\|_{w} }},(1 \le i,j \le L) $$
(4)

Where \(\left\| {Y^{i} } \right\|_{w} = \sqrt {(Y^{{(i)}} ,Y^{{(i)}} )}\), \((Y^{{(i)}} ,Y^{{(j)}} ) = \sum\limits_{{k = 1}}^{N} {w_{k} y_{k}^{i} y_{k}^{j} }\), and \(w_{k} = \min (k,M,N - k)\). The larger the absolute value of \(\rho _{{i,j}}^{w}\) is, the greater the correlation of the corresponding components of i and j is, which should be classified as the same periodic signal component. Then the corresponding trajectory matrix has been build.

The corresponding group of trajectory matrix is converted into a new time series with the length of N, which is called the reconstructed component (Golyandina et al. 2001). Then the reconstructed component (RC) is

$$ {\text{x}}_{{li}}^{k} {\text{ = }}\left\{ {\begin{array}{*{20}l} {\frac{1}{i}\sum\limits_{{j = 1}}^{i} {a_{{i - j}}^{k} E_{{lj}}^{k} } } \hfill & {1 \le i \le M - 1} \hfill \\ {\frac{1}{M}\sum\limits_{{j = 1}}^{M} {a_{{i - j}}^{k} E_{{lj}}^{k} } } \hfill & {M \le i \le N - M + 1} \hfill \\ {\frac{1}{{N - i + 1}}\sum\limits_{{j = i - N + M}}^{M} {a_{{i - j}}^{k} E_{{lj}}^{k} } } \hfill & {N - M{\text{ + }}2 < i \le N} \hfill \\ \end{array} } \right. $$
(5)

3 Analysis of GCM

3.1 GCM Seasonal Variation

The GCM time series comprising monthly data (\(L = {\text{290}}\)) from Jan. 1993 to Feb. 2017 are used to analyze GCM variation. The trajectory matrix of GCM are decomposed by selecting the window \(M = {\text{108}}\) (month), which is determined by the period of the annual term of GCM and many practical experiments. Singular values in descending order are shown in Fig. 1.

Fig. 1.
figure 1

Singular values of GCM series determined by MSSA

As shown in Fig. 1 that the values starting from the 20th singular value are already smaller than the first 4th previous singular values, and the values after 20th order change smoothly so that they can be ignored. The w-correlations \(\rho _{{i,j}}^{w}\) of the first 20 reconstruction parts of GCM in the three directions except for the trend term are shownin Fig. 2.

Fig. 2.
figure 2

W-correlations for the first 20 reconstructions. (a) X direction, (b) Y direction, and (c) Z direction.

The greater w-correlation \(\rho _{{i,j}}^{w}\) means the corresponding components belong to the same periodic term. As shown in Fig. 2 that the time series in the X direction is not completely separated from each other when \(i > 10\) and the separation effect in both Y and Z directions also deteriorates after \(i >10\), which may be caused by noises. Hence, the first 10 groups (RC1, RC2, …, and RC10) are used to reconstruct the new GCM series. RC1 and RC2 represent a periodic term in the original series; RC3 and RC4 represent the other one; RC5 and RC6, RC7 and RC8, RC9 and RC10 can be combined into one periodic component, respectively. Figure 3 shows the original GCM series in X, Y and Z directions except for the trend term and the time series reconstructed by the first 10-order reconstruction components.

As shown in Fig. 3 that the fluctuation ranges of raw GCM in X and Y directions are smaller than that in Z direction. Although the offset of CM relative to CF in the Z direction is large, the fluctuation amplitude is small, and most of them are negative.

The correlation coefficients between the time series reconstructed by the first 10 singular vectors and the corresponding original time series in X, Y and Z directions are 73.34%, 86.78% and 83.84%, respectively, which indicates that they have good consistency. It shows that MSSA model can effectively extract relatively complete information about the main components in the three directions.

Fig. 3.
figure 3

Time series of GCM from 1993 to 2017.2. (a) X direction, (b) Y direction, and (c) Z direction.

Table 2 shows the singular spectrum values and the variance contributions of GCM time series calculated by MSSA from 1993 to Feb. 2017. The variance contribution of the first 10-order reconstruction components has reached 65.80%, which can characterize the variation of GCM effectively. Furthermore, the variance contribution of reconstruction RC1 and RC2 is significantly larger than that of other components, which indicates that the corresponding cyclophysis is most obvious. The first 10-order reconstruction components can be combined into five periodic terms according to the principle of w-correlation. Figures 4, 5 and 6 show the reconstructed component and the corresponding power spectrum in the three directions.

Table 2. Singular values and variance contributions of the first 10 orders based on MSSA
Fig. 4.
figure 4

Combination of RC and the power spectrum analysis in X direction

Fig. 5.
figure 5

Combination of RC and the power spectrum analysis in Y direction

Fig. 6.
figure 6

Combination of RC and the power spectrum analysis in Z direction

As shown in Figs. 4, 5 and 6 that the five main periodic terms of GCM in three directions are basically the same. There is the annual term, semi-annual term, quasi-0.6 years term, quasi-1.5 years term and long-term term. The cycles are respectively 0.99 years, 0.5 years, 0.58 years, 1.47 years and 6.09 years. The fifth main periodic term is also mixed with a small periodic term of about 1.4 years. The variance contributions of these five main periodic components in the three directions are different, and annual characteristics make great domination.

The annual periodic oscillations in the three directions are relatively stable, and the periodicity is the most obvious, with amplitudes of about 1.7 mm, 2.8 mm and 4.4 mm respectively. The valley value of the annual variation of GCM in X direction occurs from August to September and the peak value appears from February to March. The valley value of the annual variation of GCM in Y direction occurs from May to June and the peak value appears from November to December. The valley value of the annual variation of GCM in Z direction occurs from July to August and the peak value appears from January to February. The semi-annual term oscillation is not relatively obvious. Although the semi-annual variations of 0.5 years are shown in these three directions, the corresponding amplitude variation characteristics are not the same. In the past decade, the amplitude of the semi-annual cycle in three directions have gradually increased. The cyclophysis of quasi-1.5 years and quasi-0.6 years, one of the five main periods of GCM, also exhibits strong seasonal characteristics. The oscillation characteristics of quasi-1.5 years and quasi-0.6 years period in Y and Z direction are similar, and the amplitude gradually decreases with time. The oscillation of quasi-1.5 years period in X direction has obvious fluctuations and reached its maximum in February 2005. The seasonal characteristic of the quasi-0.6 years period in X direction is obvious, which may be due to the inclusion of many other signals in the period.

The annual term, semi-annual term, quasi-0.6 years term and quasi-1.5 years term mentioned above belong to the seasonal cycle of GCM. The solar radiation, changes in the gravitational field and other Earth external energy cause the surface mass redistribution of land water, ocean and atmosphere, which results in the significant seasonal GCM. The major reason for the seasonal cycle of GCM is the seasonal variation of land water storage (Guo et al. 2008; Feissel-Vernier et al. 2006). The exchange of water mass in the Earth’s hemisphere has a clear annual cycle. Greater water mass in the northern hemisphere appears during June-August, instead of appears during December-February in the southern hemisphere (Blewitt et al. 2001; Baur et al. 2013). The peak and valley values of the annual term in Z direction may be the reflection of water mass redistribution.

The long period terms in X, Y and Z directions analyzed by MSSA are all 6.09 years. The major reason for the secular variation of center of mass of the Earth system is the glacial isostatic adjustment. The influence of the glacial ablation on the solid Earth causes GCM velocity of less than 1 mm/yr (Klemann and Martinec 2011). The amplitude of the long-period term in the X direction is maintained within 0.5 mm, but the amplitude in the Y and Z directions has a sudden increase of 1 mm in 1997, which may be caused by the El Nińo that was strongest in the 20th century (Guo et al. 2009).

3.2 GCM Trend Variation

The GCM series in X, Y and Z directions are directly decomposed and analyzed by using MSSA to determine their trends. The GCM trend variation from 1993 to 2017.2 is shown in Fig. 7. The secular velocity of GCM is calculated by least square method. Table 3 shows the comparisons among the proposed method and the reported methods.

As shown in Fig. 7 that the trend variation in the three directions of GCM are nonlinear. After 1997, the moving direction of GCM in the three directions remained stable. It can be seen from Table 3 that the long-term speed of GCM obtained from different data and time has a relatively large difference, and there is no accurate reference standard at present. In this paper, the variation rates in X, Y and Z are 0.05 mm/yr, 0.04 mm/yr and −0.10 mm/yr, respectively, all of which are less than 1 mm/yr. After 1997, the long-term variation rate of GCM in Z direction is negative, which may be caused by glacial isostatic adjustment (Wu et al. 2010).

Fig. 7.
figure 7

GCM trend variation by using MSSA

Due to the local expansion of the zero-degree hemisphere and the condensation of the 180-degree hemisphere, the variation rate of GCM in X direction is positive, which shows that CM moves towards the X direction relative to CF. Besides, the maximum variation of atmospheric pressure occurs in Central Asia (Blewitt et al. 2001; Greff-Lefftz and Legros 2007). The solid Earth is a viscous elastomer so that the maximum fluctuation of atmospheric pressure in Central Asia may have a certain impact on GCM in Y direction, which results in the positive of the secular velocity.

Table 3. Secular velocity of GCM in different studies (mm/yr)

4 Prediction of GCM

4.1 Principle of Prediction

With the continuous development and improvement of space technology, there is a growing demand for the prediction parameters of GCM, such as millimeter level, dynamic real-time global reference framework and high-precision satellite navigation and positioning (Metivier et al. 2010; Zhao et al. 2019). To meet this demand, therefore, the combined method of LM and MSSA and ARMA is applied to predict the GCM parameters. The flow chart of prediction is shown in Fig. 8.

Fig. 8.
figure 8

The prediction method of GCM

  1. (1)

    First, perform linear fitting on the GCM series, establish a linear model and make predictions. Assuming that the GCM time series is \(X_{t}\), the least squares method is used to linearly fit it, and the model is expressed as:

    $$ X_{t} = \beta _{0} + \beta _{1} {\text{t}} $$
    (6)

Where \(\beta _{0}\) is a constant, and \(\beta _{1}\) represents the linear trend change rate.

  1. (2)

    MSSA is used to decompose the GCM series of the de-linear trend, and the appropriate main component terms are selected for the main component prediction of the GCM. The main component prediction method of MSSA is as follows:

  • The number of predictions of GCM is N, assuming that the time series of GCM without linear trend is \(Y_{t}\), N zeros are added at the end of \(Y_{t}\) to form a new prediction sequence \(Y_{{t + N}}\);

  • The new prediction sequence \(Y_{{t + N}}\) is decomposed by MSSA, and the N values at the end of the first RC (RC1) are used to replace the corresponding prediction values of the new sequence. This process is repeated until the RMS value of the two replacements data is less than 0.001 mas;

  • RC2 is added to reconstruct the prediction data, that is, the prediction data is obtained by linear superposition of RC1 and RC2. Step 2 is repeated until RC1…RCi is linearly added to the prediction data, and the predictions using MSSA can be obtained.

  1. (3)

    ARMA is used to model and forecast the residual components reconstructed by MSSA. Assuming that the sequence of residual items is \(Z_{t}\), the ARMA model is expressed as:

$$ Z_{t} = c + \sum\limits_{{i = 1}}^{p} {\phi _{i} Z_{{t - i}} + \varepsilon _{t} + \sum\limits_{{j = 1}}^{q} {\theta _{j} \varepsilon _{{t - j}} } } $$
(7)

Where p represents the order of the AR model, q represents the order of the MA model, c is the constant term of the ARMA model, \(\phi _{i}\) is the free regression coefficient at time t − i, \(\varepsilon _{t}\) is the error term at time t, and \(\theta _{j}\) is the moving average coefficient at time t − j. The extended autocorrelation function (EACF) is selected to determine the order p and q of the AR model and the MA model through the maximum likelihood estimation method. The model parameters are estimated to build the ARMA model and predict the residual items.

Finally, the GCM predictions are obtained by adding the predictions of the linear model, MSSA and ARMA models.

4.2 Results and Analysis

This paper selects the time series of GCM from 1993 to Feb. 2017 provided by CSR for prediction research. Taking the data from January 1993 to the starting time of forecast as the training data, seven 2-year forecast experiments were carried out. The starting time was January, May and September 2013, January, May and September 2014 and January 2015, respectively. Through the analysis in Sect. 3, the MSSA window and main component were selected as 108 and 10. Through EACF analysis, it is found that ARMA(0,6), ARMA(0,11) and ARMA(3,9) have good applicability to the residual items in the X, Y and Z directions of GCM, respectively.

Table 4. The statistical precision of LM+MSSA+ARMA model (mm)

Figure 9 shows the comparison between the predictions and the true values of GCM in the three directions. The black line in the figure represents the true values of GCM, and the red line represents the predicted values. The statistical results of the prediction precision of the LM+MSSA+ARMA model are shown in the Table 4. As shown in Fig. 8 and Table 4 that the combined model has the best prediction precision when the prediction length is 6 months. The RMSE in X, Y, and Z directions are 1.29 mm, 1.03 mm and 3.29 mm, respectively, and the precision of X and Y directions are all within 1.5 mm. The precision of the Z direction is relatively poor, which may be caused by the large change in the amplitude of GCM in the Z direction. As the prediction time increases, the predicted precision shows a downward trend as a whole, reaching the maximum when the prediction length is 2 years, and the prediction precision in the three directions is 1.53 mm, 1.08 mm and 3.46 mm respectively. In general, the LM+MSSA+ARMA model has good performance in short-term, medium-term and long-term predictions of GCM, and can provide prediction precision of 1.5 mm, 1.1 mm and 3.5 mm in the X, Y and Z directions respectively.

Fig. 9.
figure 9

Comparison of GCM predictions

5 Conclusions

The GCM products estimated from SLR data released by CSR are analyzed to determine the variation of geocenter motion. MSSA is used to analyze the GCM series in the X, Y and Z directions from 1993 to 2017.2. The seasonal variation periods are 0.99 years, 0.5 years, 0.58 years and 1.47 years, and the long period term is 6.09 years in the X, Y and Z directions, which shows that the MSSA can well display and amplify the main periodic signals of the original time series of GCM. The annual characteristics in the three directions are the most obvious and the wave oscillation is stable. The GCM in the X, Y and Z directions are directly analyzed by using MSSA to determine its trends. The long-term trends of the three directions are 0.05 mm/yr, 0.04 mm/yr and −0.10 mm/yr, respectively. The migration velocity in Z direction is obviously higher than that in X and Y directions, which may be mainly caused by the water mass redistribution in the northern and southern hemispheres.

To obtain high-precision geocentric motion prediction parameters, the method combining LM and MSSA and ARMA models is used to predict GCM with ahead 2 years. The results show that the LM+MSSA+ARMA model can effectively predict GCM parameters, and can provide prediction precision of 1.5 mm, 1.1 mm and 3.5 mm in X, Y and Z directions, respectively.