1 Introduction

Earth orientation parameters (EOPs) include the nutation and precession parameters, the polar motion parameters and the length of day (LOD). EOPs contain a wealth of geodynamics information and play a significant role in applications including the determination of high-precision satellite orbits, spacecraft tracking, laser measurements, and deep space exploration. These parameters are needed to achieve mutual conversion between celestial and terrestrial reference frames. Thanks to developments in modern measurement techniques, such as very long baseline interferometry (VLBI), satellite laser ranging (SLR), global navigation satellite system (GNSS), and doppler orbitography by radio-positioning integrated on satellite (DORIS) (Dow et al. 2005), increasingly accurate observations of EOPs promote advances in celestial dynamics.

Due to the complexity of the data processing involved, EOP products cannot be calculated in real time using the data obtained by modern earth observation technologies. Thus, the parameters are usually provided after a delay of hours to days. However, for some practical applications, the EOPs must be acquired in advance. For example, in satellite navigation systems, only long-term predictions of EOPs can be used to achieve mutual conversion between the celestial and terrestrial reference frames when satellites enter the autonomous orbit mode. Therefore, a reliable and high precision prediction model is needed. However, EOPs are affected by many factors (Völgyesi 2006) that can be divided into two groups, specifically the effects of (1) different levels and periodic excitation sources and (2) high-frequency variations. These factors cause enormous challenges in the precise prediction of EOPs.

To check the accuracy of different methods of predicting EOPs, the Institute of Geodesy and Geophysics of the University of Vienna conducted two prediction competitions, the Earth Orientation Parameters Prediction Comparison Campaign (EOP PCC) in October 2005 (http://users.cbk.waw.pl/~kalma/EOP_PCC/) and the Earth Orientation Parameters Combination of Prediction Pilot Project (EOPCPPP) in October 2010 (http://eopcppp.cbk.waw.pl). The purposes of these activities were to encourage scholars worldwide to utilize different methods to predict EOPs and to assess the forecasting accuracy and applicability of various prediction methods. The prediction methods can be divided into two categories, specifically single models (including both linear and nonlinear models) and combinations of multi models (Freedman et al. 1994; Schuh et al. 2002; Kosek et al. 2004, 2008; Xu 2012; Guo et al. 2013; Xu and Zhou 2015). One major conclusion that was reached as a result of these competitions is that no single prediction technique is suitable for all EOPs over their entire ranges of variation. The following two conclusions were also reached. (1) Least-squares extrapolation of harmonic models and autoregressive prediction (LS + AR), spectral analysis and LS extrapolation, and neural networks display relatively good performance in predicting PM. (2) Kalman filters, wavelet decomposition and auto-covariance prediction, and adaptive transformation of the atmospheric angular momentum (AAM) yield relatively high-quality results in predicting UT1-UTC and LOD (Kalarus et al. 2007, 2010).

Many scholars have carried out studies on methods of predicting polar motion (Zhang 2012; Sun 2013; Lei 2016). Of these methods, LS + AR models represent a relatively stable type of model that is used to predict PM parameters; moreover, many scholars have made different improvements to this method (Sun and Xu 2012; Xu et al. 2012a, b; Yao et al. 2013). Note that the high calculation precision of the PM parameters is almost negligible compare with the prediction accuracy; thus, the effects of model error on the predictions should be emphasized. Therefore, how to address the residuals that result from fitting LS models to data has been examined by different scholars. Differential method processing LS + AR models are used mainly in short-term forecasts (Yan and Yao 2012), whereas WLS + AR models are employed in medium-and-long-term predictions of PM parameters. The main principle used by previous studies to construct the weight matrix is that relatively high weights are assigned when the predicted values lie close to the data; although this practice effectively improves the prediction accuracy, it depends considerably on the experience of the operator (Sun and Xu 2012). Therefore, in this paper, we propose employing the calculation precision of the PM parameters extracted from EOP products as weighting factors to determine the weight matrices of the vectors of observation used in the least-squares extrapolation model. In this study, the fitting parameters of the weighted least-squares model are first calculated; the AR model parameters are then determined from the residuals; and finally, the extrapolated values obtained from the WLS and AR models are combined to obtain the final predictions. Having extracted PM parameters and their precisions from the data products published by the analysis centres of the IGS and the IERS, which are taken to represent basic data, this study predicts medium-term changes in PM and compares them with the results predicted by the LS + AR model to verify the feasibility of the method presented here.

2 Methods

2.1 Weighted least-squares model

The method is based on LS + AR. An introduction to the construction of and the calculations performed by the model is first presented. Existing studies have shown that the major trends of the PM contain a linear term and periodic terms, including the Chandler wobble and the annual and semi-annual terms (Sun 2013). The fitting equation of the LS model can be expressed as:

$$ {\text{X}}\left( t \right) = a_{0} + a_{1} \cdot t + a_{2} \cdot cos\left( {\frac{2\pi t}{{T_{1} }}} \right) + a_{3} \cdot sin\left( {\frac{2\pi t}{{T_{1} }}} \right) + a_{4} \cdot cos\left( {\frac{2\pi t}{{T_{2} }}} \right) + a_{5} \cdot sin\left( {\frac{2\pi t}{{T_{2} }}} \right) + a_{6} \cdot cos\left( {\frac{2\pi t}{{T_{3} }}} \right) + a_{7} \cdot sin\left( {\frac{2\pi t}{{T_{3} }}} \right) $$
(1)

where X(t) is the PM at time t; t is the UTC time (in years); \( a_{0} ,a_{1} ,a_{2} ,a_{3} ,a_{4} ,a_{5} ,a_{6} ,a_{7} \) need to be estimated; \( T_{1} ,T_{2} ,T_{3} \) are the periods of the semi-annual, annual and Chandler wobble terms, respectively. In this paper, \( T_{1} = 0.5a,T_{2} = 1a \) and \( T_{3} = 1.183a \). Estimates of the model parameters can be calculated using formula (2):

$$ \upalpha = \left( {A^{T} PA} \right)^{ - 1} A^{T} PL $$
(2)
$$ \upalpha = \left[ {\begin{array}{*{20}c} {a_{0} } & {a_{1} } & {a_{2} } & {a_{3} } & {a_{4} } & {a_{5} } & {a_{6} } & {a_{7} } \\ \end{array} } \right]^{T} $$
(3)
$$ {\text{L}} = \left[ {\begin{array}{*{20}c} {X\left( {t_{1} } \right)} & {X\left( {t_{2} } \right)} & \cdots & {X\left( {t_{n} } \right)} \\ \end{array} } \right]^{T} $$
(4)
$$ {\text{A}} = \left[ {\begin{array}{*{20}c} 1 & {t_{1} } & {cos\left( {\frac{{2\pi t_{1} }}{{T_{1} }}} \right)} & {sin\left( {\frac{{2\pi t_{1} }}{{T_{1} }}} \right)} & {cos\left( {\frac{{2\pi t_{1} }}{{T_{2} }}} \right)} & {sin\left( {\frac{{2\pi t_{1} }}{{T_{2} }}} \right)} & {cos\left( {\frac{{2\pi t_{1} }}{{T_{3} }}} \right)} & {sin\left( {\frac{{2\pi t_{1} }}{{T_{3} }}} \right)} \\ \vdots & \vdots & \vdots & \vdots & \vdots & \vdots & \vdots & \vdots \\ 1 & {t_{n} } & {cos\left( {\frac{{2\pi t_{n} }}{{T_{1} }}} \right)} & {sin\left( {\frac{{2\pi t_{1} }}{{T_{1} }}} \right)} & {cos\left( {\frac{{2\pi t_{n} }}{{T_{2} }}} \right)} & {sin\left( {\frac{{2\pi t_{n} }}{{T_{2} }}} \right)} & {cos\left( {\frac{{2\pi t_{n} }}{{T_{3} }}} \right)} & {sin\left( {\frac{{2\pi t_{n} }}{{T_{3} }}} \right)} \\ \end{array} } \right] $$
(5)

where \( \upalpha \) is the vector of estimated parameter; A is a coefficient matrix; L is a vector of observation; and P is the weight matrix, which is a diagonal matrix. The residuals are then processed using an AR model.

2.2 Determination of the weight matrix

The key feature of the WLS method is that it incorporates a weight matrix into the estimation of parameters based on LS models. An appropriate weighting method effectively improves the precision of the model parameters, thus resulting in a more stable base sequence for the AR model. In the actual process of surveying adjustment, using the calculated precision of the parameters as the posterior factor is a commonly weighting method. Therefore, this paper extracts the precision of PM parameters from the EOP products released by different analysis centers for use as a variance factor in building the weight matrix. The weight matrix is expressed as follows:

$$ {\text{P}}_{w} = \left[ {\begin{array}{*{20}c} {\frac{{\sigma_{0}^{2} }}{{\sigma_{1,w}^{2} }}} & 0 & 0 & 0 \\ 0 & {\frac{{\sigma_{0}^{2} }}{{\sigma_{2,w}^{2} }}} & 0 & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & 0 & {\frac{{\sigma_{0}^{2} }}{{\sigma_{n,w}^{2} }}} \\ \end{array} } \right] $$
(6)

where w denotes the polar motion (PMX, PMY); n is the length of the basic data; the unit weight variance \( \sigma_{0}^{2} \) is assigned a value of 1; and \( \sigma_{n,w}^{2} \) represents the variances of the calculated PM parameters on the nth day.

2.3 AR model

An AR model describes the relationship among the variables of a random series \( z_{t} \left( {t = 1,2, \ldots N} \right) \), which can be expressed as follows:

$$ z_{t} = \sum_{i = 1}^{p} \varphi_{i} z_{t - i} + \varepsilon_{t} $$
(7)

where \( \varphi_{1} ,\varphi_{2} , \ldots \varphi_{p} \) are autoregressive coefficients obtained using the least squares method; p is the order of the model; and \( \varepsilon \) represents zero-mean white noise. In this paper, the AR model is constructed by the fitting residuals of the WLS model.

A key step in applying AR model is determining the order p. Three main methods of estimating the orders of AR model exist. Specifically, these rules are the final prediction error (FPE) criterion, the akaike information criterion (AIC), and the criterion autoregressive transfer (CAT). In practice, these three methods are virtually equivalent to each other. The FPE criterion is adopted to determine the order of AR model.

$$ {\text{FPE}}\left( p \right) = \frac{{\left( {N + p} \right)}}{{\left( {N - p} \right)}}P_{p} $$
(8)

where

$$ P_{p} = \frac{1}{N - p}\mathop \sum \limits_{t = p + 1}^{N} \left( {z_{t} - \mathop \sum \limits_{j = 1}^{p} \varphi_{j} z_{t - j} } \right)^{2} $$
(9)

Here \( p = 1,2, \ldots N \), and P p is the residual mean squared deviation of the model fitting sequence in AR(p). When FPE (p) reaches its minimum value, p is taken to represent the order of the best model.

2.4 Prediction error estimates

The mean absolute error (MAE) is an indicator of prediction accuracy. It is calculated as:

$$ {\text{MAE}}_{i} = \frac{1}{n} \sum_{j = 1}^{n} \left| {S_{j}^{i} - O_{j}^{i} } \right| $$
(10)

where i is the prediction interval; \( S_{j}^{i} ,O_{j}^{i} \) are the predicted and released values, respectively, on day j; and n is the number of predictions used to calculate the statistics.

3 Examples and analysis

The products released by IGS and IERS contain the values and precision of PM parameters, and the time interval used in the products is 1 day. However, the EOP products released by the IGS Analysis Centre are determined using GNSS technology, whereas those of the IERS Analysis Centre are calculated using multiple technologies (such as GNSS, VLBI, SLR, and DORIS); the precisions of the PM parameters are not the same. In this study, the EOP products released by IGS and IERS are used as basic experimental data to predict the medium-term PM and to verify the effectiveness of the method proposed. As recent studies have shown (Sun et al. 2015), the highest accuracies are obtained in predictions using of the LS + AR models when an input sequence of 10 years is used. Therefore, this thesis takes 10 years as an elementary sequence to predict PM using an interval of 1 day and a span of 1–360 days.

3.1 Experiments based on products released by IGS

The values and the precisions derived from the PM parameters products issued by IGS (ftp://cddis.gsfc.nasa.gov/pub/gps/products/), which extend from January 1, 2002 to December 31, 2011, are employed to construct the prediction model and build the weight matrix. For each prediction (1–360 days), 600 and 1000 experiments are performed using the LS + AR and WLS + AR methods. The predictions extend from January 1, 2012 to August 18, 2014 and January 1, 2012 to September 22, 2015, and the sampling interval is 1 day. Tables 1 and 2 list the MAE values for the different prediction intervals, and these values are shown graphically in Figs. 1 and 2.

Table 1 Statistical accuracy results in 600 IGS forecasts
Table 2 Statistical of accuracy results in 1000 IGS forecasts
Fig. 1
figure 1

MAE values calculated for 600 IGS forecasts over different intervals

Fig. 2
figure 2

MAE values calculated for 1000 IGS forecasts over different intervals

3.2 Experiments based on products released by IERS

As another example, this study chooses the values and the precision of PM parameters issued by IERS 08 C04 EOP product (ftp://ftp.iers.org/products/eop/long-term/c04_08/iau2000/), which extends from January 1, 2002 to December 31, 2011. These data are used to construct the prediction model and build the weight matrix. 600 and 1000 experiments are performed using the LS + AR and WLS + AR methods for each prediction (1–360 days), and the sampling interval is uniformly 1 day. The prediction dates are the same as the previous two experiments. The results are presented in Tables 3, 4 and Figs. 3, 4.

Table 3 Statistical of accuracy results in 600 IERS forecasts
Table 4 Statistical of accuracy results for 1000 IERS forecasts
Fig. 3
figure 3

MAE values calculated in 600 IERS forecasts

Fig. 4
figure 4

MAE values calculated for 1000 IERS forecasts

The above results indicate that the prediction accuracy of the WLS + AR model shows improvement at different levels in both the PMX and PMY directions. The improvements in the forecasting accuracy for short-term predictions (1–30 days in the future) are not obvious; the maximum improvement in the polar motion parameters in the PMX direction is less than 8.19%, whereas that in the PMY direction is within 8.587%. However, when the prediction span exceeds 30 days, the accuracy of the prediction displays clear improvements. For the group of 600 experiments, the prediction accuracy in the PMX direction calculated using the IGS and IERS data increases by an average of 16.09 and 21.43%, whereas that in the PMY direction using the IGS and IERS data increases by 23.64 and 19.19%, respectively. In the group of 1000 experiments, the prediction accuracy in the PMX direction calculated using the IGS and IERS data increases by an average of 24.14 and 19.04%, whereas that in the PMY direction using the IGS and IERS data increases by 22.94 and 11.07%, respectively. Compared with the LS + AR model, the WLS + AR model generally better in predicting polar motion parameters and yields more accurate medium-term forecasts.

4 Conclusion

This paper proposes a method of producing medium-term predictions of polar motion parameters using a WLS + AR model. In this method, the calculation precision of the PM parameters is employed to produce weighting factors to build the weight matrix of the vector of observations in the least-squares extrapolation model. More accurate parameters and extrapolated values of the LS model while and more stable residuals to form and calculate the AR model can be obtained using this method. The EOP products released by the analysis center of IGS and IERS are used as the basic data to predict the polar motion parameters in groups of 600 and 1000 experiments with a 1-day prediction interval and a span of 360 days. The results of the experiments show that, compared with the LS + AR model, the WLS + AR model proposed in this paper effectively improves the accuracy with which polar motion parameters can be predicted.