1 Introduction

Traditional ballistic missiles and flying missiles have not been able to achieve full penetration. Therefore, with the improvement of missile defence system, the new missile called boost-gliding-guided projectile which combines the advantages of both ballistic missiles and flying missiles is developed [1]. This kind of missile is launched by barrel weapon. Once coming through the muzzle, it opens the tail wing, which enables it to spin at a specified speed and to maintain stable fight. After a few seconds, the small rocket booster engine on the missile begins working to assist the missile in climbing. Once the rocket shuts down, the missile detect system starts to work. When the missile reaches somewhere near the vertex, the duck rudders are opened. Then the guiding system continuously adjusts the deflections of the rudders to control the attitude of the projectile. By this way, the range is extended and precise strike can be achieved [2]. Due to the rotating characteristics of the projectile and some other factors, such as the delay between the control instructions and the response of the rudder system, there is significant cross coupling between the yaw channel and the pitch channel. What’s more, the gliding-guided projectile control model which is always constructed under some assumptions differs from the actual one, and the projectile suffers from uncertain disturbance and dynamic parameters perturbation in flight. In other words, the attitude control problem studied in this paper can be classified as the control problem of nonlinear multiple-input multiple-output (MIMO) uncertain system. To deal with such problem, researchers have proposed many methods, such as feedback linearization method, fuzzy method, variable structure method [3,4,5,6,7,8,9,10] and so on. Among them, variable structure method is widely used to solve nonlinear control problems [11,12,13,14,15], for its merits of simplicity in implementation, fast dynamic response, and robustness against parameter variations. However, traditional sliding mode control suffers from the chattering problem [16]. As a remedy to this drawback, higher order sliding mode control approach is proposed [17,18,19,20]. Furthermore, to cope with the uncertain terms in model plant, estimator should be used. Artificial neural network (ANN) is an efficient tool to estimate disturbance due to its major advantages such as strong self-learning skills, and powerful ability to map nonlinear system. Usually, ANNs adopt Hebb or learning rule to adjust weights of networks [21], but that can only guarantee the system is convergent around the ideal value. So, researchers propose online adaptive neural network [22,23,24,25,26], which is based on Lyapunov stable theory, to ensure the stability of closed-loop system.

In general, estimator is assigned to figure out the uncertainties in plant parameters, as well as coupling inputs and external disturbance [27, 28].

In this paper, we take the gliding guided missile as a research object, and a composite control strategy is proposed to solve the problem of cross coupling and uncertain disturbance. Specifically, the coupled control inputs are regarded as a single channel input, while the rest part of the plant is treated as lumped uncertainty. Model following control approach is adopted in the framework of full-order sliding mode control, in this way, it is feasible to get the desired control quality. The uncertainties are cancelled by an adaptive RBF neural network estimator, and the convergence of system is demonstrated through Lyapunuov stable theory. The effectiveness of the control method proposed in this paper is verified through simulation. Simulation results show that the controller proposed can achieve high control precision, and is robust to internal and external disturbances.

The rest of this study is organized as follows. In Sect. 2, the problem to be solved is described. In Sect. 3, the decoupling control is designed based on model following SMC and an adaptive RBF network estimator, and the stability of system is proved. In Sect.  4, comparison simulation is performed, and the improved attitude controller is given.

2 Gliding-Guided Projectile Attitude Kinematics Model

Consider that the gliding-guided projectile is gliding along a short flight path, assuming the flight speed and rotational speed of the projectile are constant, and neglect the variation of the dynamic pressure and Mach number as well as other physical parameters. Under the assumption that the attack angle and side slip angle are small, the attitude kinematics model of the gliding-guided projectile can be constructed as [23]

$$\begin{aligned} \left\{ \begin{aligned} {\dot{\theta }}&=a_1\alpha -a_2\beta +a_3\delta _z\\ -\dot{\varphi }\cos \theta&=a_1\beta -a_2\alpha -a_3\delta _y\\ \ddot{\vartheta }&=b_1\alpha -b_2\beta +b_4\delta _z-b_3{\dot{\vartheta }}+b_5\cos \vartheta \cdot {\dot{\psi }}\\ \ddot{\psi } \cos \vartheta&=b_1\beta +b_2\alpha +b_4\delta _y-b_3\cos \vartheta \cdot {\dot{\psi }}-b_5{\dot{\vartheta }}, \end{aligned}\right. \end{aligned}$$
(1)

where \(\alpha \) represents angle-of-attack; \(\beta \) represents side slip angle; \(\theta \) and \(\varphi \) are trajectory inclination angle and trajectory deflection angle ,respectively; \(\vartheta \) and \(\psi \) are angle of pitch and angle of yaw, respectively; \(\delta _z\) and \(\delta _y\) are equivalent rudder angle of pitch channel and equivalent rudder angle of yaw channel, respectively; \(a_i\) and \(b_i\) are kinetic coefficients, \(a_1=\frac{QS}{mv}(C^{\alpha }_L+C^{\delta }_L)\) ,\(a_2=\frac{QS}{mv}C^{\prime \prime }_\mu ({\dot{\gamma }} d/v)\),\(a_3=\frac{QS}{mv}(C^{\alpha }_L\),\(b_1=\frac{QS}{E}(m^{\prime }_z+m^{\prime }_\sigma )\),\(b_2=\frac{QSl{\dot{\gamma }} D}{Ev}m^{\prime \prime }_y\),\(b_3=\frac{QSlD}{Ev}m^{\prime }_{zz}\),\(b_4=\frac{QSl}{E}m^{\prime }_\sigma \),\(b_5=\frac{C{\dot{\gamma }}}{E}\), where m is mass, v is velocity, D is missile diameter, S and l are characteristic area and characteristic length, respectively, Q is dynamic pressure, \({\dot{\gamma }}\) is rotational speed of the missile, E and C are equatorial damping coefficient and polar damping coefficient, respectively, \(C^\alpha _L\) and \(C^\delta _L\) are derivation of lift induced by missile and that induced by canard, respectively, \(C^{\prime \prime }_\mu \) and \(m^{\prime \prime }_y\) are Magnus force and the derivation of Magnus moment, respectively, \(m^{\prime }_z\) is derivation of the static moment, \(m^{\prime }_{zz}\) is the derivation of equatorial damping moment, and \(m^{\prime }_\sigma \) is the derivation of control moment induced by canard.

When studying the attitude control problem of the projectile, we can think the attitude change speed is faster than the change speed of velocity vector direction of projectile center of mass [24]. Without loss of generality, we assume that \({\dot{\vartheta }}\thickapprox {\dot{\alpha }}\), \({\dot{\varphi }}\thickapprox {\dot{\beta }}\). From (1) and the assumption of \(\alpha =\vartheta -\theta \), \(\beta =(\psi -\varphi )\cos \theta \), \(\cos \theta \thickapprox \cos \vartheta \), we obtain

$$\begin{aligned} \left\{ \begin{aligned} \ddot{\beta }&=-k_1{\dot{\beta }}-k_2{\dot{\alpha }}+k_3\beta +k_4\alpha \\&\quad \quad +k_5\delta _y-k_6\delta _z-k_7{\dot{\delta }}_y\\ \ddot{\alpha }&=-k_1{\dot{\alpha }}-k_2{\dot{\beta }}+k_3\alpha -k_4\beta \\&\quad \quad + k_5\delta _z-k_6\delta _y-k_7{\dot{\delta }}_z, \end{aligned}\right. \end{aligned}$$
(2)

where \(k_1=b_3+a_1\), \(k_2=b_5+a_2\), \(k_3=b_1-b_3a_1+b_5a_2\), \(k_4=b_2-b_3a_2-b_5a_1\), \(k_5=b_4-b_3a_3\), \(k_6=b_5a_3\), \(k_7=a_3\).

Here the steering gear system is considered to be second order, the steady output of equivalent rudder deflection of a spinning gliding missile is denoted as [25]:

$$\begin{aligned} \begin{bmatrix} \delta _y \\ \delta _z\end{bmatrix}=k_\mathrm{s}k_\mathrm{r}\begin{bmatrix} \cos \gamma _d&\sin \gamma _d \\sin\gamma _d&\cos \gamma _d\end{bmatrix}, \end{aligned}$$
(3)

where

$$\begin{aligned} \left\{ \begin{aligned} k_\mathrm{s}&=\frac{1}{\sqrt{(1-T^2_\mathrm{s}{\dot{\gamma }}^2)^2+(2\mu _\mathrm{s}T_\mathrm{s}{\dot{\gamma }})^2}} \\ k_\mathrm{r}&=\tau {\dot{\gamma }}+\mathrm{arccos}\frac{1-T^2_\mathrm{s}{\dot{\gamma }}^2}{\sqrt{(1-T^2_\mathrm{s}{\dot{\gamma }}^2)^2+(2\mu _\mathrm{s}T_\mathrm{s}{\dot{\gamma }})^2}}, \end{aligned}\right. \end{aligned}$$
(4)

where \(\delta _{yc}\) and \(\delta _{zc}\) are equivalent rudder deflect angle control signal for pitching channel and yaw channel, respectively, \(k_\mathrm{s}\) is gain of canard, \(T_\mathrm{s}\) is time constant of steer gear system, \(\mu _\mathrm{s}\) is system damping ratio, \(\tau \) is delay time, \(\gamma _\mathrm{d}\) is delay phase angle.

By omitting \({\dot{\delta }}_y\) and \({\dot{\delta }}_z\), from (1), (2) and (3), we get

$$\begin{aligned} \left\{ \begin{aligned} \ddot{\beta }&=-k_1{\dot{\beta }}-k_2{\dot{\alpha }}+k_3\beta +k_4\alpha \\&\quad \quad +\kappa _5\delta _yc-\kappa _6\delta _zc\\ \ddot{\alpha }&=-k_1{\dot{\alpha }}-k_2{\dot{\beta }}+k_3\alpha -k_4\beta \\&\quad \quad +\kappa _5\delta _{zc}+\kappa _6\delta _{yc}, \end{aligned}\right. \end{aligned}$$
(5)

where

$$\begin{aligned} \begin{aligned} \kappa _5&=k_5k_\mathrm{s}k_\mathrm{r}\cos \gamma _d-k_6k_\mathrm{s}k_\mathrm{r}\sin \gamma _d,\\ \kappa _6&=k_5k_\mathrm{s}k_\mathrm{r}\sin \gamma _d+k_6k_\mathrm{s}k_\mathrm{r}\cos \gamma _d. \end{aligned} \end{aligned}$$
(6)

Taking parameter variation and external disturbance into consideration, with denoting \(u_1=\delta _{yc}, u_2=\delta _{zc}, x_1=\beta , x_2={\dot{\beta }}, x_3=\alpha , x_4={\dot{\alpha }}\), we get

$$\begin{aligned} \begin{aligned} {\left\{ \begin{array}{ll} \dot{x}_1=&{}x_2\\ \dot{x}_2=&{}f_1(x,t)+\varDelta f_1(x,t)+g_{11}(x,t)u_1+\varDelta g_{11}(x,t)u_1\\ &{}+\,g_{12}(x,t)u_2+\varDelta g_{12}(x,t)u_2+d_1\\ \dot{x}_3=&{}x_4\\ \dot{x}_4=&{}f_2(x,t)+\varDelta f_2(x,t)+g_{21}(x,t)u_1+\varDelta g_{21}(x,t)u_1\\ &{}\quad +\,g_{22}(x,t)u_2+\varDelta g_{22}(x,t)u_2+d_2,\\ \end{array}\right. } \end{aligned} \end{aligned}$$
(7)

where \(f_1(x,t)=-k_1x_1-k_2x_2+k_3x_1+k_4x_3\), \(f_2(x,t)=-k_1x_4+k_2x_2+k_3x_3-k_4x_1\), \(g_{11}=\kappa _5\), \(g_{12}=-\kappa _6\), \(g_{21}=\kappa _5\), and \(g_{22}=\kappa _6\).

In (7), we can see that \(x_1\), \(x_2\) are all influenced by the coupled inputs, which makes the control problem very complicating. To solve this problem, we denote a nominal input vector \(U=\begin{bmatrix} U_1&U_2\end{bmatrix}^T\), and it satisfies

$$\begin{aligned} U=F\begin{bmatrix}u_1\\u_2\end{bmatrix}=\begin{bmatrix}g_{11}&g_{12}\\g_{21}&g_{22}\end{bmatrix}\begin{bmatrix}u_1\\u_2\end{bmatrix}. \end{aligned}$$
(8)

The rest part beside control U in (7) is treated as a lumped uncertainty, which is denoted as

$$\begin{aligned} \begin{aligned} d&=\begin{bmatrix}d_p\\d_q\end{bmatrix}\\&=\begin{bmatrix} \begin{aligned}f_1(x,t)+&\varDelta f_1(x,t)+g_{11}(x,t)u_1+\varDelta g_{11}(x,t)u_1\\&+\,g_{12}(x,t)u_2+\varDelta g_{12}(x,t)u_2+d_1\end{aligned}\\ \begin{aligned}f_2(x,t)+&\varDelta f_2(x,t)+g_{21}(x,t)u_1+\varDelta g_{21}(x,t)u_1\\&+\,g_{22}(x,t)u_2+\varDelta g_{22}(x,t)u_2+d_2\end{aligned}\end{bmatrix}.\\ \end{aligned} \end{aligned}$$
(9)

Then (7) can be transformed into two decoupled subsystems:

$$\begin{aligned} \begin{aligned} \dot{x}_p&=A_px_p+B_pU_1+B_pd_p,\\ y_p&=x_1, \end{aligned} \end{aligned}$$
(10)

and

$$\begin{aligned} \begin{aligned} \dot{x}_q&=A_qx_q+B_qU_2+B_qd_q,\\ y_q&=x_3, \end{aligned} \end{aligned}$$
(11)

where \(x_p=\begin{bmatrix} x_1&x_2\end{bmatrix}^T\), \(x_q=\begin{bmatrix} x_3&x_4\end{bmatrix}^T\), \(A_p=\begin{bmatrix} 0&1\\0&0\end{bmatrix}^T\), \(B_p=\begin{bmatrix} 0&1\end{bmatrix}^T\), \(A_q=\begin{bmatrix} 0&1\\0&0\end{bmatrix}^T\), \(B_q=\begin{bmatrix} 0&1\end{bmatrix}^T\), and \(y=\begin{bmatrix} y_p&y_q\end{bmatrix}^T\) are output signals.

The object is to design a control input U so as to drive the output vector \(y=\begin{bmatrix} y_p&y_q\end{bmatrix}^T\) to follow the control command \(y_c=\begin{bmatrix} y_{1c}&y_{2c}\end{bmatrix}^T=\begin{bmatrix} y_{pc}&y_{qc}\end{bmatrix}^T\) in spite of the uncertainties and external disturbances.

3 Design of Control

In this section, we design a controller based on model following control method in the framework of a full-order sliding mode control suggested in [32]. Besides, the uncertainties are compensated by an adaptive RBF neural network estimator. Because the subsystems (10), (11) have similar forms, we only take(10) as an example to display the design process of control.

3.1 Model Following Control with Full-Order Sliding Mode

Let

$$\begin{aligned} \dot{x}_{pm}=A_{pm}x_{pm}+B_{pm}y_{1c}, \end{aligned}$$
(12)

be a stable model satisfying certain structural conditions stated by the following assumption:

Assumption 1

\(A_{pm}\) and \(B_{pm}\) in (12) satisfy

$$\begin{aligned} A_p-A_{pm}=B_pL,\quad B_{pm}=B_pM, \end{aligned}$$
(13)

where L and M are known matrixes.

Define a sliding surface [25]

$$\begin{aligned} \sigma =B_p^Tx_p+z_p, \quad z_p(0)=-B_p^Tx_p(0), \end{aligned}$$
(14)

where

$$\begin{aligned} \dot{z}_p=-B_p^TA_{pm}x_p-B_p^TB_{pm}y_{1c}, \end{aligned}$$
(15)

where the chosen sliding surface offers the full-order sliding mode, i.e., the order of the sliding surface is identical to the origin system’s [32]. The dynamics of auxiliary \(z_p\) provides perfect model following once \(\sigma \) goes to zero and the subsystem (10) tracks the desired model dynamic given by (12) [28]. Differentiating (14) and using (10) with (15) give

$$\begin{aligned} \begin{aligned} {\dot{\sigma }}&=B_p^TA_px_p+B_p^TB_pU_1+B_p^TB_pd_p-B_p^TA_{pm}x_p\\&\quad -B_p^TB_{pm}y_{1c}\\&=B_p^TLx_p+B_p^TB_pU_1-B_p^TB_pMy_{1c}+\\&\quad B_p^TB_pd_p. \end{aligned} \end{aligned}$$
(16)

Let the control \(U_1\) be split as

$$\begin{aligned} U_1=U_\mathrm{eq}+U_n, \end{aligned}$$
(17)

where the equivalent control \(U_\mathrm{eq}\) is put in charge of the known dynamics and \(U_n\) takes care of the lumped uncertainty in (16). Choosing

$$\begin{aligned} U_\mathrm{eq}=-Lx_p+My_{1c}-(B_p^TB_p)^{-1}k_p\sigma , \end{aligned}$$
(18)

where \(k_p\) is a positive constant. From (16), (17), and (18), we obtain

$$\begin{aligned} {\dot{\sigma }}=-k_p\sigma +B_p^TB_pU_n+B_p^TB_pd_p. \end{aligned}$$
(19)

Next, the compensator \(U_n\) will be designed.

3.2 Design of Adaptive Compensator

The uncertainties in parameter and external disturbance are the main factors influencing the existence of the sliding mode for \(\sigma \). To neglect the uncertain term in (19), RBF neural network is used to design an adaptive compensator.

The compensator \(U_n\) is designed as

$$\begin{aligned} U_n=-\rho , \end{aligned}$$
(20)

where \(\rho \) is the upper bound of the uncertain term in (19) which is hard to be predicted. Thus, we use RBF neural network to estimate its value dynamically. \(\chi =\begin{bmatrix} x_1&x_2\end{bmatrix}^T\) is the input vector of RBF network, the estimation of \(\rho \) is defined as

$$\begin{aligned} {\hat{\rho }}={\hat{w}}^T\varphi (\chi ), \end{aligned}$$
(21)

where \({\hat{w}}^T\) is the weight for RBF network and \(\varphi (\chi )\) is chosen to be Gauss function with the form

$$\begin{aligned} \varphi (\chi )=\mathrm{exp}\left( -\frac{\Arrowvert \chi -c_i\Arrowvert ^2}{b_i^2}\right) , \quad i=1,2,\ldots ,n, \end{aligned}$$
(22)

where \(c_i\) denotes the center of the ith neuron, \(b_i\) denotes the width of the ith neuron, and n denotes the number of neurons.

Therefore, the compensator using RBF network for estimating the upper bound of the uncertainties is designed as follows:

$$\begin{aligned} U_n=-\,{\hat{\rho }}. \end{aligned}$$
(23)

From (19) and (23), we can have

$$\begin{aligned} {\dot{\sigma }}=-\,k_p\sigma +B_p^TB_p(d_p-{\hat{\rho }}). \end{aligned}$$
(24)

Because

$$\begin{aligned} \begin{aligned} d_p-{\hat{\rho }}&=d_p-\rho ^*-{\hat{\rho }}\\&=\eta +w^*\varphi (\chi )-{\hat{w}}\varphi (\chi )\\&=\eta +{\tilde{w}}\varphi (\chi ), \end{aligned} \end{aligned}$$
(25)

where \(\eta =d_p-\rho ^*\) represents approximate error of the optimal RBF network, the \(w^*\) represents the optimal weight for RBF network, \(\rho ^*=w^*\varphi (\chi )\) represents the optimal output of RBF network, and \({\tilde{w}}=w^*-{\hat{w}}\). Then (24) becomes

$$\begin{aligned} {\dot{\sigma }}=-\,k_p\sigma +B_p^TB_p(\eta +{\tilde{w}}\varphi (\chi )). \end{aligned}$$
(26)

Lyapunov function is defined as \(V=\frac{1}{2}\sigma ^2+\frac{1}{2N}{\tilde{w}}^2\), and weight value of the RBF network is adjusted online by adaptive algorithm. Choosing \(\dot{{\hat{w}}}=NB_p^TB_p\sigma \varphi (\chi )\), where N is positive constant, the following is derived:

$$\begin{aligned} \begin{aligned} \dot{V}&=\sigma {\dot{\sigma }}-N^{-1}{\tilde{w}}\dot{{\hat{w}}}\\&=\sigma (-k_p\sigma +B_p^TB_p(\eta +\tilde{w}\varphi (\chi )))-B_p^TB_p\sigma {\tilde{w}}\varphi ( \chi )\\&=-\,k_p\sigma ^2+B_p^TB_p\sigma \eta \\&\le -\,k_p|\sigma |^2+B_p^TB_p|\sigma |\eta \\&=-\,k_p|\sigma |(|\sigma |-k_p^{-1}B_p^TB_p\eta ). \end{aligned} \end{aligned}$$
(27)

To lead \(\dot{V}\le 0\), \(k_p\) should satisfy \(k_p\ge \frac{B_p^TB_p\eta }{|\sigma |}\). If and only if \(|\sigma |=k_p^{-1}B_p^TB_p\), \(\dot{V}=0\). When \(\dot{V}=0\), \(|\sigma |\equiv k_p^{-1}B_p^TB_p\eta \). According to LaSalle invariance principle [33], system (14) is asymptotic stable, which means \(|\sigma |\rightarrow k_p^{-1}B_p^{-1}\eta \) when \(t\rightarrow \infty \). Besides, the convergence rate depends on \(k_p\).

Due to \(V\ge 0\), \(\dot{V}\le 0\), V is bounded when \(t\rightarrow \infty \). Therefore, \({\tilde{w}}\) is bounded. It is noted that the larger the \(k_p\) is, the smaller approximate error the optimal RBF network has, which will make \({\sigma }\) have a smaller convergence radius and as a result, the effect of model tracking will be improved.

4 Simulation and Results

In this section, the validity of the proposed control scheme is demonstrated by applying it to the attitude control problem of a certain type of gliding guided projectile.

4.1 Simulation Parameters

The parameters of the gliding-guided projectile are given in Table  1.

Table 1 Parameters for a gliding-guided projectile

The attitude angle tracking command is given by

$$\begin{aligned} {\left\{ \begin{array}{ll}{} \begin{aligned} \beta _c=y_{1c}={\left\{ \begin{array}{ll}1^{\circ },\quad 1<t\le 2\\ -1^{\circ },\quad t>2\\ 5^{\circ }, \quad t\le 1.6 \end{array}\right. }\\ \alpha _c=y_{2c}={\left\{ \begin{array}{ll}4^{\circ },\quad 1.6<t\le 3\\ 7^{\circ },\quad t>3.\end{array}\right. } \end{aligned} \end{array}\right. } \end{aligned}$$
(28)

The controller parameters are set as Table 2 shows. The initial value of weight w of RBF network is \([0.1\; 0.1\; 0.1\; 0.1\;0.1]^T\), \(c_p=[-1^{\circ }\quad -0.5^{\circ }\quad 0^{\circ }\quad 0.5^{\circ }\quad 1^{\circ }]^T\), \(c_q=[-7^{\circ }\quad -3.5^{\circ }\quad 0^{\circ }\quad 3.5^{\circ }\quad 7^{\circ }]^T\), and \(b=2\).

Table 2 The parameters of controller

Since the second-order system in (7) is split into two subsystems, \(A_p=A_q=\begin{bmatrix}0&1\\0&0\end{bmatrix}\), \(B_p=B_q=\begin{bmatrix}0&1\end{bmatrix}^T\).

The reference model to be followed is chosen as

$$\begin{aligned} \dot{x}_m=A_mx_m+B_mu_m, \end{aligned}$$
(29)

where \(A_m=\begin{bmatrix}0&1&0&0\\ -\,400&-\,36.6&0&0\\ 0&0&0&1\\ 0&0&-\,200&-\,24.6\end{bmatrix}\).

Besides, \(B_m=\begin{bmatrix}0&0\\ 400&0\\ 0&0\\ 0&200\end{bmatrix}\). The initial values for the plant and model are

$$\begin{aligned} \begin{aligned}&x(0)=\begin{bmatrix}0.01&0&0.01&0\end{bmatrix}^T,\\&x_m(0)=\begin{bmatrix}0&0&0&0\end{bmatrix}^T.\\ \end{aligned} \end{aligned}$$
(30)

Next, we perform two kinds of experiments: 1, the closed-loop system of attitude suffers from strong internal and external disturbance; 2, Monte Carlo experiments with parameters variation.

4.2 Attitude Decoupling Control Simulation Under Strong Internal and External Disturbances

4.2.1 Comparison Experiment Between the Proposed Decoupling Controller and ESO-PD Controller

In this simulation, severe disturbances are considered. \(\varDelta f_1(x,t)\), \(\varDelta f_2(x,t)\), \(\varDelta g_{11}(x,t)\), \(\varDelta g_{12}(x,t)\), \(\varDelta g_{21}(x,t)\), and \(\varDelta g_{22}(x,t)\) are \(40\%\) uncertainty in \(f_1(x,t)\), \(f_2(x,t)\), \(g_{11}(x,t)\), \(g_{12}(x,t)\), \(g_{21}(x,t)\) and \(g_{22}(x,t)\), respectively. The external disturbance vector is

$$\begin{aligned} d=\begin{bmatrix}4x_1\sin (0.5t)\\3x_3\cos (t)\end{bmatrix}. \end{aligned}$$
(31)

For performance comparison, the ESO (extended state observer) − PD (proportional and differential) controller proposed in [10] is also simulated under the same conditions.

Simulation results are shown as Fig. 1. From Fig. 1a, b, we can see that both the proposed method and ESO-PD method can track the desired output with smooth trajectories. It should be noted that the proposed method responses are relative more quick than ESO-PD method. As shown in Fig. 1c, d, the sliding surfaces start from zero point and have a smooth trajectory with no chattering phenomenon, which demonstrates the characteristics of full-order SMC. And this can be verified from the attitude tracking trajectory with reference model attitude as shown in Fig. 1e, f. Figure 1g, h shows the estimation of lumped uncertainties. A deviation from the trajectory of lumped uncertainties is observed whenever there is a step change in \(\alpha \) and \(\beta \). But the output of the adaptive compensator is able to response to the change timely. From Fig. 1i, j, we can observe that ESO-PD method always demands much larger control commands to response to the change of attitude angle tracking instructions. Despite that, the controller proposed in this paper performs better in dealing with the coupling terms contained in (7), which has been demonstrated by Fig. 1a, b. Figure 1k, l shows the equivalent canard angle. It should be noted that both of the two methods change sharply at certain time, which may do harm to the actuator. To prevent this phenomenon, it is essential to arrange a transition process for the reference inputs. The method is described in the following.

Fig. 1
figure 1

Comparison analysis between the proposed decoupling controller and ESO-PD controller: a tracking effect of yaw commands; b tracking effect of pitch commands; c sliding surface \(\sigma _p\); d sliding surface \(\sigma _q\); e tracking of \(x_{pm}\); f tracking of \(x_{qm}\); g lumped uncertainty estimation of \(d_p\); h lumped uncertainty estimation of \(d_q\); iequivalent canard angle commands in yaw channel; j equivalent canard angle commands in pitch channel; k equivalent canard angle in yaw channel; l equivalent canard angle in yaw channel

4.2.2 Tracking Differentiator (TD)

In this paper, a discrete tracking differentiator (TD) is used to provide transition process for \(y_{ic},i=1,2\):

$$\begin{aligned} {\left\{ \begin{array}{ll}{} \begin{aligned} v_{i1}(k+1)&{}=v_{i1}(k)+hv_{i2}(k)\\ v_{i2}(k+1)&{}=v_{i2}(k)-rhsat(v_{i1}(k)-y_{ic}(k),\\ &{}\quad v_{i2}(k),r_0,h_0), \end{aligned} \end{array}\right. } \end{aligned}$$
(32)

where \(v_{i1}\) represents transition signal, whose differential is \(v_{i1}\), h is integration step, \(v_{i1}(k)\), \(v_{i2}(k)\) and \(y_{ic}(k)\) denotes their states at \(T=kh\), \(h_0\) is filter factor, which plays an important role in noise suppression. Here, it is not needed to filter the control command, so \(h_0=h\). \(r_0\) represents speed factor, and it is used to adjust tracking speed. Let \(T_d\) represent the time of the transition process, then the following equation holds:

$$\begin{aligned} r_0=4(y_{ic}-v_{i1}(t_0))/T_d^2, \end{aligned}$$
(33)

where \(v_{i1}(t_0)\) represents the initial value of \(v_{i1}\).

In (32), \(sat(\xi _1,\xi _2,r,h)\) is denoted as

$$\begin{aligned} {\left\{ \begin{array}{ll}{} \begin{aligned} a&{}=rh,\quad a_1=ha,\quad z=\xi _1+h\xi _2\\ g&{}={\left\{ \begin{array}{ll}{}\begin{aligned}v_{i2}-sign(z)\frac{r(h-\sqrt{\frac{8|z|}{r}+h^2})}{2},|z|&{}\ge a_1\\ v_{i2}+\frac{z}{h}, |z|&{}<a_1\end{aligned} \end{array}\right. } \\ &{}sat(\xi _1,\xi _2,r,h)=a, \end{aligned} \end{array}\right. } \end{aligned}$$
(34)

where \(sign(\cdot )\) denotes the standard sign function.

Fig. 2
figure 2

The overview attitude decoupling control of the gliding guided projectile

Fig. 3
figure 3

Comparison analysis between the improved controller and proposed method without TD: a tracking effect of yaw commands; b tracking effect of pitch commands; c sliding surface \(\sigma _p\); d sliding surface \(\sigma _q\); e tracking of \(x_{pm}\); f tracking of \(x_{qm}\); g lumped uncertainty estimation of \(d_p\); h lumped uncertainty estimation of \(d_q\); i equivalent canard angle commands in yaw channel; j equivalent canard angle commands in pitch channel; k equivalent canard angle in yaw channel; l equivalent canard angle in yaw channel

Fig. 4
figure 4

Attitude tracking of \(\beta \) and \(\alpha \): a tracking effect of yaw commands; b tracking effect of pitch commands

4.2.3 The Overview Attitude Decoupling Control of the Gliding-Guided Projectile

The attitude decoupling control of the gliding guided projectile based on full-order sliding mode control and adaptive RBF network compensator is depicted as Fig. 2. As shown, TD provides gradient signal for the reference inputs \(y_{1c}\) and \(y_{2c}\), by which the large jumping phenomenon and saturation of the control signal is avoided. Model following controllers are designed in the framework of full-order sliding mode control. What’s more, an adaptive compensator based on RBF network is proposed to enhance system robustness against parameter variation and external disturbance.

Fig. 5
figure 5

Estimation errors: a lumped uncertainty estimation error of \(d_p\); b lumped uncertainty estimation error of \(d_q\)

4.2.4 Simulation and Analysis for the Improved Attitude Controller

In this subsection, the improved attitude controller is simulated under the same conditions used in Sect. 4.2.1. The simulation results are shown in Fig. 3.

As Fig. 3a, b shows, the improved method can also track the desired output precisely, despite it responding slower than that without TD. From Fig. 3c, d, we can see that the sliding surface profile generated by the improved method is flatter. The attitude tracking with reference model attitude is depicted in Fig. 3e, f. From Fig. 3g, h, we can note that the improved method can still respond to the change of lumped uncertainties timely. The transition process arranged by the TD greatly reduces the control commands around the time of reference inputs switching just as Fig. 3i, j shows. Because the improved method generates smaller control commands, the equivalent canard angles needed are smaller, which greatly reduce the burden on the actuator and means a lot to the stability of the system.

4.3 Monte Carlo Experiments with Parameter Variation

To demonstrate the robustness of our method, we perform Monte Carlo experiments 100 times. The external disturbance is the same as that in Sect. 4.1. The standard deviation of the error of the aerodynamic parameters is set to be 30% of the nominal value. As Fig. 4 shows, attitude response curve is constrained within a very narrow envelope, which means the proposed control method is insensitive to parameter variation. Figure 5 shows the estimation errors for lumped uncertainty. The curve appears to be ladder shape due to the relatively large sampling period, but it does not prevent the RBF networks from estimating the uncertainties precisely as the figure exhibits.

5 Conclusion

In this paper, an attitude decoupling control strategy is proposed for gliding-guided projectile based on model following SMC and RBF neural network. The coupling inputs are kept as a nominal input instead of being included in lumped uncertainty, so high control value is avoided. Also, control signal jumping phenomenon is avoided by using of TD, which provides a transition process for the reference inputs. The coupling effects have been weakened completely using of RBF network, which is demonstrated by simulations.