Abstract
An integrated guidance and control scheme is developed for next generation of reusable launch vehicle (RLV) with the aim to improve the flexibility, safety and autonomy. Firstly, an outer-loop optimal feedback reentry guidance law with online trajectory reshaping capability is designed. Then, a novel reentry attitude control strategy is proposed based on multivariables smooth second-order sliding mode controller and disturbance observer. The proposed control scheme is able to guarantee that the guidance commands generated from the guidance system can be tracked in finite time. Furthermore, a control allocation is integrated in the system in order to transform the control moments to control surface deflection. Finally, some representative simulation tests are conducted to demonstrate the effectiveness of the proposed integrated guidance and control strategy for six-degree-of-freedom RLV.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Since RLV could allow for far lower per flight costs than expendable vehicle, intensive research has been investigated at NASA’s Marshall Space Flight Center in order to improve safety, reliability and affordability for RLV. To this end, the system must have capabilities of optimality, robustness and reconfiguration. As pointed in [1], reentry guidance algorithms that are not based on optimal control and high fidelity model compromise safety. Thus, the guidance technique based on optimal feedback and high fidelity model is absolute necessary for RLV. In addition, the RLV, especially in the reentry phase, suffers from severe model parameter uncertainties and unknown external disturbances which require that the system has enough robustness. The reconfiguration capability means that the system can recover from some unexpected events such as retargeted trajectory and control effector failures. From a guidance and control perspective, the optimality, robustness and reconfiguration for next generation of RLV cause enormous challenges for researchers. The reasons can be summarized as [2–4]: (1) path constraints: Heat rate, structural loads and dynamic pressure restrict the reentry flight corridor in very narrow scope; (2) high accuracy and rapid attitude tracking requirements: The attitude control with high accuracy and rapid convergence is essential but not easy to be realized due to the influence of uncertainties and external disturbances. Although the difficulties mentioned above, guidance and control technologies for RLV have been intensively studied during the last decades.
Reentry guidance is tasked to generate guidance commands which guide vehicle from its initial position to reach desired landing point with specified accuracy. In general, reentry guidance methods can be classified as trajectory following guidance and predictive guidance. In trajectory following guidance, a nominal trajectory needs to be pre-computed. Then, a linear time-varying system is obtained along the nominal trajectory. Based on the obtained linear time-varying system, many methods, such as LQR [5], indirect Legendre pseudospectral [6] and dynamic inversion [7] can be used to design reentry guidance law which controls RLV back to the nominal trajectory in the presence of uncertainties and external disturbances. The advantages of this method are that the path constraints can be dealt with carefully and it requires minimal onboard computational effort. In addition, there is no issue with respect to the convergence of solution due to the fact that the nominal trajectory can be generated prior to flight. Many reentry missions were performed using this guidance method such as Space Shuttle guidance [8] and Apollo guidance in the final phase of reentry flight [9]. Although the trajectory following guidance method is proved to be successful, it has to consider enormous pre-mission planning to account for unexpected conditions which may be difficult to be known in practice. As a result, a large deviation from nominal trajector may be observed in the presence of disturbance. Predictive guidance methods compute trajectory onboard repeatedly based on current state and expected final condition [10, 11]. The main advantage of predictive guidance is able to adapt to different conditions which may differ significantly from nominal conditions. However, an obstacle of this method in practice is the computational burden. Therefore, in order to ensure fast and reliable convergence, guidance parameters to be found must be kept at minimum. Zimmerman et al. [12] presented an effective reentry guidance method. In the method, the reentry trajectory is divided into two parts. In the first one, the vehicle flies on a constant heating-rate trajectory, and then, two parameters predictor-corrector guidance algorithm is used in the second part. The number of total guidance parameters to be solved in guidance cycle is four which reduced computational burden to some extent. A constrained predictor-corrector guidance method based on quasi-equilibrium-glide condition is proposed in [13] where the guidance is divided into longitudinal and lateral channel. In the longitudinal channel, the path constraints are converted into bank angle constraints. Then, the magnitude of bank angle is determined via solving a root-finding problem which ensures that RLV reaches the expected landing site at specified energy level. The lateral guidance determines the sign of bank angle which can be achieved via bank reversals logic. As a result, only one guidance parameters to be iteratively found online during a guidance cycle which further reduces the computational burden comparing with that provided in [12]. Recently, with the advance in onboard computational capability, the reentry guidance methods based on real-time trajectory optimization have received much attention. Combining model predictive control and approximate dynamic programming, an efficient model predictive static programming (MPSP) is proposed by Padhi et al. [14]. The method brings in the philosophy of real-time trajectory into the framework of guidance design which in turn yields an effective guidance scheme which has already many applications in aerospace engineering [15–17]. Another alternative optimization technique is pseudospectral method (PM), especially the Gauss PM (GPM) [18–20]. In comparison with MPSP, GPM provides a complete covector mapping theorem which ensures that the obtained trajectory is optimal [21]. As mentioned previously, the optimality is an important requirement for RLV. Taking into account this feature, this method is included in proposed scheme. Also, the implementation of this method under architecture of integrated guidance and control for six-degree-of-freedom (6DOF) RLV is to be discussed.
The reentry attitude control becomes the focus of integrated guidance and control system when the guidance law is obtained. The main aim of attitude control system is to achieve attitude tracking in the presence of model parameter uncertainties and unknown external disturbances. To implement the integrated system successfully, reentry attitude controller needs to have enough robustness with respect to uncertainties. During the last decades, many control methods have been applied in the design of attitude controller, such as gain scheduling [22, 23], backstepping control [24, 25], trajectory linearization control [26, 27] and dynamic inversion technique [28, 29]. Although many control methods have been proposed, the sliding mode control stays the main choice for nonlinear dynamic system with bounded uncertainties due to its inherent insensitivity to uncertainties and external disturbances [30–34].
Shtessel has done excellent work in the field of attitude control based on sliding mode technique [35–40]. In [35, 36], sliding mode control is used for RLV attitude control in ascent and reentry phase. In the method, a discontinuous bang–bang type control is used to drive system state to sliding manifold and then rapid switching control is applied to keep it on manifold. In order to reduce control chattering, the discontinuous sign function is replaced by saturation function which is a trade-off between the smoothness of control action and robustness. With the aim to improve control performance, a disturbance observer technique is investigated in satellite formation attitude control [37]. Subsequently, to improve high-gain switching control because of conservative estimation of disturbance bounds, Hall and Shtessel [38] combine gain adaptation with disturbance observer to provide the least gain needed for X-38 attitude control. The similar technique has also been applied in attitude controller design for launch vehicle [39] and quadrotor vehicle [40]. However, in all these developments, a single-input control structure is used which means multivariables attitude control problem has to be transformed into decoupled single-input problem and then is solved via single-input single-output sliding mode technique. As pointed in [41], multivariables control scheme provides a more elegant solution than trying to employ a decoupled single variable structure. In addition, only asymptotic convergence can be achieved via the methods mentioned above. It is well known that the finite time convergent property is able to provide better robustness and higher tracking accuracy. To this end, it is necessary to design reentry controller under multivariables control scheme with finite time convergence despite of uncertainties.
The main contributions of the paper are stated as follows: (I) A guidance and control architecture with online trajectory design, multivariables controller and control allocation is integrated for 6DOF RLV with the aim to improve RLV’s robustness, autonomy and safety. (II) A Lyapunov-based multivariables smooth second-order sliding mode controller and disturbance observer is proposed to achieve reentry attitude tracking in finite time. The paper is organized as follows: the problem studied in the research is formulated in Sect. 2. In Sect. 3, an integrated guidance and control scheme is provided. In Sect. 4, some representative simulation tests are carried out. The conclusions and future work are presented in Sect. 5.
2 Problem formulation
2.1 Translational equations of motion
During the entry phase, most applications assume steady, coordinated turns such that the sideslip angle is kept to zero. Thus, the equations of translational motion of the RLV in three-dimensional space over a spherical, rotating Earth are described by the following equations
where \(h\) represents the flight altitude; \(V\) is velocity; \(\phi \) is latitude; \(\theta \) is longitude; \(\gamma \) is flight path angle (FPA); \(\chi \) is heading angle; \(L\) is lift force; \(D\) is drag force; \(g\) is gravity (\(g=\mu /\left( {R_E +h} \right) ^{2}\) with \(\mu \) being the Earth gravity constant); \(\Omega \) is Earth angular speed; \(R_E \) is radius of Earth. The control input used for trajectory optimization and guidance law design are angle of attack (AOA) \(\alpha \) and the bank angle \(\sigma \) (BA).
2.2 Rotational equations of motion
The rotational motion is caused by the moments about the center of mass that acts on the vehicle and is used to design the altitude controller during the entry flight, is given as:
where \(p,\,q,\,r\) denote roll, pitch and yaw angular rate, respectively; \(\beta \) denotes sideslip angle (SA); \(I_{ij} \left( {i=x,y,z;j=x,y,z} \right) \) denotes moments inertia; \(M_x,\,M_y ,\,M_z\) are roll, pitch and yaw moments.
The objective of the work is to develop an guidance and control scheme including real trajectory, multivariable attitude controller and control allocation for 6DOF RLV. Through the proposed scheme, a RLV can be guided from an initial position to terminal area energy management (TAEM) interface to be given later with minimizing final latitude and without deviating from path constraints provided in [6]. More importantly, it is able to adapt to some unexpected events, such as the retargeting of TAEM.
3 Integrated guidance and control scheme
To achieve objective of the research, an integrated guidance and control architecture is proposed in Fig. 1 which is an extension of the result in [42]. It can be seen from Fig. 1 that the system consists of real-time optimal trajectory, multivariables attitude controller–disturbance observer and control allocator. The first part is used to generate reentry trajectory online and provides feasible guidance commands, including AOA and bank angle, for attitude control system. The multivariables controller and disturbance observer are designed in second part to achieve high accuracy and rapid tracking for guidance commands. The control allocator is used to convert control moments into control surface commands. Finally, the 6ODF RLV model provided in (1)–(12) is used as the simulation model. The implementations of each part are described in detail in the remainder of this section.
3.1 Real-time optimal trajectory design
The implementation of real-time optimal trajectory is composed of trajectory optimization algorithm and real-time feedback logic. As noted earlier, the PM has been widely used over last few years to solve a variety of trajectory optimization problems. Recently, the adaptive GPM is proposed in [43] and it is also used in our separated paper [44]. For brevity, the details for this method are omitted here. From the previous results, it has been known that the reentry trajectory can be obtained quickly if initial reentry condition, path constraints, terminal condition, cost function and initial guess are provided properly. Next, the emphasis of the section is paid on real-time feedback logic. The feedback logic is based on the recently proposed pseudospectral optimal feedback theory [45]. In this theory, the closed-loop feedback solution is formulated via real-time optimal open-loop control. Since the guidance commands need to be tracked by attitude loop, it must be smooth enough. However, it is known that open-loop optimal control generated via real-time trajectory optimization is discontinuous. In order to address the issue, the derivatives of AOA and bank angle, namely \({\mathbf {u}}=\left[ {\dot{\alpha },\dot{\sigma }} \right] ^{T}\in R^{2}\), are used as virtual control for trajectory optimization. Then, the augmented states become \({\mathbf {x}}=\left[ {h,\,\phi ,\,\theta ,\,V,\,\gamma ,\,\chi ,\,\alpha ,\,\sigma } \right] ^{T}\in R^{8}\). As a result, continuous guidance commands are generated via integrating virtual control \({\mathbf {u}}\) (see Fig. 1). Combining adaptive GPM and pseudospectral optimal feedback theory, the real-time optimal trajectory under the integrated guidance and control architecture is implemented as follows:
-
Step 1 (Before \(t_0\)) Calculate reentry trajectory based on given initial reentry condition, TAEM interface, cost function and path constraints using adaptive GPM. Then, the off-line trajectory and control, \({\mathbf {x}}_0^{*} \left( {t\!\ge \! t_0 } \right) {\mathbf {u}}_0^* \left( {t\ge t_0 } \right) \), are obtained and will be used as initial value guess for the first real-time trajectory design.
-
Step 2 (\([{t_0,t_1}])\) Track guidance commands \(\alpha _\mathrm{cmd\_0}\,\hbox {and}\,\sigma _\mathrm{cmd\_0} \) given in (13) through control input generated from attitude controller and control allocation to be developed soon. Then, the actual trajectory and attitude at \(t=t_1\) (\(t_1 =15\) which can be defined by the user) can be obtained from the output and it will be used as the initial value guess for next calculation.
-
Step 3 \(([{t_i,t_{i+1}}]({i=1,2,\ldots }))\) Two programs run simultaneously in this step. One is optimization program which is used to generate new reentry trajectory from current state \({\mathbf {x}}\left( {t=t_i } \right) \) to TAEM interface. The other is control program including attitude controller and control allocation which make the attitude track the guidance commands \(\alpha _\mathrm{cmd\_\left( {i-1} \right) }\,\hbox {and}\,\sigma _\mathrm{cmd\_\left( {i-1} \right) } \). During this phase, the program stops if landing accuracy given in (35) is satisfied; otherwise, set \(i=i+1\), repeat Step 3.
Similar to [46], the following proportional controller is included in proposed scheme to improve \(\alpha \) and \(\sigma \) tracking
where \(\alpha _\mathrm{des\_i} \) and \(\sigma _\mathrm{des\_i} \) are obtained via integrating \({\mathbf {u}}_{i-1}^*\). The parameters \(\alpha \) and \(\sigma \) denote actual flight attitude obtained from the output of 6DOF RLV model.
Remark 1
In step 3, \(t_{i+1} =t_i +\Delta T_i \) where \(\Delta T_i \) is the time consumed to generate new reentry trajectory. It should be noted that the guidance commands are \(\alpha _\mathrm{des\_i-1}\,\hbox {and}\,\sigma _\mathrm{des\_i-1} \) during the time interval \(\left[ {t_i,t_{i+1} } \right] \). That is because the new guidance commands \(\alpha _\mathrm{des\_i}\,\hbox {and}\,\sigma _\mathrm{des\_i} \) are not available until time \(t_{i+1} \).
Remark 2
The TAEM interface in step 3 is alterable, and this feature is beneficial for RLV to reach any feasible landing site. As a result, it will enhance flexibility, safety and autonomy of RLV significantly.
3.2 Multivariables attitude controller–disturbance observer synthesis
3.2.1 Preliminaries
This subsection introduces some useful lemmas and theorem to be used in controller and disturbance observer design.
Lemma 1
[47] Consider a system in the form of \(\dot{\mathbf{x}}=f\left( \mathbf{x} \right) ,f\left( 0 \right) =0\) with \(\mathbf{x}\in R^{m}\). It is assumed that there exists a continuous positive definite function \(V\) such that \(\dot{V}\left( \mathbf{x} \right) +c\left[ {V\left( \mathbf{x} \right) } \right] ^{a}\le 0\) with \(c>0\,\hbox {and}\,a\in \left( {0,1} \right) \). Then, the origin is finite time stable. Moreover, the convergent time can be calculated by \(T\left( {\mathbf{x}_0 } \right) \,\le \frac{\left[ {V\left( {\mathbf{x}_0 } \right) } \right] ^{1-a}}{c\left( {1-a} \right) }\) for any initial value \(\mathbf{x}_0 \).
Lemma 2
[41] For the following multivariables system
where \(\mathbf{x}_1,\mathbf{x}_2 \in R^{m}\) and \(\varvec{\Delta }_1,\varvec{\Delta }_2 \in R^{m}\) are bounded uncertainties satisfying \(\left\| {\varvec{\Delta }_1 } \right\| \le \delta _1 \left\| {\mathbf{x}_1 } \right\| ,\,\left\| {\varvec{\Delta }_2 } \right\| \le \delta _2 \) with known constants \(\delta _1\,\hbox {and}\,\delta _2 \). Then, vectors \(\mathbf{x}_1\,\hbox {and}\,\mathbf{x}_2 \) converge to zero in finite time if the following conditions hold
where \(k_3^\Omega =3\delta _2 +\frac{2\delta _2^2 }{k_1^2 },\,k_3^\Psi =\frac{9\left( {k_1 \delta _1 } \right) ^{2}}{16k_2 \left( {k_2 -2\delta _1 } \right) }+\frac{k_1^2 \delta _1 -4k_1^2 k_2 +2k_2 \delta _2 }{2\left( {k_2 -2\delta _1 } \right) },\,k_4^\Omega =\frac{\left( {1.5k_1^2 k_2 +3k_2 \delta _2 } \right) ^{2}}{k_1^2 k_3 -2\delta _2^2 -3k_1^2 \delta _2}+2k_2^2 +\frac{3}{2}k_2 \delta _1 \) and \(k_4^\Psi =\frac{\alpha _1 }{\alpha _2 \left( {k_2 -2\delta _1 } \right) }+\frac{k_2 \delta \left( {8k_2 +\delta _1 } \right) }{4\left( {k_2 -2\delta _1 } \right) }\) with \(\alpha _1=\frac{9\left( {k_1 \delta _1 } \right) ^{2}\left( {k_2 +0.5\delta _1 } \right) ^{2}}{16k_2^2 }\,\hbox {and}\,\alpha _2=k_2 \left( {k_3 +2k_1^2 -\delta _2 } \right) -\left( 2k_3 +0.5k_1^2 \right) \delta _1 -\frac{9\left( {k_1 \delta _1 } \right) ^{2}}{16k_2 }\).
In addition to Lemmas 1 and 2, the following theorem that is an extension (from single variable to multivariables) of the result in [48] is also needed in order to obtain multivariables controller–disturbance observer.
Theorem 1
Suppose that the uncertain terms \(\varvec{\Delta }_1\,\hbox {and}\,\varvec{\Delta }_2 \) in (16) are bounded with known constants \(\delta _1 \,\hbox {and}\,\delta _2 \) such that \(\left\| {\varvec{\Delta }_1} \right\| \le \delta _1 \) and \(\left\| {\varvec{\Delta }_2} \right\| \le \delta _2 \), then the solution of system (16) is globally bounded if constants \(0.5<p<1,\,k_1 >0\,\hbox {and}\,k_2 >0\). Furthermore, \(\mathbf{x}_1\,\hbox {and}\,\mathbf{x}_2\) converge to zero in finite time for any \(0.5<p<1,k_1 >0\,\hbox {and}\,k_2 >0\) if \(\varvec{\Delta }_1 =\varvec{\Delta }_2 =0\) in (16).
Proof
See “Appendix”. \(\square \)
3.3 Control-oriented model
Since RLV’s rotational motion is much faster than translational motion, the translational terms and angular velocity of the Earth are neglected in attitude controller design [50]. This simplified rotational equations are described by
where \({\varvec{\upomega }}=\left[ {p,\,q,\,r} \right] ^{T},\, \varvec{\Theta } =\left[ {a,\,\beta ,\,\sigma } \right] ^{T},\, \mathbf{M}=\left[ {M_l,\,M_m,\,M_n } \right] ^{T},\,\Delta \mathbf{F}=\left[ {\Delta f_1,\Delta f_2,\Delta f_3} \right] ^{T}\) and \(\varvec{\Delta } \mathbf{D}=\left[ {\Delta D_1,\Delta D_2,\Delta D_3} \right] ^{T}\). \(\Delta \mathbf{D}\) denotes model parameters uncertainties and unknown external disturbances. The matrices \(\mathbf{I},\,\varvec{\Omega },\,\mathbf{R}\in R^{3\times 3}\) are defined as follows:
The controller design is based on double-loops architecture where the outer-loop generates the desired attitude angular rate and the inner-loop produces the control moments commands.
3.3.1 Attitude controller–disturbance observer design
The problem of interest can be stated to design an attitude controller such that guidance command \(\varvec{\Theta }^{*}=\left[ {\alpha _\mathrm{cmd}\,\beta _\mathrm{cmd}\,\sigma _\mathrm{cmd} } \right] ^{T}\) can be tracked by attitude angle \(\varvec{\Theta }\) in finite time in the presence of uncertainties \(\Delta \mathbf{F}\) and \(\Delta \mathbf{D}\). The system (17) and (18) can be considered as multivariables cascaded system and the proposed attitude control scheme is composed of two nested control loops. For inner-loop, \({\varvec{\upomega }}\) is chosen as the virtual control input to be designed to make \(\left( {\varvec{\Theta }-\varvec{\Theta }^{*}} \right) \rightarrow 0\) in finite time. For outer-loop, let \(\mathbf{M}\) be actual input with the aim at steering \(\left( {{\varvec{\upomega }}-{\varvec{\upomega }}^{*}} \right) \rightarrow 0\) in finite time. To this end, define tracking errors \(\varvec{\upsigma }_o =\varvec{\Theta }-\varvec{\Theta }^{*}\,\hbox {and}\,\varvec{\upsigma }_i ={\varvec{\upomega }}-{\varvec{\upomega }}^{*}\), then the error dynamics of (17) and (18) can be described as following multivariables cascade system
where \(\Delta \mathbf{F}_1 =\Delta \mathbf{F},\, \Delta \mathbf{F}_2 =\mathbf{I}^{-1}\Delta \mathbf{D}\) and \({\varvec{\upomega }}^{*}\) denotes desired attitude angular rates to be developed.
Assumption 1
For system (20) and (21), suppose that \(\Delta \mathbf{F}_1 \) and \(\Delta \mathbf{F}_2 \) are continuously differentiable and there exist known constants \(\delta _1\,\hbox {and}\,\delta _2 \) such that \(\left\| {\dot{\Delta }\mathbf{F}_1 } \right\| \le \delta _1 ,\left\| {\dot{\Delta }\mathbf{F}_{_2 } } \right\| \le \delta _2 \).
The proposed multivariables controller and disturbance observer for attitude cascade system can be summarized via the following theorem.
Theorem 2
For system (20) and (21), inner-loop and outer-loop controllers and disturbance observers are designed as follows:
-
(a1) Outer-loop disturbance observer
$$\begin{aligned} \dot{\mathbf{z}}_1^o&= -k_1^o \frac{\mathbf{e}_1^o }{\left\| {\mathbf{e}_1^o } \right\| ^{1/2}}-k_2^o \mathbf{e}_1^o +\mathbf{z}_2^o \nonumber \\&\quad +\,\left( \mathbf{R}\,{\varvec{\upomega }}^{*}+\mathbf{R}\,\varvec{\upsigma }_i -\dot{\varvec{\Theta }}^{*} \right) ,\nonumber \\ \dot{\mathbf{z}}_2^o&= -k_3^o \frac{\mathbf{e}_1^o }{\left\| {\mathbf{e}_1^o } \right\| }-k_4^o \mathbf{e}_1^o \end{aligned}$$(22) -
(a2) Outer-loop attitude controller
$$\begin{aligned} \mathbf{R}{\varvec{\upomega }}^{*}&= \dot{\varvec{\Theta }}^{*}-K_1^o \left\| {\varvec{\upsigma }_o } \right\| ^{p-1}\varvec{\upsigma }_o +\mathbf{x}_2^o -\hat{{\Delta }}\mathbf{F}_1,\nonumber \\ \dot{\mathbf{x}}_2^o&= -K_2^o p\left\| {\varvec{\upsigma }_o } \right\| ^{2\left( {p-1} \right) }\varvec{\upsigma }_o \end{aligned}$$(23) -
(b1) Inner-loop disturbance observer
$$\begin{aligned} \dot{\mathbf{z}}_{1}^i&= -k_{1}^i \frac{\mathbf{e}_{1}^i }{\left\| {\mathbf{e}_{1}^\mathrm{I} } \right\| ^{1/2}}-k_2^i \mathbf{e}_{1}^i +\mathbf{z}_2^i\nonumber \\&\quad +\,\left( {-\mathbf{I}^{-1}\varvec{\Omega }\,\mathbf{I}\,{\varvec{\upomega }}-\dot{{\varvec{\upomega }}}^{*}+\mathbf{I}^{-1}\mathbf{M}} \right) ,\nonumber \\ \dot{\mathbf{z}}_2^i&= -k_3^i \frac{\mathbf{e}_{1}^i }{\left\| {\mathbf{e}_{1}^i } \right\| }-k_4^i \mathbf{e}_{1}^i \end{aligned}$$(24) -
(b2) Inner-loop attitude controller
$$\begin{aligned}&\mathbf{I}^{-1}\mathbf{M}=-\left( {-\mathbf{I}^{-1}\varvec{\Omega }\,\mathbf{I}\,{\varvec{\upomega }}-\dot{{\varvec{\upomega }}}^{*}} \right) -K_1^i \left\| {\varvec{\upsigma }_i } \right\| ^{p-1}\varvec{\upsigma }_i \nonumber \\&\quad +\,\mathbf{x}_{2}^\mathrm{I} -\hat{{\Delta }}\mathbf{F}_2,\quad \dot{\mathbf{x}}_{2}^\mathrm{I} =-K_2^i p\left\| {\varvec{\upsigma }_i } \right\| ^{2\left( {p-1} \right) }\varvec{\upsigma }_i \end{aligned}$$(25)where \(\hat{{\Delta }}\mathbf{F}_1 =\mathbf{z}_2^o,\hat{{\Delta }}\mathbf{F}_2 =\mathbf{z}_2^i,\,\mathbf{e}_1^j =\mathbf{z}_1^j -\varvec{\upsigma }_j\,\left( {j=i,o} \right) \). If Assumption 1 holds and the parameters are chosen as, \(K_m^j >0\left( {m=1,2;j=i,o} \right) \) and \(k_n^j \left( {n=1,2,3,4;j=i,o} \right) \) are determined according to Lemma 2, then \(\varvec{\upsigma }_o\,\hbox {and}\,\varvec{\upsigma }_i \) converge to zero in finite time.
Proof
The proof inspired from [51] can be split into three steps. In the first step, we will show that system (21) is finite time convergent with inner-loop disturbance observer (24) and controller (25). Substituting (25) into (21) yields
Define \(\mathbf{e}_2^i =\mathbf{z}_2^i -\Delta \mathbf{F}_2 \). Taking into account \(\mathbf{e}_1^i =\mathbf{z}_1^i -\varvec{\upsigma }_i\), the error dynamics for inner-loop disturbance observer (24) can be transformed into the following form
Taking into account Assumption 1 and lemma1, it can be observed that if the parameters \(k_n^i \left( {n=1,2,3,4} \right) \) are chosen properly, then \(\mathbf{e}_1^i,\mathbf{e}_2^i \rightarrow 0\) in finite time, namely \(T_1 \). Clearly, \(\mathbf{e}_2^i =\hat{{\Delta }}\mathbf{F}_2 -\Delta \mathbf{F}_2 \) is bounded for \(t\in \left[ {0,T_1 } \right] \). It follows from Theorem 1 that \(\varvec{\upsigma }_i \left( t \right) \) in (26) is bounded for \(t\in \left[ {0,T_1 } \right] \). After \(T_1 \), system (26) is reduced to \(\dot{\varvec{\upsigma }}_i =-K_1^i \left\| {\varvec{\upsigma }_i } \right\| ^{p-1}\varvec{\upsigma }_i +\mathbf{x}_2^i,\,\dot{\mathbf{x}}_2^i =-K_2^i p\left\| {\varvec{\upsigma }_i } \right\| ^{2\left( {p-1} \right) }\varvec{\upsigma }_i \) which is a special case for (16). In view of Theorem 1, \(\varvec{\upsigma }_i \left( t \right) \) converge to zero from any bounded initial value \(\varvec{\upsigma }_i \left( {T_1 } \right) \) in finite time, namely \(T_2 \). Therefore, it follows that \(\varvec{\upsigma }_i \left( t \right) \) is bounded and \(\varvec{\upsigma }_i \left( t \right) =0\) for \(t\ge T_1 +T_2\).
Next, we need to prove that \(\varvec{\upsigma }_o \left( t \right) \) is bounded. Substituting (23) into (20) results in
Using outer-loop disturbance observer and similar analysis in the previous, it is easily found that \(\mathbf{z}_1^i \rightarrow \Delta \mathbf{F}_1 \) in finite time, namely \(T_3 \), which means \(\mathbf{z}_1^i -\Delta \mathbf{F}_1 =\hat{{\Delta }}\mathbf{F}_1 -\Delta \mathbf{F}_1 \) is bounded and converge to zero after \(T_3 \). Therefore, the item \(\mathbf{R}\,\varvec{\upsigma }_i +\left( {\Delta \mathbf{F}_1 -\hat{{\Delta }}\mathbf{F}_1 } \right) \) can be viewed as a bounded perturbation for (28). It follows from Theorem 1 that \(\varvec{\upsigma }_o \) is bounded.
Finally, the proof of finite time stability for (20) and (21) is provided. In view of the analysis in the previous, it can be seen that there exists \(T_4 =\max \left( {T_1 +T_2,T_3 } \right) \) such that \(\varvec{\upsigma }_i \rightarrow 0\,\hbox {and}\,\left( {\Delta \mathbf{F}_1 -\hat{{\Delta }}\mathbf{F}_1 } \right) \rightarrow 0\) for \(t\ge T_4 \). After that, (28) is reduced to \(\dot{\varvec{\upsigma }}_o =-K_1^o \left\| {\varvec{\upsigma }_o } \right\| ^{p-1}\varvec{\upsigma }_o +\mathbf{x}_2^o,\,\dot{\mathbf{x}}_2^o =-K_2^o p\left\| {\varvec{\upsigma }_o } \right\| ^{2\left( {p-1} \right) }\varvec{\upsigma }_o \). It follows from Theorem 1 that \(\varvec{\upsigma }_o \rightarrow 0\) from any bounded initial value \(\varvec{\upsigma }_o \left( {T_4 } \right) \) in finite time, namely \(T_5 \). As a result, both \(\varvec{\upsigma }_i \) and \(\varvec{\upsigma }_o \) converge to zero in finite time \(T_4 +T_5 \). This completes the proof.\(\square \)
Remark 3
The controller (23) and (25) can be interpreted as a smooth multivariables controller. If parameter \(p=1/2\) is used in (26) and (28), the control law is continuous but not smooth (the detailed analysis can be found in [52, 53]). However, the smoothness is necessary for multiloop system. Therefore, \(p=1/2\) is not included in proposed control scheme.
3.4 Control allocation
For completeness, control allocation algorithm provided in [54] is used here to transform control moment commands into control surface deflections. The aerodynamic moment expressions used in the research are expressed as
where \(C_l,\,C_m \,\hbox {and}\,C_n \) denote roll, pitch and yaw moment coefficients. In order to implement control allocator, it is assumed that the relationship between moments and control surface is linear [48]
where vector \(\varvec{\updelta }=\left[ {\delta _\mathrm{REI}\,\delta _\mathrm{REO} \,\delta _\mathrm{LEI}\,\delta _\mathrm{LEO}\,\delta _\mathrm{RF} \,\delta _\mathrm{LF}\,\delta _\mathrm{RR}\,\delta _\mathrm{LF} } \right] \in R^{8}\) represents eight control surfaces, namely right elevon inboard, right elevon outboard, left elevon inboard, left elevon outboard, right body flap, left body flap, right rudder and left rudder. In addition, the control surface vector satisfies the following constraints
Substituting (30) into (29) yields
The resulting mapping from control moments \(\mathbf{M}\) to control surface \(\varvec{\updelta }\) can be rewritten as matrix form
where \(\Lambda =0.5\rho V^{2}Sb\). Now, we hope to determine, at each sampling instant, a control command \(\varvec{\updelta }\) that is feasible with respect to the actuator constraints (31). The solution of (33) can be obtained via the following optimization problem:
where \(W_\delta \) and \(W_M \) denote the weight matrix which allow designer to prioritize between the control surfaces. The equation (34) can be interpreted as follows: given \(\Theta \), the set of feasible control input satisfies position constraints minimizing \(\left[ {\Lambda \mathbf{B}\,\varvec{\updelta }-\left( {\mathbf{M}-\Lambda \mathbf{C}_B } \right) } \right] \) and then picks control surface that minimizes \(\varvec{\updelta }^{T}W_\delta \varvec{\updelta }\) weighted by \(W_\delta \). Finally, the resulting optimization problem can be efficiently solved using active set methods provided in [55].
4 Results and discussion
The reentry initial position is chosen as \(h_0 =183.33\,\hbox {kft},\phi _0 =60.43,\,\theta _0 =14.55^{\circ },\,V_0 =17{,}147.04\,\hbox {ft/s},\,\gamma _0 =-0.30^{\circ }\) and \(\chi _0 =57.51^{\circ }\). The aerodynamic coefficients and path constraints are same with those provided in [6]. The elements of inertia matrix are \(I_{xx} =434{,}270\,\hbox {slug-ft}^{2},\, I_{xz} =17{,}880\,\hbox {slug-ft}^{2},\, I_{yy} =961{,}200\,\hbox {slug-ft}^{2},\, I_{zz} =1{,}131{,}541\,\hbox {slug-ft}^{2}\) and \(I_{xy} =I_{yz}=0\,\hbox {slug-ft}^{2}\). In order to verify the robustness of the proposed control scheme, the disturbances \(\Delta \mathbf{D}_i =\left[ {1+\sin \left( {\pi t/125} \right) +\sin \left( {\pi t/250} \right) } \right] \times 10^{2} \quad \left( {i=1,2,3} \right) \) are added. In addition, the parameter uncertainty \(\Delta \mathbf{I}=30\,\% \mathbf{I}\) is also included in the simulation. The control surface constraints are \(\varvec{\updelta }_{\min } =\left[ {-40,-40,-40,-40,-40,-40,-40,-40} \right] \deg \) and \(\varvec{\updelta }_{\max } =-\varvec{\updelta }_{\min } \). The controller and disturbance observer parameters are set as follows: \(K_1^o =0.04,K_2^o =0.01,K_1^i =0.2,K_2^i =0.01,\, p=0.8\) and \(k_1^o =k_1^i =0.15,\,k_2^o =k_2^i =0.2,\, k_3^o =k_3^i =0.35 \quad k_4^o =k_4^i =0.01\), the weight matrixes in control allocation are given as: \(W_\delta =\hbox {diag}\left( {1,1,1,1,1,1,1,1} \right) ,\, W_M =\hbox {diag}\left( {1,1,1} \right) \). In addition, the simulation stops when landing accuracy satisfies
where \(h_d,\,V_d\) and \(\gamma _d \) are actual final flight altitude, velocity and FPA, respectively. The TAEM interface will be provided in the next. The sampling time of rotational motions is 5 ms. The updated cycle of guidance commands is not fixed, and it depends on the time consumed to generate the trajectory onboard which can be seen from Fig. 2a, b. Next, two representative cases are discussed.
Case 1 The TAEM interface is fixed and defined as \(h_f =80{,}000\,\hbox {ft},\, V_f =2{,}500\,\hbox {ft},\,\gamma _f =-5^{\circ }\) namely original TAEM interface (see Fig. 2). The performance of proposed system in guiding RLV from initial position to TAEM interface is to be evaluated in this case. The simulation results are provided in Figs. 2, 3, 4 and 5. Figure 2a shows the feedback number and time consumed to obtain new trajectory during reentry phase. From that, it can be seen that a new trajectory can be generated between 0.4 and 1.85 s. Furthermore, the feedback number during flight is 1,818. The average time used to update guidance commands is about 0.5806 s which are also provided in Table 1. The guidance commands including angle of attack and bank angle are shown in Fig. 2c, d where the continuous commands can be observed. The tracking errors for reentry attitudes and attitude angular rates are provided in Fig. 3. From the simulation results, it can be observed that both the attitudes and attitude angular rates can be tracked well with relatively small errors using the proposed control scheme. The curves for path constraints plotted in Fig. 4a–c show that they satisfy the predefined constraints in [6]. Furthermore, it follows from the results in Fig. 4b, c that the dynamic pressure and load factor increase with the decreasing altitude. In fact, the increase of dynamic pressure and load factor is caused by the increase of atmospheric density which is inversely proportional to altitude. The curves for control moments and control surface deflections are given in Figs. 4 and 5. At the end of reentry phase, control chattering can be observed from these results. Further analysis shows that it may be caused by rapid change of angle of attack which can be seen in Fig. 2c. Figure2e shows the actual reentry flight trajectory from initial position to TAEM interface. Finally, the simulation results demonstrate that the proposed integrated guidance and control scheme is able to guide RLV to the desired TAEM interface in the presence of model parameter uncertainties and unknown external disturbance. In addition, the landing accuracy is also provided in Table 1. The terminal errors for altitude, velocity and FPA with respect to predefined TAEM interface are about 12.3438 ft, 2.9226 ft/s and \(0.0985^{\circ }\), respectively.
Case 2 In this case, the capacity of trajectory reshaping is verified. Suppose that the TAEM interface is \(h_f =80{,}000\,\hbox {ft},\,V_f =2{,}500\,\hbox {ft}\,\hbox {and}\,\gamma _f =-5^{\circ }\) prior to flight. During flight, assume that the TAEM interface is changed to \(h_f =80{,}000\,\hbox {ft},\,V_f =2{,}000\,\hbox {ft}\,\hbox {and}\,\gamma _f =-6^{\circ }\), namely updated TAEM interface. For simulation purpose, suppose that the moment when new TAEM interface is available for system is determined by \(t_r =100\left( {1+\varepsilon } \right) \) with a random number \(0<\varepsilon <1\). The same initial conditions, constraints and controller parameters with case 1 are used. The feedback calculation time over entire reentry trajectory is shown in Fig. 2b. From the simulation results, it can be seen that the maximum calculation time is about 3.1316 s. In fact, the original trajectory is still used as the initial value guess for calculation of the first reentry trajectory with updated TAEM interface being final condition. However, the original trajectory does not provide a good guess for updated trajectory. Therefore, it needs a longer time to obtain the first updated trajectory. After that, the calculation time reduced rapidly which can be seen from Fig. 2b. The attitude tracking error and corresponding control information including control moments as well as control surface deflections are also provided in Figs. 3, 4 and 5. In comparison with original trajectory in case 1, the updated trajectory in this case needs more time to reach updated TAEM interface. That is because the terminal velocity in case 2 is lower than that in case 1. Hence, RLV needs enough time to dissipate energy to ensure that RLV is able to enter updated TAEM interface. Furthermore, it follows from Fig. 2b, c the dynamic pressure and load factor decrease rapidly with the decreasing velocity. The opposite trends for dynamic pressure and load factor at the end of reentry phase in case 1 and case 2 can be explained as follows: the increase of atmospheric density is dominant in case 1, whereas the velocity decreases so quickly that it become dominant effect on dynamic pressure and load factor. The good tracking performance provided in Fig. 3 and flight trajectory in Fig. 2e demonstrate that the successful landing for RLV even in the alteration of unpredictable TAEM interface is achieved.
5 Conclusion and future work
An integrated guidance and control scheme with trajectory reshaping capacity is proposed for 6DOF RLV. Firstly, a reentry guidance law is designed based on real-time trajectory ensuring the flexibiligy and autonomy of RLV. Then, a multivariables smooth second-order sliding mode attitude controller and disturbance observer scheme is proposed to ensure that the guidance commands can be tracked in finite time. Also, the control allocation is included in the integrated guidance and control system. Finally, some representative simulations are provided for 6DOF RLV in order to verify the effectiveness of proposed integrated guidance and control scheme. If the aerodynamic moment coefficients uncertainties are included in the system, the system performance would be degraded. Therefore, the robust control allocation will be discussed in future.
References
Fahroo, F., Doman, D.: A direct method for approach and landing trajectory reshaping with failure effect estimation. In: AIAA Guidance, Navigation and Control Conference, AIAA-2004-4772 (2004)
Alexandre, F., David, H., Ali, Z.: Robust fault diagnosis for atmospheric reentry vehicles: a case study. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 40(5), 886–899 (2010)
Morio, V., Cazaurang, F., Zolghadri, A., et al.: Onboard path planning for reusable launch vehicles application to the shuttle orbiter reentry mission. Int. Rev. Aerosp. Eng. 1(6), 492–503 (2008)
Jiang, Z.S., Ardonez, R.: On-line robust trajectory generation on approach and landing for reusable launch vehicles. Automatica 45(7), 634–646 (2009)
Gao, Y.: Linear feedback guidance for low-thrust many-revolution earth-orbit transfers. J. Spacecr. Rockets 46(6), 1320–1325 (2010)
Tian, B.L., Zong, Q.: Optimal guidance for reentry vehicles based on indirect Legendre pseudospectral method. Acta Astronaut. 68(7–8), 1176–1184 (2011)
Mease, K.D., Kremer, J.P.: Shuttle entry guidance revisited using nonlinear geometric methods. J. Guid. Control Dyn. 17(6), 1350–1356 (1994)
Harpold, J.C., Graves, C.A.: Shuttle Program: Shuttle Entry Guidance; NASA-TM-79949 (1979)
Bogner, I.: Description of Apollo Entry Guidance. Technical memorandum, NASA (1966)
Braun, R.D., Powell, R.W.: Predictor–corrector guidance algorithm for use in high-energy aerobraking system studies. J. Guid. Control Dyn. 15(3), 672–678 (1992)
Mease, K.D., Chen, D.T., Tandon, S., et al.: A Three-Dimensional Predictive Entry Guidance Approach. AIAA Paper 2000-3959 (2000)
Zimmerman, C., Dukeman, G., Hanson, J.: Automated method to compute orbital reentry trajectories with heating constraints. J. Guid. Control Dyn. 26(4), 523–529 (2003)
Xue, S.B., Lu, P.: Constrained predictor–corrector entry guidance. J. Guid. Control Dyn. 33(4), 1273–1281 (2010)
Padhi, R., Kothari, M.: Model predictive static programming: a computationally efficient technique for suboptimal control design. Int. J. Innov. Comput. Inf. Control 5(2), 399–411 (2009)
Chawla, c, Sarmah, P., Padhi, R.: Suboptimal reentry guidance of reusable launch vehicles using pitch plane maneuver. Aerosp. Sci. Technol. 14(6), 377–386 (2010)
Dwived, P.N., Bhattacharya, A., Padhi, R.: Suboptimal midcourse guidance of interceptors for high speed targets with alignment angle constraint. J. Guid. Control Dyn. 34(3), 860–877 (2011)
Halbe, O., Raja, R.G., Padhi, R.: Robust reentry guidance of a reusable launch vehicle using model predictive static programming. J. Guid. Control Dyn. 37(1), 134–148 (2011)
Ross, I.M., Karpenko, M.: A review of pseudospectral optimal control: from theory to flight. Annu. Rev. Control 36(2), 182–197 (2012)
Darby, C.L., Hager, W.W., Rao, A.V.: Direct trajectory optimization using a variable low-order adaptive pseudospectral method. J. Spacecr. Rockets 48(3), 433–455 (2011)
Zong, Q., Tian, B.L., Dou, L.Q., et al.: Ascent phase trajectory optimization for vehicle with space restricted. Trans. Jpn. Soc. Aeronaut. Space Sci. 54(183), 37–43 (2011)
Benson, D.A., Huntington, G.T., Thorvaldsen, T.P.: Direct trajectory optimization and costate estimation via an orthogonal collocation method. J. Guid. Control Dyn. 29(6), 1435–1440 (2006)
Hodel, A.S., Hall, C.E.: Variable-structure PID control to prevent integrator windup. IEEE Trans. Ind. Electron. 48(2), 442–451 (2001)
Smith, R., Ahmed, A., Hadaegh, F.Y.: The design of a robust parametrically varying attitude controller for the X-33 vehicle. In: AIAA Guidance, Navigation, and Control Conference and Exhibit, Denver, CO, 14–17 August 2000, paper no. AIAA-2000-4158 (2000)
Lian, B.H., Bang, H.C., Hurtado, J.E.: Adaptive backstepping control based autopilot design for reentry vehicles. In: AIAA Guidance, Navigation, and Control Conference, Providence, Rhode Island, 16–19 August 2004, paper no. AIAA-2004-5328 (2004)
Farrell, J., Sharma, M., Polycarpou, M.: Backstepping-based flight control with adaptive function approximation. J. Guid. Control Dyn. 28(6), 1089–1102 (2005)
Zhu, J.J., Hodel, A., Scott, A,. et al.: X-33 entry flight controller design by trajectory linearization-a singular perturbational approach. In: American Astronautical Society Guidance and Control Conference, Breckenridge, CO, January 31–February 4 2001, pp. 151–170 (2001)
Bevaqua, T., Best, E., Huizenga, A., et al.: Improved trajectory linearization flight controller for reusable launch vehicles. In: AIAA Aerospace Sciences Meeting and Exhibit, Reno, Nevada. 5–8 January 11 2004, paper no. AIAA-2004-875 (2004)
Georgie, J., Valasek, J.: Evaluation of longitudinal desired dynamics for dynamic Inversion controlled generic re-entry vehicles. J. Guid. Control Dyn. 26(5), 811–819 (2003)
Ito, D., Ward, D.T., Valasek, J.: Robust dynamic inversion controller design and analysis for the X-38. In: AIAA Conference on Guidance, Navigation and Control, Canada, 6–9 August, 2001, paper no. AIAA-2001-4380 (2001)
Huang, J., Sun, L.N., Han, Z.Z., et al.: Adaptive terminal sliding mode control for nonlinear differential inclusion systems with disturbance. Nonlinear Dyn. 72(1–2), 221–228 (2013)
Sun, H.B., Li, S.H., Sun, C.Y.: Finite time integral sliding mode control of hypersonic vehicles. Nonlinear Dyn. 73(1–2), 229–244 (2013)
Gao, G., Wang, J.Z.: Observer-based fault-tolerant control for an air-breathing hypersonic vehicle model. Nonlinear Dyn. 76(1), 409–430 (2014)
Niu, Y.J., Wang, X.Y.: A novel adaptive fuzzy sliding-mode controller for uncertain chaotic systems. Nonlinear Dyn. 73(3), 1201–1209 (2013)
Zhang, Y.X., Sun, M.W., Chen, Z.Q.: Finite-time convergent guidance law with impact angle constraint based on sliding-mode control. Nonlinear Dyn. 70(1), 619–625 (2012)
Shtessel, Y.B., Hall, C.E., Jackson, M.: Reusable launch vehicle control in multiple-time-scale sliding modes. J. Guid. Control Dyn. 23(6), 1013–1020 (2000)
Shtessel, Y.B., Hall, C.E.: Multiple time scale sliding mode control of reusable launch vehicles in ascent and descent modes. In: American Control Conference, Piscataway, NJ 2001, pp. 4357–4362 (2001)
Massey, T., Shtessel, Y.B.: Continuous traditional and high-order sliding modes for satellite formation control. J. Guid. Control Dyn. 28(4), 826–831 (2005)
Hall, C.E., Shtessel, Y.B.: Sliding mode disturbance observer-based control for a reusable launch vehicle. J. Guid. Control Dyn. 29(6), 1315–1328 (2006)
Stott, J.E., Shtessel, Y.B.: Launch vehicle attitude control using sliding mode control and observation techniques. J. Frankl. Inst. 349(2), 397–412 (2012)
Besnard, L., Shtessel, Y.B., Landrum, B.: Quadrotor vehicle control via sliding mode controller driven by sliding mode disturbance observer. J. Frankl. Inst. 349(2), 658–684 (2012)
Nagesh, I., Edwards, C.: A multivariable super-twisting sliding mode approach. Automatica 50(3), 984–988 (2014)
Bollino, K., Ross, I.M., Doman, D.: Optimal nonlinear feedback guidance for reentry vehicle. In: AIAA Guidance, Navigation and Control Conference, Keystone Co, AIAA 2006-6074 (2006)
Darby, C.L., Hager, W.W., Rao, A.V.: An hp-adaptive pseudospectral method for solving optimal control problems. Optim. Control Appl. Methods 32(4), 476–502 (2011)
Tian, B.L., Zong, Q.: 3DOF ascent phase trajectory optimization for aircraft based on adaptive Gauss pseudospectral method. 2012 Third International Conference on Intelligent Control and Information Processing, Dalian, China, July 15–17, 2012, pp. 431–435 (2012)
Ross, I.M., Sekhavat, P., Fleming, A.: Optimal feedback control: foundations, examples and experimental results for a new approach. J. Guid. Control Dyn. 31(2), 307–321 (2008)
Bollino, K.P.: High fidelity real-time trajectory optimization for reusable launch vehicles. Ph.D Thesis, Naval Postgraduate School (2006)
Bhat, S.P., Bernstein, D.S.: Finite-time stability of continuous autonomous systems. SIAM J. Control Optim. 38(3), 751–766 (2000)
Moreno, J.A., Osorio, M.: Strict Lyapunov functions for the super-twisting algorithm. IEEE Trans. Autom. Control 57(4), 1035–1040 (2012)
Khalil, H.K.: Nonlinear System, 3rd edn. Prentice Hall, Englewood Cliffs (2001)
Recasens, J.J., Chu, Q.P., Mulder, J.A.: Robust model predictive control of a feedback linearized system for a lifting-body reentry vehicle. In: AIAA Guidance, Navigation, and Control Conference and Exhibit, San Francisco, California, AIAA-2005-6147 (2005)
Estrada, A., Loria, A., Santiesteban, R., et al.: Cascaded-based stabilization of time-varying systems using 2nd-order sliding modes. IMA J. Math. Control Inf. 30(1), 115–128 (2013)
Shkolnikov, I.A., Shtessel, Y.B.: A multiple loop sliding mode control system with second-order boundary layer dynamics. In: 15th World Congress of the International Federation of Automatic Control, Barcelona (2002)
Shtessel, Y.B., Shkolnikov, I.A., Levant, A.: Smooth second-order sliding modes: missile guidance application. Automatica 43, 1470–1476 (2007)
Harkegard, O., Glad, S.T.: Resolving actuator redundancy-optimal control vs control allocation. Automatica 41(1), 137–144 (2005)
Harkegard, O.: Efficient active set algorithms for solving constrained least squares problems in aircraft control allocation. In: Proceedings of the 41st IEEE Conference on Decision and Control. Las Vegas, NV, pp. 1295–1300 (2002)
Acknowledgments
This work has been supported by National Natural Science Foundation of China (61203012, 61304018, 61273092), Key Grant Project of Chinese Ministry of Education (311012) and Tianjin Research Program of Application Foundation and Advanced Technology (12JCZDJC30300). Independent Innovation Fund of Tianjin University (2013XQ-0022),.
Author information
Authors and Affiliations
Corresponding author
Appendix: Proof of Theorem 1
Appendix: Proof of Theorem 1
Proof
For brevity, we present system (16) in a form convenient for Lyapunov analysis. To this end, a new state vector is introduced
Furthermore, system (16) can be rewritten as
where \(\mathbf{I}_m \) denotes m-dimensional unit matrix. Taking into account the definition in (36), we obtain \(\left\| {\mathbf{z}_1 } \right\| =\left\| {\mathbf{x}_1 } \right\| ^{p}\) and \(\mathbf{x}_1 =\mathbf{z}_1 /\left\| {\mathbf{x}_1 } \right\| ^{p-1}=\mathbf{z}_1 /\left\| {\mathbf{z}_1 } \right\| ^{\frac{p-1}{p}}\). Therefore, system (37) can be equivalently written as
It follows from the definition in (36) that \(\mathbf{z}_1,\,\mathbf{z}_2 \rightarrow 0\) in finite time implies that \(\mathbf{x}_1,\,\mathbf{x}_2 \rightarrow 0\) in finite time. Next, a Lyapunov function candidate is proposed as
It is easy to verify that \(V\left( {\mathbf{z}_1,\mathbf{z}_2 } \right) \) is positive definite and radially unbounded for arbitrary positive constants \(k_1 >0\,\hbox {and}\,k_2 >0\). The derivative of \(V\left( {\mathbf{z}_1,\mathbf{z}_2 } \right) \) is given by
Substituting (38) into (40) yields
Taking into account \(p\in \left( {0.5,1} \right) \) and using Cauchy–Schwarz inequality on the inner product terms, we have
Let \(\mathbf{z}=\left[ {\left\| {\mathbf{z}_1 } \right\| \,\left\| {\mathbf{z}_2 } \right\| } \right] ^{T}\), it is obvious that the first item on the right-hand side of (42) can be rewritten as \(-p\left\| {\mathbf{z}_\mathbf{1} } \right\| ^{\frac{p-1}{p}}\mathbf{z}^{T}\mathbf{Qz}\) with positive definite matrix \(\mathbf{Q}=\left[ {{\begin{array}{ll} {\left( {k_2 +k_1^2 } \right) k_1 }&{} {-k_1^2 } \\ {-k_1^2 }&{} {k_1 } \\ \end{array} }} \right] \) for any positive constants \(k_1 >0\,\hbox {and}\,k_2 >0\). In addition, it is easily verified that \(\lambda _\mathrm{mim} \left( \mathbf{Q} \right) \left\| \mathbf{z} \right\| ^{2}\le \mathbf{z}^{T}\mathbf{Qz}\le \lambda _{\max } \left( \mathbf{Q} \right) \left\| \mathbf{z} \right\| ^{2}\) with \(\lambda _\mathrm{mim} \left( \mathbf{Q} \right) \) and \(\lambda _{\max } \left( \mathbf{Q} \right) \) being the minimum and maximum eigenvalue of matrix \(\mathbf{Q}\), respectively. Furthermore, in view of bounds on the terms \(\left\| {\varvec{\Delta }_1 } \right\| \) and\(\left\| {\varvec{\Delta }_2 } \right\| \), we have
Next, we want to use part of \(-p\left\| {\mathbf{z}_\mathbf{1} } \right\| ^{\frac{p-1}{p}}\lambda _{mim} \left( \mathbf{Q} \right) \left\| \mathbf{z} \right\| ^{2}\) to dominate the last two items on the right-hand side of (43) for large \(\left\| \mathbf{z} \right\| \). To this end, we rewrite the inequality (43) as
where \(\theta _1\,\hbox {and}\,\theta _2 \) are positive constants and satisfy \(0<\theta _1+\theta _2 <1\). Furthermore, let \(\mathbf{Z}=\left[ {\mathbf{z}_1;\,\mathbf{z}_2 } \right] \). It follows from the definition of Lyapunov function in (39) that it can be rewritten as \(V\left( {\mathbf{z}_1,\,\mathbf{z}_2 } \right) =\mathbf{Z}^{T}\mathbf{PZ}\) for an appropriate symmetric positive definite matrix \(\mathbf{P}\in R^{2m\times 2m}\). Therefore, the following inequality is satisfied for any vector \(\mathbf{Z}\)
where \(\lambda _{\min } \left( \mathbf{P} \right) \) and \(\lambda _{\max } \left( \mathbf{P} \right) \) represent the minimum and maximum eigenvalue of matrix \(\mathbf{P}\), respectively. In addition, it can be observed from the definition of \(\mathbf{z}\,\hbox {and}\,\mathbf{Z}\) that \(\left\| {\mathbf{z}} \right\| =\left\| \mathbf{Z} \right\| \). Therefore, based on (45), we have inequality \(\frac{V^{1/2}}{\left[ {\lambda _{\mathrm{max}} \left( \mathbf{P} \right) } \right] ^{1/2}}\le \left\| \mathbf{z} \right\| =\left\| \mathbf{Z} \right\| =\le \frac{V^{1/2}}{\left[ {\lambda _{\min } \left( \mathbf{P} \right) } \right] ^{1/2}}\). Since \(0.5<p<1\), it can be concluded that \(\left\| {\mathbf{z}_\mathbf{1} } \right\| ^{\frac{p-1}{p}}\ge \left\| \mathbf{z} \right\| ^{\frac{p-1}{p}}\ge \frac{V^{\left( {p-1} \right) /\left( {2p} \right) }}{\left[ {\lambda _{\max } \left( \mathbf{P} \right) } \right] ^{\left( {p-1} \right) /\left( {2p} \right) }}\). Then, inequality (44) satisfies
Then,
It follows from (47) and theorem 4.18 in [49] that the solution for [A3] is globally bounded. Furthermore, inequality (A8) is reduced to
if \(\varvec{\Delta }_1 =\varvec{\Delta }_2 =0\). It follows from \(0.5<p<1\) that we obtain \(\frac{3p-1}{2p}\in \left( {0.5,1} \right) \). According to Lemma 1, it can be concluded that \(\mathbf{z}_1 \), \(\mathbf{z}_2 \rightarrow 0\) in finite time. This completes the proof.\(\square \)
Rights and permissions
About this article
Cite this article
Tian, B., Fan, W. & Zong, Q. Integrated guidance and control for reusable launch vehicle in reentry phase. Nonlinear Dyn 80, 397–412 (2015). https://doi.org/10.1007/s11071-014-1877-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11071-014-1877-0