Abstract
This study presents a robust self-learning proportional-integral-derivative (RSPID) control system design for nonlinear systems. This RSPID control system comprises a self-learning PID (SPID) controller and a robust controller. The gradient descent method is utilized to derive the on-line tuning laws of SPID controller; and the \( \, H_{\infty } \, \) control technique is applied for the robust controller design so as to achieve robust tracking performance. Moreover, in order to achieve fast learning of PID controller, a particle swarm optimization (PSO) algorithm is adopted to search the optimal learning-rates of PID adaptive gains. Finally, two nonlinear systems, a two-link manipulator and a chaotic system are examined to illustrate the effectiveness of the proposed control algorithm. Simulation results show that the proposed control system can achieve favorable control performance for these nonlinear systems.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
The proportional-integral-derivative (PID) controller has been practically applied in industries for 60 years due to its simple architecture and easy design properties. Until today, the PID controller is still used in different control applications, even though lots of new control techniques have been proposed. However, the traditional PID controller needs some manual tuning before it is used to practical application in industries. When the PID controllers are applied to complicated systems such as nonlinear systems, different tuning algorithms of PID controllers have been proposed [1–5]. In recent decade, several intelligent tuning algorithms have been applied to tune the PID controllers. The PID controller automatic tuning methods have been proposed by using a genetic algorithm [6], an immune algorithm [7] and a fuzzy-genetic algorithm [8]. However, the learning speed of these algorithms is slow; thus, they are not suitable for real time control systems. Harinath and Mann [9] proposed a fuzzy PID controller for multivariable process systems. However, it takes a two-level tuning algorithm; thus, it also takes too much computation time. Therefore, in this paper, another simple optimization algorithm called particle swam optimization (PSO) algorithm will be used for the optimal parameter search of the PID controller.
The PSO algorithm, a new evolutionary computation technique, is proposed by Kennedy and Eberhart [10]. It was developed by the research of the social behavior of animals, e.g., bird flocking. Unlike a typical GA, PSO algorithms have memorial capability without complicated evolutionary process such as crossover and mutation in GA, and each particle can memorize its best solution. In addition, if another particle discovers a better solution, it will be shared among other particles. The best solution is thus memorized and every particle will move toward this best solution. In past decade, PSO algorithms have been widely used to solve the modeling and control problems of application systems [11–14]. Moreover, the PID control based on PSO algorithms have been proposed in [15, 16]. However, it is difficult to solve the inequality for the optimal solution in [15]; and it has not given the stability analysis in [16].
In this paper, a robust self-learning PID (RSPID) control system is proposed for nonlinear systems. This RSPID control system comprises a self-learning PID (SPID) controller and a robust controller. The SPID controller is utilized to approximate an ideal controller, and the robust controller is designed to recover the residual approximation error between the ideal controller and SPID controller. The gradient descent method and \( \, H_{\infty } \, \) control technique are utilized to derive the on-line tuning laws of SPID controller and robust controller, so that the robust stability of the system can be obtained. Furthermore, the PSO algorithm is adopted to auto-search the optimal learning-rates of PID controller to increase the learning speed. Finally, two nonlinear systems are presented to support the validity of the proposed control method.
This study is organized as follows. Problem formulation is described in Sect. 2. The PSO algorithm is briefly reviewed in Sect. 3. The design procedure of the proposed RSPID control system is constructed in Sect. 4. In Sect. 5, simulations are performed to verify the effectiveness of the proposed control method. Finally, conclusions are drawn in Sect. 6.
2 Problem formulation
Consider a class of nth-order multi-input multi-output (MIMO) nonlinear systems expressed in the following form:
where
- \( \user2{u}(t) = [u_{1} (t),u_{2} (t), \ldots , \, u_{m} (t)]^{T} \in \Re^{m} \, \) :
-
the control input vector of the system
- \( \user2{y} = \user2{x}(t) = [x_{1} (t),x_{2} (t), \ldots , \, x_{m} (t)]^{T} \in \Re^{m} \) :
-
the system output vector
- \( \underline{\user2{x}} (t) = [\user2{x}^{T} (t),\dot{\user2{x}}^{T} (t), \ldots ,\user2{x}^{(n - 1)T} (t)]^{T} \in \Re^{mn} \) :
-
the state vector of the system
- \( \user2{f}(\underline{\user2{x}} (t)) \in \Re^{m} \, \) :
-
unknown bounded nonlinear function
- \( \user2{G}(\underline{\user2{x}} (t)) \in \Re^{m \times m} \, \) :
-
unknown bounded nonlinear matrix
- \( \user2{d}(t) = [d_{ 1} (t) ,d_{2} (t) ,\ldots , { }d_{m} (t)]^{T} \in \Re^{m} \) :
-
unknown bounded external disturbance.
When neglecting the modeling uncertainty and external disturbance, the nominal system of Eq. 1 can be obtained as
where \( \, \user2{f}_{n} (\underline{\user2{x}} (t)) \in \Re^{\text{m}} \) is the nominal function of \( \user2{f}(\underline{\user2{x}} (t)) \), and the constant matrix \( \user2{G}_{n} = diag(g_{n1} , \, g_{n2} , \ldots , \, g_{nm} ) \in \Re^{m \times m} \, \) is the nominal functions of \( \user2{G}(\underline{\user2{x}} (t)) \). Without losing generality, assume the constant \( g_{ni} \ge 0 \) for \( i = 1 \ldots m \). Assume that the nonlinear system of Eq. 2 is controllable and \( \user2{G}_{n}^{ - 1} \) exists for all \( \underline{\user2{x}} (t) \). If the external disturbance and modeling uncertainty exist, the MIMO nonlinear systems Eq. 1 can be reformulated as
where \( \user2{l}(\underline{\user2{x}} (t),t) \) is referred to as the lumped uncertainty, including system’s uncertainty and external disturbance.
The control problem is to find a suitable controller for the MIMO nonlinear systems Eq. 1 so that the system output vector \( \user2{x}(t) \) can track desired reference trajectory vector \( \user2{x}{}_{d}(t) = [x_{d1} (t),x_{d2} (t), \ldots , \, x_{dm} (t)]^{T} \in \Re^{m} \) closely.
A lot of control techniques have been presented to achieve reference trajectory tracking. However, in this paper, a simple adaptive PID control scheme will be proposed to deal with the uncertain MIMO nonlinear system. Moreover, the \( \, H_{\infty } \, \) control technique will be used to guarantee the robust tracking performance.
Define the tracking error as
then the system tracking error vector is defined as
If the system dynamics \( \user2{f}_{n} (\underline{\user2{x}} (t)) \) and \( \user2{G}_{n} \), and the lumped uncertainty \( \user2{l}(\underline{\user2{x}} ,t) \) are exactly known, an ideal controller can be designed as
where \( \underline{\user2{H}} = [\user2{H}_{n} , \ldots ,\user2{H}_{2} ,\user2{H}_{1} ]^{T} \in \Re^{mn \times m} \) is the feedback gain matrix which contains real numbers. \( \user2{H}_{i} = diag(h_{i1} ,h_{i2} , \ldots ,h_{im} ) \in \Re^{m \times m} \, \) is a nonzero positive constant diagonal matrix. Substituting the ideal controller Eq. 6 into Eq. 3, gives the error dynamic equation
In Eq. 7, if H is chosen to correspond to the coefficients of a Hurwitz polynomial, it implies \( \mathop {\lim }\limits_{t \to \infty } ||\underline{\user2{e}} (t)|| = 0 \). However, in practical application, the system uncertainties and external disturbance of nonlinear systems are generally unknown, so that the idea controller \( \user2{u}^{*} \) in Eq. 6 is always unobtainable. Thus, a SPID controller is designed to mimic the idea controller. And then, based on the \( H_{\infty } \) control technique, the robust controller is developed to attenuate the effect of the approximation error between SPID controller and the ideal controller so that the robust tracking performance can be achieved.
3 Robust self-learning PID (RSPID) control system design
The block diagram of the nonlinear control system is shown in Fig. 1. The RSPID control system is assumed to take the following form:
where \( \user2{u}_{SPID} (t) \) is a self-learning PID controller utilized to approximate the ideal controller \( \user2{u}^{*} \), and \( \user2{u}_{R} (t) \) is the robust controller designed to suppress the influence of residual approximation error between the ideal controller and SPID controller.
3.1 SPID controller design
The SPID controller can be described as
where \( \hat{\user2{K}}_{P} ,\; \, \hat{\user2{K}}_{I} \) and \( \hat{\user2{K}}_{D} \) are the adaptive parameters of proportional gain, integral gain and derivative gain matrices, respectively; and \( \hat{\user2{K}}_{P} = diag(\hat{k}_{P1} ,\hat{k}_{P2} , \ldots ,\hat{k}_{Pm} ) \in \Re^{m \times m} , \) \( \hat{\user2{K}}_{I} = diag(\hat{k}_{I1} ,\hat{k}_{I2} , \ldots ,\hat{k}_{Im} ) \in \Re^{m \times m} , \) \( \hat{\user2{K}}_{D} = diag(\hat{k}_{D1} ,\hat{k}_{D2} , \ldots \hat{k}_{Dm} ) \in \Re^{m \times m} . \)
An integrated error function is defined as
where \( \user2{s}(\underline{\user2{e}} ,t) = [s_{1} (t),s_{2} (t), \ldots ,s_{m} (t)]^{T} \). From Eq. 9, the control law Eq. 8 can be rewritten as
Taking the time derivative of both sides of Eq. 10 and using Eq. 3, it can be obtained that
Substituting Eq. 9 into Eq. 12 and multiplying both sides by \( \user2{s}^{T} (\underline{\user2{e}} ,t) \), yields
By defining \( \frac{1}{2}\user2{s}^{T} (\underline{\user2{e}} ,t)\user2{s}(\underline{\user2{e}} ,t) \) as a cost function, then its derivative is \( \user2{s}^{T} (\underline{\user2{e}} ,t)\dot{\user2{s}}(\underline{\user2{e}} ,t) \). According to the gradient descent method, the gains of \( \hat{\user2{K}}_{P} ,\,\hat{\user2{K}}_{I} \) and \( \hat{\user2{K}}_{D} \) are updated by the following tuning laws
where \( u_{{SPID_{i} }} \) is the ith element of \( \user2{u}_{SPID} \); \( \eta_{P} ,\,\eta_{I} \) and \( \eta_{D} \) are the learning-rates, which will be auto-searched by PSO algorithm.
3.2 Robust controller design
In case of the existence of an approximation error, the ideal controller can be reformulated as the summation of SPID controller and the approximation error:
where \( \varepsilon (t) = [\varepsilon_{1} (t),\varepsilon_{2} (t), \ldots , \, \varepsilon_{m} (t)]^{T} \in \Re^{m} \) denotes the approximation error.
Substituting Eq. 8 into Eq. 3, yields
From Eqs. 6, 10, 18 and after some straightforward manipulation, it can be obtained that
Now, the robust controller can be developed to attenuate the effect of the approximation error between the ideal controller and SPID controller so that the \( \, H_{\infty } \, \) tracking performance can be achieved. In case of the existence of \( \, {\mathbf{\varepsilon }}(t) , \) consider a specified \( \, H_{\infty } \, \) tracking performance [17]
where \( r_{i} \, \) is a prescribed attenuation constant. The robust controller is designed as
where \( \, \user2{R} = diag (r_{ 1} ,r_{2} ,\ldots ,r_{m} )\in \Re^{m \times m} . { } \) Then the following theorem can be stated and proven.
Theorem 1: Consider the nth-order MIMO nonlinear systems represented by Eq. 1 . The RSPID control law is designed as Eq. 11 , where \( \user2{u}_{SPID} (t) \) is given in Eq. 9 with the on-line parameter tuning algorithms given as Eqs. 14–16, and the robust controller is designed as Eq. 21 . Then the desired \( H_{\infty } \) tracking performance in Eq. 20 can be achieved for the specified attenuation levels \( \, r_{i} ,i = 1,2, \ldots ,m. \)
Proof: The Lyapunov function is given by
Taking the derivative of the Lyapunov function and using Eqs. 17, 19 and 21, yields
Assuming \( \, \varepsilon_{i} (t) \in L_{2} [0,T], \, \forall T \in [0,\infty ), \) integrating the above equation from t = 0 to t = T, yields
Since \( V(T) \ge 0 \), the above inequality implies the following inequality
Using Eq. 22, the above inequality is equivalent to the following
Thus the proof is completed.
4 Particle swarm optimization (PSO) algorithm
The learning-rates of the tuning laws in SPID controller are usually selected by trial-and-error process. In order to achieve the best learning speed, the PSO algorithm is adopted to search the optimal learning-rates \( \eta_{P} ,\,\eta_{I} \) and \( \eta_{D} \) in the SPID controller.
In 1995, Kennedy and Eberhart [10] initially proposed the particle swarm concept and PSO algorithm; this algorithm is one of optimization methods. It has been proven to be efficient in solving optimization problem. In the PSO algorithm, each particle represents a candidate solution to the optimization problem. The particle keeps track of its coordinates in the problem space which are associated with the personal best solution. Another is the global best value that is tracked by the global version of the particle swarm optimizer. At each time step, the PSO algorithm consists of changing the velocity that accelerates each particle toward its personal best and global best locations. Acceleration is weighted by a random term with separate random numbers being generated for acceleration toward personal best and global best locations, respectively [11]. From then on, several PSO algorithms have been proposed with slightly different versions [12–16].
The flowchart of the utilized PSO algorithm is drawn in Fig. 2.
4.1 Fitness function
In order to maintain the control characteristic of SPID controller, a fitness function is chosen as
It means that if the error states \( \underline{\user2{e}} (t) \) is forced to zero then the expected value of fitness will be \( fit = 10 \).
4.2 Velocity and position update law
In PSO algorithm, a population of particles is randomly generated initially; each particle adjusts self-position with velocity according to its own experience and the experiences of other particles. The particle velocity and position update law is adopted as [16]
where iw is called the inertia weight which balances the global and local search, \( v_{q}^{l} (n) \) and \( p_{q}^{l} (n) \) denote current velocity and current position, respectively; \( rand_{1} ( \cdot ) \), \( rand_{2} ( \cdot ) \) and \( rand_{3} ( \cdot ) \) denote random variables between 0 and 1; ξ 1, ξ 2, and ξ 3 denote acceleration factor_1, acceleration factor_2 and acceleration factor_3, respectively; \( L_{{best_{q} }}^{l} \), \( G_{{best_{q} }}^{l} \), and \( S_{{best_{q} }}^{l} \) are the personal best index, the global best index, and the sub-population best index of the qth particle, respectively. Additionally, \( q = 1,2, \ldots ,n_{p} \), in which \( n_{p} \) is the population size and \( \ell = 1,2, \ldots ,n_{d} \), in which n d is the dimension of each particle and iw is given by
where iw max, iw min, N max and N n are iteration maximum value, iteration minimum value, total iteration number and current iteration number of inertia weight.
This PSO algorithm is used to on-line tune the learning-rates η P , η I and η D in Eqs. 14–16 to achieve fast learning speed of PID gains.
5 Simulation results
Two uncertain nonlinear systems, a two-link manipulator control system and a unified chaotic system are examined to illustrate the effectiveness of the proposed design method. The parameters of PSO are set as: the population size z = 20, the dimension of the particle h = 2, the acceleration factors ξ 1 = 0.75, ξ 2 = 3.25, ξ 3 = 0.1, the total iteration number N max = 10, the iteration maximum value iw max = 0.9, the iteration minimum value iw min = 0.4, the initial states of velocity \( v_{q}^{l} (n) \) and position \( p_{q}^{l} (n) \) of each particle are randomly generated.
Example 1. Two-link manipulator system
In this example, the proposed control system is applied to control a two-link robot manipulator. Figure 3 depicts this two-link manipulator, the angles of the second and the third links were considered to be θ 1 and θ 2, respectively [18]. In addition, the numerical values of parameters of the robot model were specified as that in [19]. The dynamic equation is given as follows
where \( {\varvec{\theta}} \in \Re^{n} \) is the joint position vector; \( \user2{M} ({\varvec{\theta}} )\in \Re^{n \times n} \) is a symmetric positive definite inertia matrix; \( \user2{A}_{m} ({\varvec{\theta}} ,\dot{{\varvec{\theta}} }) \) is a vector of Coriolos and centripetal torques; \( \user2{B}({\varvec{\uptheta}}) \in \Re^{n} \) representing the gravitational torques; \( \user2{Z} = \user2{K}_{\omega } + \user2{V}_{f} \in \Re^{n \times n} \) is a diagonal matrix consisting of the back emf coefficient matrix \( \user2{K}_{\omega } \) and the viscous friction coefficient matrix \( \user2{V}_{f} \); \( \tau_{d} \in \Re^{n} \) is the unmodeled disturbances vector; \( \tau \in \Re^{n} \) is the vector of control input torques. By defining the state vector \( \underline{x} (t) = [x_{1} (t),x_{2} (t)]^{T} = [\theta_{1} ,\theta_{2} ]^{T} \), the dynamic Eq. 30 can be expressed as
where the unknown nonlinear function \( \user2{G} (\user2{x} (t ) )= \user2{M}^{ - 1} ({\varvec{\theta}} ), \) and \( \user2{f} (\underline{\user2{x}} (t) )= \user2{M}^{ - 1} ({\varvec{\theta}} ) - \left[ {\user2{A}_{m} ({\varvec{\theta}} ,\dot{{\varvec{\theta}} })\dot{{\varvec{\theta}} } - \user2{B}(\dot{{\varvec{\theta}} }) - \user2{Z}\dot{{\varvec{\theta}} } - \tau_{d} } \right], \) where \( \user2{A}_{m} ({\varvec{\theta}} ,\dot{{\varvec{\theta}} }) = \sin (\theta_{2} )(a_{2} + p_{2} a_{9} )\left[ {\begin{array}{*{20}c} { - \dot{\theta }_{2} } & { - (\dot{\theta }_{1} + \dot{\theta }_{2} )} \\ {\dot{\theta }_{1} } & 0 \\ \end{array} } \right],\quad \user2{B}({\varvec{\theta}} ) = \left[ {\begin{array}{*{20}c} {a_{2} } & 0 \\ 0 & {a_{8} } \\ \end{array} } \right], \) \( \user2{Z} = \left[ {\begin{array}{*{20}c} {a_{5} \cos (\theta_{1} ) + a_{6} \cos (\theta_{1} + \theta_{2} ) + p_{4} \cos (\theta_{1} ) + p_{5} \cos (\theta_{1} + \theta_{2} )a_{9} } \\ {a_{6} \cos (\theta_{1} + \theta_{2} ) + p_{5} \cos (\theta_{1} + \theta_{2} )a_{9} } \\ \end{array} } \right],M = \left[ {\begin{array}{*{20}c} {M_{11} } & {M_{12} } \\ {M_{21} } & {M_{22} } \\ \end{array} } \right] \)
where \( \begin{aligned} M_{11} & = a_{1} + 2a_{2} \cos (\theta_{2} ) + (p_{1} + 2p_{2} \cos (\theta_{2} )a_{9} ),M_{12} = (a_{3} + a_{2} \cos (\theta_{2} )) + (p_{3} + p_{2} \cos (\theta_{2} )a_{9}), \\ M_{21} & = (a_{3} + a_{2} \cos (\theta_{2} )) + (p_{3} + p_{2} \cos (\theta_{2} )a_{9} ),\,M_{22} = a_{1} + p_{3} a_{9} \\ \end{aligned} \) [19].
In this example, the initial conditions of the states are given as \( a_{1} = 6.33,a_{2} = 0.14,a_{3} = 0.11,a_{4} = 27.6,a_{5} = 31.9,a_{6} = 3.3,a_{7} = 0.94,a_{8} = 4.54,a_{9} = 1.25, \) \( p_{1} = 0.37,p_{2} = 0.18,p_{3} = 0.18,p_{4} = 4.23,p_{5} = 4.15,\theta_{1} = 50{\varvec{\pi}} /180,\theta_{2} = 10{\varvec{\pi}} /180, \) \( \dot{\theta }_{1} = 0,\dot{\theta }_{2} = 0. \)
For demonstrating the tracking performance of the proposed control system, the desired trajectories for \( x_{d1} (t) \) and \( x_{d2} (t) \) are set as
where \( \underline{\user2{x}}_{d} (t) = \left[ {x_{d1} (t),x_{d2} (t)} \right]^{T} = \left[ {\theta_{d1} ,\theta_{d2} } \right]^{T} \) is the reference trajectory vector.
For the proposed RSPID control system, the feedback gain matrix is designed as \( \underline{\user2{H}} = diag(40,4) \); the initial conditions of the learning-rates are chosen as η P = 400, η I = 300 and η D = 400, respectively; all the PID gains are set as zero initially; the initial states of two-link manipulator are specified as \( x_{1} (0) = 20\pi /180,x_{2} (0) = 10\pi /180,\dot{x}_{1} (0) = 0{\text{ and }}\dot{x}_{2} (0) = 0, \) and the initial states of desired trajectories \( x_{d1} (0) = 0,x_{d2} (0) = 0,\dot{x}_{d1} (0) = 0{\text{ and }}\dot{x}_{d2} (0) = 0. \) In order to study the robustness of the proposed control system, assume that the two-link manipulator control system has external disturbance \( \tau_{d} = [40,25] \) at \( t = 5\,\text{s} \). Besides, the attenuation level is chosen as \( \user2{R} = diag(0.5,0.5) \)
In order to compare the control performance, the adaptive fuzzy control (AFC) presented in [18] is also applied to this manipulator system. Using this control system, the tracking trajectories are shown in Fig. 4a, b, respectively. The simulation results of the proposed RSPID control system for this two-link manipulator control system are shown in Figs. 4, 5, 6, 7. Figure 4c, d represents the tracking responses of \( x_{1} (t) \) and \( x_{2} (t) \) by the proposed RSPID control scheme, respectively. Moreover, the control inputs and tracking errors of \( x_{1} (t) \) and \( \, x_{2} (t) \) are plotted in Fig. 5a–d, respectively. In addition, the fitness function and learning-rates are shown in Fig. 6a–d, respectively. The PID gains \( \user2{K}_{P} ,\;\user2{K}_{I} \) and \( \user2{K}_{D} \) are shown in Fig. 7a–c, respectively. Comparing Fig. 4c, d with Fig. 4a, b, it can be seen the tracking error have been much reduced by using the RSPID controller. Moreover, from Fig. 6, it is also seen that the learning-rates of PID controller converge after the 5th second. The simulation results also show that the proposed RSPID control system can effectively achieve parameter tuning and favorable control for the two-link manipulator control system.
Example 2 . Unified chaotic system
Consider a general master–slave unified chaotic systems; the master system and slave system are given as [20].
where x di (t) and x i (t) are the system states variables of master system and slave system, respectively; d i , i = 1, 2, 3 denote the disturbances and u i , i = 1, 2, 3 are the control inputs. Assume \( \alpha_{1} = (25\theta_{c} + 10),\alpha_{2} = (28 - 35\theta_{c} ),\alpha_{3} = (29\theta_{c} - 1) \) and \( \alpha_{4} = \left( {\frac{{8 + \theta_{c} }}{3}} \right) \); then the master system Eq. 33 and the slave system Eq. 34 can be rewritten as
the master–slave system can be expressed as
and
where \( \user2{x}{}_{d}(t) = [x_{d1} (t),x_{d2} (t),x_{d3} (t)]^{T} ,\user2{x}(t) = [x_{1} (t),x_{2} (t),x_{3} (t)]^{T} , \) \( \user2{f} (\user2{x}{}_{d}(t) )= [\alpha_{1} (x_{d2} (t) - x_{d1} (t)),\alpha_{2} x_{d1} (t) - x_{d1} (t)x_{d3} (t) + \alpha_{3} x_{d2} (t),x_{d1} (t)x_{d2} (t) - \alpha_{4} x_{d3} (t)]^{T} \), \( \user2{f} (\user2{x}(t) )= [\alpha_{1} (x_{2} (t) - x_{1} (t)),\alpha_{2} x_{1} (t) - x_{1} (t)x_{3} (t) + \alpha_{3} x_{2} (t),x_{1} (t)x_{2} (t) - \alpha_{4} x_{3} (t)]^{T} , \) \( \user2{G} (\user2{x}(t) )= diag[1,1,1] \), \( \user2{d}(t) = [d_{1} (t),d_{2} (t),d_{3} (t)]^{T} \) and \( \user2{u}(t) = [u_{1} (t),u_{2} (t),u_{3} (t)]^{T} \).
The control objective is to find a suitable control law \( \user2{u}(t) \), so that the state trajectories of slave chaotic system \( \user2{x}(t) \) can follow the master chaotic system \( \user2{x}{}_{d}(t) \) under different initial conditions and subject to disturbances.
When \( \theta_{c} = [0\sim 0.8) \) the system is known to be the generalized Lorenz system; when \( \theta_{c} = 0.8 \) the system is called the Lu system; and when \( \theta_{c} = (0.8\sim 1] \) the system is called the Chen system. It is supposed that the initial conditions of the states are \( x_{d1} (0) = 3, \, x_{d2} (0) = 5, \, x_{d3} (0) = 7 \) for the master system, and \( x_{1} (0) = - 2, \, x_{2} (0) = 2, \, x_{2} (0) = 3 \) for the salve system. In addition, \( \theta_{c} = 1 \)and the external disturbance \( \user2{d}(t) = [0.2\cos (\pi t),0.1\cos (t),0.3\cos (2t)]^{T} \) are used. The attenuation level is chosen as \( \user2{R} = diag(0.6,0.6,0.6) \) and the gains of integrated error function are selected as \( \underline{\user2{H}} = diag(0.1,0.1,0.1). \, \)In addition, the initial values of the learning-rates of PID gains are set as \( \eta_{P} = 80,\eta_{I} = 20, \) and \( \eta_{D} = 0, \) respectively; all the PID gains are set as zero initially.
For comparison, an adaptive fuzzy sliding model control (AFSMC) [21] is also applied to this system, the tracking trajectories are shown in Fig. 8a–c, respectively. The simulation results of the proposed RSPID control are shown in Figs. 8, 9, 10, 11. Synchronization responses of \( x_{d1} (t),x_{1} (t),x_{d2} (t),x_{2} (t),x_{d3} (t) \) and \( x_{3} (t) \) are depicted in Figs. 8d–f, respectively. Figure 9 shows the synchronization errors and control efforts of chaotic system. The fitness function and learning-rates are shown in Fig. 10, it is shown when error state \( \underline{\user2{e}} (t) \) converges to zero, the fitness function also approximates to \( fit = 10 \). Figure 11 illustrates the \( \user2{K}_{P} ,\;\user2{K}_{I} \) and \( \user2{K}_{D} \) gains. Since this system is a first order system (i.e. PI controller is used); thus, \( \user2{K}_{D} \) gains approach to zero.
From Figs. 4 and 8, it’s easy to find that the proposed control system can achieve control performance better than AFSMC scheme, even under disturbance. By using the proposed design method, the tracking error can converge to zero quickly even in the presence of disturbance for different initial conditions.
6 Conclusion
This paper proposes a nonlinear system control scheme by using a robust self-learning PID (RSPID) control system. In the proposed control system, the self-learning PID (SPID) controller is the main controller used to mimic an ideal controller, and the robust controller is designed to compensate for the difference between the ideal controller and the SPID controller based on H ∞ control technology. In addition, a PSO algorithm has been adopted to search the optimal learning-rates of the tuning law in SPID controller. Simulation results have demonstrated that the proposed RSPID control system can achieve favorable control performance for the nonlinear systems. The major contributions of this study are: (1) The successful development of a self-learning PID controller. (2) The SPID controller with PSO algorithm and the robust controller are successful combined for MIMO nonlinear systems.
References
Astrom KJ, Hagglund T (1988) Automatic tuning of PID controllers. Instrument Society of America, Research Triangle Park
Ho WK, Hang CC, Zhou JH (1997) Self-tuning PID control of a plant with under-damped response with specifications on gain and phase margins. IEEE Trans Control Syst Technol 5(4):446–452
Ren TJ, Chen TC, Chen CJ (2008) Motion control for a two-wheeled vehicle using a self-tuning PID controller. Control Eng Pract 16:365–375
Vagia M, Tzes A (2008) Robust PID control design for an electrostatic micromechanical actuator with structured uncertainty. IET Control Theory Appl 2(5):365–373
Yu DL, Chang TK, Yu DW (2005) Fault tolerant control of multivariable processes using auto-tuning PID controller. IEEE Trans Syst Man Cybern B 35(1):32–43
Juang JG, Huang MT, Lin WK (2008) PID control using presearched genetic algorithms for a MIMO system. IEEE Trans Syst Man Cybern C 38(5):716–727
Wang J, Zhang C, Jing Y (2008) Fuzzy immune self-tuning PID control of HVAC system. IEEE Int Conf Mech Autom 678–683
Bandyopadhyay R, Chakraborty UK, Patranabis D (2001) Autotuning a PID controller: a fuzzy-genetic approach. J Syst Archit 47:663–673
Harinath E, Mann GKI (2008) Design and tuning of standard additive model based fuzzy PID controllers for multivariable process systems. IEEE Trans Syst Man Cybern B 38(3):667–674
Kennedy J, Eberhart RC (1995) Particle swarm optimization. IEEE Int Conf Neural Netw Perth, Australia 4:1942–1948
Eberhart RC, Shi Y (2001) Particle swarm optimization: developments, applications and resources. In: Proceedings of congress on evolution computation, Seoul, pp 81–75
Juang CF (2004) A hybrid of genetic algorithm and particle swarm optimization for recurrent network design. IEEE Trans Syst Man Cybern B Cybern 34(2):997–1006
Heo JS, Wang K, Lee Y, Garduno-Ramirez R (2006) Multiobjective control of power plants using particle swarm optimization techniques. IEEE Trans Energy Convers 21(2):552–561
Lin FJ, Teng LT, Chu H (2008) A robust recurrent wavelet neural network controller with improved particle swarm optimization for linear synchronous motor drive. IEEE Trans Power Electron 23(6):3067–3078
Kim TH, Maruta I, Sugie T (2008) Robust PID controller tuning based on the constrained particle swarm optimization. Automatica 44:1104–1110
Chang WD, shih SP (2010) PID controller design of nonlinear systems using an improved particle swarm optimization approach. Commun Nonlinear Sci Numer Simul 15:3632–3639
Chen BS, Lee CH (1996) H ∞ tracking design of uncertain nonlinear SISO systems: adaptive fuzzy approach. IEEE Trans Fuzzy Syst 4(1):32–43
Wang SD, Lin CK (2000) Adaptive tuning of the fuzzy controller for robots. Fuzzy Sets Syst 110:351–363
Erlic M, Lu WS (1995) A reduced-order adaptive velocity observer for manipulator control. IEEE Trans Robot Autom 11(2):293–303
Lu J, Chen G, Cheng D (2002) Bridge the gap between the Lorenz system and the Chen system. Int J Bifurcat Chaos 12:2917–2926
Lin CM, Hsu CF (2004) Adaptive fuzzy sliding-mode control for induction servomotor systems. IEEE Trans Energy Convers 19(2):362–368
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lin, CM., Li, MC., Ting, AB. et al. A robust self-learning PID control system design for nonlinear systems using a particle swarm optimization algorithm. Int. J. Mach. Learn. & Cyber. 2, 225–234 (2011). https://doi.org/10.1007/s13042-011-0021-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-011-0021-4