Fast and Robust Online Dynamic System Identification

Latocha, Andrzej

doi:10.1007/978-3-319-64474-5_18

Andrzej Latocha¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 635))

Included in the following conference series:

International Conference on Diagnostics of Processes and Systems

1288 Accesses
2 Citations

Abstract

A new method is proposed for black-box linear model identification of a dynamic system embedded at a nearly Gaussian noise. The Gaussian process can highlight areas of the output spaces where the prediction quality is poor, due to the lack of data or its complexity, by indicating the higher variance of the predicted mean; the input spaces in which we can reconstruct data represent the expected values. This paper proposed a new approach for the online system identification for non-zero initial conditions in the moving window.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Identification of linear time-varying dynamic systems based on the WKB method

Article Open access 03 March 2023

Continuous-Time Dynamic Model Identification Using Binary-Valued Observations of Input and Output Signals

A New Parameter Identification Algorithm for a Class of Second Order Nonlinear Systems: An On-line Closed-loop Approach

Article 03 April 2018

Keywords

1 Introduction

To solve many of the problems in the design, implementation, and operation of automatic control systems, relatively precise mathematical models for the static and dynamic behavior of industrial processes are required. If the underlying physical laws are not known or are only partially known, or if significant parameters are not known precisely enough, one has to perform an experimental modeling, which is called process or system identification. There are different ways to identify systems when the input and output of the system are known. In real systems, the signals are always more or less subject to interference with noise. The expected values can only be estimated. Preferably, identification algorithms should be fast algorithms that allow for the identification of dynamic systems around the operating point in real time. There are different methods that model different situations with respect to the noise. This study was based on identification methods using least squares estimation (LSE) [1, 2] for the black-box model (Fig. 1). The work is of use in the field of control systems engineering. The innovation in the paper is a fast system for the high-precision identification of linear dynamics that is independent of the initial conditions.

2 Problem Formulation

Describing the process using equations with an acceptable error margin is very difficult due to the complexity of the system structure and the noise distortion. These data [1] have been obtained from an experiment. For identification, we assume that the structure of the black-box system being tested (Fig. 1) will be approximated by the parametric ARMAX model, which will have the same dynamic properties in terms of system input/output.

For this reason, we define a cost function that minimizes the error between the tested system and the approximating model (i.e., the model used to approximate the system dynamics). The test signal satisfies the assumptions [1] of zero initial conditions, and the system is embedded in nearly Gaussian noise [1,2,3]. The test signal u(k) is permanently changeable, and the sampling sequences are equal lengths of time [1]. For the above assumptions, we can solve problems fast with robust identification of linear dynamic systems.

3 Least Squares Estimator (LSE)

Assuming the parametric model (1),

$$\begin{aligned} y(t)=\varphi ^{T}(t)\theta \end{aligned}$$

(1)

where y(t) is the measure of the output value, $\varphi (t)$ stands for the n-dimensional vector of the data samples, and the $\theta $ represents the n-dimensional vector of the unknown coefficient. For the measurement data, we can write the Eq. (2) as follows:

$$\begin{aligned} Y=\varPhi ^{T}\theta . \end{aligned}$$

(2)

Due to account noise, and the inaccuracy of the model, it is better to use an overly large number of samples, as additional data improves the accuracy of the estimation. For $N\gg n$, the system is overdetermined, and there is no exact solution. For oversized samples, the data matrix will not be a square matrix. In this case, the samples matrix can be replaced by a pseudo square matrix. Taking into account the inaccuracy of samples (3) [1,2,3],

$$\begin{aligned} \varepsilon (k)=y(k)-\varphi ^{T}(k)\theta ,\,k\in \mathbb {N},\,k>0 \end{aligned}$$

(3)

The least squares error (LSE) estimator $\hat{\theta }$ is defined as a vector that minimizes the cost function (4):

$$\begin{aligned} V(\theta )=\frac{1}{2}\sum _{t=1}^{N}\varepsilon ^{2}(k)=\frac{1}{2}\varepsilon ^{T}\varepsilon =\frac{1}{2}||\varepsilon ||^{2} \end{aligned}$$

(4)

where $||\centerdot ||$ is the Euclidean vector norm. For the positive definite matrix $\varPhi ^{T}\varPhi $, the cost function (4) has a minimum:

$$\begin{aligned} minV(\theta )=V(\hat{\theta })=\frac{1}{2}[Y^{T}Y-Y^{T}\varPhi (\varPhi ^{T}\varPhi )^{-1}\varPhi ^{T}Y], \end{aligned}$$

(5)

$$\begin{aligned} E=Y-\varPhi \theta , \end{aligned}$$

(6)

$$\begin{aligned} 0=\frac{dV}{d\theta }=-Y^{T}\varPhi +\theta ^{T}(\varPhi ^{T}\varPhi ), \end{aligned}$$

(7)

$$\begin{aligned} \hat{\theta }=(\varPhi ^{T}\varPhi )^{-1}\varPhi ^{T}Y. \end{aligned}$$

(8)

Equation (8) in the field of control systems engineering can be considered to represent good or bad numerical task conditioning (9) for computing:

$$\begin{aligned} (\mathbb {\varPhi }^{T}\mathbb {\varPhi })(\mathbb {\varPhi }^{T}\mathbb {\varPhi })^{-1}=\tilde{I}\approx I. \end{aligned}$$

(9)

Perturbations outside the main diagonal show poor conditions for the numeric task. For significant perturbations outside the main diagonal obtained numerically, a pseudo-square matrix can be close to the losing row, although it is reversible. Equation (9) returns the predictive indices, probability of good identification results for LSE (8). If the matrix $\varPhi ^{T}\varPhi $ is known as the Gramian matrix of $\varPhi $, which possesses several correct properties, such as being a positive semi-definite matrix, the matrix $\varPhi ^{T}Y$ is known as the moment matrix. Finally, $\hat{\theta }$ is the coefficient vector of the least-squares hyperplane, expressed as (8). For this reason, we consider an equation in the field of discrete time on the moving window. Systems can be described by the autoregressive moving average model with exogenous inputs (ARMAX) (10):

$$\begin{aligned} y(k)=z^{-n}\frac{B(z^{-1})}{A(z^{-1})}u(k)+\frac{C(z^{-1})}{A(z^{-1})}\varepsilon (k) \end{aligned}$$

(10)

The model does not require a preliminary asumption of the system stability, as shown in Sect. 4, as this is a contribution of the new identification algorithm. A necessary and sufficient condition to identify the system is satisfy the controllability condition in the sense limited input and output, bounded input generates a bounded signal as an output over limited time range (b.i.b.o.), and a sufficient amount of data. We assume the following:

$$\begin{aligned} \varepsilon =\frac{C(z^{-1})}{A(z^{-1})}\varepsilon (k) \end{aligned}$$

(11)

where $y(k),\;u(k)$, and $\varepsilon (k)$ are a series of discrete data equally distant in time. By describing the system using a difference equation, the following equation is obtained:

$$\begin{aligned}&y(k)+a_{1}y(k-1)+...+a_{n}y(k-n)+\varepsilon =b_{1}u(k-1)+...+b_{m}u(k-m); \nonumber \\&\;\qquad \qquad \qquad k\gg n,\,n\ge m;\,m,n\in \mathbb {N};\,u\in \mathbb {R};\,y\in \mathbb {R};\;\varepsilon \in \mathbb {R} \end{aligned}$$

(12)

where the linearization error $\varepsilon =0$, and $b_{0},b_{1},...,b_{m};a_{0},a_{1},...,a_{n}$ are search coefficients. By applying the discrete Z-transform, the zero initial condition and $\varepsilon =0$ are obtained as follows:

$$\begin{aligned} \hat{G}(z)=\frac{Y(z)}{U(z)}=\frac{\hat{b}_{1}z^{m-1}+\hat{b}_{2}z^{m-2}+...+\hat{b}_{m-1}z+\hat{b}_{m}}{z^{n}+\hat{a}_{1}z^{n-1}+\hat{a}_{2}z^{n-2}+...+\hat{a}_{n-1}z+\hat{a}_{n}}, \end{aligned}$$

(13)

The discrete transfer function, from the definition of the discrete “z” operator, requires the assumption that the signal does not grow faster than the exponential function (14)

$$\begin{aligned}&Z[f^{*}(t)]=Z[f(kT)]=F(z),\;F(z)=\sum _{k=-\infty }^{\infty }f(kT)z^{-k} \nonumber \\&\qquad \qquad k\in \mathbb {\mathbb {N}},\;T\in \mathbb {R},\;f(k)<k!,\;f(k)<e^{ak^{2}};\,a>0,\,a\in \mathbb {R}. \end{aligned}$$

(14)

4 Non-zero Initial Condition

A linear system without noise fulfills the principle of causality and can be identified by the LSE in any state. For the non-zero initial condition, we have a non-continuous function. The problem appears when the system is exposed to noise, because such a system does not fulfill the principle of causality. The goal is satisfy zero initial condition on u(k) signal for Eq. (12), zero initial condition is being arbitrarily imposed with regard to the input signal, an output error is added to the noise. For this reason, the discontinuity on the input is modeled as a nonlinearity f(.) (Fig. 2):

A nonlinear function f(.) (Fig. 2) is estimated by the proposed delay function (15), which optimally carries out the $\tilde{u}(k)$ (16) signal from the zero initial condition to its actual state on range of data samples used to identification and has an insignificant impact on the dynamics of the system (18). The function is use to satisfy the zero initial condition on the input signal $\tilde{u}(k)$ for the computation algorithm (LSE) of error Eq. (24). A strong nonlinear function, [12, 13], corrects the discrete input value in Eqs. (8) and (12), by imposing the zero initial condition for the optimal first initials of samples by delay time (15). The discrete output signal in Eqs. (8) and (12), are unchanged, and the original value is retained (as the modification of Eqs. (8) and (12) breaks the principle of causation through the delay time of the input signal (15).

$$\begin{aligned} h(z)=\frac{1}{z^{\eta }};\,\eta \epsilon \mathbb {N},\,\eta >1 \end{aligned}$$

(15)

A proposed function (15) is defined as the zero input initial reconstructor (ZIIR).

$$\begin{aligned} \tilde{u}(k)=Z^{-1}[h(z)U(z)];\;k\epsilon \mathbb {N};\,k>0;\,\hat{u}\epsilon \mathbb {\mathbb {R}} \end{aligned}$$

(16)

where u[1, ..., j) is arbitrary assuming the imposition of the zero initial condition:

$$\begin{aligned} u[1,...,j)\equiv 0 \end{aligned}$$

(17)

$$\begin{aligned} k=j,\,j+1,...,N;\,N\epsilon \mathbb {N};\,j\epsilon \mathbb {N}, \end{aligned}$$

(18)

$$\begin{aligned} N-j\gg n, \end{aligned}$$

(19)

$$\begin{aligned} y(k)=z^{-n}\frac{B(z^{-1})}{A(z^{-1})}\tilde{u}(k)+\frac{C(z^{-1})}{A(z^{-1})}\varepsilon (k), \end{aligned}$$

(20)

$$\begin{aligned} \varepsilon =\frac{C(z^{-1})}{A(z^{-1})}\varepsilon (k), \end{aligned}$$

(21)

where (21) includes an equation error.

$$\begin{aligned} y(k)+a_{1}y(k-1)+...+a_{n}y(k_{i}-n)+\varepsilon =b_{1}\tilde{u}(k-1)+...+b_{m}\tilde{u}(k-m);\;y\epsilon \mathbb {\mathbb {R}},\;\varepsilon \in \mathbb {R} \end{aligned}$$

(22)

$$\begin{aligned} \hat{\theta }_{i}=(\tilde{\varPhi }_{i}^{T}\tilde{\varPhi }_{i})^{-1}\mathbb {\tilde{\varPhi }}_{i}^{T}Y_{i}. \end{aligned}$$

(23)

By applying the discrete Z-transform, the following is obtained:

$$\begin{aligned} \hat{G}_{i}(z)=\frac{\hat{b}_{1}z^{m-1}+\hat{b}_{2}z^{m-2}+...+\hat{b}_{m-1}z+\hat{b}_{m}}{z^{n}+\hat{a}_{1}z^{n-1}+\hat{a}_{2}z^{n-2}+...+\hat{a}_{n-1}z+\hat{a}_{n}} \end{aligned}$$

(24)

$$\begin{aligned} \hat{y}(k)=Z^{-1}[\hat{G}(z)\tilde{u}(z)] \end{aligned}$$

(25)

The mean squar error (MSE) is based on the window (27):

$$\begin{aligned} e_{i}=\frac{1}{N-j}\sum _{k=0}^{N-j}(y_{j+k}-Ey_{j+k})^{2} \end{aligned}$$

(26)

$$\begin{aligned} \hat{e}_{i}=\frac{1}{N-j}\sum _{k=0}^{N-j}(y_{j+k}-E\hat{y}_{j+k})^{2} \end{aligned}$$

(27)

The optimality function (15) can be calculated as follows:

$$\begin{aligned} \eta =f(inf(e_{i}(1(t))),inf(e_{i}(\delta (t)))), \end{aligned}$$

(28)

and the optimal identification we obtain for the minimum of error (29) is

$$\begin{aligned} \hat{G}(z)=\underset{\hat{G}_{i}(z)}{arg}\,\,inf(e_{i}). \end{aligned}$$

(29)

5 Numerical Experiments

5.1 Example System Identification

A discrete example system is described by Eq. (30), where the sampling discretization step $\triangle t=0.1[s]$.

$$\begin{aligned} G(z)=\frac{-0.3832z^{2}-0.2338z+0.06683}{z^{3}-1.127z^{2}+0.494z-0.1129} \end{aligned}$$

(30)

Using the relationship (20) can identify the model of the system for different cases as demonstrated below.

System Without Noise. A plant that is not subject to noise is identified by the LSE in any state by a minimum number of samples (Fig. 3). Oversizing data in relation to the system dimensions is a result of numerical errors.

System with Unit Distorted Input. If unit noise is introduced into the system on the input (e.g. $u(N-5)=0$, for $k>N-5$ samples), the principle of causality for the dynamic system will not be satisfied. The results of such a disturbance are presented in Table 1; the quality of these results depends on the number of data gathered before disturbance. Here: the $e_{1(t)}$ is the MSE (26), of the response-identified model for the step signal, the $e_{\delta (t)}$ is the MSE (26) of the response-identified model for the impulse signal, the $e_{u(1..N)}$ is the MSE (26) distorted u signal by function h(z), and the $e_{(j..N)}$ is the MSE (26) response-identified model.

By comparing the results of Tables 1 and 2, it can be seen that the disturbance of zero at the first position on the input signal for a large number of samples fulfills the principle of causality and response-identified model for the identification window.

Table 1. Distorted input $u(N-5)=0$ for $k>N-5$ samples.

Full size table

Table 2. Distorted input $u(N-j)=0$ on the first position input samples.

Full size table

System with Noise on Output. The next experiment identifies the system (30) embedded in Gaussian noise in the output. It was assumed that the output system signal was exposed to a noise of covariance (32) (Fig. 4).

$$\begin{aligned} ErrCov=\frac{1}{N}\sum _{i=1}^{N}(Ex_{i}-x_{i})^{2} \end{aligned}$$

(31)

$$\begin{aligned} y_{ErrCov}=0.0494 \end{aligned}$$

(32)

Table 3 shows the comparison error (26) of the response-identified model by the Matlab System Identification Toolbox (MSIT) and the proposed algorithm.

Table 3. Response error of the identified model with noise on the output.

Full size table

Table 4. Dependence of the estimated coefficients on the discrete samples.

Full size table

Table 4 displays the dependence of the estimated model coefficients on the horizon of the data and the dependence of the noise on the output.

System with Noise on Input and Output. The next experiment identifies the system (30) embedded in Gaussian noise on the input and output. It was assumed that the input and output system signals were exposed to a noise of covariance (33) (Fig. 6 and Tables 5 and 6).

$$\begin{aligned} u_{ErrCov}=0.0494,\;y_{ErrCov}=0.0494. \end{aligned}$$

(33)

Fig. 5 shows the time constants of the identified system based on proposed algorithm and MSIT.

Table 5. Response error of the identified model with noise on the input and output.

Full size table

Table 6. Dependence of the estimated coefficients on the discrete samples.

Full size table

5.2 Laboratory System Distillation Column

Identification Laboratory Subsystem of Distillation Column. Identification of the non-Gaussian distribution of noise is done using registered data from the measurement level point $u=L175$ to the measurement level point $y=L176$, where: $\eta =3$, $k=6000$ samples, and the discretization step $\triangle t=0.1[s]$ (Figs. 7, 8, 9 and 10).

Comparison of the Proposed Algorithm and the Matlab System Identification Toolbox (MSIT). A comparison of the identified model transfer function response between the proposed algorithm and MSIT (Fig. 11).

$$\begin{aligned} G_{F1}(z)=\frac{-0.0001766z^{2}+2.856e-05z+0.0001745}{z^{3}-2.953z^{2}+2.928z-0.9745} \end{aligned}$$

(34)

$$\begin{aligned} G_{MSIT\,F1}(z)=\frac{0.0001183z^{2}-0.0002365z+0.0001183}{z^{3}-3z^{2}+3z-0.9998} \end{aligned}$$

(35)

where the error of identification (27) is obtained as

$$\begin{aligned} e_{F1}=8.3425,\;e_{MSIT\,F1}=140.7838 \end{aligned}$$

(36)

A comparison of the identified model transfer function laboratory subsystems between the proposed algorithm and MSIT around the operating point is presented in (Fig. 12). If the operating point is biased to the neighborhood of the zero initial condition and optimal filters are used, the proposed algorithm produces acceptable results. The state matrix changes very little, but a coefficient of the control matrix changes, which has an impact on the system.

$$\begin{aligned} G_{F2}(z)=\frac{2.671e-05z^{2}-6.632e-07z+5.943e-07}{z^{3}-2.954z^{2}+2.93z-0.976} \end{aligned}$$

(37)

$$\begin{aligned} G_{MSIT\,F2}(z)=\frac{-0.3832z^{2}-0.2338z+0.06683}{z^{3}-1.127z^{2}+0.494z-0.1129} \end{aligned}$$

(38)

where the error of identification (27) is obtained as

$$\begin{aligned} e_{F2}=14.5630,\;e_{MSIT\,F2}=153.6657 \end{aligned}$$

(39)

6 Conclusion

The proposed algorithm gives exact and repeatable results for systems embedded at a nearly Gaussian noise on the input and output. The results of identification are independent of the initial conditions. The algorithm allows for the correct of the time constants in the identified model through the modulation of the function (15). In the literature there is a lot of theoretical proposals of concepts that are based on the mathematics of dynamic systems [4,5,6,7,8,9,10,11]. The problem appears when these concepts are used in control systems engineering, which requires more generalized assumptions: limited precision of data representation and perturbed Gaussian distribution on the input and output, as shown in Sect. 5.2. The study demonstrated that the proposed algorithm is an innovation in the fields of control systems engineering and applied mathematics. It returns acceptable quality indices for online real-system identification, is independent of the system state and preliminary parametrization [10, 11], and can be used for a wide range of test signals for the assumptions given in [1]. Direct calculation of the identification results for the window data range allows for robust identification of the optimal model of LSE. A new achievement of the presented algorithm is the ability to identify unstable systems, which satisfies the controllability condition in the b.i.b.o. sense. The proposed algorithm also opens up a new horizon of possibilities in process diagnostics, enabling high-precision faults detection and reconstruction of damaged data using mathematical models and linear regression.

References

Soderstrom, T., Stoica, P.: System Identification. Prentice-Hall, Hemel Hempstead (1989)
MATH Google Scholar
Keesman, K.J.: System Identification. An Introduction. Springer, London (2011). doi:10.1007/978-0-85729-522-4
Book MATH Google Scholar
Isermann, R., Munchhof, M.: Identification of Dynamic Systems. An Introduction with Applications. Springer, Heidelberg (2011). doi:10.1007/978-3-540-78879-9
Book MATH Google Scholar
Niedźwiecki, M., Meller, M., Pietrzak, P.: System identification based approach to dynamic weighing revisited. Mechan. Syst. Sig. Process. 80, 582–599 (2016). Elsevier
Article Google Scholar
Sastry, S.: Nonlinear Systems Analysis, Stability, and Control. Springer Science+Business Media, New York (1999). doi:10.1007/978-1-4757-3108-8
MATH Google Scholar
Greblicki, W., Pawlak, M.: Nonparametric System Identification. Cambridge University Press, New York (2008)
Book MATH Google Scholar
Donga, S., Liua, T., Wanga, W., Baob, J., Caoc, Y.: Identification of discrete-time output error model for industrial processes with time delay subject to load disturbance. J. Process Control 50, 40–55 (2017). Elsevier
Article Google Scholar
Liu, T., Yao, K., Gao, F.: Identification and autotuning of temperature-control system with application to injection molding. IEEE Trans. Control Syst. Technol. 17(6), 1282–1294 (2009)
Article Google Scholar
Aljanaideh, K.F., Bernstein, D.S.: Closed-loop identification of unstable systems using noncausal FIR models. Int. J. Control 90(2), 184–201 (2017)
Article MathSciNet MATH Google Scholar
Wang, W., Garnier, H.: System Identification, Environmental Modelling, and Control System Design. Springer, London (2012)
Book MATH Google Scholar
Ljung, L.: System Identification. Theory for the User. PTR Prentice Hall Inc, Boston (1987)
MATH Google Scholar
Brooder, F.E.: Strongly nonlinear parabolic boundary-value problems. Am. J. Math. 86, 339–357 (1964)
Article Google Scholar
Lions, J.L.: Sur certaines equations paraboliques nonlineaires. Bull. Sm. Math. Frunce 93, 155–175 (1965)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, AGH University of Science and Technology, Krakow, Poland
Andrzej Latocha

Authors

Andrzej Latocha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrzej Latocha .

Editor information

Editors and Affiliations

Faculty of Mechatronics, Institute of Automatic Control and Robotics, Warsaw University of Technology, Warsaw, Poland
Jan M. Kościelny
Faculty of Mechatronics, Institute of Automatic Control and Robotics, Warsaw University of Technology, Warsaw, Poland
Michał Syfert
Faculty of Mechatronics, Institute of Automatic Control and Robotics, Warsaw University of Technology, Warsaw, Poland
Anna Sztyber

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Latocha, A. (2018). Fast and Robust Online Dynamic System Identification. In: Kościelny, J., Syfert, M., Sztyber, A. (eds) Advanced Solutions in Diagnostics and Fault Tolerant Control. DPS 2017. Advances in Intelligent Systems and Computing, vol 635. Springer, Cham. https://doi.org/10.1007/978-3-319-64474-5_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-64474-5_18
Published: 29 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64473-8
Online ISBN: 978-3-319-64474-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Fast and Robust Online Dynamic System Identification

Abstract

Similar content being viewed by others

Identification of linear time-varying dynamic systems based on the WKB method

Continuous-Time Dynamic Model Identification Using Binary-Valued Observations of Input and Output Signals

A New Parameter Identification Algorithm for a Class of Second Order Nonlinear Systems: An On-line Closed-loop Approach

Keywords

1 Introduction

2 Problem Formulation

3 Least Squares Estimator (LSE)

4 Non-zero Initial Condition