1 Introduction

Most of the physical systems in real-life applications are nonlinear in nature. Safe operational practice of various industrial units needs mathematical modeling of the physical system, optimization and design of the control system. In the past, researchers contributed a lot of work in obtaining the mathematical model of a nonlinear system from its first principle concepts. The First Principle (FP) concept was desirable for only systems where adequate knowledge of it is available. Moreover, if the assumptions made in the derived models are inadequate or unrealistic, then the FP-based model might fail to capture the essential process dynamics. An alternative proposition to build a nonlinear model directly from the observation of measured data of the process is called system identification. In many of the process industries, whose internal functions are complex to understand, formulate and compute, parametric and nonparametric nonlinear system identification are adopted, by providing the measured input and output data. The main advantage of nonlinear system identification is the fact that, even, if an unknown system is given, just with the measured input and output data, the nonlinear dynamics of the process can be retrieved accurately. In recent years, many literatures have brought out the features, pros and cons of the usage and complexity of many notable identification algorithms for nonlinear systems. To manifest a few, Schoukens and Ljung presented a review of identification methods of linear and nonlinear systems [1]. The article also indexed an exemplary summary of many parametric identification methods. Block-oriented nonlinear models can be classified under (i) Hammerstein (ii) Wieners (iii) Voltera-series. The review article conferred by [2], not only portrayed the block-oriented identification methods, but also delivered a deep thought on the most prominent nonlinear control schemes of recent times. A non-stochastic subspace algorithm was considered for multi-dimensional nonlinear system identification based on measured output data. However, the procedure was not tested for systems with different structural nonlinearities [3]. The autoregressive models with exogenous inputs are employed in applications where state transitions are triggered by external events [4]. Stochastic gradient parametric estimation using moving window data was presented in [5] to estimate the system’s response to discrete measured data. However, the effectiveness of the method was shown only by using numerical examples and not on physical systems. The identification of LPV time-delay systems with missing output data using multiple-model approach is framed in [6]. Output-error (OE) model representing the process dynamics of CSTR and continuous fermenter, are recovered using the expectation–maximization (EM) algorithm to obtain the final global model. Reference [7] is concerned with the parametric identification of a special class of nonlinear systems called as bilinear state space systems. Parametric identification of time-delay systems was discussed in [8, 48]. Multi-innovation theory is put forward in stochastic gradient algorithm based on state observer and recursive least-squared identification algorithm to improve their accuracy and convergence rate. In another work by [9], a generalized identification scheme for integral-order systems is utilized for the identification of fractional-order nonlinear systems with both non-chaotic and chaotic behaviors. Being under the class of black-box modeling, Hammerstein-Wiener models can be employed for the identification of complex nonlinear systems with static nonlinearity as well as dynamic linear regions [10, 11]. Machine learning approaches are very powerful tools to identify a variety of highly nonlinear systems. The approaches come out with high fidelity models, that reflect the underlying physics of the nonlinear system. Many standard machine learning methods have shown spectacular performance in predicting dynamics of any interpolated system, but the resulting models usually lack generalizability and interpretability [12, 13]. Recently, in one article by [14], the authors reviewed system identification in context to powerful tools of computational intelligence methods which include genetic algorithm, particle swarm optimization and differential evolution. A variety of highly nonlinear occurrences are contemplated to assess the competence and the fast computing intelligence of genetic programming in [15]. Control of pH using adaptive nonlinear model-based control was implemented where process parameters were estimated [49]. Takagi–Sugeno (TS) fuzzy modeling with an unscented Kalman filter was carried over for a practical heat exchanger process [16]. Yet, the real challenge lies in the choice of fuzzy rule numbers on the output precision. In another research, the authors of [17] have put forth a Reliable Fuzzy Neural Network (ReFNN) which can handle reliability of nonlinear systems using an information reliability measure. The Stone-Weierstrass theorem was used to prove the universal approximation property of ReFNNs. Results showed that ReFNNs outperformed traditional Feed forward Neural Networks in terms of error and sensitivity, especially in the presence of noise. Nevertheless, an integrated framework using data driven system identification tools and nonlinear metric has not executed by the researchers up to this point. In this article, we propose an integrated framework relating these two concepts. Nonetheless, there are several identification methods where the real challenge lies in developing a parsimonious model with the smallest possible number of parameters that can adequately describe the dynamics of the physical system. Also, the confrontation lies in determining the underlying dynamics of the process from the measured data. It becomes difficult to select suitable identification techniques for a given system with unknown dynamics and unmeasured nonlinearity. In this article, we emphasize the importance of quantifying nonlinearity in order to choose an appropriate identification method. A control-relevant nonlinearity measure (CRNM) was proposed for measuring the nonlinear degree of a system when a linear control strategy is selected [18]. The CRNM method is an integrated multi-model control framework based on the gap metric and the gap metric stability margin. In spite of the investigations on two CSTR systems, the nonlinear metric failed to classify them based on nonlinearity level. Jiang et al. proposed a nonlinearity measure-based damage location method for beam-like structures [19]. Nonlinearity degree of the characteristic points in undamaged and damaged structures was compared to identify the location of damage by finding positions of maximal change. Nonlinear approaches are more sensitive in detecting breathing cracks in blades. A bicoherence-based nonlinearity measurement method was intended for identifying the location of breathing cracks in blades, which evaluated the extent of nonlinearity in the responses of blades under random excitations [20]. The Total Nonlinearity Index was used to establish the indicators of the cracks in blades, and the crack location was identified by finding the maximum components of the indicators. Generally, tools for analyzing nonlinear systems, like, describing function, phase portrait, perturbation, stability criteria (Lyapunov or Popov), and passivity are well-established. However, some of the existing units demand fault diagnosis and model order reduction of complex systems. Another salient aspect in the analysis of nonlinear physical systems, is the synthesis of closed loop systems with advanced control techniques. Zhaou [21], in his article focused on stability analysis and controller synthesis for stochastic networked control systems (NCS) under aperiodic denial-of-service (DoS) jamming attacks. An observer was constructed to estimate unmeasurable states, and a new adaptive event-triggered mechanism was proposed to reduce the transmission burden and mitigate the effects of DoS attacks. Furthermore, an observer-based controller was designed, and a switched system with time-varying delays was introduced in a mass-spring-damper mechanical system. In another work, a new control strategy for stochastic nonlinear systems (SNS) with state constraints and time-varying delays was presented [22]. The strategy used an event-triggered adaptive artificial neural network (AANN) and a barrier Lyapunov function (BLF) to handle the state constraints. The constructed AANN control scheme guaranteed stability and did not violate predefined constraints. The developed method also enriched the AANN control design of SNS. Alternatively, the design of the controller or achieving better closed loop-performance requires a pertinent identification scheme.

1.1 Motivation

The above discussion reveals that there are indigenous number of articles that discuss the concepts of system identification and measurement of nonlinear metric in separate attempts. However, there exists a break in the continuity between these two concepts for over years. There exist enumerable identification methods and methods for quantification of nonlinearity. However, well-designed directions or guidelines for the selection of identification algorithms (based on nonlinearity-measure) are rare and need to be established. The motivation for this research comes up with a bang by readdressing the issues of system identification and the concept of nonlinear metrics in a joint venture. The research idea discussed in this article will overcome the existing disruption by formulating an integrated framework that relates the nonlinear metric (\(\Delta_{0}\)) with three noteworthy system identification tools.

1.2 Major contributions

Research studies in the past have not attempted to employ the identification schemes (kSINDYc, NL2SQ and N3ARX) for mild, medium and highly nonlinear systems in the process engineering domain. The readers may refer to our earlier research contribution on the nonlinear metric CANM [23], to understand the detailed concepts of quantification of nonlinearity. Furthermore, in this article, we have made a reminiscent improvement from our earlier work on CANM [23], by introducing the \(\Delta_{0}\) criteria for stable, unstable and marginally stable systems. In this research, we have proposed a new framework for making an appropriate choice of system identification method by inspecting the degree of nonlinearity of the nonlinear system under investigation. The substructure of the proposed framework involves three steps.

  1. 1.

    To conduct nonlinear system identification of the physical system using (a) proposed data-driven kSINDYc identification (b) Neural network-based data-driven N3ARX method (c) parametric NL2SQ identification method.

  2. 2.

    To measure the degree of nonlinearity of the physical system at the specified operating region using the nonlinearity measure CANM \(\left( {\Delta_{0} } \right)\).

  3. 3.

    To make a suitable choice of system identification by mapping the nonlinearity level \(\Delta_{0}\) with the identification method that dispenses the least RMSE.

  4. 4.

    Furthermore, the nonlinear metric, namely CANM method, is upgraded in this article by recommending certain directives, on the computation of \(\Delta_{0}\) for stable, unstable and marginally stable systems.

Besides, assimilating the proposed framework for five different physical systems from chemical engineering units with different nonlinear levels has not been carried out in the existing literature. The data driven SINDYc identification proposed in [24], does not provide descriptive library terms based on the nonlinearity, which in turn makes it ill-conditioned in the prediction of complex nonlinear processes. This major concern is drenched here, by choosing a fewer number of relevant key terms in the candidate library of the kSINDYc scheme. This paper also addresses this issue by providing the relevant choice of key terms based on the degree and type of nonlinearity of dynamic nonlinear systems. The paper is divided into six Sects. Section 1 has introduced the literature review of many system identification routines and nonlinear metric tools. Section 2 investigates the nonlinearity metric CANM for stable, unstable and marginally stable systems. Section 3 elucidates the concept of the proposed framework consisting of the three identification methods kSINDYc, NL2SQ and N3ARX and nonlinear metric CANM. It is followed by simulation results in Sect. 4 which show that the computed nonlinearity \(\Delta_{0}\) as well as evaluation index RMSE witness a major lively role in deciding the choice of nonlinear identification method for the five dynamic systems with contrasting nonlinearity. Besides, Sect. 4 also adds increased flavor to the current study by suggesting a suitable parsimonious model for every physical system under study and Sect. 5 concludes the article. Section 6 presents the future directions of research of this article.

2 Nonlinearity metric–CANM

The nonlinearity of the physical systems is an important characteristic to be inscribed in controller design, bifurcation and uncertainty analysis. It varies with respect to the initial condition of state variables, excitation signals given, and input constraints associated with it. This research brings out the strength of the nonlinearity of typical industrial processes and their impacts on popular system identification schemes. Nevertheless, there are several nonlinear indices to mark the value of nonlinearity in dynamic systems [25,26,27]. The concept of Convergence-area-based-nonlinearity measure (CANM) proposed in [23] has been endorsed in the current study. Additionally, the calculation of the nonlinear metric \(\Delta_{0}\) for stable, unstable and marginally stable systems is, refurbished in this article by recommending some directives. Without loss of generality, consider a nonlinear dynamic system of the form

$$ \frac{{{\text{d}}x(t)}}{{{\text{d}}t}} = f\left( {x(t),u(t)} \right) $$
(1)

in which \(x(t) \in {\mathbb{R}}^{m}\) denotes the state variables of a system at a time \(t\).Eq. (1) also generalizes the first principle model of nonlinear systems. If \(y_{{{\text{True}}}} (t)\) and \(y_{{{\text{lin}}}} (t)\) represent the measured output (True output) of the nonlinear system and its linearized response at the \(j{\text{th}}\) operating point \(P_{j}\), then the nonlinear metric \(\Delta_{0j}\) quantifies the level of nonlinearity as given in Eq. (2).

$$ \Delta_{0j} = \frac{{\left| {\left| {\int\limits_{0}^{{t_{f} }} {y_{{{\text{True}}}} {\text{d}}t} } \right| - \left| {\int\limits_{0}^{{t_{f} }} {y_{{{\text{lin}}}} {\text{d}}t} } \right|} \right|}}{{\left| {\int\limits_{0}^{{t_{f} }} {y_{{{\text{True}}}} {\text{d}}t} } \right|}} $$
(2)

where \(t_{f}\) represents the settling time of the nonlinear system. For a nonlinear process with \(m\) number of operating sectors such as \(P_{1} ,P_{2} \ldots P_{j} \ldots P_{m}\), the overall nonlinearity \(\Delta_{{\text{0 nom}}}\) is shown in Eq. (3)

$$ \Delta_{{\text{0 nom}}} = \frac{{\Delta_{{0P_{1} }} + \Delta_{{0P_{2} }} + \cdots \Delta_{{0P_{j} }} + \cdots \Delta_{{0P_{m} }} }}{m} $$
(3)

The CANM method conferred in this work stands distinct for its amenability in dealing with wide range of nonlinear dynamic systems. The method uses Jacobian linearization to find out linear approximation \(y_{{{\text{lin}}}} (t)\) which thoroughly depends on analysis of an operating point. The stability of the operating point decides the current dynamic behavior of the plant. A nonlinear system has multiple operating points, unlike a linear system which has only one operating point with zero initial condition. In a nonlinear system, with multiple operating points, the initial condition by itself is an operating point, which may be stable or unstable. Another class of nonlinear systems are chaotic processes, which don’t have initial conditions. The scope of study in this manuscript does not include any chaotic system. Moreover, CANM is an operating point-dependent nonlinear metric. So to maintain a standard consistency in the nonlinear metric, the examples explored in Sect. 4 of this article are subjected to initial conditions and excitation inputs at nominal operating points referring to the concerned literature. The effect of the initial condition and the type of excitation signal applied to a physical system will definitely affect \(\Delta_{0}\). Considering this characteristic, the nonlinear systems elaborated in Sect. 4 are subjected to step (\(u_{{{\text{nom}}}}\)) and PRBS (\(u_{{{\text{prbs}}}}\)) inputs, and the effect of \(\Delta_{0}\) over the excitation signals is also investigated.

The simulations for the computation of nonlinearity CANM are restricted only to SISO systems. CANM method proposed in [23], is upgraded in this article by recommending the following directives, on computation of \(\Delta_{0}\) for stable, unstable and marginally stable systems.


Case (i): \(\Delta_{0}\) for stable systems.

For any stable system, the eigen values of the Jacobian linearized model will have their eigen values on L.H.S of ‘s’ plane. If \(t_{f}\) represents the settling time of the nonlinear system around the vicinity of the stable steady state operating point \(P\), then \(\Delta_{0P}\) is operating point dependent and is given as

$$ \Delta_{{{\text{0\_stable}}}} = \frac{{\left| {\left| {\int\limits_{0}^{{t_{f} }} {y_{{{\text{True}}}} {\text{d}}t} } \right| - \left| {\int\limits_{0}^{{t_{f} }} {y_{{{\text{lin}}}} {\text{d}}t} } \right|} \right|}}{{\left| {\int\limits_{0}^{{t_{f} }} {y_{{{\text{True}}}} {\text{d}}t} } \right|}} $$
(4)

Case (ii): \(\Delta_{0}\) for marginally stable systems.

A crucial point in CANM is finding the nonlinearity for systems with transient states (marginally stable system and unstable systems). In a marginally stable system, the eigen values of the linearized model \((y_{{{\text{lin}}}} )\) are located on the imaginary axis. The response \(y\) will display sustained oscillations and there is no steady state \(t_{f}\). While finding \(\Delta_{{0}}\),instead of choosing \(t_{f}\), it is suggested to use \(t_{{{\text{cycle}}}}\) as the sustained oscillations repeat with the same time period after every cycle. Then \(\Delta_{{0}}\) becomes

$$ \Delta_{{{\text{0\_marg}}{\text{.stable}}}} = \frac{{\left| {\left| {\int\limits_{0}^{{t_{{{\text{cycle}}}} }} {y_{{{\text{True}}}} {\text{d}}t} } \right| - \left| {\int\limits_{0}^{{t_{{{\text{cycle}}}} }} {y_{{{\text{lin}}}} {\text{d}}t} } \right|} \right|}}{{\left| {\int\limits_{0}^{{t_{{{\text{cycle}}}} }} {y_{{{\text{True}}}} {\text{d}}t} } \right|}} $$
(5)

Case (iii): \(\Delta_{0}\) for unstable systems:

In an unstable system, the eigen values of the Jacobian linearized model will occur on the R.H.S of the ‘s’ plane. But there is no steady state \(t_{f}\) for unstable system. (Ex: Batch and transient processes in Chemical Reactors). The unstable response shows a transient behavior. Moreover \(t_{f}\) cannot be chosen as infinite. In such cases, local nonlinearity analysis will be an alternative solution. The nonlinear metric \(\Delta_{0}\) for unstable system can be obtained by making a trajectory dependent analysis of the measured output. To attain this feature, the whole sequence of output \(y_{{{\text{True}}}}\) is considered a trajectory which can be broken into many short time intervals \(\left( {t_{1} ,t_{2} , \ldots t_{x} \ldots t_{n} } \right)\) with \(n\) number of regions such that we can obtain piece-wise models. \(t_{n}\) corresponds to the time instant applied by the user to sort out the dynamic transient response. Then the nonlinearity metric \(\Delta_{0}\) becomes trajectory dependent and is computed by taking a cumulative mean from all regions \((R_{1} ,R_{2} , \ldots R_{x} \ldots R_{n} )\) as follows.

$$ \Delta_{{0R_{x} }} = \frac{{\left| {\left| {\int\limits_{{t_{x - 1} }}^{{t_{x} }} {y_{{{\text{True}}\_x}} {\text{d}}t} } \right| - \left| {\int\limits_{{t_{x - 1} }}^{{t_{x} }} {y_{{{\text{lin}}\_x}} {\text{d}}t} } \right|} \right|}}{{\left| {\int\limits_{{t_{x - 1} }}^{{t_{x} }} {y_{{{\text{True}}\_x}} {\text{d}}t} } \right|}} $$
(6)
$$ \Delta_{{{\text{0\_unstable}}}} = \frac{{\Delta_{{0R_{1} }} + \Delta_{{0R_{2} }} + \cdots \Delta_{{0R_{x} }} + \cdots \Delta_{{0R_{n} }} }}{n} $$
(7)

The main difference between \(\Delta_{{{\text{0\_stable}}}}\),\(\Delta_{{{\text{0\_marg}}{\text{.stable}}}}\) and \(\Delta_{{{\text{0\_unstable}}}}\) lies in the time interval limit \(t_{f}\),\(t_{{{\text{cycle}}}}\) and \(t_{n}\). This sort of analysis can also be applied to batch processes in many chemical reactor units. In many batch processes, the eigen values are stable only at the beginning, but as the batch process continues, the eigen values become unstable.

$$ {\text{Nonlinearity}}\;{\text{level}} = \left\{ \begin{gathered} \Delta_{0} \le 0.3,\;{\text{mild}}\;{\text{nonlinear}} \hfill \\ 0.3 < \Delta_{0} \le 0.7,\;{\text{medium}}\;{\text{nonlinear}} \hfill \\ \Delta_{0} > 0.7,\;{\text{highly}}\;{\text{nonlinear}} \hfill \\ \end{gathered} \right. $$
(8)

Equation (8), implies the classification of nonlinearity as mild, medium or highly nonlinear using the CANM metric where the value of \(\Delta_{0}\) for any nonlinear system is consigned between 0 and 1. Table 1 gives an outright summary on the computation of \(\Delta_{0}\) for stable, unstable and marginally stable systems, working at a single operating point \(P\).

Table 1 Computation of \(\Delta_{0}\) for nonlinear systems

Over and above, that the nonlinearity level of any dynamic system will have a serious impact on identifying the dynamics of the complex process. As many chemical, biomedical and biological processes often operate on a predesigned operating region with multiple operating points, this CANM method will be most beneficial to them.

3 System identification in coherence with nonlinearity measure

So far, literatures have discussed the idea of nonlinear system identification and nonlinearity measurement in individual research studies. Moreover, to this notch, research on the usage of nonlinear system identification based on classification of nonlinearity remains very limited. This implication necessitates the requirement of a mathematical tool to bridge the gap between nonlinearity measurement and nonlinear system identification. The primary spotlight of present research is to encapsulate the nonlinear dynamics identified for any process with its nonlinearity level through a mathematical measurement tool. Viewed in this way, we have proposed a single framework ‘System identification in coherence with nonlinear measure’ with the assorted combination of (a) nonlinear identification schemes (kSINDYc, NL2SQ, N3ARX) and (b) CANM nonlinear metric. This combined framework will ensure an appropriate choice of system identification by inspecting the degree of nonlinearity of the nonlinear system under investigation. Among the three identification methods, kSINDYc identification is proposed in this article and its accuracy is compared with NL2SQ and N3ARX methods available in existing literature.

3.1 System identification using kSINDYc

The recent impeccable SINDYc (Sparse Identification of Nonlinear Dynamics with control) algorithm is a celebrated parsimonious system identification technique introduced by Brunton [24]. Abundant collection of technical records is garnered with widespread curiosity on the remarkable progress made in sparse dynamics in many disciplines ranging from biology to control engineering [29,30,31,32]. The SINDy (Sparse Identification of Nonlinear Dynamics) algorithm is a symbolic sparse regression problem, to identify nonlinear systems. It uses a candidate library with higher order polynomials, trigonometric terms, logarithmic functions etc., in Eq. (9) to identify any unknown process. The term ‘candidate library’ refers to a set of diversity of many functions to determine the learned SINDYc models.

$$ \Theta (X,U) = \left[ {\begin{array}{*{20}c} | \\ 1 \\ | \\ \end{array} \;\begin{array}{*{20}c} | \\ X \\ | \\ \end{array} \;\begin{array}{*{20}c} | \\ U \\ | \\ \end{array} \;\begin{array}{*{20}c} | \\ {X \otimes X} \\ | \\ \end{array} \;\begin{array}{*{20}c} | \\ {X \otimes U} \\ | \\ \end{array} ...\begin{array}{*{20}c} | \\ {\sin (X)} \\ | \\ \end{array} ...\begin{array}{*{20}c} | \\ {e^{X} } \\ | \\ \end{array} ...\begin{array}{*{20}c} | \\ {\log (X)} \\ | \\ \end{array} } \right] $$
(9)

where \(X \otimes U\) denotes the vector of all product combinations in \(X\) and U. The use of higher order polynomials or trigonometric nonlinearities or other mathematical functions in Eq. (9) without observing the system nonlinearity might cause numerical problems, which in turn engenders unnecessary oscillations in the predicted model outputs. Moreover, without descriptive library terms, the size of the \(\Theta (X,\;U)\) grows rapidly, which sequentially drives the SINDYc library to be ill-conditioned. Recent literature by [28] have substantiated that SINDYc may be susceptible to over fitting problem if care is not taken to balance the model complexity and polynomial order in its candidate library. In this article, this major concern is drenched, by the introduction of kSINDYc identification scheme for nonlinear systems. In kSINDYc method, we have refined the selection of the candidate library \(\Theta (X,\;U)\) with the inclusion of key nonlinear terms \(\left( {k_{{{\text{nl}}}} } \right)\) that describe the system dynamics. The term \(k_{nl}\) is chosen as a basic nonlinear function that plays a vital role in deciding the nonlinear dynamics of the physical system. Our research concentrates on establishing a streamlined library function \(\Theta (X,\;U)\) for kSINDYc, where library terms are chosen based on the nonlinearity that contemplates the system dynamics rather than choosing the library elements in trial and error criteria as in SINDYc. Even though kSINDYc is a data-driven approach, dynamic equations describing the identified model can be obtained from Eq. (11) after the inclusion of \(k_{{{\text{nl}}}}\) in the library \(\Theta ({\text{X,}}\;{\text{U}})\). Consequently, the learned model using the data-driven kSINDYc approach predicts the nonlinear dynamics of the physical system, with a lesser number of parametric terms in \(\Theta (X,\;U)\) without overfitting issues. Backdrop in this Section, we provide a brief retrospect to the SINDYc algorithm, which forms the bottom line of the proposed ‘kSINDYc’ system identification methodology. Consider a nonlinear dynamic system of the form given in Eq. (10).

$$ \dot{x}(t) = f(x) + g(x,u) $$
(10)

where \(x \in {\mathbb{R}}^{m}\) denotes the state of the system at time \(t\) and \(u \in {\mathbb{R}}^{p}\) gives the manipulated input vector. The function \(f( \cdot )\) and \(g( \cdot )\) represent the system parameters that capture the physics-based dynamics of the system. Inspired by its application to many physical systems, kSINDYc is formulated to determine the nonlinear dynamics of Eq. (10) using the measured input and output data.\(\dot{X}\) is a data matrix that gives the time derivatives of state variables in the sparse regression problem in Eq. (11). \(\Theta\) is the augmented library matrix with all the candidate terms in kSINDYc library. For a time period \(t = \left[ {t_{1} ,t_{2} \ldots t_{f} } \right]^{T}\), consider the input matrix \(U = \left[ {u_{1} (t),u_{2} (t) \ldots u_{p} (t)} \right]^{T} \in {\mathbb{R}}^{p}\), the state vector (data matrix) \(X = \left[ {x_{1} (t),x_{2} (t) \cdots x_{m} (t)} \right]^{T} \in {\mathbb{R}}^{m}\), then the sparse regression becomes

$$ \dot{X} = \Theta \left( {X,U} \right)\xi $$
(11)
$$ \xi = \left[ {\xi_{1} \;\xi_{2} \; \ldots \;\xi_{m} } \right] $$
(12)

Equation (12) is vector that has the sparse co-efficient \(\xi_{1} \xi_{2} \ldots \xi_{n}\) corresponding to \(\Theta (X,U)\).

$$ X = \left[ {\begin{array}{*{20}c} {x^{T} (t_{1} )} \\ {x^{T} (t_{2} )} \\ \vdots \\ {x^{T} (t_{f} )} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {x_{1} (t_{1} )} & {x_{2} (t_{1} )} & \ldots & {x_{m} (t_{1} )} \\ {x_{1} (t_{2} )} & {x_{2} (t_{2} )} & \cdots & {x_{m} (t_{2} )} \\ \vdots & \vdots & \ddots & \vdots \\ {x_{1} (t_{f} )} & {x_{2} (t_{f} )} & \cdots & {x_{m} (t_{f} )} \\ \end{array} } \right] $$
(13)
$$ {\varvec{U}} = \left[ {\begin{array}{*{20}c} {u^{T} (t_{1} )} \\ {u^{T} (t_{2} )} \\ \vdots \\ {u^{T} (t_{f} )} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {u_{1} (t_{1} )} & {u_{2} (t_{1} )} & \ldots & {u_{p} (t_{1} )} \\ {u_{1} (t_{2} )} & {u_{2} (t_{2} )} & \cdots & {u_{p} (t_{2} )} \\ \vdots & \vdots & \ddots & \vdots \\ {u_{1} (t_{f} )} & {u_{2} (t_{f} )} & \cdots & {u_{p} (t_{f} )} \\ \end{array} } \right] $$
(14)
$$ \dot{X} = \left[ {\begin{array}{*{20}c} {\dot{x}^{T} (t_{1} )} \\ {\dot{x}^{T} (t_{2} )} \\ \vdots \\ {\dot{x}^{T} (t_{f} )} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {\dot{x}_{1} (t_{1} )} & {\dot{x}_{2} (t_{1} )} & \ldots & {\dot{x}_{m} (t_{1} )} \\ {\dot{x}_{1} (t_{2} )} & {\dot{x}_{2} (t_{2} )} & \cdots & {\dot{x}_{m} (t_{2} )} \\ \vdots & \vdots & \ddots & \vdots \\ {\dot{x}_{1} (t_{f} )} & {\dot{x}_{2} (t_{f} )} & \cdots & {\dot{x}_{m} (t_{f} )} \\ \end{array} } \right] $$
(15)

The key terms in the kSINDYc library function matrix \(\Theta (X,U)\) are given in Eq. (16)

$$ \Theta (X,U) = \left[ {\begin{array}{*{20}c} | \\ 1 \\ | \\ \end{array} \begin{array}{*{20}c} | \\ X \\ | \\ \end{array} \begin{array}{*{20}c} | \\ U \\ | \\ \end{array} \begin{array}{*{20}c} | \\ {X \otimes X} \\ | \\ \end{array} \begin{array}{*{20}c} | \\ {X \otimes U} \\ | \\ \end{array} ...\begin{array}{*{20}c} | \\ {k_{nl} } \\ | \\ \end{array} } \right] $$
(16)

\(X \otimes U\) in Eq. (16) denotes the vector of all product combinations in \(X\) and \(U\).It also indicates the quadratic nonlinearities in the unknown system.

$$ \xi_{k} = \mathop {{\text{argmin}}\frac{1}{2}}\limits_{{\hat{\xi }_{k} }} \left\| {\dot{X}_{k} - \Theta (X,U)\hat{\xi }_{k} } \right\|_{2}^{2} + \lambda \left\| {\hat{\xi }_{k} } \right\|_{1} $$
(17)

The \(l_{1}\) regularized optimization problem given in Eq. (17) can be evaluated using sparsity promoting scheme called \({\text{STLS}}\) (Sequential Threshold least square method). Equation (17) penalizes the number of active terms in the candidate library \(\Theta (X,U)\). The second part of Eq. (17) has the penalty term with the tunable weighing parameter \(\lambda \ge 0\) to establish model parsimony.\(\xi_{k}\) represents \(k{\text{th}}\) row of \(\xi\) and \(\dot{X}_{k}\) represents \(k{\text{th}}\) row of \(\dot{X}\). Equation (17) is solved iteratively till the coefficients converge. A notable development is made on the SINDYc candidate library function, by introducing the ‘key nonlinear term’ from the plant dynamics, apart from the other higher-order polynomials of the processes. The algorithmic pseudocode for the proposed kSINDYc identification is given below:

Algorithm 1
figure a

Key term SINDYc (kSINDYc)

A coarse sweep of the tunable parameter \(\lambda\) in the range \(\lambda_{\min } < \lambda \le \lambda_{\max }\) is considered in Algorithm 1 to attain an optimal solution with minimum error and maximum convergence rate. The specific convergence criteria for \({\text{STLS}}\) algorithm in sparse regression framework is provided in [33]. kSINDYc can handle large candidate library with the regularizing tuner \(\lambda\). Not limitingly, the choice of the sparsity knob \(\lambda\) is made in such a way that there is a tradeoff between accuracy and complexity of the kSINDYc algorithm.

A comparison between the existing SINDy method and the proposed kSINDYc is summarized in Table 2. The descriptive library \(\Theta (X,U)\) built for each nonlinear system is unique and the same is discussed elaborately in Sect. 4 of this article.

Table 2 Overview of SINDy and kSINDYc

3.2 System identification using NL2SQ

Many works on parametric system identification have used least square method to estimate the numerical values of parameters [33, 34]. NL2SQ with Levenberg–Marquardt algorithm is another established method used for optimizing the process parameters in the field nonlinear system identification [35, 36]. An optimization problem on any nonlinear system targets on maximizing profit or minimizing the overall cost. The optimization problem so formed will have a set of independent variables with some restrictions called constraints. The solution to any optimal problem depends on the allows set of variables where the objective function \(f(x)\) is minimized and attains an optimal value. For physical systems with many nonlinear functions, the objective function \(f(x)\) is framed as the sum of squares of the nonlinear function \(r(x)\) as follows

$$ f(x) = \frac{1}{2}\sum\limits_{j = 1}^{m} {\left( {r_{j} (x)} \right)}^{2} = \frac{1}{2}\left\| {r(x)} \right\|_{2}^{2} $$
(18)

In Eq. (17) the objective function \(f(x)\) has to be minimized. Then Eq. (18) becomes

$$ \mathop {\min }\limits_{x} f(x) = \sum\limits_{j = 1}^{m} {\left( {r_{j} (x)} \right)^{2} } $$
(19)

In Eq. (18), the sum of least squares of the nonlinear function \(r(x)\) is minimized, and hence the optimization problem with this methodology is called Nonlinear Least Square method (NL2SQ). If a nonlinear system has a model function \(\phi (x)\) and the measured output be \(y_{{{\text{True}}}}\) then

$$ r_{j} (x) = \phi \left( {x,t_{j} } \right) - y_{True(j)} $$
(20)
$$ r(x) = \left( {r_{1} (x),r_{2} (x), \cdots r_{m} (x)} \right)^{T} $$
(21)

The residual vector \(r(x)\) has \(m\) number of components. To solve the least square problem, the most common algorithms used are Gauss Newton method and Levenberg Marquardt (LM) methods. Gradient of the objective function \(\nabla f(x)\) is expressed from Eq. (22) as

$$ \nabla f(x) = \sum\limits_{j = 1}^{m} {r_{j} (x)} \nabla r_{j} (x) = J(x)^{T} r(x) $$
(22)

where the Jacobian term \(J(x)\) is given as

$$ J(x) = \left( {\frac{{\partial r_{j} }}{{\partial x_{i} }}} \right)_{j = 1,...m;\;i = 1,...n} = \left( {\begin{array}{*{20}c} {\nabla r_{1} (x)^{T} } \\ {\nabla r_{2} (x)^{T} } \\ \vdots \\ {\nabla r_{m} (x)^{T} } \\ \end{array} } \right) $$
(23)

Hessian matrix \(H\left( {f(x)} \right)\) consists of second-order partial derivatives of \(r(x)\) and is given by \(\nabla^{2} f(x)\)

$$ \begin{aligned} \nabla^{2} f(x) = & \sum\limits_{j = 1}^{m} {\nabla r_{j} (x)\nabla r_{j} (x)^{T} } + \sum\limits_{j = 1}^{m} {r_{j} (x)\nabla^{2} r_{j} (x)} \\ & = J(x)^{T} J(x) + \sum\limits_{j = 1}^{m} {r_{j} (x)} \nabla^{2} r_{j} (x) \\ \end{aligned} $$
(24)

The Hessian matrix observed in Eq. (24) must be positive definite in all the least square solutions. Equation (23) can be approximated to have the first term of Jacobian function alone, eliminating the second term \(\nabla^{2} r(x)\), when the residual is very close to the actual solution. This approximation is followed in LM method adopted in this article. The Jacobian matrix \(J(x)\) of \(H(x)\) has to be found out to optimize \(x\) for \(m\) number of samples. Using LM algorithm in [35], the objective function for NL2SQ method is modified as \(h_{{{\text{LM}}}} (x)\).

$$ (J(x)^{T} J(x) + \mu I)h_{{{\text{LM}}}} (x) = - J(x)^{T} f(x) $$
(25)
$$ h_{{{\text{LM}}}} (x) = - (J(x)^{T} J(x) + \mu I)^{ - 1} J(x)^{T} f(x) $$
(26)

The damping factor is always \(\mu \ge 0\),for which the following effects are observed. When \(\mu > 0\), the co-efficient \((J(x)^{T} J(x) + \mu I)\) is positive definite and so \(h_{LM} (x)\) is in descent direction. If \(\mu\) is very large \(h_{{{\text{LM}}}} (x) = - \frac{1}{\mu }J(x)^{T} f(x)\) and goes into the steepest descent direction. On the other hand, if \(\mu\) is very small \(h_{{{\text{LM}}}} (x) = h_{{{\text{GN}}}} (x)\), the LM algorithm converges with the Gauss Newton method. In the gradient descent method, the \(h_{{{\text{LM}}}} (x)\) is minimized by updating the parameters in the steepest-descent direction. The gradients of the process are calculated using automatic differentiation. On the other hand, in the Gauss–Newton method \(h_{{{\text{LM}}}} (x)\) is reduced by considering the least square module to be locally quadratic to its parameters and sorting out the minimum value from this quadratic term. The LM algorithm operates similar to gradient-descent method when the parameters are away from their optimal value, and behaves more like a Gauss–Newton scheme when the parameters are very near to the optimal point. It can be concluded that the LM algorithm involves the cross-combination of gradient descent and Gauss–Newton methods.

3.3 System identification using N3ARX

Neural Networks is another computational intelligence approach for identifying nonlinear systems in real world scenario with accurate estimations [37]. Neural networks are well-suited for nonlinear modeling tasks because they can learn complex patterns and relationships in the data, without knowing any prior knowledge about the dynamics of the system. N3ARX combines exogenous inputs (X) using feed forward neural networks to capture complex nonlinear relationships in time-series data and make predictions based on both past values of the series and external inputs. The exogenous inputs (X) represent additional factors or variables that may influence the time series but are not directly part of the series itself are fed into the neural network along with the autoregressive inputs (AR). The N3ARX method is a standard identification technique and is found in enormous literatures [38,39,40,41]. A novel optimal identification algorithm is presented for NMPC based on the Neural network model for different operating regions of highly nonlinear dynamic processes in [42]. Hybrid combination of Neural network algorithm with NARX method is investigated in this research to make a strong comparison with the kSINDYc method of identification. The N3ARX model employs neural networks to capture nonlinear relationships between the autoregressive and exogenous inputs and the target variable. The hidden layers of the neural network enable the model to capture and represent these nonlinear relationships. The number of neurons required to identify each process will differ depending upon the nonlinearity and operating region. The number of hidden layer nodes in N3ARX method is chosen iteratively. A simple Neural network structure is taken with 1 hidden layer, and nonlinear Rectified linear Activation function (RELU) for the simulation studies carried out in this article. The regressor equation for the N3ARX model is given by

$$ y(t) = F\left( {y(t - 1),y(t - 2), \ldots ,y(t - n),u(t),u(t - 1), \ldots ,u(t - m)} \right) $$
(27)

where \(y(t)\) refers the target variable at time \(t\). \(y(t - 1),y(t - 2), \ldots ,y(t - n)\) are the past value of the target variable also called AR inputs. \(u(t),u(t - 1), \ldots ,u(t - m)\) are the exogenous inputs (X) at time \(t\) and their past values. The function \(F\) is the feedforward neural network architecture with the weights and bias terms learned during the training process. It is evident from Eq. (27) that \(F\) maps the inputs to the target variable \(y(t)\). The N3ARX model is trained using the historical time-series data with corresponding exogenous inputs. Followingly, the predicted model is optimized by adjusting the weights and biases of the neural network to minimize the prediction error between the model’s output and the actual target values. This is typically done using gradient descent optimization algorithm. Given the previous values of the time series and the corresponding exogenous inputs, the N3ARX model can generate predictions for the future values of the target variable.

3.4 Proposed framework

In Fig. 1, the terms \(k_{{{\text{para}}}}\) and \(x_{{{\text{init}}}}\) denote the nominal input parameters and initial states of the nonlinear system, respectively. The excitation signal \(u\) and the nonlinear output \(y_{{{\text{True}}}} (t)\) from the nonlinear process are treated as measured input and output data. Region I contain the proposed kSINDYc algorithm to learn the dynamics of \(y_{{{\text{True}}}} (t)\).

Fig.1
figure 1

Proposed framework- System identification in coherence with nonlinear measure

The most essential term in the kSINDYc library, which plays a critical role in determining the nonlinear dynamics, is weighed from the governing equations of the physical system. The predicted output \(y_{{{\text{pred}}}} (t)\) of Region I and measured output \(y_{{{\text{True}}}} (t)\) are used to calculate performance using \({\text{RMSE}}\) criteria. On the flip side, the nonlinear system under study is linearized about its operating point \(P_{j}\) using Jacobian linearization and is expressed as \(y_{{{\text{lin}}}} (t)\). The metric \(\Delta_{0}\) is computed using CANM method as given in Eq. (2). On completion of the learned dynamics using kSINDYc, Region I of Fig. 1 is replaced by N2LSQ and N3ARX identification methods, that makes \(y_{{{\text{pred}}}} (t) = \left\{ {y_{{{\text{kSINDYc}}}} ,y_{{{\text{NL2SQ}}}} ,y_{{{\text{N3ARX}}}} } \right\}\). The steps involved in the proposed view of nonlinear system identification and nonlinearity quantification are as follows:

  1. 1.

    Measure the nonlinearity \(\Delta_{0}\) of the nonlinear dynamic system (Plant -P) under study using the CANM metric.

  2. 2.

    Identify (Predict) the dynamics of the plant \(y_{{{\text{pred}}}} (t)\) using (a)kSINDYc (b)NL2SQ and (c) N3ARX methods.

  3. 3.

    Create a graphical mapping between \(\Delta_{0}\) obtained in step 1 and \(y_{{{\text{pred}}}} (t)\) acquired in step 2 using a performance index.

  4. 4.

    Sort out the suitable choice of system identification from kSINDYc, NL2SQ and N3ARX, by mapping the nonlinearity level \(\Delta_{0}\) with the identification method which dispenses least evaluation index.

The crucial step in the proposed framework is the selection of an applicable identification from kSINDYc, NL2SQ and N3ARX methods. A graphical plot is made between \(\Delta_{0}\) and RMSE to fill the leveraging gap between computation of nonlinearity and the suitable choice of identification for nonlinear dynamic physical systems. This article differs from the existing literature by providing guidelines for suitable nonlinear identification from the three methods based on \(\Delta_{0}\). The proposed substructure also serves as a bridge to fill the leveraging gap between the computation of nonlinearity and the suitable choice of nonlinear system identification for nonlinear dynamic physical systems.

4 Simulation study

A reliable quantitative analysis is exemplified to cohere the nonlinear metric \(\Delta_{0}\) with the above said identification methods for five nonlinear systems with divergent nonlinear strengths. The simulation study is carried out on five Industrial physical systems with different nonlinearity levels, ranging from chemical to biological domain from the process engineering field. The nonlinear metric \(\Delta_{0}\) subjected to excitations \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\) are figured out for all the examples of Sect. 4. The computed values of \(\Delta_{0}\) can be inspected in Tables 4 and 5, to check whether the nonlinear system falls under mild, medium or highly nonlinear. The measured data \(\left( {y_{{{\text{True}}}} } \right)\) for all the five case studies are obtained from the first principle equations (also called true models). The measured outputs \(\left( {y_{{{\text{True}}}} } \right)\) are utilized in the data driven identification of kSINDYc and N3ARX methods. The models developed using kSINDYc, N3ARX and NL2SQ identification techniques are called predicted models (learned models) whose outputs are indicated as \(\left( {y_{{{\text{pred}}}} } \right)\). The nominal operating data of all the simulation examples discussed in this Section can be referred from the relevant literatures cited inside the article. In order to acquire an accurate estimate of the learned models, two test signals namely the step \((u_{{{\text{nom}}}} )\) and PRBS \((u_{{{\text{prbs}}}} )\) signals are input excited on all the examples. Consider the set of input output data \(Z^{N}\) obtained using first principle equations.

$$ Z^{N} = [u(1),y(1),u(2),y(2)...u(N),y(N)] $$
(28)

\(u\) and \(y\) corresponds to the excitation signal and the response of the SISO system, \(t_{f}\) denotes the final time for the \(N{\text{th}}\) sample. The data set \(Z^{N}\) is subjected to pre-processing before proceeding with the prediction, by splitting it into training set \(\left( {Z^{{N_{{{\text{train}}}} }} } \right)\) and testing set \(\left( {Z^{{N_{{{\text{test}}}} }} } \right)\)

$$ Z^{{N_{{{\text{train}}}} }} = 0.7Z^{N} $$
(29)
$$ Z^{{N_{{{\text{test}}}} }} = 0.3Z^{N} $$
(30)

where

$$ N = N^{{{\text{train}}}} + N^{{{\text{test}}}} ; $$
(31)

\(N^{{{\text{train}}}}\) and \(N^{{{\text{test}}}}\) conform to the training and testing data samples. As observed from Eq. (29) and (30), 70% of \(Z^{N}\) is taken randomly for training and remaining data (30%) for testing purpose. All the case studies are simulated with a sampling time of \(T_{{\text{s}}} = 0.01\;s\) in \(N\) number of sample space.\(Z^{{N_{{{\text{train}}}} }}\) intends the input–output data taken for training kSINDYc, N3ARX and NL2SQ algorithms and remaining \(Z^{{N_{{{\text{test}}}} }}\) corresponds to testing dataset. The following Section exemplifies five industrial systems with divergent nonlinear dynamics and their time response to nonlinear system identification methods at \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\).

Example 1: Three tank process

A three-tank hydraulic process with the configuration of first pump supplying a liquid to first tank is considered in the present work. The objective is to control liquid levels in 3rd tank by measuring the level in each tank. The dynamic equations and the associated process parameters \(c_{12} ,c_{23} ,c_{3} ,A_{1}\) are acquired from [43]. The terms \(h_{1} ,h_{2} {\text{ and }}h_{3}\) stand for the individual level of each tank in the cascaded-arrangement of three-tank process, respectively. The state variable \(h_{3}\) is the only measurable output when the input flow rate to Tank 1 is \(q_{1} \,m^{3} \,s^{ - 1}\). The nonlinear differential equations of the three-tank system are given by Eq. (3234). The initial level of the all the tanks is assumed to be zero. \(q_{1}\) denotes the inflow rate of the liquid in the first tank with the constraint \(u = q_{1} \,m^{3} \,s^{ - 1}\) \(q_{1} \in [0 - 1e^{ - 5} ]m^{3} s^{ - 1}\) and \(u_{{{\text{nom}}}} = 0.5e^{ - 5} m^{3} s^{ - 1}\).

$$ \dot{h}_{1} = \frac{{q_{1} }}{{A_{1} }} - c_{12} \sqrt {h_{1} - h_{2} } $$
(32)
$$ \dot{h}_{2} = c_{12} \sqrt {h_{1} - h_{2} } - c_{23} \sqrt {h_{2} - h_{3} } $$
(33)
$$ \dot{h}_{3} = c_{23} \sqrt {h_{2} - h_{3} } - c_{3} \sqrt {h_{3} } $$
(34)
$$ y_{{{\text{True}}}} = h_{3} $$
(35)

The step response for the tank level \(y_{{{\text{True}}}} (t)\) from Eq. (34) is compared with that \(y_{{{\text{pred}}}} (t)\) obtained from the predicted models identified by kSINDYc, N3ARX and NL2SQ where the training dataset \(Z^{{N\;{\text{train}}}}\) is presented in Fig. 2.The nonlinearity of the three-tank using CANM adheres to The three-tank process has a large settling time of around 5000 s with a weak nonlinear behavior.

Fig. 2
figure 2

True and learned model response \(h_{3}\) of three tank process for step input at \(u_{{{\text{nom}}}} = 0.5e^{ - 5} {\text{m}}^{{3}} {\text{s}}^{{ - {1}}}\)

A \(\pm 10\%\) variation in feed flow rate from \(u_{{{\text{nom}}}}\) also termed as \(u_{{{\text{prbs}}}}\) is adopted (through PRBS mode) to check the open loop response of tank level \(h_{3}\) in Fig. 3.The height of \(h_{3}\) swirls over a level band between \((0 - 1.5){\text{m}}\) when subjected to \(u_{{{\text{prbs}}}}\) unlike the response to \(u_{{{\text{nom}}}}\) that attains the steady state at \(h_{3} = 0.7{\text{m}}\). The system suffers from mild nonlinearity where the value of \(\Delta_{0} < 0.3\) using CANM. It has been observed that NL2SQ identification approach outperforms kSINDYc in predicting the dynamics of \(y_{{{\text{True}}}} = h_{3}\) for a sluggish nonlinear system like three-tank process.

Fig. 3
figure 3

True and learned model response of three-tank process for input at \(u_{{{\text{prbs}}}}\)

Example 2: CSTR

An exothermal, continuous stirred tank reactor (CSTR) is widely used to convert reactants to products \((A \to P)\). The reactor suffers from operational difficulties like complex behavior, output multiplicity (as it shows multiple steady states), oscillations and chaos due to its nonlinear dynamics. Here we consider a uniformly mixed CSTR which undergoes a single irreversible, exothermic reaction. Rate of reaction and heat transfer from heating media to reactor wall impose nonlinearity to the system. Sometimes, due to economic reason, the reactor is preferred to be operated at an unstable steady state. The rate of heat generation and rate of heat removal should be balanced by rate of cooling for efficient control so that dynamic disturbances can be safely handled. The standard state variable representation of the reactor is given in Eqs. (36, 37). The coolant flow rate \(q_{c} = u_{{{\text{nom}}}}\) is considered as manipulated input and temperature of the reactor \(T\) is the output variable. The states are concentration of reactants \(C_{a}\) and temperature of reactor \(T\). The nominal operating data for the reaction is available in [44]. The initial states and steady state points of the Concentration gradient \(C_{a}\) of the species A and the effluent Temperature of the reactor \(T\) are assumed to be the same where \((C_{{\text{a nom}}} ,T_{{{\text{nom}}}} ) = (0.08235\;{\text{mol}}\;{\text{l}}^{{ - 1}} {,441}{\text{.81K}})\).The open loop study obtained for a nominal input \(u_{{{\text{nom}}}} = 102\;l\min^{ - 1}\) can be viewed from Fig. 4.

$$ \dot{C}_{a} = \frac{q}{V}\left( {C_{af} - C_{a} } \right) - k_{0} C_{a} \exp \left( { - \frac{E}{{{\text{RT}}}}} \right) $$
(36)
$$ \dot{T} = \frac{q}{V}\left( {T_{f} - T} \right) + \frac{{\left( { - \Delta H} \right)k_{0} C_{a} }}{{\rho C_{p} }}\exp \left( { - \frac{E}{{{\text{RT}}}}} \right) + \frac{{\rho_{c} C_{pc} }}{{\rho C_{p} V}}q_{c} \left[ {1 - \exp \left( { - \frac{hA}{{q_{c} \rho_{c} C_{pc} }}} \right)} \right]\left( {T_{c} - T} \right) $$
(37)
$$ y_{{{\text{True}}}} = T $$
(38)
Fig. 4
figure 4

True and learned model response of CSTR process for step input \(u_{{{\text{nom}}}}\)

By carefully observing Eqs. (36) and (37), we can clearly understand that the activation energy level \(E\) has an effect on rate-constant of reaction which further influences the outputs of the CSTR, and depends upon the operating conditions and mechanism of species \(A\) undergoing the reaction \((A \to P)\). Therefore, the key term for the kSINDYc identification method in an exothermal CSTR appears in the term \(\exp \left( { - \frac{E}{{{\text{RT}}}}} \right)\).

The open loop (temperature) responses, with the jacketed-coolant flowrate at \(u_{{{\text{nom}}}} = 102\;l\min^{ - 1}\) of the CSTR studied in Fig. 4, ensures that both kSINDYc and NL2SQ expedite the process dynamics more accurately. Moreover, the N3ARX method shows notable deviations in predicted temperature (\(y_{{{\text{pred}}}} (t)\)) from the true reactor temperature \(y_{act} (t)\) where accuracy falls down with a value of RMSE = 3.2382. The input PRBS region, \(u_{{{\text{prbs}}}} \in [90,110]\;{\text{l}}\;\min^{ - 1}[90,110]\)\(\;{\text{l}}\;\min^{ - 1}\) is near the vicinity of the steady state point \(u_{{{\text{nom}}}}\). The results are obtained with respect to reactor temperature, \(T(K)\). The open loop simulations for \(\pm 10\% \;u_{{{\text{nom}}}}\) type of changes on the coolant flow rate at \(u_{{{\text{prbs}}}} = (90 - 110)\,{\text{l}}\min^{ - 1}\) are presented in Fig. 5. These graphs, showing outlet temperatures, in Fig. 4 and Fig. 5 prove that the dynamic characteristics of CSTR undergo wide variations when it is operated at input regions \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\). Identification using N3ARX at \(u_{{{\text{nom}}}}\) resulted in a small offset error from the steady state at \(y_{{{\text{True}}}} = 440.31\;{\text{K}}\) with \(y_{{{\text{N3ARX}}}} = 440.22K\). The learned dynamics from kSINDYc \((y_{{{\text{kSINDYc}}}} )\) and NL2SQ \((y_{{{\text{NL2SQ}}}} )\) identification method outperformed N3ARX for the medium nonlinear CSTR process \((0.3 < \Delta_{0} \le 0.7)\) by tracking the steady state at \(y_{{{\text{True}}}}\).

Fig. 5
figure 5

True and learned model response of CSTR process for input \(u_{{{\text{prbs}}}}\)

Example 3: Heat exchanger

A Heat Exchanger (HE) is a device where a cold fluid is heated by another hot stream mostly by convection principle. Recently, first principle modeling of a heat exchanger for a high temperature milk pasteurization unit was enumerated using log mean temperature difference approach [45]. In our research, a nonlinear physical model of a fluid–fluid HE processes is detailed is adopted from [46]. Here, we consider the outlet temperature of the process fluid \(T_{{{\text{po}}}}\) as the controlled variable and flow rate \(F_{c}\) of the heating fluid as the manipulated variable. The operating conditions and parameter of the heat exchanger are acquired from [46]. The steady state values and the initial states \((T_{{{\text{co\_nom}}}} ,T_{{{\text{po\_nom}}}} )\) are found to be \((115,\,150)\;^\circ {\text{F}}\). The nonlinear material balance equations of the process are given in Eqs. (39, 40).

$$ \dot{T}_{{{\text{co}}}} = \frac{2}{{M_{c} }}\left[ {F_{c} \left( {T_{{{\text{ci}}}} - T_{{{\text{co}}}} } \right) - \left( {UA\Delta T_{{{\text{lm}}}} /C_{{{\text{pp}}}} } \right)} \right] $$
(39)
$$ \dot{T}_{{{\text{po}}}} = \frac{2}{{M_{p} }}\left[ {F_{p} \left( {T_{{{\text{pi}}}} - T_{{{\text{po}}}} } \right) - \left( {UA\Delta T_{{{\text{lm}}}} /C_{{{\text{pp}}}} } \right)} \right] $$
(40)
$$ \Delta T_{{{\text{lm}}}} = \frac{{\left( {T_{{{\text{po}}}} - T_{{{\text{ci}}}} } \right) - \left( {T_{{{\text{pi}}}} - T_{{{\text{co}}}} } \right)}}{{\log \left( {T_{{{\text{po}}}} - T_{{{\text{ci}}}} } \right) - \log \left( {T_{{{\text{pi}}}} - T_{{{\text{co}}}} } \right)}} $$
(41)

\(\Delta T_{{{\text{lm}}}}\) is the logarithmic mean temperature difference of the Heat exchanger system.

$$ y_{{{\text{True}}}} = T_{{{\text{po}}}} $$
(42)

The process fluid temperature from the Heat Exchanger \(y_{{{\text{True}}}} (t) = T_{{{\text{po}}}} (t)\) also called, outlet fluid temperature depicted in Fig. 6, reveals that HE model is highly nonlinear where \(T_{{{\text{po}}}}\) values drop drastically from \(150^\circ {\text{F}}\) to a new steady state at \(T_{{{\text{po}}}} = 44.62^\circ {\text{F}}\) at \(u_{{{\text{nom}}}}\). The predicted response of the fluid temperature \(y_{{{\text{pred}}}} (t)\) of kSINDYc method reached the steady state surpassing other identification schemes for a highly nonlinear heat exchanger at \(u_{{{\text{nom}}}} = 40\;{\text{lbm}}\;{\text{m}}in^{ - 1}\). The output variable \(T_{po}\) is plotted for all the three methods kSINDYc, NL2SQ and N3ARX in Fig. 7 when the excitation signal is \(u_{{{\text{prbs}}}} \in \left[ {36,44} \right]\;{\text{lbm}}\;{\text{m}}in^{ - 1}\). The measured temperature \(y_{{{\text{True}}}}\) for PRBS excitation displays a steady state at \(y_{{{\text{True}}}} = 44.7^\circ {\text{F}}\) which is relatively closer to the response at \(u_{{{\text{nom}}}}\). The predicted response of \(y_{{{\text{kSINDYc}}}}\) and \(y_{NL2SQ}\) follows \(y_{{{\text{True}}}}\) accurately compared to \(y_{N3ARX}\) when excited at \(u_{{{\text{prbs}}}}\). The high level of nonlinearity \(\Delta_{0} > 0.7\) for step and PRBS inputs of the Heat Exchanger can be noticed from Tables 5 and 6.

Fig. 6
figure 6

True and learned model temperature responses \(T_{{{\text{po}}}}\) of heat exchanger process for step input at \(u_{{{\text{nom}}}}\)

Fig. 7
figure 7

True and learned model response of heat exchanger process for step input at \(u_{{{\text{prbs}}}}\)

Example 4: Bio reactor

A bioreactor otherwise called a fermenter, a special type of heterogeneous reactor, is an essential automated system used in food processing and pharmaceutical industries. A fed-batch reactor with the manipulated input of dilution rate \(D\) and the process output, biomass concentration \(X\) is adopted from [47]. The mass balance equations representing the kinetic model of the bioreactor are given in Eqs. (4345). At high substrate concentration, \(S\), rate of product formation is independent of \(S\) due to limited amount of enzyme; at low substrate concentration, the rate of product formation becomes proportional to \(S\) and follows first-order kinetics. Fermenters generally produce heat respiration and maintenance of bio-chemical pathways by microbes. Control becomes essential in large scale installations. However, lack in proper knowledge behind kinetic pathways, calculation of cooling, aeration, pH, and agitations need attention. Here growth rate \((\mu_{\max } )\), yield factor \((Y_{XS} )\), rate constant for conversion of substrate to product \((K_{m} )\) and rate of inhibition \((K_{1} )\) are the vital process parameters. The manipulated input \(D\) occupies the region \([0,0.6]\,{\text{hr}}^{{ - 1}}\). The initial values and nominal (operating) points of the state variables are \((X_{{{\text{nom}}}} ,S_{{{\text{nom}}}} ) = (1.530,0.174)\;{\text{g}}\;{\text{l}}^{{ - {1}}}\). The density of microbial cells also called biomass concentration \(X\) of any microorganism grows by consuming the substrate \(S\) fed to it.

$$ \dot{S} = \frac{ - 1}{{Y_{XS} }}\mu \left( S \right)X + D\left( {S_{{{\text{in}}}} - S} \right) $$
(43)
$$ \dot{X} = (\mu (S) - D)X $$
(44)

where a Haldane type of specific growth rate is given by

$$ \mu (S) = \mu_{\max } \frac{S}{{\left( {S + K_{m} + K_{1} S^{2} } \right)}} $$
(45)
$$ y_{{{\text{True}}}} = X $$
(46)

The nonlinearity of the bioreactor varies w.r.t the specific growth rate \(\mu (S)\), the type of excitation given \((u)\), initial states of \((S,\,X)\) and the operating region of dilution rate \((u = D)\).Therefore a bioreactor can be contemplated as a very sensitive nonlinear system, subjected to the above actors. The Bioreactor is highly sensitive, whose nonlinearity may switch from mild to medium based on the excitation input \(u = D\).The input constraints of the Bioreactor lies in the range \(u = (0.1 - 0.3){\text{hr}}^{{ - {1}}}\). For dilution rate \(D < 0.3\), the Bioreactor system remains in the mild nonlinear category. The step input at \(D_{c} = 0.3\) is called critical dilution rate where the biomass concentration \(\left( X \right)\) disappears, and the system becomes unstable [46]. In this example, we restrict our analysis with the operating point \(P\) at \(D = 0.27\), as the microbial cell growth gets affected beyond \(D_{c}\).

Figure 8 represents the response \(y_{{{\text{True}}}} = X({\text{g}}\;{\text{l}}^{{ - 1}} )\) at steady state, when the dilution rate is operated at \(u_{{{\text{nom}}}} = 0.27\,{\text{hr}}^{ - 1}\). The bioreactor is operated at the stable operating point \(u_{{{\text{nom}}}}\). It can be observed that the predicted response generated by NL2SQ,N3ARX and kSINDYc methods, show a pattern that follows the nonlinear dynamics of the bioreactor very accurately for \(y_{{{\text{pred}}}} = \left\{ {y_{{{\text{kSINDYc}}}} ,y_{{{\text{NL2SQ}}}} ,y_{{{\text{N3ARX}}}} } \right\}\) with the steady state of biomass concentration \((X)\) at \(u_{{{\text{nom}}}} = 0.27\;{\text{hr}}^{ - 1}\) to settle at \(y_{{{\text{True}}}} (t) = 1.547\;{\text{g}}\;{\text{l}}^{ - 1}\). The response of \(X\;{\text{(g/litre)}}\) due to PRBS input which has a feed flow \(u_{{{\text{prbs}}}} = \pm 10\% \;u_{{{\text{nom}}}}\) is portrayed in Fig. 9. It can be noticed that the three methods of identification \((y_{{{\text{kSINDYc}}}} ,y_{{{\text{NL2SQ}}}} ,y_{{{\text{N3ARX}}}} )\) showed excellent tracking of the biomass-concentration with very sharp variations when excited with PRBS signal also. However, the evaluation criteria RMSE, is very small for \(y_{{{\text{N3ARX}}}}\) than kSINDYc and NL2SQ identification both for \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\) excitations. The N3ARX identification is also computationally faster compared to the other two methods for the bioreactor, which exhibits mild nonlineatity of \(\Delta_{0} < 0.3\) when operated below the critical dilution rate \(u < 0.3\;{\text{hr}}^{{ - {1}}}\).

Fig. 8
figure 8

True and learned model response of bioreactor process for input \(u_{{{\text{nom}}}}\)

Fig. 9
figure 9

True and learned model response \(X({\text{g/litre}})\) of bioreactor process for input \(u_{{{\text{prbs}}}}\)

Example 5: Distillation column

A 9 stage (\(n_{s} = 9\)) binary Distillation Column (DC), to separate methanol–water mixture, operated in the LV (liquid–vapour) configuration with the manipulated variable as reflux rate to the column \(u_{{{\text{nom}}}} = 2.704\,{\text{kmol}}\,{\text{min}}^{{ - {1}}}\) is taken for the study from [47]. The distillate composition \(x_{D}\) (mole fraction) which is the top most product is the output variable \(y\). The feed mixture containing 50% Methanol has to be rectified continuously to 98% purity. The common problems are vapor cross-flow channeling, foaming and unaccounted interactions. The presence of many state-variables and process parameters make the simulation of DC model more complex. Accordingly, certain process assumptions are made as follows for an easier analysis: A uniform binary mixture with constant pressure, no vapor holdup, and constant relative volatility, on all stages are considered. The ordinary differential equations governing the DC are Eqs. (4751).

Condenser Stage:

$$ \dot{x}_{1} = \frac{1}{{M_{D} }}\left[ {V_{R} \left( {y_{2} - y_{1} } \right)} \right] $$
(47)

Rectifying section above feed tray (i=2 to NF-1)

$$ \dot{x}_{i} = \frac{1}{{M_{T} }}\left[ {L_{R} x_{i - 1} + V_{R} y_{i + 1} - L_{R} x_{i} - V_{R} y_{i} } \right] $$
(48)

Feed stage (NF)

$$ \dot{x}_{NF} = \frac{1}{{M_{T} }}\left[ {L_{R} x_{NF - 1} + V_{R} y_{NF + 1} + Fz_{F} - L_{S} x_{NF} - V_{R} y_{NF} } \right] $$
(49)

Rectifying section below feed tray (i=NF+1 to NS-1)

$$ \dot{x}_{i} = \frac{1}{{M_{T} }}\left[ {L_{S} x_{i - 1} + V_{S} y_{i + 1} - L_{S} x_{i} - V_{S} y_{i} } \right] $$
(50)

Re-boiler Stage

$$ \dot{x}_{NS} = \frac{1}{{M_{B} }}\left[ {L_{S} x_{NS - 1} - Bx_{NS} - V_{S} y_{NS} } \right] $$
(51)
$$ y_{{{\text{True}}}} = x_{D} $$
(52)

The DC model when operated in a wider operating region instead of a fixed input at \(u{}_{nom}\) imparts a massive nonlinear phenomenon and does not suit the normal operation. Therefore, the initial conditions of distillate \(x_{{\text{D}}}\) and bottoms composition \(x_{{\text{B}}}\) are carefully chosen to be \((0.005,1.05)\). The nominal operating values so obtained are \((x_{{{\text{Dnom}}}} ,x_{{{\text{Bnom}}}} ) = (0.775,0.2225)\). The time response of \(x_{D}\) in Fig. 10 is noticed when the DC is operated at reflux rate \(u_{{{\text{nom}}}} = 2.704\,{\text{kmol}}\,\min^{ - 1}\).Fig. 11 flaunts the time response of \(x_{{\text{D}}}\) when excited in the PRBS range \(u_{{{\text{prbs}}}} \in (2.43,3)\,{\text{kmol}}\,\min^{ - 1}\). Except for N3ARX method, the learned models \(y_{{{\text{pred}}}} (t)\) from kSINDYc and NL2SQ approaches bear a very close follow up to the true output of the distillate composition \(y_{{{\text{True}}}} = x_{D} = 0.775\,\). The DC exhibits medium nonlinearity using CANM method with \(\Delta_{0}\) in the range \(0.3 < \Delta_{0} \le 0.7\) when excited by both \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\).

Fig. 10
figure 10

True and learned model response \(x_{D}\) of distillation column process for step input at \(u_{{{\text{nom}}}}\)

Fig. 11
figure 11

True and learned model response \(x_{D}\) of distillation column process for step input at \(u_{{{\text{prbs}}}}\)

4.1 Selection of key nonlinear terms \(k_{{{\text{nl}}}}\)

The key nonlinear term \(k_{{{\text{nl}}}}\) in every system taken for the study is the predominant nonlinear function that determines the nonlinear dynamics of the physical system. In each Example, the dynamic equations have some mathematical terms that will have the most significant influence on the system’s behavior. This could be found out by careful examination of the Ordinary Differential Equation (ODE) of every Example from (Eqs. 3252). Nonlinear terms typically include products, powers, trigonometric functions, exponential functions, and other nonlinear operations. In the examples considered for the present study, the significant terms in each ODE equation that involve nonlinear functions of the dependent variable(s) or their derivatives are carefully observed based on prior knowledge of the system. These terms often have larger coefficients or play a central role in the dynamics. Alternatively, numerical simulation tools can also be adopted (e.g., differential equation solvers) to simulate the dynamic response of the nonlinear system to different inputs and initial conditions. In such cases, the transient behavior of all examples has to be carefully analyzed for any observation of any nonlinear effects, oscillations, or stability issues. Moreover, interest readers may refer Global Sensitivity Analysis (GSA) methods reported in [50] for sorting out \(k_{{{\text{nl}}}}\) of complex nonlinear systems. Applying GSA will definitely assess how changes in various parameters (e.g., reaction rate constants, flow rates, temperature) affect the system’s response. Furthermore, this method will also reveal the predominant terms that have the most significant impact on the system’s nonlinear behavior. Table 3 shows the kSINDYc library terms obtained for all the five examples elaborated in Sect. 4. kSINDYc is an empirical data driven method, which needs only the measured input and output data to identify any nonlinear system. Even if kSINDYc is data driven identification, using the sparse regression and key nonlinearity terms \(\left( {k_{{{\text{nl}}}} } \right)\) in its library function, the governing equation can be found out along with the parameters.

Table 3 Key term-based SINDYc (kSINDYc)

The parametric coefficients of each example can be found out after solving Algorithm A1 in iterative steps. Table 3 summarizes the basis terms to be carefully placed inside the candidate library for all the five case studies. The nonlinear function of each process is decided by the rudimentary key term \(k_{{{\text{nl}}}}\). The candidate library \(\Theta\) has relatively a fewer functional terms. \(k_{{{\text{nl}}}}\) term is chosen carefully by checking the influencing terms from the dynamic equations of every process. The active terms in the dynamics from the library \(\Theta (X,U)\) are identified using the sparse regression algorithm defined in Eq. (17). It is evident that compared to SINDYc identification, the proposed kSINDYc scheme requires only a smaller number of relevant key terms in the candidate library, thereby reducing the number of parameters required to identify the nonlinear system. Using the candidate library terms of kSINDYc in Table 3, every nonlinear process can be identified using higher-order polynomial equations. Moreover, introducing the key nonlinear terms in the candidate library function of kSINDYc is intended to build models of dynamic physical systems with diverse nonlinear behavior. It is always interesting to observe how the dynamics of nonlinear system changes in response to different excitations/test signals.

4.2 Performance evaluation

In our proposed framework we intend to validate the performance of kSINDYc, NL2SQ and N3ARX for all examples subjected to step and PRBS excitation. By looking at Table 4, a major conclusion can be brought over in the concept of data driven modeling. A careful observation of the \(y_{{{\text{pred}}}}\) at \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\) in all the examples signify that N3ARX does not mimic the system dynamics accurately for CSTR and Heat exchanger process. This inspection ensures that N3ARX identification can be adopted for systems with mild nonlinearity. However, it predicts the PRBS response of all the examples with an acceptable RMSE. Furthermore, the identified model using kSINDYc and NL2SQ under \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\), predicted the nonlinear dynamics of all the examples with high accuracy. The predicted model accuracy of each identification method (kSINDYc, NL2SQ, N3ARX) are validated using the performance Index RMSE. By inspecting the level of nonlinearity using CANM, a user can flexibly choose the appropriate identification method among the three methods investigated in the study. To emphasize this point, in all the five case studies, the RMSE at \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\) are summarized in Table 3 and \(\Delta_{0}\) computed using CANM are graphically related using scatter plot in Figs. 12 and 13.The quantitative analysis in Figs. 12 and 13 are very important graphical representations that relates the nonlinearity of each system with the three nonlinear system identification methods in terms of performance evaluation criteria.

$$ {\text{RMSE}} = \sqrt {\frac{1}{N}\sum\limits_{i = 1}^{N} {(y_{{{\text{True}}}} (t) - y_{{{\text{pred}}}} (t))^{2} } } $$
(53)
Table 4 Comparison of RMSE for System identification using kSINDYc, N3ARX and NL2SQ methods
Fig. 12
figure 12

RMSE of learned models of all systems with its metric \(\Delta_{0}\) at input \(u_{{{\text{nom}}}}\)

Fig. 13
figure 13

RMSE of learned models of all systems with its metric \(\Delta_{0}\) at input \(u_{{{\text{prbs}}}}\)

The RMSE for \(N\) samples are found from Eq. (53) for all the dynamic physical systems described in this Sect. As seen from Figs.12 and 13, the crucial factor in determining the choice of system identification rests on the minimum RMSE between the actual and predicted dynamics of the identified model among all the three identified models \(\left\{ {y_{{{\text{pred}}}} = y_{{{\text{kSINDYc}}}} ,y_{{{\text{NL2SQ}}}} ,y_{{{\text{N3ARX}}}} } \right\}\). A graphical comparison is made between the RMSE of all the nonlinear system identification methods for \(u_{{{\text{nom}}}}\) is given in Fig. 12. The scatter plot in Fig. 13 maps the RMSE with \(\Delta_{0}\) for nonlinear systems excited at \(u_{{{\text{prbs}}}}\). The identification method which gives least RMSE under each class of \(\Delta_{0}\) imply a better estimate on all basis and is exclusively picked up for the accurate choice of system identification.

CANM is an operating point dependent nonlinear metric. All the five examples examined in this manuscript, are continuous processes working at a stable steady state operating point (xas, xbs). The effect of nonlinearity \(\Delta_{0}\) around the vicinity of the fixed point (xas, xbs) are carefully investigated in this section. The usage of NL2SQ method is preferred only for mild systems like three tank process and Bioreactor. Diversely, kSINDYc is the best opted non-parametric model for highly nonlinear processes like Heat Exchanger. A low value of RMSE ensures that \(y_{{{\text{pred}}}}\) is very close to \(y_{{{\text{True}}}} (t)\) capturing the underlying patterns and the nonlinear dynamics with high precision. In an application like DC, RMSE is very low in the order of \(10^{ - 4}\) or lesser in the predicted models kSINDYc and NL2SQ. The RMSE precision of all the learned models in the Bioreactor implies that even though the predicted data has high fidelity, the model is highly sensitive to noise that leads to overfitting of \(y_{{{\text{True}}}} (t)\). By using this combined framework, any user can find out the conducive system identification method based on the nonlinearity in the desired operating region. CANM is an operating point dependent nonlinear metric.

The physical quantities of each process addressed in Tables 5 and 6, have different orders of magnitude. To sustain a uniform scale in measuring the nonlinearity, the time period \(t\), input \(u\) and output variable \(y_{{{\text{True}}}} (t)\) of all nonlinear physical process are normalized between 0 and 1, and thereafter the CANM method \(\Delta_{0}\) is intended from Eq. (2). The performance index (RMSE) of all the system identification methods are correlated along with the degree of nonlinearity \(\Delta_{0}\) of each nonlinear system and the outcomes are enumerated. To verify the impact of excitation inputs on \(\Delta_{0}\), \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\) are applied to all the examples. Even though the \(\Delta_{0}\) values were different for \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\) signals operated at operating point \(P\), the class of nonlinearity (mild, medium, high) will remained the same. This perception reveals an important remark that irrespective of the excitation inputs, the class of nonlinearity will not change but remains the same using CANM. Tables 5 and 6 will provide the inference made on the choice of the appropriate system identification method, on the basis of \(\Delta_{0}\) and the RMSE of every Example. The special features of kSINDYc method lies in its concrete algorithm to develop an exact model even with limited number of samples, without involving in under fitting or over fitting of highly nonlinear systems. kSINDYc proves to be a better choice for system identification in high nonlinear process whereas the mild nonlinear systems can follow N3ARX method and medium nonlinear units can adopt NL2SQ to learn the process dynamics as observed from Tables 5 and 6. The effectiveness of N3ARX approach trails behind kSINDYc and NL2SQ approaches in terms of RMSE and execution time. From the detailed system identification analysis carried out on all case studies, some assertive conclusive remarks are presented below over the selection of appropriate identification method based on degree of nonlinearity \(\Delta_{0}\).

Table 5 System identification from \(\Delta_{0}\) for excitation input \(u_{{{\text{nom}}}}\)
Table 6 System identification from \(\Delta_{0}\) for excitation input \(u_{{{\text{prbs}}}}\)

Remark 1

N3ARX method requires large number of training data set to provide accurate solution. The learned dynamics using this approach did not meet the acceptable limit at \(u_{{{\text{nom}}}}\).Consequently, it can be used for systems with broader range of excitation signals \(u_{{{\text{prbs}}}}\). The prime hindrance of N3ARX method compared to NL2SQ and kSINDYc schemes is its long computation time in MATLAB.

Remark 2

The learned dynamics using NL2SQ is satisfactory for mild and medium nonlinear systems. As it is a parametric identification scheme it requires a healthy knowledge of the process parameters and the nominal operating regions. However, this approach crashes to identify the process models with less measured I-O data.

Remark 3

kSINDYc is computationally attractive, requires less data, assumes a few numbers of candidate terms in \(\Theta\) to make an interpretable efficient model at \(u_{{{\text{nom}}}}\) and \(u_{{{\text{prbs}}}}\). The method outstrips NL2SQ and N3ARX by accurately following the plant dynamics of highly nonlinear systems.

The presented adaptation of these advisable system identification methods is therefore considered important for all users who are interested in finding an interpretable, identification method for complex and diverse nonlinear systems. The use of the proposed kSINDYc identification necessitates the knowledge of the system dynamics to acquire the key term \(k_{{{\text{nl}}}}\) which appear to be a hindrance in nonlinear systems whose governing equations are completely unknown and unpredictable. In such cases, SINDy algorithm performs the same task without involving the system dynamics in the form of ODEs.

5 Conclusion

Selection of an appropriate identification method is very decisive for any complex nonlinear system. The proposed framework ‘System identification in coherence with nonlinearity measure’ indisputably accomplishes this task by mathematically relating the nonlinearity level with the applicable identification method. An integrated framework comprising three identification methods (kSINDYc, N3ARX and NL2SQ) and a nonlinearity measure called CANM is devised in this research study. In particular, the proposed data driven kSINDYc scheme, identifies nonlinear processes under the sparse dimensional space using key nonlinear terms in its candidate library. A notable development is made in the ‘kSINDYc candidate library’, by introducing the ‘key nonlinear terms’ from the plant dynamics, along with the polynomial terms. The kSINDYc identification uses a sparcification knob \(\lambda\) set between 0 and 10, to identify nonlinear dynamics of five physical systems with divergent nonlinear strengths. The method surmounts NL2SQ and N3ARX by meticulously adopting the process dynamics of highly nonlinear processes. Additionally, this research comprehends the nonlinear metric CANM that targets to find out the degree of nonlinearity between 0 and 1 and subclassifies the five examples to fit in mild, medium or highly nonlinear category. This article exemplifies a contemporary quantitative analysis that correlates the nonlinear metric with the system identification schemes. The proposed framework is tested for five nonlinear systems with diverse nonlinear strengths. This article differs from the existing literature by providing a configuration for suitable system identification from the three methods based on the computed \(\Delta_{0}\) using CANM metric.

6 Scope

The nonlinear systems considered in this study, are classified as mild, medium and highly nonlinear using CANM method. However, the measure may be deficient, when the process is operated in a region far beyond the operating point. Such deprivation issues have to be addressed in the sequel while measuring nonlinearity. Extending the proposed framework to more complex MIMO process structures should be carried out without losing the dynamic behavior of the system. The choice of nonlinear control schemes based on the computation of \(\Delta_{0}\) is another decisive study that has to be devised in the future research. Conclusively, this research study will be definitely instrumental for the researchers and academicians of nonlinear dynamics community but needs to be further tested in real-world physical systems.