Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors

Zhu, Rui; Chen, Sufang; Jiang, Dong; Xie, Shitao; Ma, Lei; Marchesiello, Stefano; Anastasio, Dario

doi:10.1007/s42417-023-01030-3

Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors

Original Paper
Published: 07 June 2023

Volume 12, pages 3021–3031, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Vibration Engineering & Technologies Aims and scope Submit manuscript

Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors

Download PDF

Rui Zhu¹,
Sufang Chen²,
Dong Jiang³,
Shitao Xie³,
Lei Ma⁴,
Stefano Marchesiello⁵ &
…
Dario Anastasio⁵

221 Accesses
Explore all metrics

Abstract

Purpose

Nonlinear system identification heavily relies on the accuracy of nonlinear unit model selection. To improve identification accuracy, the Sparse Bayesian Learning method is incorporated into the nonlinear subspace. Enhanced nonlinear subspace identification is proposed.

Methods

The nonlinear term in the system is treated as an internal excitation. By applying low-level excitation, the response of the structure can be approximated as linear, allowing for the determination of the linear frequency response function of the structure. High-level excitation is then applied to separate the response caused by intrinsic nonlinear force excitation. The type of nonlinearity is evaluated using Spike-and-Slab Priors for Sparse Bayesian Learning. Finally, the screened nonlinear elements are substituted into subspace identification to determine nonlinear parameters.

Results

The effectiveness of this method in dealing with nonlinear stiffness and damping is verified through a simulation example and its robustness is further discussed. Experiments on negative stiffness systems also demonstrate the method's good applicability when dealing with complex damping.

Conclusion

Incorporating the Sparse Bayesian Learning method into the nonlinear subspace significantly improves the accuracy of nonlinear system identification. The proposed approach effectively deals with nonlinear stiffness and damping, as validated by simulation results. The method's robustness is further demonstrated through extensive discussions, while experiments on negative stiffness systems showcase its applicability in complex damping scenarios.

Integrating automatic order determination with response prediction error minimization for nonlinear subspace identification in structural dynamics

Article 04 September 2024

A modified time domain subspace method for nonlinear identification based on nonlinear separation strategy

Article 09 August 2018

Dynamic Characterization of Structures from Limited Measurements Using a Subspace System Identification Method

Article 18 November 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Nonlinear system identification is the process of determining the mathematical model of a nonlinear system from input − output data [1, 2]. The goal is to find a mathematical representation of the system that accurately describes its behavior over a range of input conditions [3]. The identified model can then be used for a variety of purposes, such as control, prediction, or simulation of the system’s behavior [4].

The process of nonlinear system identification involves three key steps: detection, characterization, and parameter estimation [5,6,7]. Detection is the first step in building a structural model with accurate predictions [8, 9]. When designing and analyzing a structure, it is essential to consider its nonlinear dynamic characteristics for performance and reliability. Once nonlinear detection is performed, the next step is to determine the mathematical form of the nonlinearity, also known as model selection [10]. Finally, parameter estimation [11, 12] is the final step in creating a structural model with good predictive accuracy. The nonlinear subspace identification (NSI) approach proposed by Marchesiello et al. [13] has opened new possibilities for identifying nonlinear mechanical systems due to its robustness. Further, a negative stiffness oscillator is modeled and tested to exploit its nonlinear dynamical characteristics [14]. However, the model selection remains a crucial step in the identification process, as it is necessary to test the accuracy of candidate models to select the appropriate one. There is currently no universal method for selecting a suitable nonlinear model, and it’s often selected based on the characteristics of the specific identification problem. Improper selection of identification parameters often leads to multiple different models, resulting in an incomplete description of the structure. To find a reasonable model, methods for model selection must be developed to evaluate the model and its parameters. A hybrid nonlinear identification approach [15] is proposed by integrating the restoring force surface (RFS) [16] method and the NSI method. The distinct feature of the presented method is that it has the excellent characterization ability of the RFS and the recognition validity and robustness of the subspace algorithm. Bayesian model selection methods [17] have been developed to evaluate the model and its parameters, taking into account uncertainty in the selection process. These methods can simultaneously consider the impact of multiple models on the prediction of structural response, automatically constrain complex models, and calculate the probability of selecting each model. Based on the evidence obtained, the most likely model can be chosen. Nayek et al. [18] present the use of spike-and-slab (SS) priors [19] for discovering governing differential equations of motion of nonlinear structural dynamic systems, where displacement, velocity, and acceleration response data need to be measured. The use of displacement differentiation or acceleration integration to estimate response data may lead to distorted results due to noise propagation, making it challenging to accurately identify true nonlinear terms.

In this paper, a Spike-and-Slab Priors for Sparse Bayesian Learning is introduced to solve the model selection problem in the nonlinear subspace identification method. Enhanced nonlinear subspace identification (ENSI) is proposed. The method treats nonlinearity as an internal excitation and separates the response caused by it. The type of nonlinearity is then evaluated using Spike-and-Slab Priors for Sparse Bayesian Learning. The selected nonlinear models are then used in subspace identification to determine nonlinear parameters. The paper is structured as follows: “Framework of Enhanced Nonlinear Subspace Identification” introduces the proposed method. “Separate Response Caused by Internal Nonlinear Force Excitation” covers the process of extracting underlying linear characteristics. “Using Spike-and-Slab Priors for Bayesian Basis function Selection” and “Nonlinear Subspace Identification” briefly explain the theory of Spike-and-Slab Priors and nonlinear subspace identification. The numerical simulation and robustness are discussed in “Numerical Simulation”. “Experimental Study” presents an experimental study. The paper concludes in “Conclusion”.

Theory

Framework of Enhanced Nonlinear Subspace Identification

The purpose of this method is to separate the response caused by the internal nonlinear force excitation from the nonlinear structural response, determine the nonlinear basis function model through the Bayesian regression method, and finally bring it into the nonlinear subspace method for parameter identification as shown in Fig. 1. The main steps are: (1) By applying low-level excitation, the response of the structure can be obtained, and the frequency response function can be obtained, from which the unit impulse response function can be obtained through the inverse Fourier transform to obtain the structural characteristics of the underlying system; Apply high-level excitation, to separate the response caused by the internal nonlinear force excitation from the response; (2) Construct a nonlinear unit function library and use Bayesian regression to complete the nonlinear characterization; (3) Substitute the nonlinearity into the nonlinear subspace algorithm for parameter identification.

Separate Response Caused by Internal Nonlinear Force Excitation

To describe the process, a single Dof system with nonlinearity is written by

$$m\ddot{x} + c\dot{x} + kx + f_{n} \left( t \right) = f\left( t \right)$$

(1)

where m is the mass matrix, c is the damping, k is the linear stiffness, f is the excitation, and f_n is the internal nonlinear force. The nonlinear force f_n can be shifted to the right, which can be regarded as an internal excitation to the underlying linear system.

$$\underbrace {{m\ddot{x} + c\dot{x} + kx}}_{{\text{Underlying linear part}}} = f\left( t \right) - f_{n} \left( t \right)$$

(2)

When the type of nonlinear function is unknown, the nonlinear restoring forces can be represented by selecting a complete set of candidate nonlinear functions.

$$\begin{aligned} f_{n} & = \left[ {\begin{array}{*{20}c} {G_{1} \left( t \right)} & {G_{2} \left( t \right)} & \cdots & {G_{m_{f}} \left( t \right)} \\ \end{array} } \right]\left\{ {\begin{array}{*{20}c} {\mu _{1} } & {\mu _{2} } & \cdots & {\mu _{m_{f}} } \\ \end{array} } \right\}^{T} \\ & = \left[ {\begin{array}{*{20}c} {G_{1} } & {G_{2} } & \cdots & {G_{m_{f}} } \\ \end{array} } \right]\mu \\ & = G\mu \\ \end{aligned}$$

(3)

where m_f is the number of candidate nonlinear functions, G is the library of nonlinear functions, and μ is the corresponding coefficient vector. Since the number of functions in the nonlinear library exceeds the number of real nonlinearities in the system, only a small number of nonlinearities are active, resulting in a sparse coefficient vector, μ.

When a nonlinear structure is subjected to a low-level excitation f_low, the effects of nonlinearities can be generally assumed to be minimal, leading to a response that approximates the linear characteristics of the structure [20]. Therefore, Eq. (2) can be rewritten by

$$m\ddot{x} + c\dot{x} + kx \approx f_{{{\text{low}}}} \left( t \right)$$

(4)

Based on the application of low-level excitation, the response of the structure approximates the linear characteristics of the structure, which allows the linear frequency response function H of the structure to be determined. The unit impulse response function h can then be obtained by performing an inverse Fourier transform [21].

$${\varvec{h}} = \frac{1}{2\pi }\int_{ - \infty }^{ + \infty } {{\varvec{H}}\left( \omega \right)e^{i\omega t} d\omega }$$

(5)

When a high-level excitation f_high is applied to the structure, the effects of intrinsic nonlinear forces cannot be ignored. The nonlinearity is treated as an additional force applied to the underlying linear structure. The total response of the structure is the result of the combined action of both linear and nonlinear forces. By using the Duhamel integral [22], the response can be expressed as:

$$x\left( t \right) = \underbrace {{h\left( t \right) \otimes f_{{{\text{high}}}} \left( t \right)}}_{{{\text{linear response }}x_{l} }} - h\left( t \right) \otimes f_{n} \left( t \right)$$

(6)

⊗ represents the convolution operation. The response of the structure due to the internal nonlinear force fn can be expressed as x_n, which is the result of a convolution operation between the unit impulse response function and the nonlinear force

$$x_{n} = h\left( t \right) \otimes f_{n} \left( t \right) = h\left( t \right) \otimes f_{{{\text{high}}}} \left( t \right) - x\left( t \right)$$

(7)

Substituting Eq. (3) into Eq. (7) formula, we can get

$$\begin{gathered} x_{n} = h\left( t \right) \otimes f_{n} \left( t \right) \\ = h\left( t \right) \otimes \left( {\left[ {\begin{array}{*{20}c} {G_{1} \left( t \right)} & {G_{2} \left( t \right)} & \cdots & {G_{m_{f}} \left( t \right)} \\ \end{array} } \right]\left\{ {\begin{array}{*{20}c} {\mu_{1} } & {\mu_{2} } & \cdots & {\mu_{m_{f}} } \\ \end{array} } \right\}^{T} } \right) \\ = \mu_{1} \underbrace {{h\left( t \right) \otimes G_{1} \left( t \right)}}_{{G_{1}^{^{\prime}} }} + \mu_{2} \underbrace {{h\left( t \right) \otimes G_{2} \left( t \right)}}_{{G_{2}^{^{\prime}} }} + \cdots + \mu_{m_{f}} \underbrace {{h\left( t \right) \otimes G_{m_{f}} \left( t \right)}}_{{G_{m_{f}}^{^{\prime}} }} \\ = G^{^{\prime}} \mu \\ \end{gathered}$$

(8)

This equation shows that nonlinear representation is a variable selection problem, which can be approached as a regression problem. To estimate the best nonlinear terms in the system, the Bayesian variable selection method based on the spike and slab prior can be used, by calculating the marginal posterior probability through Gibbs sampling [23].

Using Spike-and-Slab Priors for Bayesian Basis Function Selection

In linear regression problems with high-dimensional covariates, the goal is to find a sparse solution to the model by identifying the best subset of relevant covariates. This is done by removing irrelevant covariates and finding a solution that has only a few nonzero coefficients in the regression function.

In the Bayesian framework, variable selection can be achieved by using shrinkage priors. The Spike and slab prior is a common approach that assigns a latent variable to each nonlinear function in the library, indicating whether the function is active in the model. By estimating the posterior probabilities for all possible models and the marginal posterior probabilities for individual functions, the true nonlinear terms in the system can be identified. This approach is useful for high-dimensional covariates, as it tends to find sparse solutions to the model by removing irrelevant functions.

Consider a standard multiple linear regression equation:

$$Y = X\beta + \varepsilon$$

(9)

where Y = (Y₁,Y₂,…, Y_n)^T is the n-dimensional target vector, X = (X₁,X₂,…,X_n)^T is the n × p matrix of predictors, β = (β₁, β₂,…, β_p) is the vector of coefficients, and ε = (ε₁, ε₂,…, ε_n)^T is the error term. Assuming that the elements in the error vector ε are independently and identically distributed following a Gaussian distribution with a mean of 0 and a variance of σ², the likelihood function of Y can be expressed as

$${\varvec{Y}}|{\mathbf{X}},{{\varvec{\upbeta}}},\sigma^{2} \sim {\mathcal{N}}\left( {{\mathbf{X\beta }},\sigma^{2} {\mathbf{I}}} \right)$$

(10)

where represents the Gaussian distribution and I is the n × n-dimensional identity matrix.

According to the spike and slab prior method, each covariate in the multiple linear regression equation is assigned a latent binary variable, s = (s₁,s₂,…,s_p). This variable, s_i (where s_i ∈ {0, 1}), indicates whether the ith covariate is included in the model. When s_i = 0, the prior distribution of the corresponding regression coefficient (β_i) is a spike with a point peak at zero. On the other hand, when s_i = 1, the prior distribution of β_i is a slab with a flat density. In this method, the spike distribution is set to δ(β_i), where δ represents the Dirac function. The slab distribution is set to (0, τσ²). Therefore, the prior density of each component (β_i) in the coefficient vector β can be expressed as:

$$\beta_{i} |s_{i} \sim \left( {1 - s_{i} } \right)\delta \left( {\beta_{i} } \right) + s_{i} {\mathcal{N}}\left( {0,\tau \sigma^{2} } \right)$$

(11)

Define the following prior distributions for the other parameters:

$$s_{i} |p_{0} \sim {\text{Bernoulli}}\left( {p_{0} } \right)$$

(12)

$$p_{0} \sim {\text{Beta}}\left( {a_{p} ,b_{p} } \right)$$

(13)

$$\tau \sim {\text{Inverse}}\,{\text{Gamma}}\left( {a_{\tau } ,b_{\tau } } \right)$$

(14)

$$\sigma^{2} \sim {\text{Inverse}}\,{\text{Gamma}}\left( {a_{\sigma } ,b_{\sigma } } \right)$$

(15)

The prior distributions for the other parameters in the model are defined as follows: a_p and b_p for the latent binary variables, a_τ and b_τ for the slab density, and a_σ and b_σ for the variance of the error term. These hyperparameters control the shape of their respective priors.

Based on Bayes' theorem, the joint posterior distribution of the parameters can be expressed as:

$$\begin{gathered} p\left( {\beta ,s,p_{0} ,\tau ,\sigma^{2} |Y,X} \right) = \frac{{p\left( {Y|\beta ,\sigma^{2} } \right)p\left( {\beta |s,\tau ,\sigma^{2} } \right)p\left( {s|p_{0} } \right)p\left( {p_{0} } \right)p\left( \tau \right)p\left( {\sigma^{2} } \right)}}{p\left( Y \right)} \\ \propto p\left( {Y|\beta ,\sigma^{2} } \right)p\left( {\beta |s,\tau ,\sigma^{2} } \right)p\left( {s|p_{0} } \right)p\left( {p_{0} } \right)p\left( \tau \right)p\left( {\sigma^{2} } \right) \\ \end{gathered}$$

(16)

To solve this Bayesian inference problem involving multiple hyperparameters, Markov−Monte Carlo techniques are usually used to sample from the posterior. In this case, the Gibbs sampler is used to sample from the posterior distribution described by Eq. (16). The Gibbs sampler requires fully conditional distributions for all random variables, which can be analytically derived using conjugate priors. By using the Gibbs sampler, a Markov chain for the parameters β, σ², and s can be obtained, and a large number l of samples for variable selection can be obtained by discarding a small number of samples before the burn-in period. The sample of each component s_i in s indicates the selection of the variable; s_i = 1 indicates that the variable is selected and s_i = 0 indicates that the variable is discarded. Therefore, for a large number l of samples of s_i, the following marginal posterior probability can be defined:

$$p\left( {s_{i} = 1|{\mathbf{Y}},{\mathbf{X}}} \right) = \frac{1}{l}\sum\limits_{j = 1}^{l} {I_{f} \left( {s_{i}^{j} = 1} \right)}$$

(17)

where I_f is the indicator function. When the marginal posterior probability of a variable, p(s_i = 1|Y, X) > 0.5 is greater than 0.5, it should be considered for inclusion in the final model.

Nonlinear Subspace Identification

Based on the theory presented in “Using Spike-and-Slab Priors for Bayesian Basis Function Selection”, it is determined that the nonlinear basis functions of the end structure have a total of v items (v < m_f). The state-space equation at this time can be expressed as:

$$\left\{ {\begin{array}{*{20}c} {\dot{x}} \\ {\ddot{x}} \\ \end{array} } \right\} = \left[ {\begin{array}{*{20}c} 0 & 1 \\ { - \frac{k}{m}} & { - \frac{c}{m}} \\ \end{array} } \right]\left\{ {\begin{array}{*{20}c} x \\ {\dot{x}} \\ \end{array} } \right\}{ + }\left[ {\begin{array}{*{20}c} 0 & 0 & \cdots & 0 \\ {m^{ - 1} } & {m^{ - 1} \mu_{1} } & \cdots & {m^{ - 1} \mu_{v} } \\ \end{array} } \right]\left\{ {\begin{array}{*{20}c} {f\left( t \right)} \\ { - G_{1} \left( t \right)} \\ \vdots \\ { - G_{v} \left( t \right)} \\ \end{array} } \right\}$$

(18)

Based on the authors’ previous work about the nonlinear subspace identification [24], the “extended” frequency response function (FRF) matrix is expressed as

$$H_{E} \left( \omega \right) = \left[ {\begin{array}{*{20}c} H & {H\mu_{1} G_{1} } & \cdots & {H\mu_{q} G_{v} } \\ \end{array} } \right]$$

(19)

where H is the underlying linear system FRF and nonlinear parameters μ_i can be identified

Numerical Simulation

The system with nonlinear stiffness and damping is investigated. The system parameters are: m = 10 kg, k₁ = 800 N/m, c = 2 Ns/m, k₃ = 2 × 10⁵ N/m³, c_n = 20 Ns²/m². The dynamic equation is as follows

$$m\ddot{z} + c\dot{z} + k_{1}z + k_{3} z^{3} + c_{n} \dot{z}^{2} { = }\,f\left( t \right)$$

(20)

The system is excited by a low-level force 2Nrms and a high-level force 100Nrms. As shown in Fig. 2, the response is calculated by the Runge– Kutta fourth-order method. The sampling frequency is 1000 Hz, and the acquisition length is t = 50 s.

It is widely recognized that when structures are subjected to low-level excitations, they display linear characteristics, and the underlying linear system frequency function of the structure can be determined equivalently. As shown in Fig. 3, the frequency response function under low-level excitation is largely consistent with the theoretical linear frequency response function, indicating that nonlinearity is not excited. The estimated impulse response function is calculated.

The estimated unit impulse response function is primarily utilized to distinguish the response caused by nonlinear internal forces and construct a nonlinear function library. Under high-level excitation, the excitation force and the unit impulse response function are convolutionally integrated. Then, the nonlinear response by the internal nonlinear force f_n is estimated using Eq. (7), as is depicted in Fig. 4. The results demonstrated that the estimated nonlinear response curve is in close agreement with the true value curve.

In addition, a nonlinear type construction function library G can be utilized in the construction of a set of systems. In this example, the polynomial type (•)ⁿ, the symbol type sgn(•), and the absolute value type |(•)| are considered, as illustrated in Table 1. The function library contains 10 basis functions, and 2¹⁰ nonlinear functions can be constructed by cross-permutation and combination. The unit impulse response function is convolved with each nonlinear vector in G to construct an extended library of response functions G′.

Table 1 Nonlinear function library

Full size table

Gibbs based on the Bayesian regression is used for sampling, and hyperparameters and initial values of each random variable are set. The hyperparameters are set as follows: a_p = 0.1, b_p = 1, a_τ = 0.5, b_τ = 0.5, a_σ = 5 × 10^–4, b_σ = 5 × 10–4. The initial values of β, s, p₀, τ, σ² are set as: p₀ (1) = 0.1, τ(1) = 10, σ² (1) = 0. Utilize the Gibbs sampler method to run two Markov chains simultaneously, each containing 5000 samples, and discard a certain number of samples before the convergence period.

Figure 5 shows the procedure of basis function selection based on the marginal posterior inclusion probability (PIP). When p (s_i = 1|Y, X) = 1, it implies that the ith basis function had been selected in all Gibbs posterior samples, while p (s_i = 1|Y, X) = 0 implies the ith basis function had never been selected. As mentioned in “Using Spike-And-Slab Priors For Bayesian Basis Function Selection”, only those basis functions are included in the final estimated model whose corresponding marginal PIPs are greater than the set threshold of 0.5. The results show that x³ is selected with a marginal posterior probability close to 1, another true relevant variable ẋ² draws marginal PIPs of around 0.9, and the remaining nonlinearities are discarded with a marginal posterior probability less than 0.2, which is consistent with the real situation and verifies the effectiveness of the basis function selection in this paper.

When measuring the real structural response, there will inevitably be measurement noise, which may lead to the inaccuracy of the separated nonlinear response and affect the subsequent nonlinear characterization. To further discuss the robustness of the method, 2% and 5% noise were added to the above-mentioned responses under low-level and high-level excitations, respectively. As shown in Fig. 6, in the case of noise, the reconstructed nonlinear response trend is still consistent with the theoretical value. When the noise is large, the curve will show more glitches. Furthermore, the corresponding nonlinear basis function selection results are given in Fig. 7. The results show that in the case of noise, the marginal posterior probability of nonlinear stiffness is still close to 1, while the marginal posterior probability of damping will decrease with the increase of noise, but it is still greater than 0.85, which can accurately determine the nonlinear type of structure, to verify that the model selection has better robustness.

Substituting the above model selection results into the nonlinear subspace, the nonlinear coefficients of the structure can be identified in Table 2. In the case of 5% noise, the absolute value of the identification error is also less than 2%, which verifies that the method has good robustness.

Table 2 Identified nonlinear parameters and errors under different noise levels

Full size table

Experimental Study

To assess the effectiveness of the proposed method in real-world structures, this section presents an experimental test on a negative system. The device being tested consists of a U-shaped frame connected to a central moving mass via rods. The structure is subjected to base motion by a shaking table. The assumption made is that the inertia of the moving parts can be concentrated into a single central point, encompassing the mass of the central bush and the equivalent inertia of the rods. Relevant theoretical findings on this structure have been reported in Ref. [25]. The equation of motion along the vertical direction in the x variable can be expressed as

$$m\ddot{x} + f_{d} - \tilde{k}_{1} x + \tilde{k}_{2} x^{2} + \tilde{k}_{3} x^{3} { + }mg = f\left( t \right)$$

(21)

where $- \tilde{k}_{1}$ < 0 is the so-called negative stiffness and f_d is the nonlinear damping force.

Figure 8 illustrates the experimental setup with two photos, showing the device stable equilibrium positions. The system is excited with a shaking table to provide random excitation. The sampling frequency is 512 Hz, and the acquisition length is 135 s. The equilibrium positions are measured using a laser vibrometer and are found to be $x_{ - }^{ * }$ = − 0.0301 m and $x_{ + }^{ * }$ = 0.0242 m.

A cross-validation method is adopted: the data set is divided into a training set (first 100 s) and a validation set (the last 35 s) to verify the generalization ability of the model in Fig. 9.

As previously reported in detail in [25], a new displacement variable z(t) = x(t)-$x_{ + }^{ * }$ can be defined when a positive reference position $x_{ + }^{ * }$ is taken into consideration. In this scenario, the system is defined as a stable underlying linear system. Consequently, the equation of motion can be expressed as:

$$m\ddot{z} + c\dot{z} + k_{1} z + k_{2} z^{2} + k_{3} z^{3} + f_{d} = f\left( t \right)$$

(22)

For this experimental system, the nonlinear damping form is challenging to determine a priori. Based on the experimental results in [25], the friction force in this negative stiffness oscillator is dependent on position and velocity. Friction is related to normal force p_r, which can be represented by:

$$\begin{gathered} f_{d} \left( {\dot{z},z} \right) = \sum\nolimits_{j = 0}^{{n_{d} }} {c_{j} sign\left( {\dot{z}} \right)} \dot{z}^{j} \left| {\underbrace {{\alpha z^{3} + \beta z^{2} + \gamma z + \delta }}_{{p_{r} \left( z \right)}}} \right| \hfill \\ \, = \sum\nolimits_{j = 0}^{{n_{d} }} {c_{j} sign\left( {\dot{z}} \right)} \dot{z}^{j} \left| {p_{r} \left( z \right)} \right| \hfill \\ \end{gathered}$$

(23)

where the three coefficients (α, β, γ, δ) related to two stable equilibrium positions $x_{ \pm }^{ * }$ are uniquely determined.

$$\alpha = \frac{{x_{ - }^{3} - x_{ + }^{3} }}{{x_{ - }^{2} - x_{ + }^{2} }} - x_{ + }^{3} - 1$$

(24)

$$\beta = - \alpha \frac{{x_{ - }^{3} - x_{ + }^{3} }}{{x_{ - }^{2} - x_{ + }^{2} }}$$

(25)

$$\gamma = 0,\delta = 1$$

(26)

Based on Ref. [14] or the underlying linear parameter identification method [26], the underlying linear FRFs can be obtained. Then, the nonlinear response by the internal nonlinear force f_n is estimated using Eq. (7), as is depicted in Fig. 10.

Then, it is important to determine the nonlinear function. The following set of nonlinear basis functions is considered: z³; z²; $\sum\nolimits_{j = 0}^{{n_{d} }} {c_{j} sign\left( {\dot{z}} \right)} \dot{z}^{j} \left| {p_{r} \left( z \right)} \right|$, n_d = 4; As per the proposed method, the results of basis function selection are illustrated in Fig. 11. The results indicate that the marginal posterior probability of selecting z³ and z² is close to 1. In terms of the damping term, the marginal PIP of Coulomb and quadratic friction is about 0.7, these two terms dominate the damping, which is in agreement with the results of the previous literature and verifies the effectiveness of the nonlinear basis functions selection of the proposed method.

By incorporating the results of the model selection into the nonlinear subspace identification method, the estimated coefficients are illustrated in Fig. 12. The results show that the real part of the parameters maintains a horizontal line and is independent of the frequency, thereby verifying the effectiveness and stability of the nonlinear identification. The imaginary parts of the coefficients are consistently lower than the real parts, however, the difference is generally decreased for the damping-related coefficients, indicating that the identified model structure is still impacted by a combination of nonlinear modeling errors and noise. The average ratio between real and imaginary parts E[R/I] is in any case at least one order of magnitude. The complete list of the estimated coefficients is presented in Table 3.

Table 3 Identified coefficients for the positive reference equilibrium positions

Full size table

The identified restoring force estimated by the proposed method is compared with the experimental value obtained by the restoring force surface method in Fig. 13, and the agreement is good. The friction is related to position and velocity, and a 2D graph like velocity force is unable to evaluate the damping identification. In Fig. 14, the complete restoring surface is obtained to include the damping force in a 3D graph. The experimental restoring surface is calculated by the restoring force surface method, and the experimental restoring surface overlaps with the identified one. The results demonstrate that the proposed method can effectively identify nonlinear stiffness and damping.

Further, the effectiveness of the proposed method is verified by the verification set data. Based on the previous identification results, the response signal of the system is constructed, as shown in Fig. 15. The results show that the estimated response is in good agreement with the experimental signal curve as a whole, but there are some local deviations.

Conclusion

This paper presents an enhanced nonlinear subspace identification method. The proposed method in this paper utilizes Spike-and-Slab Priors for Sparse Bayesian Learning in nonlinear subspace identification to solve the model selection problem. The nonlinear term in the system is treated as an internal excitation. The first step is to extract the impulse response function of the underlying linear system of the structure. This is followed by applying a high-level excitation to the structure, which allows for the separation of the response caused by the nonlinear term from the overall response. The type of nonlinearity is evaluated using Spike-and-Slab Priors for Sparse Bayesian Learning. Then, the nonlinear models are substituted into subspace identification to determine nonlinear parameters. The effectiveness of this method in dealing with nonlinear stiffness and damping is verified by simulation analysis and robustness is further discussed. The method is also shown to have good adaptability in dealing with complex nonlinear damping conditions through experimental study.

Data availability

Data will be made available on request.

References

Marchesiello S, Garibaldi L (2008) Identification of clearance-type nonlinearities. Mech Syst Signal Process 22(5):1133–1145. https://doi.org/10.1016/j.ymssp.2007.11.004
Article Google Scholar
Nelles O (2020) Nonlinear system identification: from classical approaches to neural networks, fuzzy models, and gaussian processes. Springer Nature, Cham
Book Google Scholar
Stender M, Oberst S, Hoffmann N (2019) Recovery of differential equations from impulse response time series data for model identification and feature extraction. Vibration 2(1):25–46
Article Google Scholar
Chen D, Gu C, Fang K et al (2021) Vortex-induced vibration of a cylinder with nonlinear energy sink (NES) at low Reynolds number. Nonlinear Dyn 104(3):1937–1954
Article Google Scholar
Kerschen G, Worden K, Vakakis AF, Golinval J (2006) Past, present and future of nonlinear system identification in structural dynamics. Mech Syst Signal Process 20:505–592. https://doi.org/10.1016/j.ymssp.2005.04.008
Article Google Scholar
Noël JP, Kerschen G (2017) Nonlinear system identification in structural dynamics: 10 more years of progress. Mech Syst Signal Process 83:2–35. https://doi.org/10.1016/j.ymssp.2016.07.020
Article Google Scholar
Zhu R, Jiang D, Marchesiello S et al (2023) Automatic nonlinear subspace identification using clustering judgment based on similarity filtering. AIAA J. https://doi.org/10.2514/1.J062816
Article Google Scholar
Hot A, Kerschen G, Foltête E et al (2012) Detection and quantification of non-linear structural behavior using principal component analysis. Mech Syst Signal Process 26:104–116
Article Google Scholar
Sun W, Paiva ARC, Xu P et al (2020) Fault detection and identification using Bayesian recurrent neural networks. Comput Chem Eng 141:106991
Article Google Scholar
Peng ZK, Lang ZQ (2007) Detecting the position of non-linear component in periodic structures from the system responses to dual sinusoidal excitations. Int J Non-Linear Mech 42(9):1074–1083
Article Google Scholar
Jin M, Kosova G, Cenedese M et al (2022) Measurement and identification of the nonlinear dynamics of a jointed structure using full-field data; part II-nonlinear system identification. Mech Syst Signal Process 166:108402
Article Google Scholar
Ji Y, Zhang C, Kang Z et al (2020) Parameter estimation for block-oriented nonlinear systems using the key term separation. Int J Robust Nonlinear Control 30(9):3727–3752
Article MathSciNet Google Scholar
Marchesiello S, Garibaldi L (2008) A time domain approach for identifying nonlinear vibrating structures by subspace methods. Mech Syst Signal Process 22:81–101. https://doi.org/10.1016/j.ymssp.2007.04.002
Article Google Scholar
Anastasio D, Fasana A, Garibaldi L, Marchesiello S (2020) Nonlinear dynamics of a duffing-like negative stiffness oscillator: modeling and experimental characterization. Shock Vib. https://doi.org/10.1155/2020/3593018
Article Google Scholar
Zhu R, Fei Q, Jiang D et al (2021) Identification of nonlinear stiffness and damping parameters using a hybrid approach. AIAA J 59(11):4686–4695
Article Google Scholar
Al-Hadid MA, Wright JR (1989) Developments in the force-state mapping technique for non-linear systems and the extension to the location of non-linear elements in a lumped-parameter system. Mech Syst Signal Process 3(3):269–290
Article Google Scholar
Simoen E, Papadimitriou C, Lombaert G (2013) On prediction error correlation in Bayesian model updating. J Sound Vib 332(18):4136–4152
Article Google Scholar
Nayek R, Fuentes R, Worden K et al (2021) On spike-and-slab priors for Bayesian equation discovery of nonlinear dynamical systems via sparse linear regression. Mech Syst Signal Process 161:107986
Article Google Scholar
Koch B, Vock DM, Wolfson J et al (2020) Variable selection and estimation in causal inference using Bayesian spike and slab priors. Stat Methods Med Res 29(9):2445–2469
Article MathSciNet Google Scholar
Zhu R, Fei Q, Jiang D et al (2019) Removing mass loading effects of multi-transducers using Sherman-Morrison-Woodbury formula in modal test. Aerosp Sci Technol 93:105241
Article Google Scholar
Folland GB (2009) Fourier analysis and its applications. American Mathematical Soc, New York
Google Scholar
Dempsey KM, Irvine HM (1978) A note on the numerical evaluation of Duhamel’s integral. Earthquake Eng Struct Dynam 6(5):511–515
Article Google Scholar
Huang Y, Beck JL, Li H (2017) Bayesian system identification based on hierarchical sparse Bayesian learning and Gibbs sampling with application to structural damage assessment. Comput Methods Appl Mech Eng 318:382–411
Article MathSciNet Google Scholar
Zhu R, Fei Q, Jiang D et al (2022) Bayesian model selection in nonlinear subspace identification. AIAA J 60(1):92–101
Google Scholar
Zhu R, Marchesiello S, Anastasio D et al (2022) Nonlinear system identification of a double-well Duffing oscillator with position-dependent friction. Nonlinear Dyn. https://doi.org/10.1007/s11071-022-07346-1
Article Google Scholar
Liu Q, Zhang Y, Hou Z et al (2023) Optimal Hilbert transform parameter identification of bistable structures. Nonlinear Dyn 111(6):5449–5468

Download references

Acknowledgements

The research results were supported by the General Project of Natural Science Research in Jiangsu Universities (No. 20KJB410001) and the National Science Research Program Cultivation Fund (No.2022NCF004 & 2022NCF005) of Southeast University Chengxian College.

Author information

Authors and Affiliations

Technical University of Munich, 85748,, Munich, Germany
Rui Zhu
School of Civil and Transportation Engineering, Southeast University Chengxian College, Nanjing, 210088, China
Sufang Chen
School of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing, 210037, China
Dong Jiang & Shitao Xie
Shanghai Institute of Spacecraft Equipment, Shanghai, 200240, China
Lei Ma
Dipartimento Di Ingegneria Meccanica Ed Aerospaziale, Politecnico Di Torino, 10121, Turin, Italy
Stefano Marchesiello & Dario Anastasio

Authors

Rui Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Sufang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Dong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Shitao Xie
View author publications
You can also search for this author in PubMed Google Scholar
Lei Ma
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Marchesiello
View author publications
You can also search for this author in PubMed Google Scholar
Dario Anastasio
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sufang Chen.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest concerning the publication of this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhu, R., Chen, S., Jiang, D. et al. Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors. J. Vib. Eng. Technol. 12, 3021–3031 (2024). https://doi.org/10.1007/s42417-023-01030-3

Download citation

Received: 30 January 2023
Revised: 24 April 2023
Accepted: 16 May 2023
Published: 07 June 2023
Issue Date: March 2024
DOI: https://doi.org/10.1007/s42417-023-01030-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors