Keywords

1.1 Introduction

Engineering structures in service suffer from long-term environmental effects and operational loads, which cause unexpected structural damage. Monitoring structural damage is important for preserving and sustaining the service life of engineering structures. It is necessary to monitor structural damage by developing noise-robust and accurate damage identification methods. Recently, machine learning and especially deep learning algorithms have been studied widely in damage identification methods, thanks to the rapid development of graphics processing units (GPUs) and sensing technology in the last decade, including acoustic emission-based methods [1], guided wavefield-based methods [2], vibration-based structural damage identification methods [3, 4], etc.

Guided wavefield-based methods have been studied widely for damage Identification, owing to the capability of scanning laser Doppler vibrometers (SLDVs), which can measure velocities of a structure on each point of a predefined grid. It allows the visualization and post-processing of a guided wavefield by reconstructing measured velocities [5]. Joseph et al. [6] proposed a guided wavefield-based method for damage detection in thin metal plates. It uses full wavefield data acquired by SLDV, half from the damaged state and another half from the undamaged state, to train a deep neural network with two convolutional layers and two fully connected layers. It was reported that their proposed method has a better performance than support vector machine-based methods. Abdalraheem et al. [2] proposed guided wavefield-based method for damage detection by using deep fully convolutional neural networks (CNNs) FCN-DenseNet. It was fed by a large dataset of guided wavefields, which contains 475 cases of guided wavefields resulting from interaction with delaminations of random shape, size, and location. The results reveal that their proposed method performs better than the adaptive wavenumber filtering method. Besides, the same dataset has been used in a comparative study for delamination identification by using several deep fully CNNs models, including VGG16 encoder-decoder, PSPNet, residual UNet, and FCN-DenseNet [7]. It was reported that GCN model has the highest identification accuracy among the five implemented models. Song et al. [8] proposed guided wavefield-based method for identifying sub-wavelength defects. It combines two distinct fully CNNs, including a global detection network and a local super-resolution network. It was reported that their proposed method can not only locate sub-wavelength defects but also estimate the shape and size of the identified sub-wavelength defects. The aforementioned methods require labeled dataset which not only contains guided wavefields but also contains corresponding information of damage. However, labels are usually unavailable in practice, or acquiring these datasets are very time-consuming. Hence, it is necessary for researchers to come up with innovative methods using guided wavefields without labels under machine learning or deep learning applications.

Physics-informed neural networks (PINNs) are a mesh-free scientific machine learning technique used to solve equations, like nonlinear partial differential equations [9]. PINNs approximate solutions of equations by training a multilayer perceptron neural network to minimize a loss function, which includes terms reflecting the equations residual at selected points in the spatial-temporal domain, and the initial as well as boundary conditions along boundaries of the domain [10]. PINNs have been applied in several works with the ability of incorporating data and their governing physics equation, especially solving inverse problems, where unknown parameters of equation are estimated. Shukla et al. [11] used a PINN to identify a surface breaking crack in a metal plate. Measured ultrasonic surface acoustic waves of the metal were target of the PINN, and a wave velocity field of the metal plate was estimated as a damage index. Rathod et al. [12] identified the change of a wave velocity in a rod by a trained PINN, where 50% reduction in Young’s modulus in the mid part can be represented in the identified wave velocity along the rod, but the sharp variation in the wave velocity was not captured. Zhang et al. [13] presented a general framework based on PINNs for solving inverse problems involving unknown material properties and geometric in continuum solid mechanics. It was reported that the framework can accurately estimate the unknown material properties and geometric with a relative error in the order of \(10^{-2}\).

In this paper, a guided wavefields-based damage identification method based on PINNs is proposed for beam-like structures. In this proposed method, a pseudo-pristine beam model is constructed by a PINN from a measured flexural guided wavefield of a damaged beam-like structure. The residual of Euler–Bernoulli equation with an unknown parameter \(\alpha \), i.e., the ratio of bending stiffness and mass per unit length, is used in the PINN as a loss term. This parameter serves as a variable and is updated in each iteration of gradient descent during the training process of the pseudo-pristine beam model. A predicted flexural guided wavefield is obtained from the pseudo-pristine beam model when the PINN is trained. An accumulative two-dimensional damage index is calculated based on the difference between the measured flexural guided wavefield and the predicted flexural guided wavefield. The location and extent of damage can be identified by consistently high values of the damage index. Accuracy and noise-robustness of the proposed method are investigated in a numerical example of a damaged beam-like structure with different noise levels.

The rest of the paper is arranged as follows. In Sect. 1.2, the proposed damage identification method based on PINNs is presented. In Sect. 1.3, the numerical investigation is described. Conclusions of this work are presented in Sect. 1.4.

1.2 Methodology

1.2.1 Euler–Bernoulli Beam

Consider a narrow, long beam with a lateral load applied, and, consequently, its displacement is off the centerline. The type of dynamic behavior is called flexural motion [14]. In this study, the Euler–Bernoulli beam model is considered. It is a simplification of the linear theory of elasticity which governs the flexural motion of beams. Assuming a pristine beam-like structure is homogeneous undamped, hence its governing equation of flexural motion on a bounded spatial-temporal domain is defined by [15]

$$\displaystyle \begin{aligned} EI\frac{\partial^{4}u\left(x,t\right)}{\partial x^{4}}+\mu\frac{\partial^{2}u\left(x,t\right)}{\partial t^{2}}=p\left(x,t\right),\quad x\in\left[0,L\right],\:{and}\:t\in\left[0,T\right]{} \end{aligned} $$
(1.1)

where \(u\left (x,t\right )\) is the spatial-temporal flexural motion of the beam with L spatial lengths and T temporal lengths, E is the elastic modulus, I is the second moment of area of the beam’s cross sections, \(\mu \) is the mass per unit length of the beam, and \(p\left (x,t\right )\) is the external force. Divide by \(\mu \) in each term of Eq. (1.1), and one has

$$\displaystyle \begin{aligned} \alpha\frac{\partial^{4}u\left(x,t\right)}{\partial x^{4}}+\frac{\partial^{2}u\left(x,t\right)}{\partial t^{2}}=\frac{p\left(x,t\right)}{\mu}{} \end{aligned} $$
(1.2)

where

$$\displaystyle \begin{aligned} \alpha:=\frac{EI}{\mu}{} \end{aligned} $$
(1.3)

It is observed that Eqs. (1.1) and (1.2) are fourth-order partial differential equations in space, and second order in time.

1.2.2 PINNs for the Euler–Bernoulli Beam

Let \(\mathcal {N}\left (\mathbf {s};\boldsymbol {\theta }\right )\): \(\mathbb {R}^{d_{\mathbf {s}}}\rightarrow \mathbb {R}^{d_{\mathbf {y}}}\) be a multilayer perceptron neural network with input vector \(\mathbf {s}\), H hidden layers, an output vector \(\mathbf {y}\), and trainable network parameters \(\boldsymbol {\theta }\). In the h-th hidden layer, \(N_{h}\) number of neurons is present. Each hidden layer receives an input \({\mathbf {s}}^{h-1}\in \mathbb {R}^{N_{h-1}}\) from the output of the previous layer through the following nested transformations:

$$\displaystyle \begin{aligned} \mathcal{Q}_{h}:=\sigma^{h}\left({\mathbf{w}}^{h}{\mathbf{s}}^{h-1}+{\mathbf{b}}^{h}\right),\quad h=1,\ldots,H{} \end{aligned} $$
(1.4)

where \(\sigma ^{h}\) is the nonlinear activation function in the h-th hidden layer, \({\mathbf {w}}^{h}\in \mathbb {R}^{N_{h}\times N_{h-1}}\) and \({\mathbf {b}}^{h}\in \mathbb {R}^{N_{h}}\) are weight matrices and bias vectors in the h-th hidden layer, respectively. Output layer has the similar transformations in Eq. (1.4) but without nonlinear activation. Hence, the final neural network representation can be expressed as

$$\displaystyle \begin{aligned} \mathcal{N}\left(\mathbf{s};\boldsymbol{\theta}\right)={\mathbf{w}}^{H+1}\left(\mathcal{Q}_{H}\circ\mathcal{Q}_{H-1}\circ\cdot\cdot\cdot\circ\mathcal{Q}_{1}\right)+{\mathbf{b}}^{H+1}{} \end{aligned} $$
(1.5)

where the operator \(\circ \) is the composition operator, \({\mathbf {w}}^{H+1}\in \mathbb {R}^{N_{H}}\) and \({\mathbf {b}}^{H+1}\in \mathbb {R}\) are weight and bias parameters of the output layer, respectively. Hence, all trainable network parameters can be represented as \(\boldsymbol {\theta }=\left \{ {\mathbf {w}}^{1},{\mathbf {b}}^{1},...,{\mathbf {w}}^{H+1},{\mathbf {b}}^{H+1}\right \} \). With reference to the universal approximation theorem [16], \(u\left (\mathbf {x},\mathbf {t}\right )\) can be approximated by

$$\displaystyle \begin{aligned} u\left(\mathbf{x},\mathbf{t}\right)\approx{\mathcal{N}}\left(\mathbf{x},\mathbf{t};\boldsymbol{\theta}\right){} \end{aligned} $$
(1.6)

where \(\mathbf {x}=\left \{ x:x\in \left [0,L\right ]\right \} \) and \(\mathbf {t}=\left \{ t:t\in \left [0,T\right ]\right \} \) are input vectors of \(\mathcal {N}\left (\mathbf {x},\mathbf {t};\boldsymbol {\theta }\right )\). Based on the governing equation of the beam, i.e., Eq. (1.2), the partial differential equation residual \(\mathcal {R}\left (\mathbf {x},\mathbf {t}\right )\) is defined as

$$\displaystyle \begin{aligned} {\mathcal{R}}\left(\mathbf{x},\mathbf{t}\right):=\alpha\frac{\partial^{4}\mathcal{N}\left(\mathbf{x},\mathbf{t};\boldsymbol{\theta}\right)}{\partial{\mathbf{x}}^{4}}+\frac{\partial^{2}\mathcal{N}\left(\mathbf{x},\mathbf{t};\boldsymbol{\theta}\right)}{\partial{\mathbf{t}}^{2}}-\frac{p\left(\mathbf{x},\mathbf{t}\right)}{\mu}{} \end{aligned} $$
(1.7)

The value of \(\alpha \) is unknown, which updates in each iteration of gradient descent in the training process. Note that \(\boldsymbol {\theta }\) can be learned by minimizing the following composite loss function:

$$\displaystyle \begin{aligned} \mathcal{L}\left(\boldsymbol{\theta}\right)=\lambda\mathcal{L}_{d}\left(\boldsymbol{\theta}\right)+\mathcal{L}_{r}\left(\boldsymbol{\theta}\right){} \end{aligned} $$
(1.8)

where

$$\displaystyle \begin{aligned} \mathcal{L}_{d}\left(\boldsymbol{\theta}\right)=\frac{1}{N_{d}}\sum\left(\hat{u}\left({\mathbf{x}}_{d},{\mathbf{t}}_{d}\right)-\mathcal{N}\left({\mathbf{x}}_{d},{\mathbf{t}}_{d};\boldsymbol{\theta}\right)\right)^{2}{} \end{aligned} $$
(1.9)
$$\displaystyle \begin{aligned} \mathcal{L}_{r}\left(\boldsymbol{\theta}\right)=\frac{1}{N_{d}}\sum\left(\mathcal{R}\left({\mathbf{x}}_{d},{\mathbf{t}}_{d}\right)\right)^{2}{} \end{aligned} $$
(1.10)

are residual loss and data loss, respectively. Here, \(\lambda \) is a hyperparameter that is larger than 0, and \(N_{d}\) denotes the number of inputs \(\left \{ {\mathbf {x}}_{d},{\mathbf {t}}_{d}\right \} \) in each channel, which can be randomly sampled from the spatial-temporal domain \(\left [0,L\right ]\times \left [0,T\right ]\), and \(\hat {u}\left ({\mathbf {x}}_{d},{\mathbf {t}}_{d}\right )\) is a sampled dataset from the measured flexural guided wavefield with corresponding coordinates \(\left \{ {\mathbf {x}}_{d},{\mathbf {t}}_{d}\right \} \). The finally learned trainable network parameters are defined as \(\boldsymbol {\theta }_{l}\). A block diagram describing the PINN model used in this study is shown in Fig. 1.1.

Fig. 1.1
A network model explains how the input nodes of N of x, t semicolon theta with a hierarchy of levels are processed via a hierarchy of differential nodes. Both the outputs L r of theta and L d of theta are combined under loss and minimized to get the output theta l.

Block diagram of the PINN model for the Euler–Bernoulli beam

1.2.3 Damage Identification Process

Damage in a homogeneous beam-like structure can cause structural flexural motion anomalies at locations of damage although such local anomalies may not be apparent from the measured flexural motion [17]. In the piecewise form, \(\hat {u}\left (x,t\right )\) can be written as a sum of three components, including the flexural motion for corresponding the pristine structure \(\hat {u}_{t}\left (x,t\right )\), local anomalies \(\hat {u}_{p}\left (x,t\right )\), if existing, and white noise \(\hat {u}_{n}\left (x,t\right )\), if existing, and \(\hat {u}\left (x,t\right )\) can be expressed by

$$\displaystyle \begin{aligned} \hat{u}\left(x,t\right)=\hat{u}_{t}\left(x,t\right)+\hat{u}_{p}\left(x,t\right)+\hat{u}_{n}\left(x,t\right){} \end{aligned} $$
(1.11)

When a surrogate model learned from \(\hat {u}\left (x,t\right )\) by PINNs, \(\mathcal {N}\left (x,t;\boldsymbol {\theta }_{l}\right )\) is close to \(\hat {u}_{t}\left (x,t\right )\). Hence, the difference between \(\hat {u}\left (x,t\right )\) and \(\mathcal {N}\left (x,t;\boldsymbol {\theta }_{l}\right )\) can eliminate most effect from \(\hat {u}_{t}\left (x,t\right )\):

$$\displaystyle \begin{aligned} \triangle u\left(x,t\right)=\hat{u}\left(x,t\right)-\mathcal{N}\left(x,t;\boldsymbol{\theta}_{l}\right){} \end{aligned} $$
(1.12)

When \(\hat {u}\left (x,t\right )\) is measured from a pristine beam, \(\triangle u\left (x,t\right )\) will contain errors of the surrogate model and \(\hat {u}_{n}\left (x,t\right )\). On the other side, \(\triangle u\left (x,t\right )\) will contain one more term, i.e., \(\hat {u}_{p}\left (x,t\right )\), than the former one, when \(\hat {u}\left (x,t\right )\) is measured from a damaged beam. Hence, unapparent local anomalies in \(\hat {u}\left (x,t\right )\) will become dominated in \(\triangle u\left (x,t\right )\) when damage exists.

An accumulative two-dimensional damage index can be constructed for damage identification for accumulating local anomalies, and alleviating the adverse effect of noise and errors in \(\triangle u\left (x,t\right )\), which is defined as

$$\displaystyle \begin{aligned} \delta\left(x,m\right)=\left(\frac{\sum_{t=0}^{T}\left|\triangle u\left(x,t\right)\right|}{\text{{$\max\left(\sum_{t=0}^{T}\left|\triangle u\left(x,t\right)\right|\right)$}}}\right)^{m},\quad m\in\left[1,\infty\right){} \end{aligned} $$
(1.13)

where \(\left |\cdot \right |\) denotes the absolute value of a function and \(\max \left (\cdot \right )\) is the maximum of a function. With increasing the value of m, the small values of \(\delta \left (x,m\right )\) responding to noise and errors can be eliminated ultimately. While too large value of m can also lower the value of \(\delta \left (x,m\right )\) in locations of damage. In this study, m is chosen in \(\left [1,10\right ]\) for all cases of \(\delta \left (x,m\right )\). Note that \(\delta \left (x,m\right )\in [0,1]\) and damage can be identified in neighborhoods with consistently high values of \(\delta \left (x,m\right )\) with different values of m. A flowchart summarizing the proposed damage identification method is shown in Fig. 1.2.

Fig. 1.2
A process flow describes how the color gradient graph of u cap of x, t is transformed into a vivid graph of the delta of x, m. It undergoes a hierarchy of network structure of N of x, t semicolon theta and two color gradient graphs of N of x, t semicolon theta l and delta u of x, t.

Flowchart of the proposed damage identification method

1.3 Numerical Investigation

In this section, flexural guided wavefields obtained from numerical simulations of two cantilever beams: one is a pristine beam and the other is a damage beam which is generated from the pristine beam with damage in form of a thickness reduction area. The former one is used to study the capability of modeling by PINNs in constructing a surrogate model for the pristine beam. The latter one is used to study the effectiveness of the proposed damage identification method. Besides, the noise-robustness of the method is also investigated and discussed.

1.3.1 Numerical Models of Two Cantilever Beams

For the pristine cantilever beam, it has a height of 6 mm, a width of 3 mm, and a length of 300 mm. The damaged cantilever beam has the same dimensions as the pristine beam, and damage is introduced by a one-sided through-width thickness reduction area with a length of 10 mm and a depth of \(0.3\) mm. Dimensions of these two beams and the location of the damage are shown in Fig. 1.3. The pristine beam and damaged beam are made of aluminum 6061-T6 with a mass density of 2700 \(\text{kg/m}^{3}\), Young’s modulus of \(68.9\) GPa, and Poisson’s ratio of \(0.33\). The ratio of bending stiffness and mass per unit length, i.e., \(\alpha \), is \(19.1389\) \(\left (\mathrm{m}^{4}/\mathrm{s}^{2}\right )\) for the pristine beam. Their numerical model is modeled by the finite element method in finite element analysis software ABAQUS. The models of these two beams are built with the linear eight-node brick (C3D8R) elements. The size of elements is \(0.3\) mm so that a total of 1001 nodes are evenly distributed along the length of the intact side of each beam. These two cantilever beams are under zero initial conditions, and their free ends are subjected to a five-count wave packet force, which is expressed by

$$\displaystyle \begin{aligned} h\left(t\right)=A\left(H\left(t\right)-H\left(t-\frac{5}{f_{c}}\right)\right)\left(1-\cos\left(\frac{2\pi f_{c}t}{5}\right)\right)\sin\left(2\pi f_{c}t\right){} \end{aligned} $$
(1.14)

where A defines the magnitude of the force, H is Heaviside function expressed by

$$\displaystyle \begin{aligned} H\left(t\right)=\begin{cases} 1, & t\geqslant0\\ 0, & t<0 \end{cases}{} \end{aligned} $$
(1.15)

and \(f_{c}\) is the central frequency of the force. The values of A and \(f_{c}\) are chosen to be \(0.5\) N and 3 kHz, respectively. The force is only applied in the free end nodes of two beams, and other nodes of beams are zeros. Hence, the force field, i.e., the rightmost item, is approximated by zeros in \(\mathcal {R}\left (\mathbf {x},\mathbf {t}\right )\) of Eq. (1.7).

Fig. 1.3
A 3-part diagram of a beam. The rectangular beam of length 300 has a damaged block of length 6 has a width of 3. The damaged beam is placed on an X-Y plane. The damaged length is 10 and is from 160 from the left and 130 from the right with a width of 3. All dimensions are in millimeters.

Dimensions of the pristine beam and damaged beam, and the location of the damage on the damaged beam

Two flexural guided wavefields for the pristine beam and damaged beam are obtained on a grid with 1001 nodes that are evenly distributed along the length of the intact side of each beam and 2501 steps in 2 ms after the excitation force is applied. These flexural guided wavefields are shown in Fig. 1.4. Figure 1.4a shows the flexural guided wavefield consists of a wave packet that is generated from the excitation point and propagating toward the fixed end, and reflecting back to the free end. In Fig. 1.4b, similar observations can be made regarding the resulted flexural guided wavefield for the damaged beam. Besides, reflections and/or local anomalies cannot be directly observed near the damage area.

Fig. 1.4
Two contour graphs plot t versus m. Both graphs have color gradients and a scale reads from negative 1 to 1. Both have spherical patterns of dark and light colors with values between negative 1 and 1. Graph B has 2 dashed vertical lines at the center.

Flexural guided wavefields for (a) the pristine beam and (b) the damaged beam. The amplitudes of the flexural guided wavefield shown in each figure are normalized for better comparison. Edges of the damage area are depicted by vertical dotted lines

1.3.2 Numerical Verification Process

The capability of modeling by PINNs is first investigated based on the flexural guided wavefield obtained from the pristine beam. PINNs are notoriously hard to train especially when high-frequency or multi-scale behavior exists [18]. In this study, the spatial-domain frequency is lowered by scaling \(\mathbf {x}=\left \{ x:x\in \left [0,0.3\right ]\right \} \) with unit of meter to \(\bar {\mathbf {x}}=\left \{ \bar {x}:\bar {x}\in \left [0,1\right ]\right \} \). In the temporal domain, only part of ranges are used and then scale \(\mathbf {t}=\left \{ t:t\in \left [t_{1},t_{2}\right ]\right \} \) with unit of second to \(\bar {\mathbf {t}}=\left \{ \bar {t}:\bar {t}\in \left [t_{1}/t_{2},1\right ]\right \} \). Besides, the amplitudes of corresponding flexural motion are scaled to the interval \(\left [-1,1\right ]\in \mathbb {R}\) as follows

$$\displaystyle \begin{aligned} \bar{u}\left(x,t\right)=-1+2\times\frac{\hat{u}\left(x,t\right)-\min\left(\hat{u}\left(x,t\right)\right)}{\max\left(\hat{u}\left(x,t\right)\right)-\min\left(\hat{u}\left(x,t\right)\right)},\quad t\in\left[t_{1},t_{2}\right]{} \end{aligned} $$
(1.16)

where \(\min \left (\cdot \right )\) is the minimum of a function. Two models, i.e., model #1 and #2, are built based on two different ranges of the temporal domain to verify the consistence of modeling by PINNs, and they are \(t\in \left [0.5,0.7\right ]\times 10^{-3}\) s and \(t\in \left [1,1.2\right ]\times 10^{-3}\) s, respectively. A three-layer fully connected neural network with 100 neurons per hidden layer is used in each model. Two input channels exist in the neural network and number of \(N_{d}=20{,}000\) points \(\left \{ \bar {x}_{i},\bar {t}_{i}\right \} _{i=1}^{N_{d}}\) which are randomly sampled from the selected spatial-temporal domain. The corresponding \(\left \{ \bar {u}\left (\bar {x}_{i},\bar {t}_{i}\right )\right \} _{i=1}^{N_{d}}\) are sampled from the scaled flexural guided wavefield \(\bar {u}\left (\bar {\mathbf {x}},\bar {\mathbf {t}}\right )\). The target function is used as the nonlinear activation function in the two models. The hyperparameter \(\lambda \) in the loss function Eq. (1.8) is set to \(10^{5}\). A two-phase optimizer is used to minimize the loss function. In the first phase, the Adam optimizer [19] with a learning rate of \(10^{-3}\) and a weight decay of \(10^{-5}\) is used via \(50{,}000\) iterations of gradient descent. In the second phase, the L-BFGS optimizer [20] with a learning rate of \(10^{-1}\) is used until the two models are converged.

The capability of the proposed damage identification method is investigated based on the flexural guided wavefield from the damaged beam. Two pseudo-pristine beam models, i.e., model #3 and #4, are trained under the same setups as the model #1 and #2 with different \(\left \{ \bar {u}\left (\bar {x}_{i},\bar {t}_{i}\right )\right \} _{i=1}^{N_{d}}\) sampled from the damaged beam, respectively. When the model #3 and #4 are trained, a damage index is calculated based on Eq. (1.13) for damage identification. Besides, the noise robustness of the proposed damage identification method is investigated. Model #5 is trained under the same setups as the model #4 with noise-contaminated \(\left \{ \bar {u}\left (\bar {x}_{i},\bar {t}_{i}\right )\right \} _{i=1}^{N_{d}}\), which is sampled from noise-contaminated flexural guided wavefield. It is generated by adding a Gaussian noise with a signal-to-noise ratio of 30 dB to the flexural guided wavefield obtained from the damaged beam.

1.3.3 Numerical Verification Results

The model #1 is trained after \(50{,}200\) iterations of gradient descent, and its results are shown in Fig. 1.5. It can be seen that \(\mathcal {L}_{r}\) is smaller than \(\mathcal {L}_{d}\) at the initial iterations, but \(\mathcal {L}_{r}\) increases radially under the effect of \(\lambda \) and then generally decreases as same to \(\mathcal {L}_{d}\) as shown in Fig. 1.5a. The estimated \(\alpha \) needs to be divided by a factor of \(\left (L^{4}/t_{2}^{2}\right )\) with \(L=0.3\) m and \(t_{2}=0.7\times 10^{-3}\) s due to the scaling in spatial and temporal coordinates in the model #1. The estimated \(\alpha \) with iterations is shown in Fig. 1.5b. It can be seen that the estimated \(\alpha \) is always nearby the ground truth after approximately \(0.5\times 10^{4}\) iterations. The final estimated \(\alpha \) is \(18.7446\) \(\left (\mathrm{m}^{4}/\mathrm{s}^{2}\right )\), and the relative error to the ground truth is \(2.06\%\). The prediction of flexural guided wavefield by the model #1 is shown in Fig. 1.5c, and the difference between the predicted wavefield and the corresponding measured wavefield is shown in Fig. 1.5d. Small magnitudes are dominated in the majority ranges, and only relative large magnitudes exist in boundaries of the spatial-temporal domain of the model #1.

Fig. 1.5
4 graphs. Line graphs A and B plot loss and alpha against iterations. A plots decreasing trends for L, L r, and L d. B plots increasing trends that grow as neutral fluctuating trends for estimated and ground truth. C and D of t versus x plot most regions with a value of 0. Both are contour graphs.

Results for the model #1: (a) the composite loss \(\mathcal {L}\), the residual loss \(\mathcal {L}_{r}\), and the data loss \(\mathcal {L}_{d}\) with iterations; (b) comparison of the estimated value of \(\alpha \) and the ground truth of \(\alpha \); (c) the prediction of flexural guided wavefield; and (d) the difference between the predicted wavefield and the corresponding measured wavefield

The model #2 is trained after \(63{,}400\) iterations of gradient descent, and its results are shown in Fig. 1.6. The loss and estimated \(\alpha \) have the similar trend as that of model #1, and their values become stable in iterations of gradient descent by the L-BFGS optimizer. The final estimated \(\alpha \) is \(18.7399\) \(\left (\mathrm{m}^{4}/\mathrm{s}^{2}\right )\), and the relative error to the ground truth is \(2.09\%\). The prediction of flexural guided wavefield by the model #2 is shown in Fig. 1.6c, and the difference between the predicted wavefield and the corresponding measured wavefield is shown in Fig. 1.6d. It can be seen that the prediction wavefield is close to the corresponding measured one. The results for model #1 and model #2 show that PINNs can model the pristine beam with high accuracy and consistency.

Fig. 1.6
Four graphs. Line graphs A and B plot loss and alpha against iterations. A plots decreasing trends for L, L r, and L d with fluctuations. B plots increasing trends that grow as neutral fluctuating trends for estimated and ground truth. Contour graphs C and D plot t versus x with low to high values.

Results for the model #2: (a) the composite loss \(\mathcal {L}\), the residual loss \(\mathcal {L}_{r}\), and the data loss \(\mathcal {L}_{d}\) with iterations; (b) comparison of the estimated value of \(\alpha \) and the ground truth of \(\alpha \); (c) the prediction of flexural guided wavefield; and (d) the difference between the predicted wavefield and the corresponding measured wavefield

The model #3 and #4 are trained after \(51{,}400\) and \(55{,}100\) iterations of gradient descent, respectively. The difference between the predicted wavefield and the corresponding measured wavefield for the model #3 is shown in Fig. 1.7a. Large magnitudes of the difference mainly exist in the areas of damage, but others exist nearly by the areas of damage and boundaries. The difference is used to calculate \(\delta \left (x,m\right )\) in a domain \(\left (x,m\right )\in \left [5,295\right ]\times \left [1,10\right ]\) with a step of \(0.01\) for m, which is shown in Fig. 1.7b. It can be seen that high \(\delta \left (x,m\right )\) values mainly exist in the damage area so that the locations and extent of the damage can be accurately identified by the consistently high \(\delta \left (x,m\right )\) values. Large magnitudes of the difference between the predicted wavefield and the corresponding measured wavefield for the model #4 also mainly exist in the areas of damage as shown in Fig. 1.7c. The difference is used to calculate \(\delta \left (x,m\right )\) in a domain \(\left (x,m\right )\in \left [0,300\right ]\times \left [1,10\right ]\) with a step of \(0.01\) for m, which is shown in Fig. 1.7d. Similarly, consistently high \(\delta \left (x,m\right )\) values mainly exist in the damage area. The damage identification results for model #3 and model #4 show that the proposed method can identify the locations and extent of the damage with high accuracy and consistency.

Fig. 1.7
4 contour graphs plotted against x in millimeters. Graphs A and C plot t while Graphs B and D plot m. All the graphs have color gradients. Graph A ranges from negative 0.01 to 0.015 while C ranges from negative 0.03 to 0.02. Graphs B and D range from 0 to 1 with dark regions and have low values.

(a) The difference between the predicted wavefield and the corresponding measured wavefield for model #3, (b) the damage identification result \(\delta \left (x,m \right )\) for model #3, (c) the difference between the predicted wavefield and the corresponding measured wavefield for model #4, and (d) damage identification results \(\delta \left (x,m \right )\) for model #4. Edges of the damage area are depicted by vertical dotted lines

It takes \(50{,}700\) iterations of gradient descent to train the model #5. The difference between the predicted wavefield and the corresponding measured wavefield for the model #5 is shown in Fig. 1.8a. Gaussian noise can be clearly detected in the difference, and large magnitudes of the difference exist in the damage area, boundaries, as well as nearby the damage area. The difference is used to calculate \(\delta \left (x,m\right )\) in a domain \(\left (x,m\right )\in \left [5,295\right ]\times \left [1,10\right ]\) with a step of \(0.01\) for m, which is shown in Fig. 1.8b. It can be seen that high \(\delta \left (x,m\right )\) values mainly exist in the damage area, and Gaussian noise is alleviated with increasing the value of m.

Fig. 1.8
Two contour graphs plotted against x. A plots t while B plots m. Both graphs have color gradients. A ranges from negative 0.1 to 0.05 while B ranges from 0 to 1. A has lighter shades between negative 0.03 to 0.02. B has more dark regions and low values. 2 dashed vertical lines are at the center.

(a) The difference between the predicted wavefield and the corresponding measured wavefield for model #5 and (b) the damage identification result \(\delta \left (x,m \right )\) for model #5. Edges of the damage area are depicted by vertical dotted lines

1.4 Concluding Remarks

In this paper, a baseline-free structural damage identification method is developed for beam-like structures based on PINNs. Pseudo-pristine beam models are built by PINNs from measured flexural guided wavefields of damaged beam-like structures. A two-dimensional damage index is calculated based on the difference of the predicted flexural guided wavefields by pseudo-pristine beams and their corresponding measured flexural guided wavefields. Damage can be identified in neighborhoods with consistently high values of the damage index. The capability of modeling by PINNs is studied on the pristine beam. The results demonstrate that PINNs can accurately model the pristine beam, providing highly accurate and consistent estimates of the ratio of bending stiffness to mass per unit length. The effectiveness of the proposed damage identification method is investigated in a damage cantilever beam. It was found that (1) the proposed method can accurately identify the location and extent of damage, and (2) the proposed method is robust for measurement noise. In future works, it is worthwhile to conduct investigations on experiments by the proposed method. Besides, segmentation is only used in the temporal domain of flexural guided wavefields for better training PINNs in this study. Segmentation in the spatial-temporal domain for flexural guided wavefields can be studied when higher excitation frequencies are applied to structures.