Keywords

1 Introduction

Modern society relies heavily on structural and mechanical systems. Many of them are presently close to the furthest limit of their design life. Because they cannot be economically replaced, techniques for damage detection are being developed and implemented so that they can continue to be safely used if or when their operation is extended beyond their design service life [1]. These conditions require the early detection of damage to increase structure’s reliability and reduce important economic consequences. Structural health monitoring (SHM) is a growing sector related with damaged detection. It has piqued the interest of the scientific community over the last decade. According to Ostachowicz et al. [2] the performance of the SHM methods is based on the quality of the information collected by the sensors. Even though there is a significant development in sensor technology over the past few years, there is a limit to their performance, so the need for research in other areas becomes significant. Optimal sensor placement is a solution that could lead to a reduction in cost of the SHM system without compromising on the quality of the monitoring approach [2] and is the main challenge of this study. Colombo et al. [3] provide a combined approach based on the Neyman-Pearson likelihood ratio test and Bayesian cost used in an optimization framework. According to the authors a multi-objective optimization scheme is vital to achieve a conjunct saving in costs and improvement in detection performances. Mallardo et al. [4] highlight the importance of impact location identification on composite structures and suggest an optimal sensor positioning on a composite-stiffened aircraft panel aiming to maximize the probability to detect the impact or minimize the error associated with a pre-assigned probability of detection by using Genetic Algorithms (GA). Another interesting work focused on marine applications is proposed in Silionis and Anyfantis [5], where the guiding principle of the damage identification strategy is based upon measuring, through a limited and constant number of sensors, the static strain redistribution caused by an extensive damage. The authors treat the problem as one of statistical pattern recognition, and apply methods derived from machine learning to tackle it. Other interesting works may be found in [6,7,8].

The present work addresses the important problem of optimizing sensor topology on a given structure. It provides an association of the sensor network arrangement that optimizes the detection performance based on the statistical patterns that govern the feature space, explored in a manual or automated fashion. More specifically, this paper focuses on a decision process for arriving at the topology for sensor placement over a domain of interest, under a variable operational environment. For research purposes, a fictitious SHM problem has been considered, which is described in the second section. In the third section, a description of the probabilistic numerical simulations is provided, followed by that of the employed detector in the fourth section. In the fifth section a presentation of the statistical response is provided. Detection results are given in the sixth section. Emphasis is given at the description of the optimal solution that has been derived by a GA. In the seventh section the conclusions drawn are presented along with some suggestions for future work.

2 Problem Definition

2.1 SHM Detection Basis

The considered SHM problem is focused on detecting corrosion-induced thickness loss (CITL), in a thin-walled structural component. This problem is of interest in marine structures, as these operate in a predominantly corrosive environment. Added thickness is commonly used in plates across vulnerable regions as a preventive measure. The SHM strategy proposed herein is oriented towards predictive, condition-based maintenance. The structure’s plate thickness is inferred, indirectly, from capturing load redistributions caused by stiffness reduction. A classifier, based on detection theory is introduced, which translates the strain-based feature vector into inferences about the structure’s condition.

2.2 Case Study Description

A thin plate subjected to probabilistically described static loads is considered. Hence, their magnitude and location are considered as random variables (RVs) that obey a given statistical structure. Strain measurements are extracted and used as the informative features. Damage is introduced as general uniform corrosion by simulating eleven damage levels, ranging from 1% to 90% thickness loss.

A Level I SHM system [9] is developed to discriminate between two states, one corresponding to the intact plate (healthy state), and the other to the corroded plate (damaged state). This binary classification task is treated using a detector based on the maximum likelihood ratio test. Τhe optimal feature vector, with respect to detector performance, is then chosen via an optimization strategy that optimizes both the sensor topology and the measured strain component.

The problem is numerically studied on a clamped square plate (a = 1000 mm and t0 = 10 mm) as shown in Fig. 1a. A non-uniform pressure profile, q(x, y), is considered with the mathematical description provided in Eq. (1).

$$q(x,y;Q_{0} ,X_{0} ,Y_{0} ) = Q_{0} \exp \left[ { - \frac{1}{2}\left( {\frac{{x - X_{0} }}{{\sigma_{{x_{0} }} }}} \right)^{2} } \right]\exp \left[ { - \frac{1}{2}\left( {\frac{{y - Y_{0} }}{{\sigma_{{y_{0} }} }}} \right)^{2} } \right]$$
(1)

Its variability (load uncertainty) is controlled through the amplitude Q0 at the controlled location \((X_{0} ,\,Y_{0} )\). These quantities are considered as normal RVs, with \(Q_{0} \,\, \sim N(q_{0} ;\mu_{{Q_{0} }} ,\sigma_{{Q_{0} }}^{2} )\), \(X_{0} \sim N(x_{0} ;\mu_{{x_{0} }} ,\sigma_{{x_{0} }}^{2} )\) and \(Y_{0} \,\,\, \sim N(y_{0} ;\mu_{{y_{0} }} ,\sigma_{{y_{0} }}^{2} )\). Parameters \(\mu_{{Q_{0} }} ,\,\sigma_{{Q_{0} }}\) are selected such that the plate remains within its elastic range. To constrain the non-zero magnitude area within the extreme locations of the plate, parameters \(\mu_{{x_{0} }}\) and \(\mu_{{y_{0} }}\) are taken equal to zero and parameters \(\sigma_{{x_{0} }}\) and \(\sigma_{{y_{0} }}\) are taken equal to a/12 (statistically symmetric loading). Some indicative realizations of X0, Y0 pairs are plotted in Fig. 1b.

Fig. 1.
figure 1

Considered problem (a) and realizations of the pressure’s profile peak (X0, Y0) (b).

3 Numerical Simulations

Strain measurements are numerically obtained using Finite Element Analysis (FEA). A mesh consisting of 2500 linear (4-node) shell elements was used and a linear elastic and isotropic material (E = 200 GPa, v = 0.3) was assigned. Εvery load realization (X0 = x0, Y0 = y0 and Q0 = q0) represents a virtual statistical experiment, as indicatively presented in Fig. 2.

Fig. 2.
figure 2

Indicative pressure distribution (a) and corresponding deflection of the plate (b).

Figure 3 presents an indicative strain distribution of the plate for the healthy state. The form of the response is similar for the damaged state due to the nature of the load. Eleven different damage states are investigated and simulated. Εach damaged condition results from the simulation of a FE model with a different thickness from 1% thickness wastage to 90% unrealistic upper bound.

Monte Carlo Simulations (MCS) were employed as the means for forward uncertainty quantification. Latin hypercube sampling is employed to generate independent realizations of Q0, X0 and Y0. Each set corresponds to a numerical statistical test (deterministic FEA). Convergence was achieved at ~104 samples. The three in-plane strain components (axial εxx, transverse εyy and shear εxy) of the top surface and at the element center (gauss point location) were stored for each sample.

Fig. 3.
figure 3

Contour plot of strain εxx (a), εyy (b) and εxy (c) for the healthy state for the realized case shown in Fig. 2.

4 The Employed Detector

Α detector can be regarded of as the transition from a feature vector to a decision [3]. Only binary decisions will be investigated in this research, where one must normally decide whether features received from a test case belong to the reference condition or to the corroded plate. The likelihood ratio test is the detector that is associated with a hypothesis test where a null hypothesis H0 is defined for the healthy state and the alternative hypothesis H1 is defined for a modeled damaged state, as follows:

$$\,H_{0} :{\mathbf{x}} \sim f_{{\text{H}}} ({\mathbf{x}};{{\varvec{\uptheta}}}_{{\text{H}}} ) = N({\mathbf{x}};{{\varvec{\upmu}}}_{{\text{H}}} ,{{\varvec{\Sigma}}}_{{\text{H}}} )$$
(2)
$$\,H_{1} :{\mathbf{x}} \sim f_{{\text{D}}} ({\mathbf{x}};{{\varvec{\uptheta}}}_{{\text{D}}} ) = N({\mathbf{x}};{{\varvec{\upmu}}}_{{\text{D}}} ,{{\varvec{\Sigma}}}_{{\text{D}}} )$$
(3)

where \({\mathbf{x}}\) is the n-dimensional feature vector \({\mathbf{x}} = \{ x_{1} ,x_{2,} ..,x_{n} \}^{T}\) that holds the strain components registered from a specific sensor network design. The random vector is considered to follow the multinormal distribution with a parameter vector \({{\varvec{\uptheta}}}_{{\text{H or D}}}\), that incorporates the mean vector \({{\varvec{\upmu}}}_{{\text{H or D}}}\) and the covariance matrix \({{\varvec{\Sigma}}}_{{\text{H or D}}}\). A test feature vector, \({\mathbf{x}}\), is associated with the damaged state (alternative hypothesis) if the likelihood ratio is larger than a threshold \(\gamma\), as stated below:

$$\,L\left( {\mathbf{x}} \right) = \frac{{N({\mathbf{x}};{{\varvec{\upmu}}}_{{\text{D}}} ,{{\varvec{\Sigma}}}_{{\text{D}}} )}}{{N({\mathbf{x}};{{\varvec{\upmu}}}_{{\text{H}}} ,{{\varvec{\Sigma}}}_{{\text{H}}} )}} > \gamma$$
(4)

The likelihood ratio test is written as:

$$L({\mathbf{x}}) = \frac{{\sqrt {\det {{\varvec{\Sigma}}}_{{\mathbf{H}}} } \exp \left[ { - \frac{1}{2}\left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{D}}} } \right)^{T} {{\varvec{\Sigma}}}_{{\mathbf{D}}}^{ - 1} \left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{D}}} } \right)} \right]}}{{\sqrt {\det {{\varvec{\Sigma}}}_{{\mathbf{D}}} } \exp \left[ { - \frac{1}{2}\left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{H}}} } \right)^{T} {{\varvec{\Sigma}}}_{{\mathbf{H}}}^{ - 1} \left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{H}}} } \right)} \right]}}$$
(5)

By taking the logarithm of both sides and rearranging the terms, the detector (hypothesis test) is expressed in terms of a test statistic, \(T({\mathbf{x}})\), that is in turn associated with a new threshold \(\gamma ^{\prime}\). The decision rule is:

$$\left\{ {\begin{array}{*{20}c} {{\text{decide}}\;H_{1} ,\,\,{\text{if}}\,\,T({\mathbf{x}}) > \log \gamma + \frac{1}{2}\log \frac{{\det {{\varvec{\Sigma}}}_{\rm H} }}{{\det {{\varvec{\Sigma}}}_{{\mathbf{D}}} }} \equiv \gamma ^{\prime}} \\ {{\text{decide}}\;H_{0} ,\,\,{\text{if}}\,\,T({\mathbf{x}}) < \log \gamma + \frac{1}{2}\log \frac{{\det {{\varvec{\Sigma}}}_{\rm H} }}{{\det {{\varvec{\Sigma}}}_{{\mathbf{D}}} }} \equiv \gamma ^{\prime}} \\ \end{array} } \right.$$
(6)

Magnitude \(T({\mathbf{x}})\) is a univariate random variable defined as follows:

$$T({\mathbf{x}}) \equiv \frac{{\left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{H}}} } \right)^{T} {{\varvec{\Sigma}}}_{{\mathbf{H}}}^{ - 1} \left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{H}}} } \right)}}{2} - \frac{{\left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{D}}} } \right)^{T} {{\varvec{\Sigma}}}_{{\mathbf{D}}}^{ - 1} \left( {{\mathbf{x}} - {{\varvec{\upmu}}}_{{\mathbf{D}}} } \right)}}{2}$$
(7)

The detector’s performance is measured in terms of the probability of detection PD:

$$P_{{\text{D}}} = \Pr \left\{ {T({\mathbf{x}}) > \gamma ^{\prime}\,;H_{1} } \right\} = \int\limits_{\gamma }^{ + \infty } {f_{{\text{D}}} ({\mathbf{x}};H_{1} )d{\text{x}}}$$
(8)

and the probability of false alarm, PFA, given by the following equation:

$$P_{{{\text{FA}}}} = \Pr \left\{ {T({\mathbf{x}}) > \gamma ^{\prime}\,;H_{0} } \right\} = \int\limits_{\gamma }^{ + \infty } {f_{{\text{H}}} ({\mathbf{x}};H_{0} ){\text{dx}}}$$
(9)

Under the Neyman-Pearson (NP) framework, given a specific value for PFA, the threshold for a selected sensor topology, i.e. given θH and θD, can be obtained by minimizing an error functional defined by comparing the constant \(P_{{{\text{FA}}}}\) with the numerically computed version of the same, \(P_{{{\text{FA}}_{{{\text{MC}}}} }}\), through MCS [3]:

$$\gamma ^{\prime}_{MC} = \arg \min\nolimits_{\gamma } (\ln ((P_{FA} - P_{{FA_{MC} }} (\gamma ))^{2} ))$$
(10)

5 Statistical Analysis of the Strain Response

The present chapter presents a statistical exploration of the strain pattern over the plate. The strain data set is extracted from the healthy state (H0) and eleven strain datasets from each corresponding damaged state (H1).

Every damaged state is considered as a distinct problem. A 80/20 sample ratio was considered for training/testing purposes. Some important statistics are computed based on a thickness loss of 20% of the nominal thickness of the plate, as a major corrosion scenario that reflects to a loss corresponding to the thickness margin introduced to marine plates for corrosion wastage purposes over the structure’s design life (20–25 years). Figure 4 presents contour maps of the mean shift between the undamaged and damaged states, over the plate domain for each strain component. The difference between mean values is proportional to the corrosion wastage level. The shift between sample standard deviations follows a similar pattern. The ratio of the standard deviation of the healthy state and the standard deviation of the damaged state is almost constant over the plate. The ratio increases with higher levels of corrosion and reduces with lower levels.

Fig. 4.
figure 4

Difference between the MC estimates of the expected values of εxx (a), εyy (b) and εxy (c) in [με] for 20% thickness wastage (damaged) and the reference (healthy) state.

6 Detector Results

6.1 Detector Performance Assessment

The detector’s performance is assessed for the 20% thickness wastage case (strain pattern in Fig. 4). A single strain component (εxx/εyy/εxy) is first selected as the unique informative feature (univariate problem). The element-wise evaluation of the performance measures (PD and PFA) is presented in Fig. 5 for a threshold γ = 1 used in Eq. (4). These plots reveal information regarding the effect of the location for sensor placement to the detector’s performance. There is a clear association between the regions where a high mean shift is recognized in Fig. 4 and the zones that exhibit high PD in Fig. 5. It is evident though, that PD and PFA maximize (or minimize) at the same zones within the plate, regardless the selected feature. It is noteworthy that we are aiming at a sensor design that maximizes PD and minimizes PFA, in the Neyman-Pearson setting. Nevertheless, an optimally placed single sensor yields poor performance metrics (PD ≈ 65% and PFA ≈ 21%), indicating the statistical complexity of the problem at hand.

Fig. 5.
figure 5

Distribution of detector’s performance over the plate for a univariate (1d) feature vector. Subplots εxx (a), εyy (b) and εxy (c) present measure PD and subplots εxx (d), εyy (e) and εxy (f) present the corresponding measure PFA. Paired subplots (a)–(d), (b)–(e) and (c)–(f) are to be considered for examination.

On an effort to increase the information content, all three strain components at each element are considered to construct the feature vector (tri-variate problem). The corresponding distribution of the performance metrics over the domain is presented in Fig. 6. A correlation between the distribution of the statistics obtained from single features (Fig. 4) and the detector’s performance distribution in Fig. 6 is intractable. Although, an increased performance is observed in some limited elements located at the boundary of the plate, this region is considered singular and is preferred to be disregarded. The remaining smooth pattern within the plate is the pool of candidate locations for sensor placement. However, the information gain from all strain components is marginal (PD ≈ 69.8% and PFA ≈ 18.7%).

Fig. 6.
figure 6

Distribution of detector’s performance over the plate for a tri-variate element-wise (3d) feature vector: PD (a) and PFA (b).

Figure 7 shows the aggregated detection performance metrics for each selected feature vector (univariate or tri-variate case) with respect to the examined corrosion levels. Each point corresponds to the maximum PD obtained within the plate and the corresponding PFA, which does not correspond to the Neyman-Pearson optimal detector, as prescribed by Eq. (10). It is evident that detection accuracy breaks the 50% threshold for corrosion levels higher than 10%. For corrosion wastage above 40%, the detection scheme becomes more powerful achieving a PD of 87%.

Fig. 7.
figure 7

Maximum probability of detection and corresponding false alarm rate per corrosion level.

6.2 Determining an Optimal Sensor Network

This section focuses on the determination of the optimal sensor topology and the corresponding strain component that maximizes the detector’s performance. This is a constrained optimization problem and is tackled through the employment of Genetic Algorithms (GA). The aim is to arrive at a N-dimensional feature vector (holding the design variables), x, corresponding to N sensors together with the specific strain component, k = xx, yy or xy measured at a location.

$${\mathbf{x}} = \left\{ {x_{1}^{k} , \ldots ,x_{j}^{k} \ldots x_{N}^{k} } \right\}$$
(11)

The features are selected from a database holding (2500 × 3 × 104) entries: (number of elements x number stain components x MC samples) for the healthy and the damaged state (20% corrosion). For each design vector, the parameters \({{\varvec{\uptheta}}}_{{\text{H or D}}}\) are calculated and used within Eq. (6) and Eq. (7) for detection performance evaluation. The fitness function was defined as:

$${\mathbf{x}}_{{{\text{opt}}}} = \arg \max\nolimits_{{\mathbf{x}}} \left( {P_{{\text{D}}} \left( {\mathbf{x}} \right)} \right)$$
(12)

The results from four optimization cases are presented in Fig. 8. Initially a case were ten sensors measuring a single strain component each was examined. The corresponding feature vector is described by a 10-dimensional multinormal distribution. The problem is solved over the entire plate domain with a resulting max(PD) equal to 99.8% and corresponding PFA equal to ~0%. The optimal sensor topology is presented in Fig. 8a. However, running the optimization by excluding the singular boundary zones (Fig. 6), detection performance drops to 92.4% for max(PD) and increases to ~2.2% for PFA. The optimal position for each single sensor is located at high PD zones over the plate according to Fig. 5.

Fig. 8.
figure 8

Optimal positions of single sensors (a) with 3 zones and (b) without 3 zones and positions of rosettes for 20% corrosion (c) with 3 zones and (d) without 3 zones.

In order to enhance the information content of the feature vector, all three strain components were considered at each candidate location, resulting at a 30-dimensional multinormal statistical description. The number of locations for sensor placement remains the same (N = 10). This optimization problem is formulated as in the previous cases (we maximize PD and evaluate the corresponding PFA – non NP) and additionally under the NP setting, where we maximize PD for a given PFA (NP). Equation (10) has been employed for implementing the NP optimal detector and the optimization problem was solved through the employment of GAs. Respective sensor topologies are presented in Fig. 8c–d. The performance for the non-NP cases is max(PD) = 99.9%/PFA =  ~0% and max(PD) = 98.4%/PFA = 0.4%, for the entire (Fig. 8c) and the reduced plate domain (Fig. 8d), respectively. On the other hand, the performance of the NP detector for PFA = 1% has arrived at a max(PD) = 99.9% by optimizing over the entire domain and at a max(PD) = 93.7% by optimizing over the reduced domain. It is evident that the boundary zones that are considered as singular, include highly informative content, however, as aforementioned, these locations may be sensitive to error and inaccuracies arising from e.g. sensor misplacement. The sensor locations are distributed over the plate when the boundary zones are included compared to the sensor concentration in their absence (Fig. 8c–d).

7 Conclusions

This study investigated the statistical pattern recognition for optimal sensor placement in damage detection due to thickness loss. The behavior of a detector under NP framework was examined for all possible sensor placement positions and different levels of corrosion. It proved that the number, locations, and measuring features of the sensors could not be selected manually, as such a choice is not immediately understandable. Thus, the selection of an appropriate optimization technique is considered necessary. A GA is used to locate optimal – in terms of PD – combinations of sensor locations and strain components measured for a constant number of sensors, in a short computational time, compared to the individual performance of the detector. The solved problem could be extended to investigate the minimum number of sensors that could be placed.