INTRODUCTION

Weak signals are extremely weak and difficult to be detected, and its detection technology has broad application prospects in radar, communication transmission, sea clutter processing and seismic detection [14]. The traditional weak signal detection methods mainly include stochastic resonance method [5, 6], lock-in amplifier method [7, 8] and Duffing oscillator method [9, 10], but these methods is basically invalid for weak signal detection under chaotic background, and has a high threshold for the signal-to-noise ratio (SNR) of weak signal detection. With the development of chaos theory, the method of combining chaos theory with weak signal does not need to consider the distribution characteristics of interference, and it also does not need a lot of prior knowledge to effectively achieve a lower SNR threshold [1114]. In addition, the pulse signal is a typical signal in communication, seismic detection and fault diagnosis and other fields [15, 16], so effectively improving the detection ability of weak pulse signal under chaotic background is of great significance to reduce the detection cost of equipment and enhance the anti-interference ability of detection system.

Therefore, scholars from home and abroad have carried out extensive research on the detection of weak pulse signal under chaotic background. Some nonlinear methods such as least squares support vector machine, Elman neural network and so on are used to detect weak pulse signals under chaotic background [1722], these methods have certain robustness and generalization ability, but the selection of the original data and factors has a great impact on the prediction results, and fail to use the chaotic nature, resulting in the low reliability of the prediction [12, 23]. Is there any other way? Broad learning system is a neural network structure independent of depth structure. Compared with “depth” structure, “width” structure is very concise because there is no coupling between layers [24]. Based on the broad learning method, a structured manifold width learning system improved the fitting effect of large-scale chaotic data [25]. Besides, a robust manifold broad learning system further improved the generalization ability and robustness of the model, and demonstrated that the model is superior to other broad learning models for chaotic data prediction [26]. In addition, the fusion model based on principal component analysis and broad learning algorithm is feasible in the fault diagnosis of rotor system [27]. In practice, since the broad learning method was proposed, it has been widely used in image classification, pattern recognition, EEG signal processing and other fields, it’s a promising machine learning algorithm [2833].

According to the broad learning method could better describe the properties of nonlinear characteristics, and in order to realize intelligence statistical detection, this paper proposed the robust manifold width adaptive detection (RMWAD) model, which is applied to obtain one-step prediction error, hypothesis testing is used to realize weak pulse signal detection under chaotic noise background.

The remainder of the paper is organized as follows. In Section 1, we provide the background for broad learning system and hypothesis test of weak pulse signal. In Section 2, we present the detection model. In Section 3, we illustrate the simulation experiments of the detection model. We summarize the results of simulation experiments in Conclusion section.

1 DESCRIPTION OF CONCEPTS

1.1 Broad Learning System

Broad learning system (BLS) based on random vector function connection network (RALFNN) is a neural network, the mainly difference is that BLS model extracts the features from original data and take the features as the input layer compared with RALFNN [24, 34]. Supposing there are \(n\) kinds of mapping methods, and each method has \(q\) nodes, we consider the mapped feature

$${\mathbf{Z}} = \varphi \left( {{\mathbf{SW}} + {\boldsymbol{\mathbf{\beta }}}} \right),$$
(1)

where \(\varphi (.){{\;}}\) is activation function, \({\mathbf{S}} \in {{{\mathbf{R}}}^{{r \times m}}}\) is the input data, \(r\) is the number of samples, \(m\) is the dimension of samples, \({\mathbf{Z}} = \left\{ {{{{\mathbf{Z}}}_{i}},i = 1,2, \ldots ,n} \right\}\), \({\mathbf{W}} = \left\{ {{{{\mathbf{W}}}_{i}},i = 1,2, \ldots ,n} \right\}\), \({{{\mathbf{W}}}_{i}} \in {{{\mathbf{R}}}^{{m \times q}}}\) is the weight of mapping features, \({\boldsymbol{\mathbf{\beta }}} = \left\{ {{{{\boldsymbol{\mathbf{\beta }}}}_{i}},i = 1,2, \ldots ,n} \right\}\) is bias.

The following are similar with RALFNN. Supposing there are \(p\) enhancement nodes, then we could receive enhanced information

$${\mathbf{H}} = \zeta \left( {{\mathbf{Z}}{{{\mathbf{W}}}_{{{\text{up}}}}} + {{{\boldsymbol{\mathbf{\beta }}}}_{{{\text{up}}}}}} \right),$$
(2)

where \({\mathbf{H}} = \left\{ {{{{\mathbf{H}}}_{j}},j = 1,2, \ldots ,p} \right\}\), \(~{{{\mathbf{W}}}_{{{\text{up}}}}},~{{{\boldsymbol{\mathbf{\beta }}}}_{{{\text{up}}}}}~\) are generated randomly, and \(\zeta ({{\;}}.{{\;)}}\) is activation function.

We could receive the output of BLS model

$${\mathbf{Y}} = \left[ {\left. {\mathbf{Z}} \right|{\kern 1pt} {\mathbf{H}}} \right]{{{\mathbf{W}}}_{{{\text{out}}}}},$$
(3)

where \({{{\mathbf{W}}}_{{{\text{out}}}}}\;\) are obtained by ridge regression algorithm [35].

Based on the BLS model, many other great new models are proposed, such as fuzzy broad learning model, structured manifold broad learning model and robust manifold broad learning model, these models have achieved satisfactory performance on chaotic time series, especially robust manifold broad learning model has great robustness and generalization ability, and is superior to other broad learning methods for chaotic time series prediction [2426, 36].

1.2 Hypothesis Test of Weak Pulse Signal

The weak pulse signal could be transformed in the chaotic background noise

$$\begin{array}{*{20}{c}} {{{H}_{0}}:s\left( t \right) = n\left( t \right) + e\left( t \right),} \\ {{{H}_{1}}:s\left( t \right) = n\left( t \right) + u\left( t \right) + e\left( t \right),} \end{array}~$$
(4)

where \(~s\left( t \right)~\) represents the observation signal, \(n\left( t \right)~\) represents the chaotic signal, \(u\left( t \right)~\)represents the weak pulse signal, \(e\left( t \right)~\) represents Gaussian white noise.

The weak pulse signal is often submerged in the chaotic noise, if the hypothesis test is directly inferred by equation (4), it’s prone to high false positive [17]. But we could obtain the estimation of the chaotic noise based on its chaotic nonlinear characteristics, and then the hypothesis testing problem is further transformed

$$\begin{array}{*{20}{c}} {{{H}_{0}}:s\left( t \right) - n\left( t \right) = e\left( t \right)~,} \\ {{{H}_{1}}:s\left( t \right) - n\left( t \right) = u\left( t \right) + e\left( t \right),} \end{array}$$
(5)

when \({{\;}}{{H}_{0}}{{\;}}\) is true, it shows that there is only white Gaussian noise in the observed signal, and there is no weak pulse signal; when the \(~{{H}_{1}}{{\;}}\) is true, it shows that there is a weak pulse signal in the observation signal.

In this paper, we consider construct a robust manifold width dynamic adaptive detection (RMWAD) model based on broad learning methods to detect the weak pulse signal from prediction error. The specific idea framework of this paper is shown in Fig. 1.

Fig. 1.
figure 1

The specific idea framework of weak pulse signal detection.

2 DETECTION MODEL

2.1 RMWAD Model

According to the Section 1, we need to obtain the estimation of the chaotic noise, then the prediction error could be computed, and the hypothesis test (5) is used to detect the weak pulse signal, so the detection model RMWAD is mainly divided into the following three steps: first, reconstructing the one dimensional observation signal into a high dimensional matrix; then, establishing the RMWAD model to obtain the one-step prediction error; finally, detecting weak pulse signal from prediction error by the hypothesis testing.

2.2 Reconstruction Observation Signal

Phase space is used to represent all possible states of a system. The dimension of nonlinear system is usually relatively high, but the time series collected in practice are generally one-dimensional and difficult to reflect the high-dimensional space of nonlinear system [37, 38]. So we need to extend the collected time series to a higher dimensional phase space. For the observation signal \(\left\{ {s\left( t \right),t = 1,2, \ldots ,n} \right\}\), we could receive its phase in high-dimensional space

$${\mathbf{S}}\left( t \right) = \left( {s\left( t \right),s\left( {t - \tau } \right), \ldots ,s\left( {t - \left( {m - 1} \right)\tau } \right)} \right),$$
(6)

where \(t = {{n}_{1}},\,\,{{n}_{1}} + 1,\,\, \ldots ,\,\,n;\,\,~{{n}_{1}} = 1 + \left( {m - 1} \right)\tau ;\) \(m\) is embedding dimension obtained by the method proposed by Cao [39], \(\tau \) is the delay time obtained by the auto-correlation method [40].

According to Takens’ theorem [37], If we could find a smooth mapping \(h{\kern 1pt} :{{{\mathbf{R}}}^{m}} \to {\mathbf{R}}{{\;}}\) or an estimation \(\hat {h}\) of \(h\), then the data of the next time could be predicted, that is \(s\left( {t + 1} \right) = h\left( {{\mathbf{S}}\left( t \right)} \right)\,\,{\text{or}}\,\,s\left( {t + 1} \right) \approx \hat {h}\left( {{\mathbf{S}}\left( t \right)} \right)\). Therefore, we could construct the RMWAD model [2426] as the approximate mapping \(\hat {h}\) to receive one-step prediction.

2.3 RMWAD Model

For the reconstructed observation signal \({\mathbf{S}}\left( t \right)\), supposing that the feature layer of the RMWAD model has \(n\) kinds of mapping methods, each method has \(q\) nodes, and then we could receive the mapped feature (1).

Because the time series in chaotic dynamic system usually evolve according to a certain manifold, it is necessary to represent the reconstructed observation signal by manifold [41], and we could express the objective function of manifold feature mapping.

$$\begin{array}{*{20}{c}} {\min {\text{tr}}\left( {{{{\left( {{\mathbf{SW}}} \right)}}^{T}}{\mathbf{D}}\left( {{\mathbf{SW}}} \right)} \right) + \alpha \left\| {{\mathbf{SW}} - {\mathbf{B}}} \right\|_{F}^{2} + \gamma {{{\left\| {\mathbf{W}} \right\|}}_{{2,1}}},} \\ {{\text{s}}.{\text{t}}.~{{{\left( {{\mathbf{SW}}} \right)}}^{T}}{\mathbf{G}}\left( {{\mathbf{SW}}} \right) = {\mathbf{{\rm I}}},} \end{array}$$
(7)

where \({\text{tr}}\) denotes the operation of finding the trace of a matrix; \({\mathbf{W}}~\) is the weight in the feature layer; \(\alpha \) and \(\gamma \) are regularization parameters; \({\mathbf{B}} \in {{{\mathbf{R}}}^{{r \times q}}}\) is randomly generated perturbation matrix; \(r\) is the number of rows of \({\mathbf{S}}\); \(q\) is the number of the mapping feature; \({\mathbf{D}} = {{{\mathbf{G}}}^{{ - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0em} 2}}}}\left( {{\mathbf{G}} - {\mathbf{Q}}} \right){{{\mathbf{G}}}^{{ - {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0em} 2}}}}\) is the normalized graph Laplacian matrix; \({\text{s}}.{\text{t}}.\) stands for the phrase of “subject to”; \({\mathbf{I}}\) is the unit matrix; T denotes the operation of transpose; \({\mathbf{G}}{{\;}}\) is the diagonal matrix denoted as \({{G}_{{ii}}}\) = \(\sum\nolimits_i {{{Q}_{{ij}}}} \), where

$${{Q}_{{ij}}} = \left\{ \begin{gathered} \exp \left( { - \frac{{{{{\left\| {{{{\mathbf{s}}}_{i}} - {{{\mathbf{s}}}_{j}}} \right\|}}^{2}}}}{{{{\sigma }^{2}}}}} \right),~\,\,{{{\mathbf{s}}}_{i}} \in {{N}_{k}}\left( {{{{\mathbf{s}}}_{j}}} \right),\,\,{{{\mathbf{s}}}_{j}} \in {{N}_{k}}\left( {{{{\mathbf{s}}}_{i}}} \right), \hfill \\ 0,\,\,\,\,{\text{otherwise}}, \hfill \\ \end{gathered} \right.$$
(8)

where \({{N}_{k}}\left( {\mathbf{S}} \right)\) represents the k nearest neighbor sets of \({\mathbf{S}}\).

\({\mathbf{S}}\) is reconstructed observation signal, \({\mathbf{D}}~\) and \({\mathbf{G}}\) could be obtained by \({\mathbf{S}}\) and \({{Q}_{{ij}}}\). But the objective function (7) is the non-convex and non-smooth function with generalized orthogonal constraints and it’s difficult to be optimized. Therefore, we need the following methods to optimize the objective function (7):

First, we use separating variables method [42] to obtain the new objective function

$$~\begin{array}{*{20}{c}} \begin{gathered} \min {\text{tr}}\left( {{{{\left( {{\mathbf{SW}}} \right)}}^{T}}{\mathbf{D}}\left( {{\mathbf{SW}}} \right)} \right) \\ + \,\,{{\alpha }}\left\| {{\mathbf{SW}} - {\mathbf{B}}} \right\|_{F}^{2} + {{\gamma }}{{\left\| {\mathbf{R}} \right\|}_{{2,1}}} + \xi \left( {\mathbf{H}} \right), \\ \end{gathered} \\ {{\text{s}}.{\text{t}}.~\,{\mathbf{H}} - {{{\mathbf{G}}}^{{{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0em} 2}}}}{\mathbf{SW}} = 0,\,\,\,\,~{\mathbf{R}} - {\mathbf{W}} = 0,} \end{array}$$
(9)

where \(\xi \left( {\mathbf{H}} \right) = \left\{ \begin{gathered} 0,\,\,\,\,{{{\mathbf{H}}}^{T}}{\mathbf{H}} = {\mathbf{{\rm I}}}, \hfill \\ \infty ,\,\,\,\,{\text{otherwise}}, \hfill \\ \end{gathered} \right.\)

and

$$~{{\left\| {\mathbf{R}} \right\|}_{{2,1}}} = \mathop \sum \limits_{i = 1}^m \sqrt {\mathop \sum \limits_{i = 1}^q R_{{ij}}^{2}} .$$
(10)

Second, we could receive the augmented Lagrange function of the objective function (9)

$$\begin{gathered} {\mathbf{L}}\left( {{\mathbf{W}},{\mathbf{R}},{\mathbf{H}},{\mathbf{\lambda }}1,{\mathbf{\lambda }}2;\rho } \right) \\ = {\text{tr}}\left( {{{{\mathbf{W}}}^{T}}{{{\mathbf{S}}}^{T}}{\mathbf{DSW}}} \right) + \alpha \left\| {{\mathbf{SW}} - {\mathbf{B}}} \right\|_{F}^{2} \\ + \,\,\gamma {{\left\| {\mathbf{R}} \right\|}_{{2,1}}} + \xi \left( {\mathbf{H}} \right) + \frac{\rho }{2}\left\| {{\mathbf{R}} - {\mathbf{W}} + \frac{{{\mathbf{\lambda }}1}}{\rho }} \right\|_{F}^{2} \\ + \,\,\frac{\rho }{2}\left\| {{\mathbf{H}} - {{{\mathbf{G}}}^{{{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0em} 2}}}}{\mathbf{SW}} + \frac{{{\mathbf{\lambda }}2}}{\rho }} \right\|_{F}^{2}. \\ \end{gathered} $$
(11)

Third, according to the augmented Lagrange function (11), the iterative equations [26, 43, 44] are as follows

$$\begin{gathered} \hfill \\ \left\{ \begin{gathered} {{{\mathbf{W}}}_{k}} = {{\left( {{\mathbf{part}}} \right)}^{{ - 1}}}\left( {\left( {2\alpha {{{\mathbf{S}}}^{T}}{\mathbf{B}} + \rho \left( {{{{\mathbf{R}}}_{k}} + \frac{{{\mathbf{\lambda }}{{1}_{k}}}}{\rho }} \right)} \right) + \rho {{{\mathbf{S}}}^{T}}{{{\mathbf{G}}}^{{{T \mathord{\left/ {\vphantom {T 2}} \right. \kern-0em} 2}}}}\left( {{{{\mathbf{H}}}_{k}} + \frac{{{\mathbf{\lambda }}{{2}_{k}}}}{\rho }} \right)} \right),\,\,\,\,~\rho > 0, \hfill \\ {{{\mathbf{R}}}_{{k + 1}}} = \frac{{{\text{max}}\left\{ {{{{\left\| {\mathbf{v}} \right\|}}_{2}} - \tau ,0} \right\}}}{{{\text{max}}\left\{ {{{{\left\| {\mathbf{v}} \right\|}}_{2}} - \tau ,0} \right\} + \tau }}\left( {{{{\mathbf{W}}}_{k}} - \frac{{{\mathbf{\lambda }}{{1}_{k}}}}{\rho }} \right),~\,\,\,\,\tau = \frac{\gamma }{\rho }, \hfill \\ {{{\mathbf{H}}}_{{k + 1}}} = {{\left( {{\mathbf{{\rm I}}} + \frac{\rho }{2}{\mathbf{L}}} \right)}^{{ - 1}}}\left( {{\mathbf{{\rm I}}} - \frac{\rho }{2}{\mathbf{L}}} \right){{{\mathbf{H}}}_{k}}, \hfill \\ \lambda {{1}_{{k + 1}}} = \lambda {{1}_{k}} + \rho \left( {{{{\mathbf{R}}}_{k}} - {{{\mathbf{W}}}_{k}}} \right), \hfill \\ {\mathbf{\lambda }}{{2}_{{k + 1}}} = {\mathbf{\lambda }}{{2}_{k}} + \rho \left( {{{{\mathbf{H}}}_{k}} - {{{\mathbf{G}}}^{{{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0em} 2}}}}{\mathbf{S}}{{{\mathbf{W}}}_{k}}} \right), \hfill \\ \end{gathered} \right. \hfill \\ \end{gathered} $$
(12)

where \({\mathbf{part}} = 2{{{\mathbf{S}}}^{T}}{\mathbf{DS}} + 2\alpha {{{\mathbf{S}}}^{T}}{\mathbf{S}} + \rho {\mathbf{{\rm I}}} + \rho {{{\mathbf{S}}}^{T}}{\mathbf{GS}}\), \({{{\mathbf{R}}}_{k}} \in {{{\mathbf{R}}}^{{m \times q}}}\), \({{{\mathbf{H}}}_{k}} \in {{{\mathbf{R}}}^{{r \times q}}},\,\,{{{\mathbf{R}}}_{1}}{{\;}}\) and \({{{\mathbf{H}}}_{1}}{{\;}}\) are generated randomly, \({\mathbf{\lambda }}{{1}_{k}} \in {{{\mathbf{R}}}^{{m \times q}}}\), \({\mathbf{\lambda }}{{2}_{k}} \in {{{\mathbf{R}}}^{{r \times q}}}\); \({\mathbf{\lambda }}{{1}_{1}}\,\,{\text{and}}\,\,{\mathbf{\lambda }}{{2}_{1}}{{\;}}\) are zero matrix. \({\mathbf{L}} = {{{\mathbf{G}}}^{{{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0em} 2}}}}{\mathbf{S}}{{{\mathbf{W}}}_{{\text{k}}}} - {{{\mathbf{\lambda }}{{2}_{k}}} \mathord{\left/ {\vphantom {{{\mathbf{\lambda }}{{2}_{k}}} \rho }} \right. \kern-0em} \rho }\); \({\mathbf{v}}\) is the row vector of \({{{\mathbf{W}}}_{k}} - {{{\mathbf{\lambda }}{{1}_{k}}} \mathord{\left/ {\vphantom {{{\mathbf{\lambda }}{{1}_{k}}} \rho }} \right. \kern-0em} \rho }\).

Finally, we use the alternating direction multiplier method [45] to solve the variables in iterative equations (12) to receive the optimal weight \({\mathbf{W}}\) of the feature layer, and then the mapping feature \({\mathbf{Z}}\) could be obtained by bringing \({\mathbf{W}}\) into the mapped feature (1).

After extracting \({\mathbf{Z}}\) from the feature layer, supposing there are \(p\) enhancement nodes, and we could receive the enhancement information \({\mathbf{H}}\) by bringing \({\mathbf{Z}}\) into the enhanced information (2).

The augmented input \(\left[ {\left. {\mathbf{Z}} \right|{\kern 1pt} {\mathbf{H}}} \right]\) of the feature layer and the enhancement layer is obtained, and then the output of RMWAD model is \(\left[ {\left. {\mathbf{Z}} \right|{\kern 1pt} {\mathbf{H}}} \right]{{{\mathbf{W}}}_{{{\text{out}}}}}\), and then we could receive the relationship between the predicted value and the true value

$$s\left( {t + 1} \right) \approx \left[ {\left. {\mathbf{Z}} \right|{\kern 1pt} {\mathbf{H}}} \right]{{{\mathbf{W}}}_{{{\text{out}}}}},$$
(13)

where \({{{\mathbf{W}}}_{{{\text{out}}}}}\) is obtained by ridge regression algorithm [35], \(s\left( {t + 1} \right)\) is real value of prediction value.

Then, we could establish the approximate mapping of reconstructed observation signal

$$s\left( {t + 1} \right) \approx \hat {h}\left( {{\mathbf{S}}\left( t \right)} \right)$$
(14)

and we could receive the one-step prediction error by the relationship between the predicted value and the true value (13).

$$\hat {e}\left( {t + 1} \right) = \hat {h}\left( {{\mathbf{S}}\left( t \right)} \right) - s\left( {t + 1} \right).$$
(15)

Supposing \(\hat {e}\left( {n + 1} \right) \sim N\left( {0,{{\sigma }^{2}}} \right)\), \(n\) is the number of samples, and the hypothesis test is as the follows:

\({{H}_{0}}\): there is no \({{u}_{t}}\) in the observed signal.

\({{H}_{1}}\): there is \({{u}_{t}}\) in the observed signal,

where ut represents the weak pulse signal. Then, we could establish \(T\) statistics

$$T = \frac{{\hat {e}\left( {n + 1} \right) - E\left( {\hat {e}\left( {n + 1} \right)} \right)}}{{{{S}_{{\hat {e}\left( {n + 1} \right)}}}}},$$
(16)

if \({{H}_{0}}\) is true, then \(T \sim t\left( {n - 1} \right)\), and \(\forall {{\;}}\alpha \in \left( {0,1} \right)\), we could receive \(P = \left\{ {\left| T \right| \geqslant \left| {{{t}_{0}}} \right|} \right\}\), where \(\alpha \) is confidence coefficient, and

$${{t}_{0}} = \frac{{\hat {e}\left( {n + 1} \right) - E\left( {\hat {e}\left( {n + 1} \right)} \right)}}{{{{S}_{{\hat {e}\left( {n + 1} \right)}}}}}~.$$
(17)

When \(P < {{\alpha }}\), we could reject the original hypothesis and consider that there is a weak pulse signal in the observation signal, and the hypothesis threshold \({{t}_{{{\alpha \mathord{\left/ {\vphantom {\alpha 2}} \right. \kern-0em} 2}}}}\left( {n - 1} \right)\) could be obtained from the \(t\) distribution table. According to the dynamic threshold, we could realize adaptive detection to make the signal detection result more objective.

In summary, the flow chart of RMWAD model detecting weak pulse signal is shown in Fig. 2.

Fig. 2.
figure 2

The flow chart of the RMWAD model.

3 SIMULATION EXPERIMENTS

In this section, we perform three simulation experiments to verify the feasibility and effectiveness of the RMWAD model. In the experiment, we would use the chaotic background signals generated by the Lorenz system and Rosser system, the signal-to-noise ratio (SNR) is used to evaluate the detection threshold, the mean absolute error (MAE) and root mean square error (RMSE) are used to evaluate the prediction error of the model, and the ROC curve and its area under (AUC), accuracy rate (PR), recall rate (RE), accuracy rate (ACC) and F1 score (F1) are used as the evaluation index of detection results:

$$1)\,\,{\text{SNR}} = 10~{\text{log}}\left( {\frac{{\sigma _{u}^{2}}}{{\sigma _{n}^{2} + \sigma _{e}^{2}}}} \right)~,$$
$$2)\,\,{\text{MAE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^n \left| {\hat {u}\left( t \right) - u\left( t \right)} \right|~,$$
$$3)\,\,{\text{RMSE}} = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 1}^n {{{(\hat {u}(t) - u(t))}}^{2}}} {{\;}},$$
$$4)\,\,{\text{PR}} = \frac{{{\text{TP}}}}{{{\text{TP}} + {\text{FP}}}},$$
$$5)\,\,{\text{RE}} = \frac{{{\text{TP}}}}{{{\text{TP}} + {\text{FN}}}},$$
$${\text{6)}}\,\,{\text{ACC}} = \frac{{{\text{TP}} + {\text{TN}}}}{{{\text{TP}} + {\text{TN}} + {\text{FP}} + {\text{FN}}}},$$
$$7)\,\,{\text{F}}1 = \frac{{2{\text{PR}}\,{{ \times }}\,{\text{RE}}}}{{{\text{PR}} + {\text{RE}}}}$$

where

$$\sigma _{u}^{2} = \frac{1}{n}\mathop \sum \limits_{i = 1}^n \left( {u\left( t \right) - \bar {u}\left( t \right)} \right)$$
(18)

and

$$\sigma _{n}^{2} = \frac{1}{n}\mathop \sum \limits_{i = 1}^n \left( {n\left( t \right) - \bar {n}\left( t \right)} \right),$$
(19)

\(\sigma _{e}^{2}\) is the variance of white noise, \(\sigma _{n}^{2}\) is the variance of chaotic signal, \(\sigma _{u}^{2}\) is the the variance of weak pulse signal, \(u\left( t \right)~\) is wake pulse signal and \(\hat {u}\left( t \right)\) is the prediction value of the RMWAD model to \(u\left( t \right)\). TP refers to the number of times when there is a pulse signal in the observation signal and it is judged to have a pulse signal; TN refers to the number of times when there is no pulse signal and it is judged to have no pulse signal; FP refers to the number of times when there is no pulse signal but it is judged to have a pulse signal; FN refers to the number of times when there is a pulse signal but it is judged to have no pulse signal.

The iterative equation of the Lorenz system is as follows:

$${\text{Lorenz}} = \left\{ \begin{gathered} \dot {x} = \alpha \left( {y - x} \right), \hfill \\ \dot {y} = x\left( {\gamma - z} \right) - y, \hfill \\ \dot {z} = xy - \beta z. \hfill \\ \end{gathered} \right.$$
(20)

The iterative equation of the Rosser system is as follows:

$${\text{Rosser}} = \left\{ \begin{gathered} \dot {x} = - \left( {y + z} \right), \hfill \\ \dot {y} = x + ay, \hfill \\ \dot {z} = b + z\left( {x - c} \right), \hfill \\ \end{gathered} \right.$$
(21)

where \(x,~y,z\) is the function of time. Supposing the initial condition \(x\left( 0 \right) = 1,\,\,y\left( 0 \right) = 1,\,\,z\left( 0 \right) = 1\), the sampling time is \(t = 0.01\;{\text{s}}\), and we could generate 10 000 data points by the fourth-order method, and the first component is taken as the chaotic background. In addition, in order to ensure the complete chaotic state of the system, we could abandon the previous 3000 points and select 4000 continuous sequences as the chaotic background denoted by \(n\left( t \right)\).

The white noise uses the wgn function in Matlab to generate a one-dimensional sequence of 4000 samples that obey the normal distribution denoted by \(e\left( t \right)\), and then, we could use the sum of \(n\left( t \right)\) and \(e\left( t \right)\) as the chaotic noise background, where the mean of the white noise is equal to zero, and the values of \({{\sigma }_{e}}\) are shown in Tables 2 to 5 respectively.

The parameters used in the simulation experiments are shown in Table 1.

Table 1. Experimental parameters

3.1 Experiment 1: Detecting the Existence of Weak Pulse Signal

Suppose that the pulse signal is a single pulse signal, that is

$$\begin{gathered} u\left( t \right) = {{a}_{1}}{{u}_{1}}\left( t \right), \\ u\left( t \right) = \left\{ \begin{gathered} 1,\,\,\,\,t = \left\{ {400,800,1200,2600, \ldots } \right\}, \hfill \\ 0,\,\,\,\,{\text{otherwise}}{\text{,}} \hfill \\ \end{gathered} \right. \\ \end{gathered} $$

we generate a time series with length of 4000, denoted by \(\left\{ {u\left( t \right),t = 1,2, \ldots ,4000} \right\}\), and detect weak pulse signals by using the RMWAD model.

For the Lorenz system, \({{a}_{1}} = 2.5\), \({{\sigma }_{u}} = 0.1248\), \({{\sigma }_{n}} = 7.8640\), \({{\sigma }_{e}} = 0.0096\), \({\text{SNR}} = - 82.8589\;{\text{dB}}\), and then the detection results of are shown in Fig. 3. For the Rosser system, \({{a}_{1}} = 0.4\), \({{\sigma }_{u}} = 0.0200\), \({{\sigma }_{n}} = 5.1025\), \({{\sigma }_{e}} = 8.7768 \times {{10}^{{ - 4}}}\), \({\text{SNR}} = - 110.8599\;{\text{dB}}\), and then the detection results are shown in Fig. 4.

Fig. 3.
figure 3

Detection results of the RMWAD model: (a) Lorenz noise signal; (b) observation signal; (c) the one-step prediction error of observed signal with Lorenz; (d) \(P\) value.

Fig. 4.
figure 4

Detection results of the RMWAD model under the Rosser noise background: (a) chaotic noise signal; (b) received signal; (c) the one-step prediction error; (d) \(P\) value.

Figures 3a, 3b respectively show the chaotic noise signal and observation signal of Lorenz system: according to Fig. 3b, the influence of weak pulse signal on chaotic noise background signal is very small, therefore, the embedding dimension and delay time of the chaotic noise background signal obtained by the same method are the same. Figures 3c, 3d are the one-step prediction error and dynamic threshold \(P\) value obtained by RMWAD model respectively: according to Fig. 3c, it shows that the error of 10 positions is obviously larger, and there may be a weak pulse signal. Combined with Fig. 3d, the \(P\) value of these 10 positions is less than 0.5, it shows that there are weak pulse signals in the observation signal under the Lorenz chaotic noise background based on subsection 1.2 hypothesis test.

The chaotic noise signal and observation signal of the Rosser system respectively are shown in Figs. 4a, 4b. The one-step prediction error and dynamic threshold \(P\) value of RMWAD model respectively is shown in Figs. 4c, 4d: the 10 positions are relatively large and there may be weak pulse signal (see Fig. 4c), but it is not show that the weak pulse signal exist objectively. Therefore, the dynamic threshold obtained by the RMWAD model is further used to judge, the \(P\) value of hypothesis test is less than 0.5 (see Fig. 4d), and according to subsection 1.2 hypothesis test, it objectively shows that the observation signal under the Rosser chaotic noise background also exits weak pulse signal.

3.2 Experiment 2: Detection of Weak Pulse Signal under the Different SNR Condition

In the subsection, we would detect weak pulse signal under different SNR to receive the range of detection of SNR by the RMWAD model. Suppose that the pulse signal is a single pulse signal, that is

$$\begin{gathered} u\left( t \right) = {{a}_{1}}{{u}_{1}}\left( t \right), \hfill \\ u\left( t \right) = \left\{ \begin{gathered} 1,\,\,\,\,t = \left\{ {500,1000,1500,2000, \ldots } \right\}, \hfill \\ 0,\,\,\,\,{\text{otherwise}}{\text{,}} \hfill \\ \end{gathered} \right. \hfill \\ \end{gathered} $$

and we generate a time series with length of 4000, denoted by \(\left\{ {u\left( t \right),t = 1,2, \ldots ,4000} \right\}\). We assume that the period of the weak pulse signal is constant, and the intensity of weak pulse signal is controlled by changing the value of \({{a}_{1}}\), the detailed values of \({{a}_{1}}\) and detection results are shown in Tables 2 and 3.

Table 2 shows that the detailed values of \({{a}_{1}}\) and detection results of the RMWAD model under the Lorenz chaotic noise background. It shows that with the decrease of the SNR, the value of MAE and RMSE become smaller and smaller (see Fig. 5a). According to Table 2, with the decrease of the SNR, the PR is always equal to 1.0, and the ACC remains above 0.99. The RE decreases, but the lowest F1 for the comprehensive evaluation of PR and RE is 0.8, it shows that the RMWAD model could effectively detect the weak pulse signal in \({\text{SNR}} \in \left( { - 103.4120\,\,{\text{dB}},\,\, - 18.4453\,\,{\text{dB}}} \right)\).

Fig. 5.
figure 5

The trend chart of error evaluation index under different SNR: (a) MAE and RMSE under the Lorenz noise background; (b) MAE and RMSE under the Rosser noise background.

Table 3 shows that the detailed values of \({{a}_{1}}\) and detection results of the RMWAD model under the Rosser noise chaotic background. It also shows that the value of MAE and RMSE of the model are getting smaller and smaller with the decrease of the SNR (see Fig. 5b). According to Table 3, the value of PR is always 1.0, the value of ACC remains above 0.99, and the value of F1 is also not less than 0.8, it shows that the model could also detect weak pulse signal well in \({\text{SNR}} \in \left( { - 126.9493\;{\text{dB}},\,\, - 16.5239\;{\text{dB}}} \right).\)

Table 2. Detection results for the Lorenz system
Table 3. Detection results for the Rosser system

3.3 Experiment 3: Comparison of Different Models for Weak Pulse Signal Detection

In the subsection, we compare the RMWAD model with extreme gradient boosting (XGB) model, back propagation (BP) neural network model, broad learning system (BLS) model and support vector regression (SVR) model. It is assumed that the pulse signal is the same single pulse signal as the experiment 2, we use above different models to detect the weak pulse signal under the chaotic noise background of the Lorenz system and Rosser system respectively, use MAE, RMSE to evaluate the prediction error of the model, and use ROC curve, PR, RE, ACC, F1 to evaluate the detection performance.

Table 4. Comparison of different models under different SNR for the Lorenz system
Table 5. Comparison of different models under different SNR for the Rosser system

The detection results of models under the Lorenz system and Rosser system chaotic background are shown in Tables 4 and 5 respectively.

In the Table 4, \({\text{SNR}} = - 103.4120\;{\text{dB}}\), the PR and ACC values of the RMWAD model are above 0.99, and the value of F1 of the comprehensive evaluation of PR and RE is 0.8, AUC (see Fig. 6a) value is over than other models. In addition, the \({\text{SNR}}\) of other models are higher than that of the RMWAD model, but their detection performance is not as good as the RMWAD model.

Fig. 6.
figure 6

ROC curves of different models in the same SNR: (a) \({\text{SNR}} = - 103.4120\,\,{\text{dB}}\) in the Lorenz noise background; (b) \({\text{SNR}} = - 126.9493\,\,{\text{dB}}\) in the Rosser noise background.

In the Table 5, \({\text{SNR}} = - 126.9493\;{\text{dB}}\), the prediction error of the RMWAD model is relatively small than other models, and the ACC and PR values are also above 0.99. The evaluation detection index values of other models are not as good as the RMWAD model. In addition, the AUC values of other models are smaller than the RMWAD model (see Fig. 6b), Therefore, it shows that the RMWAD model has better detection performance than other models under the Rosser chaotic noise background.

CONCLUSIONS

We have established a new RMWAD model to detect weak pulse signal under the chaotic noise background. Considering the short-term predictability and sensitivity to small disturbance of chaotic time series, we construct the RMWAD model combined with phase space reconstruction. We detect weak pulse signal under Lorenz and Rosser chaotic noise background respectively by using the RMWAD model. The Results of the simulation experiments are as follows:

(1) According to experiment 1, there is a wake pulse signal in the observation signal, and the RMWAD model could detect weak pulse signal from the Lorenz and Rosser chaotic noise background respectively.

(2) According to experiment 2, for Lorenz chaotic noise background, it shows that the RMWAD model could effectively detect weak pulse signal in \({\text{SNR}} \in \left( { - 103.4120\,\,{\text{dB}}, - 18.4453\,\,{\text{dB}}} \right)\), and the value of AUC, PR, ACC and F1 are relatively high; for the Rosser noise chaotic background, the RMWAD model could also detect weak pulse signal in \({\text{SNR}} \in \left( { - 126.9493\,\,{\text{dB}}, - 16.5239\,\,{\text{dB}}} \right)\), and the values of AUC, PR, ACC and F1 are relatively high.

(3) According to experiment 3, the RMWAD model has better detection performance for weak pulse signals under chaotic noise background than BLS, BP, XGBOOST and SVR models under the same SNR or different SNR, and the prediction error of this model is relatively small.

In summary, the RMWAD model proposed by this paper is effective and simple for weak pulse signal detection under the chaotic noise background without additional prior knowledge. In the future, we will continue to improve the prediction error and detection performance of the RMWAD model to achieve lower SNR operating threshold and extend it to the detection of other signals.