1 Introduction

Non-contact detections for vital sign feature (VSF) of a living person have attracted extensive attentions from researchers worldwide in past few years (Liang 2016; Liang et al. 2018a, b; Wang et al. 2014; Singh et al. 2013). Among all these numerous detection techniques, electromagnetic detection is considered the most promising as it can extract VSF using a 2-D matrix acquired from human beings. Ultra-wide band (UWB) radars have achieved promising results like indoor target localization, human fall detection, etc. based on the continuous wave (CW) (Wang et al. 2014; Mercuri 2013; Mercuri et al. 2013; Wang 2015). As a better alternative method to the CW radars, UWB radar has been widely applied to many non-contact applications including moving targets identification (Wang et al. 2012, 2013; Koo 2013; Nijsure et al. 2013), through-wall imaging (Li et al. 2013, 2014; Hu et al. 2014), and post-earthquake search and rescue (Gu and Li 2015; Wang et al. 2018a, b; Ren 2015) due to its strong permeability and excellent range resolution. Although, many schemes have been discussed to detect VSF (Huang et al. 2010, 2016; JalaliBidgoli et al. 2016; Gennarelli et al. 2016; Le et al. 2009), these methods cannot achieve better performance especially in a complex environment and the ability to detect VSF is even unique. As a result, extensive efforts are required to apply UWB radar on detecting VSF (Vu et al. 2010; Zhuge and Yarovoy 2011; Ascione et al. 2013).

Several algorithms have been developed to detect VSF in recent years (Liu et al. 2011, 2014; Baldi et al. 2015; Li et al. 2013; Lazaro et al. 2014; Conte et al. 2010; Nezirovíc et al. 2010; Lv et al. 2016; Li 2013; Zhang 2013; Liang et al. 2018c, d, e, f; Wu et al. 2016; Hu and Jin 2016; Li and Lin 2008; Park et al. 2007; Naishadham and Piou 2008; Ren et al. 2016). However, most approaches are unsuitable for detecting VSF of living persons behind obstacles as they can only deal with one or some aspects like clutter removal, analysis of breath characteristics and period estimation of VSF. The fast Fourier transform (FFT) and Hilbert–Huang transform (HHT) were applied to analyse the time–frequency characteristics of VSF (Liu et al. 2011, 2014). A low complexity maximum likelihood period estimator was used to estimate the period of VSF in the additive white Gaussian noise (AWGN) conditions (Conte et al. 2010). Singular value decomposition (SVD) was employed in Nezirovíc et al. (2010) to detect breath signal of human beings by improving the signal to noise and clutter ratio (SNCR). The linear trend was eliminated based on linear trend subtraction (LTS) (Liang et al. 2018e). Li and Lin (2008) the complex signal demodulation was discussed to remove the unwanted noises.

In this paper, a new method is proposed to detect VSF in a complex environment. The method can better remove clutter; suppress the linear trend term and other unwanted electromagnetic noises. A novel way is proposed to acquire range estimate by employing the discrete windowed Fourier transform (DWFT) technique. We provide a whole new analytical framework to detect VSF. An improved accumulation method in frequency domain is discussed to eliminate harmonics, which makes VSF frequency accurate to obtain.

The remainder of this paper is organized as follows. In Sect. 2, the model for VSF detection is introduced. A new detection method is analyzed in Sect. 3. Section 4 shows the performance of the proposed method compared with several well-known algorithms. Section 5 summarizes the whole paper.

2 UWB time domain life sign model

In VSF detection, the most important is to acquire the breathing frequency of a living person. To analyze the spectrums of VSF, the time domain model is developed (Liang et al. 2018f), where slow time stands for the collected pulse number and fast time stands for the range bin. In this paper, t is the slow time and the corresponding frequency components are given by f, τ is fast time and the corresponding frequency components are given by v. Figure 1 provides the time of arrival (TOA) of the collected pulses, which can be given by (Liang et al. 2018f)

$$ \widehat{\tau }{ = }\frac{d\left( t \right)}{c} = \frac{{d_{0} + A_{r} \sin \left( {2\pi f_{r} t} \right)}}{c} $$
(1)

where d0 is an constant, which is used to denote the range between the human being and radar; Ar is VSF amplitude; and fr is VSF rate.

Fig. 1
figure 1

The sketch map of the received pulse reflected only with breath motion

The received pulses are

$$ \begin{aligned} R\left( \tau \right) & = \sum\limits_{n = 0}^{N - 1} {u\left( {\tau - nT - \tau_{r} } \right)} * h_{r} \left( \tau \right) \\ & \quad + \sum\limits_{n = 0}^{N - 1} {\sum\limits_{p = 1,p \ne r}^{P} {u\left( {\tau - nT - \tau_{p} } \right)} } * h_{p} \left( \tau \right) + a\left( \tau \right) + q\left( \tau \right) + g\left( \tau \right) + \omega \left( \tau \right) \\ \end{aligned} $$
(2)

where * denotes the convolution; u(t) is the transmitted pulse; T is the effective pulse repetition time; t is discrete with \( nT,\;n = 0, \ldots ,N - 1 \); N represents the sampling points of t; hr(τ) is the pulse response of the breathing movement; hp(τ) is the joint pulse responses of the transmitting and receiving antenna and other static objects; a(τ) is linear trend; w(τ) is AWGN; q(τ) is non-static clutter; and g(τ) is some unwanted clutter.

Under normal breath condition, TOA can be given by

$$ \tau_{r} = \tau_{0} + \tau_{r} \sin \left( {2\pi f_{r} nT} \right). $$
(3)

where \( \tau_{0} = {{2d_{0} } \mathord{\left/ {\vphantom {{2d_{0} } v}} \right. \kern-0pt} v} \), and \( \tau_{r} = {{2A_{r} } \mathord{\left/ {\vphantom {{2A_{r} } v}} \right. \kern-0pt} v} \).

Assuming that τ is discrete with δT, \( {{\delta_{R} = v\delta_{T} } \mathord{\left/ {\vphantom {{\delta_{R} = v\delta_{T} } 2}} \right. \kern-0pt} 2} \) is the sampling interval in range. The discrete signal can be expressed as a 2-D matrix RM×N

$$ {\mathbf{R}} = {\mathbf{h}} + {\mathbf{c}} + {\mathbf{a}} + {\mathbf{w}} + {\mathbf{q}} + {\mathbf{g}} $$
(4)

where \( m = 0, \ldots ,M - 1 \) is the sampling points of τ.

As shown in (2), there are various clutters in a complex environment, which make it challenging to extract VSF. Figure 2a shows the ideal signal only with breath motion, while the signal under AWGN is given in Fig. 2b.

Fig. 2
figure 2

The results a without clutter, and b under AWGN

3 VSF detection algorithm

In this section, the implementation steps of VSF detection are discussed as shown in Fig. 3.

Fig. 3
figure 3

The proposed detection method

3.1 Pre-processing step

VSF of a living person is usually covered by c with strong amplitudes, which is estimated as (Liang et al. 2018a)

$$ \Im = \frac{1}{M \times N}\sum\limits_{m = 1}^{M} {\sum\limits_{n = 1}^{N} {R\left[ {m,n} \right]} } . $$
(5)

The results via suppressing c is given by

$$ {\varvec{\Omega}} = {\mathbf{R}} - \Im . $$
(6)

LTS method is used to remove a (Liang et al. 2018b)

$$ {\mathbf{W}} = {\varvec{\Omega}}^{\rm T} - {\mathbf{X}}\left( {{\mathbf{X}}^{\rm T} {\mathbf{X}}} \right)^{ - 1} {\mathbf{X}}^{\rm T} {\varvec{\Omega}}^{\text{T}} $$
(7)

where \( {\mathbf{X}} = \left[ {{\mathbf{x}}_{1} ,\;{\mathbf{x}}_{2} } \right] \), \( {\mathbf{x}}_{1} = \left[ {0,\;1,\; \ldots ,\;N - 1} \right]^{\rm T} \), and \( {\mathbf{x}}_{2} = \left[ {1,\;1,\; \ldots ,1} \right]_{N \times 1}^{\rm T} \).

An infinite impulse response band-pass filter is performed on (7) to enhance SNR, which is given by (Liang et al. 2018e)

$$ {\varvec{\Lambda}} = {{\varvec{\upalpha}} \mathbf{W}} - {\varvec{\upbeta} \mathbf{W}} $$
(8)

where the coefficients α and β can be acquired automatically (Liang et al. 2018d).

And another filter with seven values averaged is developed to enhance VSF amplitude, which is given by (Liang et al. 2018f)

$$ {\mho }\left[ {k,n} \right] = {{\sum\limits_{m = 7 \times k}^{7 \times k + 6} {\Lambda \left[ {m,n} \right]} } \mathord{\left/ {\vphantom {{\sum\limits_{m = 7 \times k}^{7 \times k + 6} {\Lambda \left[ {m,n} \right]} } 7}} \right. \kern-0pt} 7} $$
(9)

where \( k = 1, \ldots ,\left\lfloor {{M \mathord{\left/ {\vphantom {M 7}} \right. \kern-0pt} 7}} \right\rfloor \), \( \left\lfloor {{M \mathord{\left/ {\vphantom {M 7}} \right. \kern-0pt} 7}} \right\rfloor \) is the maximum integer less than \( {M \mathord{\left/ {\vphantom {M 7}} \right. \kern-0pt} 7} \).

3.2 TOA estimation

This section provides a novel scheme for TOA estimate via analyzing the characteristics including the skewness (S) and standard deviation (SD) of VSF in (9). S is (Liang et al. 2017)

$$ S_{m} = {\text{E}}\left[ {\left( {{{\left( {\mho_{m} - \mu } \right)} \mathord{\left/ {\vphantom {{\left( {\mho_{m} - \mu } \right)} 6}} \right. \kern-0pt} 6}} \right)^{ 3} } \right] $$
(10)

where E[·] is the expectation, μ and σ are mean value and deviation, respectively.

SD is given by

$$ SD_{m} = \sqrt {{{\left( {\sum\limits_{n = 1}^{N} {\left( {\mho_{m} - \mu } \right)^{2} } } \right)} \mathord{\left/ {\vphantom {{\left( {\sum\limits_{n = 1}^{N} {\left( {\mho_{m} - \mu } \right)^{2} } } \right)} {N - 1}}} \right. \kern-0pt} {N - 1}}} . $$
(11)

To achieve TOA estimate, the SD-based S spectral i.e. SDS are calculated using (10) and (11). Figure 4 provides SDS values Z using a dataset acquired from one volunteer at 600 cm outdoors, which follow periodicity in target area approximately. DWFT is applied to obtain TOA estimate via analyzing the time–frequency characteristics as shown in Fig. 5 (Allen 1977; Wójcicki et al. 2008), which is expressed as

$$ K\left[ {o,p} \right] = \sum\limits_{m = 1}^{M} {{\mathbf{Z}}_{m}\Xi \left[ {o - m} \right]e^{{{{ - j2p\pi m} \mathord{\left/ {\vphantom {{ - j2p\pi m} P}} \right. \kern-0pt} P}}} } . $$
(12)

where p is the pth frequency component in P discrete components, and \( \Xi \) is used Hamming function with the length O = 512 (Mak et al. 2016) given by

$$ \Xi \left( o \right) = 0.54 - 0.46 \times \cos \left( {\frac{2\pi o}{512}} \right),\quad o = 0,1, \ldots ,512. $$
(13)
Fig. 4
figure 4

a SDS values with the normalized amplitude, and b SDS in target area

Fig. 5
figure 5

The time–frequency matrix using DWFT

The range is estimated as

$$ \widehat{L} = v\widehat{\tau } $$
(14)

where TOA estimate \( \widehat{\tau } \) corresponds to the maximal K.

3.3 Frequency estimation

The \( \widehat{\tau } \) index in \( \mho \) is

$$ \Im = {{\widehat{\tau }} \mathord{\left/ {\vphantom {{\widehat{\tau }} {\delta_{T} }}} \right. \kern-0pt} {\delta_{T} }}. $$
(15)

To estimate VSF frequency, the accumulation method in time domain is first applied to reduce harmonics, which is given by

$$ \widetilde{\text{O}}\left[ i \right] = \sum\limits_{i = \Im - 50}^{\Im } {\left| {\mho_{i} } \right|^{2} } . $$
(16)

Usually, VSF frequency is within 0.2–0.4 Hz (Marple 1999). To improve SNR, the frequency components outside these frequencies are removed by employing a rectangular window \( {\varvec{\theta}} \) within 0.1–0.8 Hz, which is given by

$$ {\varvec{\uppi}} = {\varvec{\uptheta}} \odot \left\{ {{\text{FT}}\left\{ {\widetilde{{\mathbf{O}}}} \right\}} \right\}. $$
(17)

VSF frequency can be acquired as

$$ \widetilde{f} = {\mathbf{w}}\left( \gamma \right) $$
(18)

where γ corresponds to the index of the maximal (17).

And the accumulation method in frequency domain is used to cancel out harmonics (Marple 1999), which is given by

$$ {\rm H}\left( \omega \right) = \left\{ {\begin{array}{*{20}l} {2\pi \left( \omega \right),} \hfill & {\omega > 0} \hfill \\ {\pi \left( \omega \right),} \hfill & {\omega = 0} \hfill \\ {0,} \hfill & {\omega < 0} \hfill \\ \end{array} } \right.. $$
(19)

4 Data acquisition

4.1 UWB radar

The UWB radar prototype used to acquire the pulses is operated by employing a wireless personal digital assistant. Table 1 provides the key parameters. Using the radar system, M = 4096 sampling points are obtained in range direction. The SNR is improved by averaging every 128 collected pulses. The pulses are saved every 0.1092 s, which is acquired from 128 × 512/600,000. A joint sampling technique is developed in the receiver based on the equivalent-time and real-time sampling methods, which outperforms the analogue receiver with only one method. Figure 6 shows the pulses with the normalized amplitude.

Table 1 The key parameters for the used radar
Fig. 6
figure 6

The normalized pulse using UWB radar

4.2 Measurement setup

To survey the performance of the presented method, two experiments on human beings are carried out indoors and outdoors, respectively. Figure 7a shows a setup considered to acquire data. In these experiments, the radar is on a desk 1.5 m in height, and these volunteers breathed normally, kept stationary, and faced radar. In the first experiment, one girl served as the volunteer as given in Fig. 7b. This experiment is carried out outdoors. The range between the receiver and the volunteer is 300 cm, 600 cm, 900 cm, and 1100 cm, respectively. The wall is composed of three materials like brick (30 cm), reinforced concrete (35 cm) and plank (35 cm). The second experiment is conducted indoors with one boy served as the volunteer as shown in Fig. 7c. The range between the receiver and the volunteer is 400 cm, 700 cm, 1000 cm, and 1200 cm, respectively. Four algorithms including the FFT, HOC, constant false alarm ratio (CFAR), and advance method (AM) are used for data analysis.

Fig. 7
figure 7

a The setup for data acquisition, b outdoors environment, and c indoors environment

5 Results and discussions

In this section, the progress of the presented method is clarified in different aspects.

5.1 Intuitive detection performance

The results of pre-processing steps are given in Fig. 8 based on the pulses acquired outdoors at 900 cm. Figure 8a shows the received VSF signals. Results indicate VSF is invisible due to the influences of various clutters. The range-slow time matrix by removing clutter is given in Fig. 8b. Figure 8c shows the results by employing LTS method. Results show VSF is too weak to extract. Figure 8d shows the acquired matrix by employing a band-pass filter, and Fig. 8e shows the matrix based on the averaging filter. After the pre-processing steps mentioned above, the clutters are removed and VSF is improved comparing with the pulses as shown in Fig. 8a.

Fig. 8
figure 8

The range-slow time matrix of a received signal, b clutter removal, c LTS, d band-pass filter, and e filtering in slow time

5.2 SNR improvement capability

Usually, the Cramer–Rao low bound of the unbiased VSF frequency increases as SNR increase. VSF can be extracted accurately by improving SNR, which is given by (Liang et al. 2018f)

$$ {\text{SNR}} = 20 \cdot \log_{10} \left( {\frac{{\sum\nolimits_{{n = \mu_{r} - 1}}^{{\mu_{r} + 1}} {\left| {{\varvec{\updelta}}\left[ n \right]} \right|} }}{{\sum\nolimits_{{n = \gamma_{1} }}^{{\mu_{r} - 2}} {\left| {{\varvec{\updelta}}\left[ n \right]} \right|} + \sum\nolimits_{{n = \mu_{r} + 2}}^{{\gamma_{2} }} {\left| {{\varvec{\updelta}}\left[ n \right]} \right|} }}} \right), $$
(20)

where μr is VSF frequency estimate, γ1 and γ2 are the boundaries of the adopted window.

The ability to improve SNR can be evaluated qualitatively using the data acquired at different distances. Figure 9 shows Z with normalized amplitudes based on the collected pulses outdoors. As shown in Fig. 10, the range estimates are 3.079 m, 6.113 m, 9.084 m and 11.24 m with the errors 0.079 m, 0.113 m, 0.084 m and 0.24 m, respectively. VSF frequency estimations are respectively 0.2308 Hz, 0.2307 Hz, 0.2932 Hz and 0.3329 Hz as shown in Fig. 11. The range and frequency estimations based on AM are given in Fig. 12. The ranges are 3.942 m, 6.86 m, 3.837 m and 18.58 m, and VSF frequencies are 0.087 Hz, 0.2038 Hz, 0.1456 Hz and 0.7862 Hz, respectively. AM can only detect VSF in short-range. Table 2 gives comparison results based on different methods. It can be seen that the presented algorithm outperform AM in accuracy and SNR improvement.

Fig. 9
figure 9

The SDS values at a 3 m, b 6 m, c 9 m, and d 11 m from the receiver

Fig. 10
figure 10

Range estimations at a 3 m, b 6 m, c 9 m, and d 11 m from the receiver

Fig. 11
figure 11

Breath frequency estimations at a 3 m, b 6 m, c 9 m, and d 11 m from the receiver

Fig. 12
figure 12

Life signal estimations using AM at a 3 m, b 6 m, c 9 m, and d 11 m from the receiver

Table 2 Performance with different methods

5.3 Indoors performance

In this section, the data acquired indoors are used to evaluate the detection performance at different distances. Figure 13 shows Z with the normalized amplitudes. The calculated ranges based on the presented algorithm are given in Fig. 14. They are respectively 4.122 m, 7.078 m, 10.071 m and 12.25 m. Figure 15 shows VSF frequencies 0.2469 Hz, 0.238 Hz, 0.2221 Hz and 0.2701 Hz, respectively.

Fig. 13
figure 13

SDS values at a 4 m, b 7 m, c 10 m, and d 12 m from the receiver

Fig. 14
figure 14

Range estimations at a 4 m, b 7 m, c 10 m, and d 12 m from the receiver

Fig. 15
figure 15

Frequency estimations at a 4 m, b 7 m, c 10 m, and d 12 m from the receiver

5.4 Clutter removing capability

In this section, the capability of clutter removal is analyzed based on the data acquired at 900 cm outdoors. Figure 16 shows the calculated spectrums by employing different detection algorithms. The results using FFT are given in Fig. 16a. Various clutters exist in the effective band of breath motion, which makes it challenging to extract VSF. The results with the accumulation method in frequency domain performed different times are shown in Fig. 16b–e. Results indicate the harmonics and mixed products of life signal can be cancelled out effectively when HHT is performed four times known as four FA compared with other times as given in Table 3.

Fig. 16
figure 16

The spectrums using a FFT, b one FA, c two FA, d three FA, and e four FA

Table 3 Performance on improving SNR

6 Conclusion

In this paper, a new method for vital sign feature (VSF) detection is presented using the UWB pulse radar. The time of arrival (TOA) of the UWB pulse reflected by human beings can be estimated by employing the discrete windowed Fourier transform (DWFT) on the skewness and standard deviation of the received signal. The accumulation method in frequency domain is developed to cancel out the mixed products and harmonics of life signal. The presented algorithm provides a preliminary signature for the search and rescue of victims in nature disaster. Experimental results in different conditions show the better capability of removing clutter and improving SNR.