On the Application of the SDLE to the Analysis of Complex Time Series

Gao, Jianbo; Hu, Jing; Tung, Wen-wen

doi:10.1007/978-1-4614-4145-8_9

Jianbo Gao^3,4,
Jing Hu⁵ &
Wen-wen Tung⁶

1572 Accesses

Abstract

Complex systems often generate highly nonstationary and multiscale signals because of nonlinear and stochastic interactions among their component systems and hierarchical regulations imposed by the operating environments. Rapid accumulation of such complex data in life sciences, systems biology, nano-sciences, information systems, and physical sciences has made it increasingly important to develop complexity measures that incorporate the concept of scale explicitly, so that different behaviors of signals on varying scales can be simultaneously characterized by the same scale-dependent measure. The scale-dependent Lyapunov exponent (SDLE) discussed here is such a measure and can be used as the basis for a unified theory of multiscale analysis of complex data. The SDLE can readily characterize deterministic low-dimensional chaos, noisy chaos, random 1 ∕ f ^α processes, random Levy processes, stochastic oscillations, and processes with multiple scaling behavior. It can also readily deal with many types of nonstationarity, detect intermittent chaos, and accurately detect epileptic seizures from EEG data and distinguish healthy subjects from patients with congestive heart failure from heart rate variability (HRV) data. More importantly, analyses of EEG and HRV data illustrate that commonly used complexity measures from information theory, chaos theory, and random fractal theory can be related to the values of the SDLE at specific scales, and useful information on the structured components of the data is also embodied by the SDLE.

Access provided by Autonomous University of Puebla. Download chapter PDF

Multiscale Entropy: Recent Advances

Intermittency-Driven Complexity in Signal Processing

Introduction to Complex Systems Analysis with Wavelets

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Complex systems are usually comprised of multiple subsystems that exhibit both highly nonlinear deterministic, as well as, stochastic characteristics, and are regulated hierarchically. These systems generate signals that exhibit complex characteristics such as sensitive dependence on small disturbances, long memory, extreme variations, and nonstationarity [1]. Examples of such signals are abundant, including biological data such as heart rate variability (HRV) and EEG data [2], highly bursty traffic on the Internet [3–5], and highly varying stock prices and foreign exchange rates in financial markets [6, 7]. For illustration, in Fig. 9.1, we have shown an example of HRV data for a normal young subject [8]. Evidently, the signal is highly nonstationary and multiscaled, appearing oscillatory for some period of time (Fig. 9.1b, d) and then varying as a 1 ∕ f process for another period of time (Fig. 9.1c, e).

While the multiscale nature of signals such as that shown in Fig. 9.1 cannot be fully characterized by existing methods, the nonstationarity of the data is even more troublesome, because it prevents direct application of spectral analysis or methods based on chaos theory and random fractal theory. For example, in order to reveal that the HRV data is of 1 ∕ f nature [9, 10] with anti-persistent long-range correlations [11, 12] and multifractality [13], time series such as shown in Fig. 9.1a have to be preprocessed to remove components (such as oscillatory ones) that do not conform to fractal scaling analysis. However, automated segmentation of complex biological signals to remove undesired components is a significant open problem, since it is closely related to the challenging task of accurately detecting transitions from normal to abnormal states in physiological data.

Rapid accumulation of complex data in life sciences, systems biology, nano-sciences, information systems, and physical sciences has made it increasingly important to be able to analyze multiscale and nonstationary data. Since multiscale signals behave differently depending upon which scale the data are looked at, it is of fundamental importance to develop measures that explicitly incorporate the concept of scale so that different behaviors of the data on varying scales can be simultaneously characterized by the same scale-dependent measure. Here, we discuss such a measure, the scale-dependent Lyapunov exponent (SDLE), and develop a unified multiscale analysis theory of complex data.

This chapter is organized as follows. We first define the SDLE, then apply it to characterize low-dimensional chaos, noisy chaos, and random processes with power-law decaying power spectral density (so-called 1 ∕ f ^α processes). We then show how it can readily detect intermittent chaos, deal with nonstationarity, and apply it to characterize EEG and HRV data. Finally, we make a few concluding remarks, including a discussion of best practices for experimental data analysis.

2 SDLE: Definitions and Fundamental Properties

Chaos and random fractal theories have been used extensively in the analysis of complex data [2, 3, 6, 11–18]. Chaos theory shows that apparently irregular behaviors in a complex system may be generated by nonlinear deterministic interactions of only a few numbers of degrees of freedom while noise or intrinsic randomness does not play any role. Random fractal theory, on the other hand, assumes that the dynamics of the system are inherently random. One of the most important classes of random fractals is the set of 1 ∕ f ^α processes that display long-range correlations. Since the foundations of chaos theory and random fractal theory are entirely different, different conclusions may be drawn depending upon which theory is used to analyze the data. In fact, much of the research in the past has been devoted to determining whether a complex time series is generated by a chaotic or a random system [19–30]. In this effort, 1 ∕ f ^α processes have distinguished themselves as the key counter examples invalidating commonly used tests for chaos [30–32]. Thus, successful classification of chaos and 1 ∕ f ^α processes based on scales may fundamentally change the practice of time series analysis—these theories will be used synergistically, instead of individually, to characterize the behaviors of signals on a wide range of scales.

SDLE is a generalization of two important concepts, the time-dependent exponent curves [24] and the finite size Lyapunov exponent (LE) [33]. It was first introduced by the authors in [34, 35], and has been further extended in [36, 37] and applied to study EEG [38], HRV [39, 40], and Earth’s geodynamo [41].

We assume that all that is known is a scalar time series x(1), x(2), …, x(n). Regardless of whether the dynamics are chaotic or random, we use time-delay embedding [42–44] to form vectors of the form: ${V }_{i} = [x(i),x(i + L),\ldots ,x(i+$ (m − 1)L)], where the embedding dimension m and the delay time L are chosen according to optimization criteria [24, 45]. When the time series is random, such a procedure transforms the self-affine stochastic process to a self-similar process in phase space. In this case, however, the specific values of m and L are not important, so long as m > 1.

After a proper phase space is reconstructed, we consider an ensemble of trajectories. We denote the initial separation between two nearby trajectories by ε₀ and their average separation at time t and t + Δt by ε_t and ε_t + Δt, respectively. We then examine the relation between ε_t and ε_t + Δt, where Δt is small. When Δt → 0, we have

$${\epsilon }_{t+\mathrm{\Delta }t} = {\epsilon }_{t}{\mathrm{e}}^{\lambda ({\epsilon }_{t})\mathrm{\Delta }t},$$

(9.1)

where λ(ε_t) is the SDLE. It is given by

$$\lambda ({\epsilon }_{t}) = \frac{\ln {\epsilon }_{t+\mathrm{\Delta }t} -\ln {\epsilon }_{t}} {\mathrm{\Delta }t}.$$

(9.2)

Equivalently, we have a differential equation for ε_t,

$$\frac{\mathrm{d}{\epsilon }_{t}} {\mathrm{d}t} = \lambda ({\epsilon }_{t}){\epsilon }_{t}.$$

(9.3)

Given a time series data, the smallest Δt possible is the sampling time τ.

To compute SDLE, we can start from an arbitrary number of shells,

$${\epsilon }_{k} \leq \| {V }_{i} - {V }_{j}\| \leq {\epsilon }_{k} + \mathrm{\Delta }{\epsilon }_{k},\ \ \ k = 1,2,3,\ldots ,$$

(9.4)

where V _i, V _j are reconstructed vectors and ε_k (the radius of the shell) and Δε_k (the width of the shell) are arbitrarily chosen small distances (Δε_k is not necessarily a constant). We then monitor the evolution of all of the pairs of vectors (V _i, V _j) within a shell and take the average. As we will see shortly, as far as estimation of the parameters corresponding to exponential or power-law divergence is concerned, taking logarithm and averaging can be exchanged; (9.2) can now be written as

$$\lambda ({\epsilon }_{t}) = \frac{\left < \ln \|{V }_{i+t+\mathrm{\Delta }t} - {V }_{j+t+\mathrm{\Delta }t}\| -\ln \| {V }_{i+t} - {V }_{j+t}\|\right >} {\mathrm{\Delta }t}$$

(9.5)

where t and Δt are integers in units of the sampling time and the angle brackets denote the average within a shell.

To see why taking logarithm and averaging can be exchanged for the purpose of computing λ(ε_t), let us consider a case involving ${\epsilon }_{1}(t) = {\epsilon }_{1}(0){\mathrm{e}}^{{\lambda }_{1}t},{\epsilon }_{2}(t) = {\epsilon }_{2}(0){\mathrm{e}}^{{\lambda }_{2}t}$, where λ₁ = λ₂ is a positive constant. Let ε(t) be the average of ε₁(t) and ε₂(t). Then it is clear that SDLE is simply λ whether one takes average first or takes logarithm first. In fact, when large t is concerned, if λ₁ is slightly larger than λ₂, then taking logarithm first is beneficial, since, otherwise, the term ${\mathrm{e}}^{{\lambda }_{1}t}$ will dominate, and thus the presence of λ₂ will not be captured. Clearly, similar argument applies to the situation of power-law divergence.

Note that in the above computational procedure, the initial set of shells for computing SDLE serve as initial values of the scales; through evolution of the dynamics, they will automatically converge to the range of inherent scales—which are the scales that define (9.2) and (9.3). Also note that when analyzing chaotic time series, the condition

$$\vert j - i\vert \geq (m - 1)L$$

(9.6)

needs to be imposed when finding pairs of vectors within a shell, to eliminate the effects of tangential motions [24] and for an initial scale to converge to the inherent scales [35].

To better understand the notion of “inherent scales,” it is beneficial to discuss the notion “characteristic scale” (or “limiting scale”), ε_∞, defined as the scale where SDLE is close to 0. If one starts from ε₀ ≪ ε_∞, then, regardless of whether the data is deterministically chaotic or simply random, ε_t will initially increase with time and gradually settle around ε_∞. Consequentially, λ(ε_t) will be positive before ε_t reaches ε_∞. On the other hand, if one starts from ε₀ ≫ ε_∞, then ε_t will simply decrease, yielding negative λ(ε_t), again regardless of whether the data are chaotic or random. When ε₀ ∼ ε_∞, then λ(ε_t) will stay around 0—note, however, that ε_∞ may not be a single point but a function of time, such as a periodic function of time. These discussions make it clear that chaos can only be observed on scales much smaller than ε_∞.

To better understand SDLE, we now point out a relation between SDLE and the largest positive LE λ₁ estimated for a true chaotic signal using, say, the Wolf et al.’s algorithm [21]. It is given by [35]

$${\lambda }_{1} ={ \int }_{0}^{{\epsilon }^{{_\ast}} }\lambda (\epsilon )p(\epsilon )\mathrm{d}\epsilon ,$$

(9.7)

where ε^∗ is a scale parameter (e.g., used for renormalization when using Wolf et al.’s algorithm [21]); p(ε) is the probability density function for the scale ε given by

$$p(\epsilon ) = Z\frac{\mathrm{d}C(\epsilon )} {\mathrm{d}\epsilon } ,$$

(9.8)

where Z is a normalization constant satisfying ${\int }_{0}^{{\epsilon }^{{_\ast}} }p(\epsilon )\mathrm{d}\epsilon = 1$ and C(ε) is the well-known Grassberger–Procaccia’s correlation integral [19]. Note that the lower bound for the integration is set to be zero here. In practice, on scales smaller than ε_min, the probability p(ε) will be zero. Therefore, one could replace the lower bound for the integration by ε_min.

To understand the SDLE, it is instructive to apply it to characterize chaotic signals and 1 ∕ f ^α processes. First, we analyze the chaotic Lorenz system with stochastic forcing

$$\begin{array}{rcl} \mathrm{d}x/\mathrm{d}t& =& -16(x - y) + D{\eta }_{1}(t), \\ \mathrm{d}y/\mathrm{d}t& =& -xz + 45.92x - y + D{\eta }_{2}(t), \\ \mathrm{d}z/\mathrm{d}t& =& xy - 4z + D{\eta }_{3}(t). \end{array}$$

(9.9)

where η_i(t), i = 1, 2, 3 are independent Gaussian noise forcing terms with zero mean and unit variance. When D = 0, the system is clean. Figure 9.2 shows five curves, for the cases with D = 0, 1, 2, 3, 4. The computations are done with 10,000 points and $m = 4,\ L = 2$. We observe the following interesting features:

For the clean chaotic signal, λ(ε) fluctuates slightly around a constant. As is expected, this constant is the very largest positive LE, λ₁,
$$\lambda (\epsilon ) = {\lambda }_{1}.$$
(9.10)
The small fluctuation in λ(ε) is due to the fact that the divergence rate on the Lorenz attractor varies from one region to another.
When there is stochastic forcing, λ(ε) is no longer a constant when ε is small but diverges to infinity as ε → 0 according the following scaling law,
$$\lambda (\epsilon ) \sim -\gamma \ln \epsilon ,$$
(9.11)
where γ is a coefficient controlling the speed of loss of information. This feature suggests that entropy generation is infinite when the scale ε approaches zero.
When the noise is increased, the part of the curve with λ(ε) ∼ − γlnε shifts to the right. In fact, little chaotic signature can be identified when D is increased beyond 3.

Note that similar results to those shown in Fig. 9.2 have been observed in other model chaotic systems, including the Mackey–Glass delay differential equation with multiple positive Lyapunov exponents [46]. To simplify our discussion of HRV data analysis, we note that in order to resolve the behavior of λ(ε) on ever smaller scales, longer and longer time series have to be used. More precisely, for a given dataset, if the smallest resolvable scale is ε₀, in order to resolve a smaller scale ε₀ ∕ r, where r > 1, a larger dataset has to be used—the larger the dimension of the attractor, the longer the time series has to be.

Next we consider 1 ∕ f ^α processes. Such type of processes is ubiquitous in science and engineering (see [47] and references therein). Two important prototypical models for such processes are fractional Brownian motion (fBm) process [48] and ON/OFF intermittency with power-law distributed ON and OFF periods [47]. For convenience, we introduce the Hurst parameter 0 < H < 1 through a simple equation

$$\alpha = 2H + 1.$$

(9.12)

Depending on whether H is smaller than, equal to, or larger than 1 ∕ 2, the process is said to have anti-persistent correlation, short-range correlation, and persistent long-range correlation [47]. Note that $D = 1/H$ is the fractal dimension of such processes and Kolmogorov’s 5/3 law for the energy spectrum of fully developed turbulence [49] corresponds to $H = 1/3$.

It is well known that the variance of such stochastic processes increases with t as t ^2H. Translating this into the average distance between nearby trajectories, we immediately have

$${\epsilon }_{t} = {\epsilon }_{0}{t}^{H}.$$

(9.13)

Using (9.2), we then have λ(ε_t) ∼ H ∕ t. Expressing t by ε_t, we obtain

$$\lambda ({\epsilon }_{t}) \sim H{\epsilon }_{t}^{-1/H}.$$

(9.14)

Equation(9.14) can be readily verified by calculating λ(ε_t) from such processes.

SDLE also has distinct scaling laws for random Levy processes, stochastic oscillations, and complex motions with multiple scaling laws on different scale ranges. For the details, we refer to [34, 35].

Finally, we emphasize that λ₁ > 0 (say, computed by Wolf et al.’s algorithm [21]) is not a sufficient condition for chaos. This is evident from (9.7): any non-chaotic scalings of SDLE such as (9.11) and (9.14) will yield λ₁ > 0.

2.1 Detecting Intermittent Chaos by SDLE

Intermittent chaos is a type of complex motion where regular (i.e., periodic) and chaotic motions alternate. Note that the stretches of the time periods for regular motions could be considerably longer than those of chaotic motions. Exactly because of this, standard methods are unable to detect chaos in such motions. This, however, is a simple task for SDLE. To illustrate the idea, we examine the logistic map

$${x}_{n+1} = a{x}_{n}(1 - {x}_{n}).$$

(9.15)

When a = 3. 8284, we have intermittent chaos. An example of the time series is shown in Fig. 9.3a. We observe that time intervals exhibiting chaos are very short compared with those exhibiting periodic motions. Traditional methods for computing LE, being based on global average, are unable to quantify chaos in such an intermittent situation, since the laminar phase dominates, neither can FSLE, since it requires that divergence dominates most of the time. Interestingly, the SDLE curve shown in Fig. 9.3b clearly indicates existence of chaotic motions, since the plateau region extends almost one order of magnitude.

Why can SDLE even detect chaos in such a situation? The reason is that the oscillatory part of the data only affects the scale range where λ(ε) ∼ 0. It cannot affect the positive portion of λ(ε). This means SDLE has a nice scale separation property to automatically separate the regular from chaotic motions.

2.2 Dealing with Nonstationarity

To facilitate our discussion of HRV data below, we now consider complicated processes generated by the following two scenarios. One is to concatenate randomly $1/{f}^{2H+1}$ and oscillatory components. Another is to superimpose oscillatory components on $1/{f}^{2H+1}$ process at randomly chosen time intervals. Either scenario generates signals that appear quite similar to that shown in Fig. 9.1a. The λ(ε) curves for such processes are shown in Fig. 9.4, for a wide range of the H parameter. We observe well-defined power-law relations, consistent with (9.14), when λ(ε) > 0. 02. This result clearly shows that oscillatory components in the signals can only affect the SDLE where λ(ε) is close to 0. This is an illustration of the effect of scale isolation by the SDLE.

When we perturb chaotic data by similar procedures, will we still be able to detect chaos? The answer is yes. In fact, the intermittent chaos discussed above may be viewed as an example of such a procedure.

We are now ready to fully understand why the SDLE can deal with the types of nonstationarity discussed here. One type of nonstationarity causes shifts of the trajectory in phase space—the greater the nonstationarity, the larger the shifts. SDLE, however, cannot be affected much by shifts, especially large ones, since it is based on the coevolution of pairs of vectors within chosen small shells. The other type is related to oscillatory components. They only affect SDLE near where it is close to zero; therefore, they will not alter the distinct scaling for chaos and fractal processes.

3 Applications: Biological Data Analysis

To better understand the SDLE and appreciate its power, we now apply it to examine two types of physiological data, HRV and EEG.

3.1 EEG Analysis

EEG signals provide a wealth of information about brain dynamics, especially related to cognitive processes and pathologies of the brain such as epileptic seizures. To understand the nature of brain dynamics as well as to develop novel methods for the diagnosis of brain pathologies, a number of complexity measures have been used in the analysis of EEG data. These include the Lempel–Ziv (LZ) complexity [50], the permutation entropy [51], the LE [21], the Kolmogorov entropy [20], the correlation dimension D ₂ [19, 52], and the Hurst parameter [53–55]. We compare the SDLE with these complexity measures or their close relatives.

The EEG signals analyzed here were measured intracranially by the Shands hospital at the University of Florida. Such EEG data are also called depth EEG and are considered cleaner and more free of artifacts than scalp (or surface) EEG. Altogether, we have analyzed seven patients’ multiple-channel EEG data, each with a duration of a few hours, with a sampling frequency of 200 Hz. When analyzing EEG for epileptic seizure prediction/detection, it is customary to partition a long EEG signal into short windows of length W points and calculate the measure of interest for each window. The criterion for choosing W is such that the EEG signal in each window is fairly stationary, is long enough to reliably estimate the measure of interest, and is short enough to accurately resolve localized activities such as seizures. Since seizure activities usually last about 1–2 min, in practice, one often chooses W to be about 10 s. When applying methods from random fractal theory such as detrended fluctuation analysis (DFA) [53], it is most convenient when the length of a sequence is a power of 2. Therefore, we have chosen W = 2, 048 when calculating various measures. We have found, however, that the variations of these measures with time are largely independent of the window size W. The relations among the measures studied here are the same for all the seven patients’ EEG data, so we illustrate the results based on only one patient’s EEG signals.

We have examined the variation of λ(ε) with ε for each segment of the EEG data. Two representative examples for seizure and non-seizure segments are shown in Fig. 9.5. We observe that on a specific scale ε^∗, the two curves cross. Loosely, we may term any ε < ε^∗ as small scale, while any ε > ε^∗ as large scale. Therefore, on small scales, λ(ε) is smaller for seizure than for non-seizure EEG, while on large scales, the opposite is true. The variations of λ_{small − ε} and λ_{large − ε} with time for this patient’s data, where small − ε and large − ε stand for (more or less arbitrarily) chosen fixed small and large scales, are shown in Fig. 9.6a, b, respectively. We observe two interesting features: (1) the pattern of variation of λ_{small − ε}(t) is reciprocal of that of λ_{large − ε}(t). This result can be expected from Fig. 9.5. (2) The variations in λ_{small − ε}(t) and λ_{large − ε}(t) clearly indicate the two seizure events. Therefore, either λ_{small − ε}(t) or λ_{large − ε}(t) can be used to detect epileptic seizures accurately.

We now compare the SDLE with three commonly used measures from chaos theory, the largest positive LE [21], the correlation entropy [20], the correlation dimension [19], and one measure from random fractal theory, the Hurst parameter. We discuss the three measures from chaos theory first.

The LE is a dynamic quantity. It characterizes the exponential growth of an infinitesimal line segment, ${\epsilon }_{t} \sim {\epsilon }_{0}{\mathrm{e}}^{{\lambda }_{1}t},\ {\epsilon }_{0} \rightarrow 0$. It is often computed by the algorithm of Wolf et al. [21], by monitoring the exponential divergence between a reference and a perturbed trajectory. For truly chaotic signals, 1 ∕ λ₁ gives the prediction time scale of the dynamics. Also, it is well known that the sum of all the positive Lyapunov exponents in a chaotic system equals the Kolmogorov–Sinai (KS) entropy. The KS entropy characterizes the rate of creation of information in a system. It is zero, positive, and infinite for regular, chaotic, and random motions, respectively. It is difficult to compute, however. Therefore, one usually computes the correlation entropy K ₂, which is a tight lower bound of the KS entropy. Similarly, the box-counting dimension, which is a geometrical quantity characterizing the minimal number of variables that are needed to fully describe the dynamics of a motion, is difficult to compute, and one often calculates the correlation dimension D ₂ instead. Again, D ₂ is a tight lower bound of the box-counting dimension. Both K ₂ and D ₂ can be readily computed from the correlation integral through the relation [19, 20].

$$C(m,\epsilon ) \sim {\epsilon }^{{D}_{2} }{\mathrm{e}}^{-mL\tau {K}_{2} }$$

(9.16)

where m and L are the embedding dimension and the delay time, τ is the sampling time, and C(m, ε) is the correlation integral defined by

$$C(m,\epsilon ) = \frac{1} {{N}^{2}} \sum \limits_{i,j=1}^{N}\theta (\epsilon -\| {V }_{ i} - {V }_{j}\|),$$

(9.17)

where θ(y) is the Heaviside step function taking values 1 or 0 depending on whether y ≥ 0 or not, V _i and V _j are reconstructed vectors, N is the number of points in the time series, and ε is a prescribed small distance. Equation (9.16) means that in a plot of lnC(m, ε) vs. lnε with m as a parameter, for truly low-dimensional chaos, one observes a series of parallel straight lines, with the slope being D ₂ and the spacing between the lines estimating K ₂ (where lines for larger m lie below those for smaller m). From these descriptions, one would expect that λ₁(t) and K ₂(t) are similar, while D ₂(t) has little to do with either λ₁(t) or K ₂(t). Surprisingly, from Fig. 9.6c–e, we observe that this is not the case: λ₁(t) is similar to D ₂(t) but reciprocal of K ₂(t). In a moment, we shall explain how these puzzling relations may be understood based on λ_{small − ε}(t) and λ_{large − ε}(t).

Next we consider the calculation of the Hurst parameter H. As pointed out earlier, H characterizes the long-term correlations in a time series. There are many different ways to estimate H. We have chosen DFA [53], since it is more reliable [47] and has been used to study EEG [54, 55].

DFA works as follows: First, divide a given EEG data of length N, which is treated as a random walk process, into ⌊N ∕ l⌋ nonoverlapping segments (where ⌊x⌋ denotes the largest integer that is not greater than x), each containing l points; then define the local trend in each segment to be the ordinate of a linear least-squares fit of the time series in that segment; finally compute the “detrended walk,” denoted by x _l(n), as the difference between the original “walk” x(n) and the local trend. One then examines the following scaling behavior:

$${F}_{d}(l) = \left < \sum \limits_{i=1}^{l}{x}_{ l}{(i)}^{2}\right > \sim {l}^{2H}$$

(9.18)

where the angle brackets denote ensemble averages of all the segments. From our EEG data, we have found that the power-law fractal scaling breaks down around l ≈ 6. This is caused by distinct time scales defined by the α rhythm [54] or the dendritic time constants [55]. Figure 9.6f shows H(t) for our EEG data. We observe that the pattern of H(t) is very similar to that of λ₁(t) but reciprocal to K ₂(t) and D ₂(t). Such relations cannot be readily understood intuitively, since the foundations for chaos theory and random fractal theory are entirely different.

Let us now resolve all of the curious relations observed between λ₁(t), K ₂(t), D ₂(t), and H(t).

Generally, entropy measures the randomness of a dataset. This pertains to small scale. Therefore, K ₂(t) should be similar to λ_{small − ε}(t). This is indeed the case. We should point out that we have also calculated other entropy-related measures, such as the LZ complexity [50], which is closely related to the Shannon entropy and permutation entropy [51], and observed similar variations. Therefore, we can conclude that the variation of the entropy is represented by λ_{small − ε}(t), regardless of how entropy is defined.
To understand why λ₁(t) calculated by the algorithm of Wolf et al. [21] corresponds to λ_{large − ε}(t), we note that the algorithm of Wolf et al. [21] involves a scale parameter that whenever the divergence between a reference and a perturbed trajectory exceeds this chosen scale, a renormalization procedure is performed. When the algorithm of Wolf et al. [21] is applied to a time series with only a few thousand points, in order to obtain a well-defined LE, a fairly large-scale parameter has to be chosen. This is the reason that the LE and λ_{large − ε} are similar. In fact, the scale we have chosen to calculate λ₁(t) is even larger than that for calculating λ_{large − ε}(t). This is the reason that the value of λ₁(t) shown in Fig. 9.6c is smaller than that of λ_{large − ε}(t) shown in Fig. 9.6b.
It is easy to see that if one fits the λ(ε) curves shown in Fig. 9.5 by a straight line, then the variation of the slope with time should be similar to λ_{small − ε}(t) but reciprocal of λ_{large − ε}(t). Such a pattern will be preserved even if one takes the logarithm of λ(ε) first and then does the fitting. Such a discussion makes it clear that even if EEG is not ideally of the $1/{f}^{2H+1}$ type, qualitatively, the relation $\lambda (\epsilon ) \sim {\epsilon }^{-1/H}$ holds. This in turn implies D ₂ ∼ 1 ∕ H. With these arguments, it is clear that the seemingly puzzling relations among the measures considered here can be readily understood by the λ(ε) curves. Most importantly, we have established that commonly used complexity measures can be related to the values of the SDLE at specific scales.

As we have pointed out, around the characteristic scale $\overline{\epsilon }$, λ(ε) is always close to 0. The pattern of λ(ε) around $\overline{\epsilon }$ is governed by the structured components in the data, such as the α, γ, β, and δ waves. From Fig. 9.5, we observe that the patterns for seizure and non-seizure EEG segments are very different. Such information is certainly helpful in predicting/detecting seizures. Since the numerous measures considered here are already very effective for this purpose, we will not pursue this issue further here. The issue becomes more important when distinguishing healthy subjects from patients with heart disease using the HRV data, as we will soon show.

3.2 HRV Analysis

HRV is an important dynamical variable of the cardiovascular function. Its most salient feature is the spontaneous fluctuation, even when the environmental parameters are maintained constant and no perturbing influences can be identified. Since the observation that HRV is related to various cardiovascular disorders [56], a number of methods have been proposed to analyze HRV data. They include methods based on simple statistics from time and frequency domain analyses (see [57] and references therein), as well as those derived from chaos theory and random fractal theory [10, 14, 15, 58–61]. We shall now show that the SDLE can readily characterize the hidden differences in the HRV under healthy and diseased conditions and shed much new light on the dynamics of the cardiovascular system.

We examine two types of HRV data, one for healthy subjects and another for subjects with the congestive heart failure (CHF), a life-threatening disease. The data were downloaded from the PhysioNet [8]. There are 18 healthy subjects and 15 subjects with CHF. Part of these datasets were analyzed by random fractal theory. In particular, 12 of the 15 CHF datasets were analyzed by wavelet-based multifractal analysis [13], for the purpose of distinguishing healthy subjects from CHF patients. For ease of comparison, we take the first 3 ×10⁴ points of both groups of HRV data for analysis. In Fig. 9.7a, b, we have shown two typical λ(ε) vs. ε curves, one for a healthy subject and another for a patient with CHF. We observe that for the healthy subject, λ(ε) linearly decreases with lnε before λ reaches around 0, or, before ε settles around the characteristic scale, $\overline{\epsilon }$. Recall that this is a characteristic of noisy dynamics (Fig. 9.2). For the CHF case plotted in Fig. 9.7b, we observe that the λ(ε) is oscillatory, with its value always close to 0, and hence, the only scale resolvable is around $\overline{\epsilon }$. Since the length of the time series used in our analysis for the healthy and the CHF subjects is the same, the inability of resolving the λ(ε) behavior on scales much smaller than $\overline{\epsilon }$ for patients with CHF strongly suggests that the dimension of the dynamics of the cardiovascular system for CHF patients is considerably higher than that for healthy subjects.

We now discuss how to distinguish between healthy subjects and patients with CHF from HRV analysis. We have devised two simple measures, or features. One is to characterize how well the linear relation between λ(ε) and lnε can be defined. We have quantified this by calculating the error between a fitted straight line and the actual λ(ε) vs. lnε plots of Fig. 9.7a, b. The second feature is to characterize how well the characteristic scale $\overline{\epsilon }$ is defined. This is quantified by the ratio between two scale ranges, one is from the second to the sixth point of the λ(ε) curves and another is from the seventh to the 11th point of the λ(ε) curves. Now, each subject’s data can be represented as a point in the feature plane, as shown in Fig. 9.8. We observe that for healthy subjects, feature 1 is generally very small but feature 2 is large, indicating that the dynamics of the cardiovascular system is like a nonlinear system with stochasticity, with resolvable small-scale behaviors and well-defined characteristic scale $\overline{\epsilon }$. The opposite is true for the patients with CHF: feature 1 is large but feature 2 is small, indicating that not only small-scale behaviors of the λ(ε) curves cannot be resolved, but also that the characteristic scale $\overline{\epsilon }$ is not well defined. Very interestingly, these two simple features separate completely the normal subjects from patients with CHF. In fact, each feature alone can almost perfectly separate the two groups of subjects studied here.

It is interesting to note that for the purpose of distinguishing normal HRV from CHF HRV, the features derived from SDLE are much more effective than other metrics including the Hurst parameter, the sample entropy, and multiscale entropy. For the details of the comparisons, we refer to [39].

4 Concluding Remarks

In this chapter, we have discussed a multiscale complexity measure, the SDLE. We have shown that it can readily characterize low-dimensional chaos and random 1 ∕ f ^α processes, can readily detect intermittent chaos, can conveniently deal with nonstationarity, and can accurately detect epileptic seizures from EEG and distinguish healthy subjects from patients with CHF from HRV. More importantly, we have established that commonly used complexity measures can be related to the value of the SDLE at specific scales, and that the pattern of the SDLE around the characteristic scale $\overline{\epsilon }$ contains a lot of useful information on the structured components of the data that may greatly help detect significant patterns. Because of the ubiquity of chaos-like motions and 1 ∕ f ^α-type processes and the complexity of HRV and EEG data, our analyses strongly suggest that the SDLE is potentially important for clinical practice and provides a comprehensive characterization of complex data arising from a wide range of fields in science and engineering.

Our analyses have a number of important implications:

To comprehensively characterize the complexity of complicated data such as HRV or EEG data, a wide range of scales has to be considered, since the complexity may be different on different scales. For this purpose, the entire λ(ε) curve, where ε is such that λ(ε) is positive, provides a good solution. This point is particularly important when one wishes to compare the complexity between two signals—the complexity for one signal may be higher on some scales, but lower on other scales. The situation shown in Fig. 9.5 may be considered one of the simplest.
For detecting important events such as epileptic seizures, λ_{small − ε} and λ_{large − ε} appear to provide better defined features than other commonly used complexity measures. This may be due to the fact that λ_{small − ε} and λ_{large − ε} are evaluated at fixed scales, while other measures are not. In other words, scale mixing may blur the features for events being detected, such as seizures.
In recent years, there has been much effort in searching for cardiac chaos [14–18, 58]. Due to the inability of unambiguously distinguishing deterministic chaos from noise by calculating the largest positive LE and the correlation dimension, it is still unclear whether the control mechanism of cardiovascular system is truly chaotic or not. Our analysis here highly suggests that if cardiac chaos does exist, it is more likely to be identified in healthy subjects than in pathological groups. This is because the dimension of the dynamics of the cardiovascular system appears to be lower for healthy than for pathological subjects. Intuitively, such an implication makes sense, because a healthy cardiovascular system is a tightly coupled system with coherent functions, while components in a malfunctioning cardiovascular system are somewhat loosely coupled and function incoherently.

As example applications, we have focused on the analyses of HRV and EEG data here. It is evident that SDLE will be useful for other kinds of complex data analyses, including financial time series and various kinds of physiological data. While much of the past as well as current research has been focused on determining whether some experimental data are chaotic or not, the scaling laws of SDLE suggest that it is often feasible to obtain the defining parameters of the data under study. That is, if the data is chaotic, then one should find out what kind of chaos it is; and if it is random, one can aim to find out what kind of random process that is, including its correlation structure. While in principle, SDLE is able to do so without pre-processing of the data under study, suitable detrending and denoising may help. A particularly simple and versatile procedure is the smooth adaptive filter developed by the authors, which has been successfully applied to recover chaos in heavy noise environment [62–64].

References

Fairley P (2004) The unruly power grid. IEEE Spectrum 41:22–27
Google Scholar
Bassingthwaighte JB, Liebovitch LS, West BJ (1994) Fractal physiology. Oxford University Press, New York
Google Scholar
Leland WE, Taqqu MS, Willinger W, Wilson DV (1994) On the self-similar nature of ethernet traffic (extended version). IEEE/ACM Trans Networking 2:1–15
Article Google Scholar
Gao JB, Rubin I (2001) Multiplicative multifractal modeling of long-range-dependent network traffic. Int J Commun Syst 14:783–801
Article MATH Google Scholar
Gao JB, Rubin I (2001) Multifractal modeling of counting processes of long-range-dependent network traffic. Comput Commun 24:1400–1410
Article Google Scholar
Mandelbrot BB (1997) Fractals and scaling in finance. Springer, New York
Book MATH Google Scholar
Gao JB, Hu J, Tung WW, Zheng Y (2011) Multiscale analysis of economic time series by scale-dependent Lyapunov exponent. Quantitative Finance DOI:10.1080/14697688.2011.580774
Google Scholar
Mit-bih normal sinus rhythm database & bidmc congestive heart failure database available at http://www.physionet.org/physiobank/database/#ecg.
Akselrod S, Gordon D, Ubel FA, Shannon DC, Barger MA, Cohen RJ (1981) Power spectrum analysis of heart rate fluctuation: a quantitative probe of beat-to-beat cardiovascular control. Science 213:220–222
Article Google Scholar
Kobayashi M, Musha T (1982) 1/f fluctuation of heart beat period. IEEE Trans Biomed Eng 29:456–457
Article Google Scholar
Peng CK, Mietus J, Hausdorff JM, Havlin S, Stanley HE, Goldberger AL (1993) Long-range anticorrelations and non-gaussian behavior of the heartbeat. Phys Rev Lett 70:1343–1346
Article Google Scholar
Ashkenazy Y, Ivanov PC, Havlin S, Peng CK, Goldberger AL, Stanley HE (2001) Magnitude and sign correlations in heartbeat fluctuations. Phys Rev Lett 86:1900–1903
Article Google Scholar
Ivanov PC, Amaral L, Goldberger AL, Havlin S, Rosenblum MG, Struzik ZR, Stanley HE (1999) Multifractality in human heartbeat dynamics. Nature 399:461–465
Article Google Scholar
Kaplan DT, Goldberger AL (1991) Chaos in cardiology. J Cardiovasc Electrophysiol 2: 342–354
Article Google Scholar
Goldberger AL, West BJ (1987) Applications of nonlinear dynamics to clinical cardiology. Ann New York Acad Sci 504:155–212
Article Google Scholar
Garfinkel A, Spano ML, Ditto WL, Weiss JN (1992) Controlling cardiac chaos. Science 257:1230–1235
Article Google Scholar
Fortrat JO, Yamamoto Y, Hughson RL (1997) Respiratory influences on non-linear dynamics of heart rate variability in humans. Biol Cybernet 77:1–10
Article MATH Google Scholar
Kaneko K, Tsuda I (2000) Complex systems: chaos and beyond. Springer, Berlin
Google Scholar
Grassberger P, Procaccia I (1983) Characterization of strange attractors. Phys Rev Lett 50: 346–349
Article MathSciNet Google Scholar
Grassberger P, Procaccia I (1983) Estimation of the kolmogorov entropy from a chaotic signal. Phys Rev A 28:2591–2593
Article Google Scholar
Wolf A, Swift JB, Swinney HL, Vastano JA (1985) Determining lyapunov exponents from a time series. Physica D 16:285–317
Article MathSciNet MATH Google Scholar
Sugihara G, May RM (1990) Nonlinear forecasting as a way of distinguishing chaos from measurement error in time series. Nature 344:734–741
Article Google Scholar
Kaplan DT, Glass L (1992) Direct test for determinism in a time-series. Phys Rev Lett 68: 427–430
Article Google Scholar
Gao JB, Zheng ZM (1994) Direct dynamical test for deterministic chaos and optimal embedding of a chaotic time series. Phys Rev E 49:3807–3814
Article Google Scholar
Gao JB, Zheng ZM (1994) Direct dynamical test for deterministic chaos. Europhys Lett 25:485–490
Article Google Scholar
Pei X, Moss F (1996) Characterization of low-dimensional dynamics in the crayfish caudal photoreceptor. Nature 379:618–621
Article Google Scholar
Poon CS, Barahona M (2001) Titration of chaos with added noise. Proc Natl Acad Sci USA 98:7107–7112
Article MATH Google Scholar
Gaspard P, Briggs ME, Francis MK, Sengers JV, Gammons RW, Dorfman JR, Calabrese RV (1998) Experimental evidence for microscopic chaos. Nature 394:865–868
Article Google Scholar
Dettmann CP, Cohen EGD (2000) Microscopic chaos and diffusion. J Stat Phys 101:775–817
Article MathSciNet MATH Google Scholar
Hu J, Tung WW, Gao JB, Cao YH (2005) Reliability of the 0-1 test for chaos. Phys Rev E 72:056207
Article Google Scholar
Osborne AR, Provenzale A (1989) Finite correlation dimension for stochastic-systems with power-law spectra. Physica D 35:357–381
Article MathSciNet MATH Google Scholar
Provenzale A, Osborne AR (1991) Convergence of the k2 entropy for random noises with power law spectra. Physica D 47:361–372
Article MathSciNet MATH Google Scholar
Aurell E, Boffetta G, Crisanti A, Paladin G, Vulpiani A (1997) Predictability in the large: an extension of the concept of lyapunov exponent. Physica A 30:1–26
MathSciNet MATH Google Scholar
Gao JB, Hu J, Tung WW, Cao YH (2006) Distinguishing chaos from noise by scale-dependent lyapunov exponent. Phys Rev E 74:066204
Article MathSciNet Google Scholar
Gao JB, Cao YH, Tung WW, Hu J (2007) Multiscale analysis of complex time series—integration of chaos and random fractal theory, and beyond. Wiley, New York
Book MATH Google Scholar
Gao JB, Hu J, Mao X, Tung WW (2012) Detecting low-dimensional chaos by the “noise titration” technique: possible problems and remedies. Chaos, Solitons, and Fractals 45:213–223
Article Google Scholar
Gao JB, Tung WW, Hu J (2009) Quantifying dynamical predictability: the pseudo-ensemble approach (in honor of professor andrew majda’s 60th birthday). Chinese Annal Math Ser B 30:569–588
Article MathSciNet MATH Google Scholar
Gao JB, Hu J, Tung WW (2011) Complexity measures of brain wave dynamics. Cogn Neurodyn 5:171–182
Article Google Scholar
Hu J, Gao JB, Tung WW, Cao YH (2010) Multiscale analysis of heart rate variability: a comparison of different complexity measures. Annal Biomed Eng 38:854–864
Article Google Scholar
Hu J, Gao JB, Tung WW (2009) Characterizing heart rate variability by scale-dependent lyapunov exponent. Chaos (special issue on Controversial Topics in Nonlinear Science: Is the Normal Heart Rate Chaotic? it is one of the most downloaded papers in that issue) 19:028506
Google Scholar
Ryan DA, Sarson GR (2008) The geodynamo as a low-dimensional deterministic system at the edge of chaos. Europhys Lett 83:49001
Article MathSciNet Google Scholar
Packard NH, Crutchfield JP, Farmer JD, Shaw RS (1980) Gemomtry from time-series. Phys Rev Lett 45:712–716
Article Google Scholar
Takens F (1981) Detecting strange attractors in turbulence. In: Rand DA, Young LS (eds.) Dynamical systems and turbulence, lecture notes in mathematics, vol 898. Springer, Berlin, p 366
Google Scholar
Sauer T, Yorke JA, Casdagli M (1991) Embedology. J Stat Phys 65:579–616
Article MathSciNet MATH Google Scholar
Gao JB, Zheng ZM (1993) Local exponential divergence plot and optimal embedding of a chaotic time series. Phys Lett A 181:153–158
Article Google Scholar
Mackey MC, Glass L (1977) Oscillation and chaos in physiological control-systems. Science 197:287–288
Article Google Scholar
Gao JB, Hu J, Tung WW, Cao YH, Sarshar N, Roychowdhury VP (2006) Assessment of long range correlation in time series: How to avoid pitfalls. Phys Rev E 73:016117
Article Google Scholar
Mandelbrot BB (1982) The fractal geometry of nature. Freeman, San Francisco
MATH Google Scholar
Frisch U (1995) Turbulence—the legacy of A.N. Kolmogorov. Cambridge University Press
Google Scholar
Lempel A, Ziv J (1976) On the complexity of finite sequences. IEEE Trans Inform Theor 22:75–81
Article MathSciNet MATH Google Scholar
Cao YH, Tung WW, Gao JB, Protopopescu VA, Hively LM (2004) Detecting dynamical changes in time series using the permutation entropy. Phys Rev E 70:046217
Article MathSciNet Google Scholar
Martinerie J, Adam C, Quyen MLV, Baulac M, Clemenceau S, Renault B, Varela FJ (1998) Epileptic seizures can be anticipated by non-linear analysis. Nat Med 4:1173–1176
Article Google Scholar
Peng CK, Buldyrev SV, Havlin S, Simons M, Stanley HE, Goldberger AL (1994) On the mosaic organization of dna sequences. Phys Rev E 49:1685–1689
Article Google Scholar
Hwa RC, Ferree TC (2002) Scaling properties of fluctuations in the human electroencephalogram. Phys Rev E 66:021901
Article Google Scholar
Robinson PA (2003) Interpretation of scaling properties of electroencephalographic fluctuations via spectral analysis and underlying physiology. Phys Rev E 67:032902
Article Google Scholar
Hon EH, Lee ST (1965) Electronic evaluations of the fetal heart rate patterns preceding fetal death: further observations. Am J Obstet Gynecol 87:814–826
Google Scholar
Task Force of the European Society of Cardiology & the North American Society of Pacing & Electrophysiology (1996) Heart rate variability: Standards of measurement, physiological interpretation,and clinical use. Circulation 93:1043–1065
Google Scholar
Babyloyantz A, Destexhe A (1988) Is the normal heart a periodic oscillator? Biol Cybern 58:203–211
Article Google Scholar
Bigger JT, Steinman RC, Rolnitzky LM, Fleiss JL, Albrecht P, Cohen RJ (1996) Power law behavior of rr-interval variability in healthy middle-aged persons, patients with recent acute myocardial infarction, and patients with heart transplants. Circulation 93:2142–2151
Article Google Scholar
Ho KKL, Moody GB, Peng CK, Mietus JE, Larson MG, Levy D, Goldberger AL (1997) Predicting survival in heart failure cases and controls using fully automated methods for deriving nonlinear and conventional indices of heart rate dynamics. Circulation 96:842–848
Article Google Scholar
Pincus SM, Viscarello RR (1992) Approximate entropy: a regularity statistic for fetal heart rate analysis. Obst Gynecol 79:249–255
Google Scholar
Hu J, Gao JB, Wang XS (2009) Multifractal analysis of sunspot time series: the effects of the 11-year cycle and fourier truncation. J Stat Mech. DOI: 10.1088/1742-5468/2009/02/P02066
Google Scholar
Gao JB, Sultan H, Hu J, Tung WW (2010) Denoising nonlinear time series by adaptive filtering and wavelet shrinkage: a comparison. IEEE Signal Process Lett 17:237–240
Article Google Scholar
Tung WW, Gao JB, Hu J, Yang L (2011) Recovering chaotic signals in heavy noise environments. Phys Rev E 83:046210
Article Google Scholar

Download references

Acknowledgements

This work is supported in part by NSF grants CMMI-1031958 and 0826119.

Author information

Authors and Affiliations

PMB Intelligence LLC, West Lafayette, IN, 47996, USA
Jianbo Gao
Mechanical and Materials Engineering, Wright State University, Dayton, Ohio, 45435, USA
Jianbo Gao
Affymetrix, Inc., 3380 Central Expressway, Santa Clara, CA, 95051, USA
Jing Hu
Department of Earth & Atmospheric Sciences, Purdue University, West Lafayette, IN, 47907, USA
Wen-wen Tung

Authors

Jianbo Gao
View author publications
You can also search for this author in PubMed Google Scholar
Jing Hu
View author publications
You can also search for this author in PubMed Google Scholar
Wen-wen Tung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianbo Gao .

Editor information

Editors and Affiliations

, Department of Mathematics, Ohio University, Ohio University 1, Athens, 45701, Ohio, USA
Xiaoping Shen
, Department of Mathematical Sciences, DePaul University, North Kenmore Avenue 2320, Chicago, 60614, Illinois, USA
Ahmed I. Zayed

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Gao, J., Hu, J., Tung, Ww. (2013). On the Application of the SDLE to the Analysis of Complex Time Series. In: Shen, X., Zayed, A. (eds) Multiscale Signal Analysis and Modeling. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4145-8_9

Download citation

DOI: https://doi.org/10.1007/978-1-4614-4145-8_9
Published: 12 August 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-4144-1
Online ISBN: 978-1-4614-4145-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

On the Application of the SDLE to the Analysis of Complex Time Series

Abstract