Digital Signal Processing

Xiros, Nikolaos I.

doi:10.1007/978-3-319-16649-0_9

Nikolaos I. Xiros³

Part of the book series: Springer Handbooks ((SHB))

12k Accesses

Abstract

In this chapter the concept of discrete-time signals and sampled-data systems implemented on digital hardware versus those of continuous-time signals driving analog systems and processes is introduced early on. The processes of signal sampling and analog signal reconstruction are then investigated, and the Nyquist sampling rate to avoid aliasing is explained by means of Fourier series analysis. Then the Z-transform is introduced as the tool of preference for analysis and synthesis of discrete-time linear, time-invariant systems defined by difference equations in the discrete time domain. A detailed account of the most important and practical continuous-time system mapping techniques to discrete-time ones is then presented. A brief account of digital filter structures and types is also given along with a presentation of the fast Fourier transform algorithm for the calculation of the discrete Fourier transform of discrete-time signals with finite duration admitting periodic expansion. The notions of waveform statistics as encountered in random signals and stochastic processes are then given in order to conclude with the effect on a random signal’s statistics due to its propagation through a linear, time-invariant system. Finally, concepts of optimal signal estimation and the Wiener filter are presented, leading to matched filter and zero-forcing equalizer parametric designs in the discrete-time domain.

Access provided by Autonomous University of Puebla. Download chapter PDF

Frequency Domain Representation of Discrete-Time Signals and Systems

Discrete-Time Signals and Systems

Chapter 10: Introduction to Digital Signal Processing

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Discrete-Time Systems

The progress in integrated circuit manufacturing (Very Large Scale Integration (GlossaryTerm

VLSI

)) combined with the need for high complexity and accuracy signal analysis and processing algorithms without development cost or time increase are the basic factors that contributed to the continuously increasing use of digital systems in industry and technology [9.1, 9.2].

1.1 Discrete-Time Signals and Digital Systems

A digital processing system is a system where the basic mathematical operations and functions, including signal transformations and data series analysis, are implemented as an embedded program onboard real-time, computer hardware [9.1, 9.2]. Since the computer hardware employed is digital, the signals that can be processed or generated cannot be continuous time, i. e., defined for any instant in time. Instead, the computer processes and generates tables (i. e., vectors) of values for the input and output signal, respectively. Such tables hold the values of the corresponding signals at discrete time instants and are referred to as discrete-time signals. The time instants for which a discrete-time signal is known or defined are in most practical cases equally spaced; this scheme is known as uniform sampling and the spacing interval between two successive samples of the signal is mentioned as sampling interval or period.

The systems, e. g., real-time digital hardware, processing discrete-time signals are known as discrete-time systems. The main advantages in using computers in marine vehicle or process instrumentation, telecommunications, acoustics etc., instead of analog (continuous-time) systems are [9.1, 9.3, 9.4, 9.5]:

1.
(Re)use of general purpose hardware for implementing a wide variety of algorithms instead of custom-built, proprietary designs that may become obsolete causing higher acquisition and development cost.
2.
Changes in the algorithm can be easily implemented as changes in the software through reprogramming without need to modify the hardware.
3.
Easy implementation of adaptive or time-varying algorithms due to programming.
4.
Very small to zero sensitivity to environmental conditions, e. g., ambient temperature, in contrast to, e. g., analog electric circuitry.
5.
Given, well-defined accuracy in computation determined by the computer’s word length.
6.
User-friendly human–machine interface (GlossaryTerm
HMI
).
7.
Network connectivity.

However, there are some disadvantages in using discrete-time systems, the most important of which is the need for converter or adapter circuits or interface systems at their input and output in order to be able to interact with the vast majority of the real-world physical or engineering systems or processes, which are most often continuous-time. Such interfaces are the analog-to-digital (GlossaryTerm

A/D

) and digital-to-analog (GlossaryTerm

D/A

) converters.

1.2 Signal Sampling

1.2.1 Modeling of Ideal Sampler

Consider a continuous-time signal x(t), which is driven through a sampling circuit (sampler) operating at a constant sampling rate (frequency), $f_{s} = 1 / T_{s}$ , as shown in Fig. 9.1.

The sampler’s output is a sequence of entries of the sampled signal, $x ({nT}_{s})$ , where n is an integer and T_s the sampling period. Sequence $x (n) = x ({nT}_{s})$ is a discrete-time signal.

We will now represent the discrete-time signal as well as the sampling process in a continuous-time framework. Such a representation can be derived on the basis of Fig. 9.2.

A pulsed continuous-time signal $x_{Δ} (t)$ , which corresponds to sequence $x ({nT}_{s})$ , can be generated from the original continuous-time signal x(t) (dashed line), if the latter is multiplied by the following sampling pulse train $P_{Δ} (t)$

P_{Δ} (t) = \sum_{n = - \infty}^{+ \infty} p_{Δ} (t - n T_{s}),

(9.1)

where

p_{Δ} (t) = \{\begin{cases} 1 / Δ, & t \in [0, Δ] \\ 0 & t \notin [0, Δ] \end{cases}, 0 < Δ ≪ T_{s} .

(9.2)

In effect

x_{Δ} (t) = x (t) \cdot P_{Δ} (t) .

(9.3)

However, the following holds

lim_{Δ \to 0} p_{Δ} (t) = δ (t) \Rightarrow lim_{Δ \to 0} P_{Δ} (t) = \sum_{n = - \infty}^{+ \infty} δ (t - n T_{s}) .

(9.4)

In the above ideal case, the sampling pulse train is an impulse train or Dirac comb. In effect, (9.4) becomes

x_{δ} (t) = x (t) \cdot \sum_{n = - \infty}^{+ \infty} δ (t - n T_{s}) = \sum_{n = - \infty}^{+ \infty} x (n T_{s}) δ (t - n T_{s}) = \sum_{n = - \infty}^{+ \infty} x (n) δ (t - n T_{s}) .

(9.5)

Above, the following fact for Dirac’s impulse delta has been used: $χ (t) δ (t) = χ (0) δ (t)$ , which implies the assumption that signal $χ (t)$ assumes non-infinite values.

Hence, the ideal sampler operation consists of multiplicatively applying the input signal by a Dirac comb with T_s. Now we will determine the spectrum of signal $x_{δ} (t)$ at the output of the sampler by employing the Fourier transform.

1.2.2 Fourier Series Expansion and Fourier Transform

Any periodic signal x(t) with period T can be expanded to a complex Fourier series according to the following

x (t) = \sum_{n = 0}^{+ \infty} c_{n} \exp (i, n, \frac{2 π}{T}, t) .

(9.6)

Coefficients c_n of the series are calculated according to

c_{n} = \frac{1}{T} \int_{0}^{T} x (t) \exp (- i n \frac{2 π}{T} t) d t .

(9.7)

The above can be extended to the case of aperiodic (non-periodic) signals assuming that period T is infinite. That is how the Fourier transform emerges

X (ω) = F {x (t)} = \int_{- \infty}^{+ \infty} x (t) \exp (- i ω t) d t, ω = \frac{2 π}{T} .

(9.8)

The inverse Fourier transform is defined by

x (t) = \frac{1}{2 π} \int_{- \infty}^{+ \infty} X (ω) \exp (i ω t) d ω .

(9.9)

To generalize formalism we refer to the Fourier transform of any signal, periodic or not. In the case of periodic signals, the Fourier transform is directly connected to coefficients c_n of its series expansion as follows

X (ω) = 2 π \sum_{n = 0}^{+ \infty} c_{n} δ (ω - n \frac{2 π}{T}) .

(9.10)

The above can be derived directly by plugging it into (9.8) and generating (9.6). The Fourier transform, just like the Laplace transform, from which the former can be defined by setting $s = i ω$ , is a linear integral transform. Its properties are identical to those of the Laplace transform, and can be sought in the literature. An important property is that of multiplication-convolution duality

x (t) = x_{1} (ξ) \otimes x_{2} (ξ) ≜ \int_{- \infty}^{+ \infty} x_{1} (ξ) \cdot x_{2} (t - ξ) \cdot d ξ \Rightarrow X (ω) = X_{1} (ω) \cdot X_{2} (ω) x (t) = x_{1} (t) \cdot x_{2} (t) \Rightarrow X (ω) = X_{1} (ω) \otimes X_{2} (ω) ≜ \int_{- \infty}^{+ \infty} X_{1} (ξ) \cdot X_{2} (ω - ξ) \cdot d ξ .

(9.11)

Here, $\otimes$ stands for the convolution operator. Note that the above can be directly applied to the Dirac comb describing the function of an ideal sampler as well as its periodicity.

Indeed, the Dirac comb in (9.4) is a periodic signal with period T_s. Its Fourier series is given below

c_{n} = \frac{1}{T_{s}} \int_{0}^{T_{s}} δ (t) \exp (- i n ω_{s} t) d t = \frac{\exp (- i n ω_{s} 0)}{T_{s}} \int_{0}^{T_{s}} δ (t) d t = \frac{1}{T_{s}}, ω_{s} = \frac{2 π}{T_{s}} .

(9.12)

Hence, based on (9.10) the spectrum (Fourier transform) of the Dirac comb is given by

F \{\sum_{n = - \infty}^{+ \infty} δ (t - n T_{s})\} = 2 π \sum_{n = 0}^{+ \infty} \frac{1}{T_{s}} δ (ω - n \frac{2 π}{T_{s}}) = ω_{s} \sum_{n = 0}^{+ \infty} δ (ω - n ω_{s}) .

(9.13)

1.2.3 Nyquist Sampling Rate – Aliasing

Combining the relationship above for the Dirac comb spectrum with (9.5), as well as the Fourier transform duality between multiplication and convolution, one can determine the spectrum of signal $x_{δ} (t)$ at the output of an ideal sampler

\begin{aligned} x_{δ} (t) & = x (t) \cdot \sum_{n = - \infty}^{+ \infty} δ (t - n T_{s}) \Rightarrow \\ \Rightarrow X_{δ} (ω) = X (ω) \otimes F \{\sum_{n = - \infty}^{+ \infty} δ (t - n T_{s})\} \\ = ω_{s} X (ω) \otimes \sum_{n = 0}^{+ \infty} δ (ω - n ω_{s}) \\ \Rightarrow X_{δ} (ω) = ω_{s} \sum_{n = 0}^{+ \infty} X (ω) \otimes δ (ω - n ω_{s}) \\ = ω_{s} \sum_{n = 0}^{+ \infty} \int_{- \infty}^{+ \infty} X (ξ) δ (ω - n ω_{s} - ξ) d ξ . \end{aligned}

The following property of the Dirac impulse delta can now be employed

χ (r) \otimes δ (r - r_{0}) = \int_{- \infty}^{+ \infty} χ (ξ) δ (r - r_{0} - ξ) d ξ = χ (r - r_{0}) .

(9.14)

Eventually one obtains

X_{δ} (ω) = F \{\sum_{n = - \infty}^{+ \infty} x (n T_{s}) δ (t - n T_{s})\} = ω_{s} \cdot \sum_{n = 0}^{+ \infty} X (ω - n ω_{s}) .

(9.15)

Using (9.15) one can assess the effect of sampling on spectrum $X (ω)$ of the analog, the continuous-time signal at the input. Sampling introduces, therefore, an infinite number of aliases of spectrum $X (ω)$ of the analog signal, centered at integer multiples of the sampling circular frequency ω_s.

To visualize this, consider a real signal with all its spectral content lying within the low frequency range (low-pass or baseband signal); like, e. g., the signal with spectrum as in Fig. 9.3. Note that the amplitude spectrum $| X (ω) |$ of the signal is an odd function of circular frequency ω due to the fact is assumed to be purely real in the time domain.

The amplitude spectrum of signal $x_{δ} (t)$ is shown in the Fig. 9.4 as generated by sampling of low-pass signal x(t) above at three (9.3) different sampling rates: $ω_{s, 0} = 2 ω_{max}$ , $ω_{s, 1} = 4 ω_{max}$ , $ω_{s, 2} = ω_{max}$ .

Using the plot, one can readily derive the following sampling rate criterion attributed to Nyquist (or the Shannon sampling theorem) for the low-pass signal like the one in Fig. 9.4

ω_{s} \geq ω_{s, 0} = 2 ω_{max} .

(9.16)

If the condition above holds as seen in (9.15) (spectrum periodicity for the sampled signal), as well as Fig. 9.5, an ideal (brick) low-pass filter with transfer function as follows can be used to reconstruct the original continuous-time signal

H_{LPF} (ω) = \{\begin{cases} 1, | ω | \leq ω_{s} / 2 \\ 0, | ω | > ω_{s} / 2 \end{cases} .

(9.17)

The original spectrum is ideally reproduced without any distortion since it holds that

H_{LPF} (ω) X_{δ} (ω) = ω_{s} X (ω) .

(9.18)

This is made clearer next when reconstruction of a continuous-time signal based on the values of a discretized version is discussed.

1.3 Analog Signal Reconstruction Using a Discrete-Time Signal

The simplest process to generate a continuous-time signal, x(t), from a discretized sequence (discrete-time signal) x(n) is to employ the zero-order hold (GlossaryTerm

ZOH

) network. This system holds the output constant and equal to the one applied to its input for an entire sampling period T_s. In effect, the output of ZOH when driven by a discretized signal is given in the Fig. 9.5.

The mathematical description of a ZOH system is achieved through the following impulse response and transfer function in the complex frequency domain of variable s

h_{ZOH} (t) = u_{step} (t) - u_{step} (t - T_{s}) \Leftrightarrow H_{ZOH} (s) = \frac{1 - e^{- s T_{s}}}{s} .

(9.19)

To better understand the action of the ZOH system, its transfer function in the circular frequency domain ω (Fourier transform) is given below, as well

H_{ZOH} (ω) = \frac{1 - e^{- i ω T_{s}}}{i ω} = T_{s} e^{- i ω T_{s} / 2} \frac{\sin (ω T_{s} / 2)}{(ω T_{s} / 2)} = \frac{e^{- i π^{f / f_{s}}}}{f_{s}} sinc (f / f_{s}) .

(9.20)

In the above $sinc (χ) = sinc (π χ) / π χ$ . In Fig. 9.6 the amplitude spectra of the transfer functions of the ZOH system (normalized by setting $f_{s} = 1 Hz$ ) and the ideal reconstruction low-pass filter which was introduced with (9.17).

Indeed, in the circuit above if resistance R is small (theoretically zero), then time constant τ indicating the time needed for charging the capacitor in the hold circuit is small when compared to sampling period T_s. The voltage across the terminals of the capacitor stays at the value set in the previous sampling instant, when the switch was momentarily switched on, until the next instant the switch is on, etc. The differences of the analog memory element, as the switched resistor and capacitor (GlossaryTerm

RC

) circuit is known, and the model ZOH system are:

a)
The ohm resistance value of the circuit is non-zero and, therefore, the time to charge the capacitor is finite and not zero.
b)
The duration of the conduction phase of the switch is finite and non-zero.
c)
The ohm resistance applied to the terminals of the capacitor when the switch is off is not infinite (open circuit) but finite, eventually leading to discharge of the capacitor.

The use of the RC hold circuit is widespread due to its simplicity. However, if a more accurate reconstruction with less distortion, without increasing the sampling rate, then hold circuits of order higher than zero (e. g., first-order hold, GlossaryTerm

FOH

) so that its transfer function better approximates the ideal brick low-pass filter in (9.17), as shown in Fig. 9.6.

1.4 The Z-Transform

The single-sided (unilateral) Z-transform is a linear transform defined as follows for a discrete-time signal x(n)

X (z) = Z {x (n)} = \sum_{n = 0}^{+ \infty} x (n) \cdot z^{- n} .

(9.21)

The Z-transform is derived from the Laplace transform of signal $x_{δ} (t)$ , which is generated by the ideal sampler shown in Fig. 9.7,

x_{δ} (t) = \sum_{n = - \infty}^{+ \infty} x (n) δ (t - n T_{s}) \Rightarrow X_{δ} (s) = L \{\sum_{n = - \infty}^{+ \infty} x (n) δ (t - n T_{s})\} = \sum_{n = 0}^{+ \infty} x (n) e^{- n \cdot s T_{s}} .

(9.22)

In effect,

X (z) |_{z^{- 1} = \exp (- s T_{s})} = X_{δ} (s) .

(9.23)

Variable z^-1 is used for describing a signal, or system as we will see later on. Its significance lies in the fact that multiplying by z^-1 in the transform domain translates to a time shift by one sampling period, T_s, in the discrete-time domain n. Indeed,

y (n) = x (n - 1) \Leftrightarrow Y (z) = z^{- 1} X (z) .

(9.24)

Furthermore, here are some more of the most important properties of the Z-transform:

1.
Linearity
$.\begin{array}{l} X_{1} (z) = Z {x_{1} (n)} \\ X_{2} (z) = Z {x_{2} (n)} \\ k_{1}, k_{2} \in R \end{array}\} \Rightarrow k_{1} \cdot X_{1} (z) + k_{2} \cdot X_{2} (z) = Z {k_{1} \cdot x_{1} (n) + k_{2} \cdot x_{2} (n)} .$
(9.25)
2.
Delay or advance by N discrete-time units
$Z {x (n)} = X (z) \Rightarrow \{\begin{cases} Z {x (n - N)} = z^{- N} X (z) \\ Z {x (n + N)} = z^{N} X (z) - \sum_{k = 0}^{N - 1} z^{N - k} x (k) \end{cases} .$
(9.26)
3.
Initial or final value theorems
$Z {x (n)} = X (z) \Rightarrow \{\begin{cases} x (n = 0) = lim_{z \to \infty} (X (z)) \\ lim_{n \to \infty} (x (n)) = lim_{z \to 1} ((1 - z^{- 1}) X (z)) \end{cases} .$
(9.27)
Note: the final value theorem $(n \to \infty)$ holds only if X(z) is an analytic function of complex variable z.
4.
Discrete convolution theorem
$.\begin{array}{l} X_{1} (z) = Z {x_{1} (n)} \\ X_{2} (z) = Z {x_{2} (n)} \\ x (n) = x_{1} (k) \otimes x_{2} (k) ≜ \sum_{k = - \infty}^{+ \infty} x_{1} (k) \cdot x_{2} (n - k) \end{array}\} \Rightarrow X (z) = Z {x (n)} = X_{1} (z) \cdot X_{2} (z) .$
(9.28)
Note: because x(n) needs to be causal, i. e., can only depend on values of signals x₁ and x₂ prior or up to instant n, the bounds of the convolutional summation are set as follows,
$(x_{1} (k) \otimes x_{2} (k)) (n) = \sum_{k = 0}^{n} x_{1} (k) \cdot x_{2} (n - k) .$
It is also pointed out here that the commutative and associative properties hold for the convolution operator applied on two discrete-time signals.

Finally, it is pointed out that the change of variable, $z = \exp (s T_{s})$ , on the basis of which the Z-transform is obtained from the Laplace transform of signal $x_{δ} (t)$ , defines a single-valued mapping of complex frequency s-plane to the plane of, also complex, delay operator z. Remember that the stability region on the complex plane $s = a + i ω$ , is the left-hand half-plane defined by inequality $α \leq 0$ . In effect, the stability region on the delay operator z-plane is defined by inequality

| z | \leq 1 .

(9.29)

Since

z = e^{s T_{s}} = e^{a T_{s}} e^{i ω T_{s}} \Rightarrow \{\begin{cases} | z | = e^{a T_{s}} \\ \arg (z) = e^{i ω T_{s}} \end{cases} .

(9.30)

That is, the stability region is the unit disc. Similarly, the origin of the s-plane is mapped to point $z = A + j 0$ on the z-plane, while the imaginary axis of the s-plane is wrapped around the unit circle on the z-plane. Finally, the real s axis is mapped to the real z semi-axis. The Z-transform of several elementary discrete-time signals are shown in Table 9.1. In the interest of completeness and allowing for comparisons a Laplace transform table is provided in Table 9.2.

Table 9.1 Some Z-transform pairs

Full size table

Table 9.2 Laplace transform pairs

Full size table

1.5 Discrete-Time LTI Systems

1.5.1 Difference Equations

A special class of dynamic systems, widely used in systems and signal theory, is linear time invariant systems (GlossaryTerm

LTI

) [9.1, 9.2, 9.4, 9.5, 9.6]. Such systems are described in continuous time by linear differential equations with constant coefficients. In discrete time, such a differential equation is converted to a difference equation corresponding to a discrete-time system described by a transfer function in variable z, or equivalently z^-1. The general form of a linear difference equation with input u(n) and output y(n) is the following,

y (n) = \sum_{k = 0}^{M} b_{k} u (n - k) - \sum_{k = 1}^{N} a_{k} y (n - k) .

(9.31)

To obtain the iterative process started in order to obtain samples of discrete-time output signal y(n) for $n \geq 0$ , N initial conditions are needed: $y (- k)$ , $k = 1, \dots, N$ . Using (9.31) we can conclude that the n-th sample of the output is determined as a weighed sum of the current plus M past input samples and N past output samples.

1.5.2 Linear Time-Invariant Discrete-Time Systems

The generic difference (9.31) in the discrete-time domain defines an LTI discrete-time system. In the Z-transform domain the difference equation becomes the transfer function as follows

H (z^{- 1}) = \frac{Y (z)}{U (z)} = \frac{b_{0} + b_{1} z^{- 1} + \dots + b_{M} z^{- M}}{1 + a_{1} z^{- 1} + \dots a_{N - 1} z^{- N + 1} + a_{N} z^{- N}} ⇕ H (z) = \frac{b_{0} z^{N} + b_{1} z^{N - 1} + \dots + b_{M} z^{N - M}}{z^{N} + a_{1} z^{N - 1} + \dots + a_{N}}, M \leq N .

(9.32)

A special, yet very important case of discrete-time systems are finite impulse response (GlossaryTerm

FIR

) systems in contrast to the generic GlossaryTerm

IIR

(infinite impulse response) system defined in (9.31). For a FIR system it holds that

a_{k} = 0

,

\forall k > 0

in (9.31), hence

y (n) = \sum_{k = 0}^{M} b_{k} u (n - k) and H (z^{- 1}) = b_{0} + b_{1} z^{- 1} + \dots + b_{M} z^{- M} .

(9.33)

In effect, the n-th output sample is exclusively determined as a weighed sum of the current plus M past input samples. In effect, the difference equations seizes to be recursive and that is why in the literature FIR systems are also mentioned as MA (moving average) ones.

In contrast, discrete-time LTI systems that have exclusively poles, i. e., $b_{i} = 0$ , for every i > 0 in (9.32), are also known as AR (auto-regressive), while the generic IIR system is mentioned also as auto-regressive moving average (GlossaryTerm

ARMA

) in this framework.

1.6 Continuous-Time System Mapping

There are many ways (methods) to map a continuous-time system to a discrete-time equivalent [9.1, 9.2, 9.4, 9.5, 9.6]. All of these methods are based on preserving some characteristic of the continuous-time system during this procedure; this is why there is more than one way. However, stability (or instability) of a continuous-time system is preserved by employing any of the methods presented here.

The mapping methods considered here can be classified into the following categories:

1.
Approximate differentiation or integration:
- Backward difference
- Forward difference
- Trapezoidal integration (Tustin transformation).
1.
Time-domain response matching at sampling instants:
- Impulse response matching
- Step response matching.
1.
Dynamic matching, i. e., matching of all poles and zeroes appearing in the transfer function.

1.6.1 Approximate Differentiation or Integration

Consider a continuous-time signal x(t) being sampled with sampling interval T_s and the discrete-time signal x(n) is obtained. Then, the following approximations can be derived.

Approximate derivative with respect to time using backward difference

\dot{x} (t) ≃ \frac{x (n) - x (n - 1)}{T_{s}} \Rightarrow s \equiv \frac{1 - z^{- 1}}{T_{s}} .

(9.34)

Approximate derivative with respect to time using forward difference

\dot{x} (t) ≃ \frac{x (n + 1) - x (n)}{T_{s}} \Rightarrow s \equiv \frac{z - 1}{T_{s}} = \frac{1 - z^{- 1}}{T_{s} z^{- 1}} .

(9.35)

Approximate integral with respect to time using the trapezoidal rule

x_{1} (n T_{s}) = \int_{0}^{n T_{s}} x (ξ) d ξ = \int_{0}^{(n - 1) T_{s}} x (ξ) d ξ + \int_{(n - 1) T_{s}}^{n T_{s}} x (ξ) d ξ ≜ x_{1} ((n - 1) T_{s}) + Δ x_{1} (n), Δ x_{1} (n) ≃ T_{s} \cdot \frac{x (n) + x (n - 1)}{2} .

(9.36)

However,

X_{1} (s) = \frac{1}{s} X (s) and X_{1} (z) = z^{- 1} X_{1} (z) + T_{s} \frac{1 + z^{- 1}}{2} X (z) .

(9.37)

The above in effect yields

\frac{1}{s} \equiv \frac{T_{s}}{2} \cdot \frac{1 + z^{- 1}}{1 - z^{- 1}} \Leftrightarrow s \equiv \frac{2}{T_{s}} \cdot \frac{1 - z^{- 1}}{1 + z^{- 1}} .

(9.38)

Equations (9.34), (9.35), and (9.38) mean that when presented with a transfer function H(s) in the complex frequency (s) domain, describing a continuous-time LTI system, it is possible to derive a discrete-time transfer function H(z), or $H (z^{- 1})$ , defining a discrete-time LTI system by substituting variable s with the approximation desired.

1.6.2 Impulse Response Matching

The methods presented previously were based on approximating the integration or differentiation operator appearing in the differential (dynamic) equation of an LTI system. However, an LTI system can, also, be described in full by its response to an arbitrary input signal. Common choices include using either Dirac’s delta (impulse) or the unit step signal (Heaviside signal).

In effect, if the impulse or step response of a continuous-time system is sampled, then a discrete-time description can be derived as follows.

Invariance of impulse response: the impulse delta is defined as follows in discrete time

δ (n) = \{\begin{cases} 0, n \neq 0 \\ 1, n = 0 \end{cases} \Leftrightarrow Δ (z) = 1 .

(9.39)

Given the transfer function, H(z) of a discrete-time LTI system, it is easy to prove that, just like in the continuous time case, H(z) is the Z-transform of the system’s impulse response.

Indeed, the output y(n) of a discrete-time LTI system is given by the following convolutional sum

y (n) = u (k) \otimes h (k) ≜ \sum_{k = - \infty}^{+ \infty} u (k) \cdot h (n - k) \Rightarrow Y (z) = H (z) \cdot U (z) .

(9.40)

In effect,

u (n) = δ (n) \Rightarrow U (z) = 1 \Rightarrow Y (z) = H (z) .

Consider a continuous-time LTI system with impulse response $h (t) = L^{- 1} {H (s)}$ , which is being sampled at rate f_s. This means that continuous-time signal $h_{δ} (t) = \sum_{n = - \infty}^{+ \infty} h (n T_{s}) δ (t - n T_{s})$ corresponds to discrete-time signal $h_{δ} (n) \equiv h (n T_{s})$ , i. e., a sample set of h(t).

The Z-transform of $h_{δ} (n)$ ,

H_{δ} (z^{- 1}) = \sum_{n = 0}^{+ \infty} h_{δ} (n) z^{- n},

defines a discrete-time LTI system, the impulse response of which, by definition, coincides with that of continuous-time transfer function H(s), at all sampling instants.

Assume, now, that for every positive M there is a positive integer ε such that

\sum_{n = M + 1}^{+ \infty} h_{δ}^{2} (n) < ε .

(9.41)

Given the above, one can proceed to impulse response truncation with accuracy ε. This procedure consists of approximating the original IIR system with transfer function $H_{δ} (z^{- 1}) = \sum_{n = 0}^{+ \infty} h_{δ} (n) z^{- n}$ by the following FIR one with memory M

{\hat{H}}_{FIR} (z^{- 1}) = \sum_{n = 0}^{M} h_{δ} (n) z^{- n} = \sum_{n = 0}^{M} b_{n} z^{- n} .

(9.42)

Approximating a continuous-time system with a discrete-time one, which on top might be FIR, comes with a number of advantages, especially for real-time applications.

Invariance of step response: the unit step signal in discrete time is defined as follows

u_{step} (n) = \{\begin{cases} 1, n \geq 0 \\ 0, n < 0 \end{cases} \Leftrightarrow U_{step} (z) = \sum_{n = 0}^{+ \infty} z^{- n} = \frac{1}{1 - z^{- 1}} .

(9.43)

The Z-transform of the unit step signal is derived by applying the lemma for the infinite sum of a geometric progression with a ratio that in magnitude is less than 1.

Consider now an LTI system with impulse response as follows

y_{step} (t) = L^{- 1} {H (s) U_{step} (s)} = L^{- 1} \{\frac{H (s)}{s}\} .

(9.44)

We will now determine a discrete-time transfer function H(z) such that the latter’s unit step response coincides with that of the continuous-time system above. Mathematically, this is defined as follows

Z^{- 1} {H (z) U_{step} (z)} = y_{step} (n) = y_{step} (n T_{s}) .

(9.45)

Then, by employing (9.43), one obtains that

H (z) = (1 - z^{- 1}) Z {y_{step} (n T_{s})} .

(9.46)

Since

z^{- 1} = \exp (- s T_{s}) and Y_{step} (s) = \frac{H (s)}{s} .

We can finally obtain that

H (z) = Z {y_{ZOH} (n T_{s})} .

(9.47)

In the above, it holds that

y_{ZOH} (t) = h_{ZOH} (t) \otimes h (t) \Leftrightarrow Y_{ZOH} (s) = H_{ZOH} (s) H (s) = \frac{1 - e^{- s T_{s}}}{s} H (s) .

(9.48)

1.6.3 Pole and Zero Matching of a Transfer Function

This method derives from equation

z^{- 1} = \exp (- s T_{s})

connecting variable z with complex frequency s. Combining the above with the following factorized form of a continuous-time scalar transfer function

H (s) = K_{s} \frac{(s + z_{1}) (s + z_{2}) \dots (s + z_{m})}{(s + p_{1}) (s + p_{2}) \dots (s + p_{n})}, m \leq n .

(9.49)

One can map every pole and every zero of the continuous-time transfer function to a corresponding pole and zero of an equivalent discrete-time transfer function

ζ_{i} = - \exp (- T_{s} z_{i}), i = 1, \dots, m, π_{i} = - \exp (- T_{s} p_{i}), i = 1, \dots, n .

(9.50)

In effect, the following discrete-time transfer function is derived in its factorized from

H (z) = k_{z} \frac{(z + 1)^{n - m} (z + ζ_{1}) (z + ζ_{2}) \dots (z + ζ_{m})}{(z + π_{1}) (z + π_{2}) \dots (z + π_{n})} .

(9.51)

Factor $(z + 1)^{n - m}$ is introduced in the discrete-time transfer function above in correspondence to factor $(τ s + 1)^{n - m}$ with $τ \to \pm j 0$ , which is hidden in the continuous-time transfer function (9.49). The intent is to equate the numerator degree to that of the denominator. This is achieved by introducing an extra zero at imaginary infinity $(\pm j \infty)$ with multiplicity $(n - m)$ using factor $(τ s + 1)^{n - m}$ . In the discrete-time transfer function, factor $(τ s + 1)^{n - m}$ is mapped to $(z + 1)^{n - m}$ , since point $\pm j \infty$ on the s-plane is mapped to −1 on the z-plane.

Finally, gain k in the discrete-time transfer function is determined by equating its value at a certain z with that of the continuous-time transfer function at the equivalent point s. For example, for z = 1, the corresponding point is s = 0, and, therefore, k is calculated as follows

H (z) |_{z = 1} = H (s) |_{s = 0} \Leftrightarrow k_{z} \frac{2^{n - m} (1 + ζ_{1}) (1 + ζ_{2}) \dots (1 + ζ_{m})}{(1 + π_{1}) (1 + π_{2}) \dots (1 + π_{n})}, = K_{s} \frac{z_{1} z_{2} \dots z_{m}}{p_{1} p_{2} \dots p_{n}} .

(9.52)

2 Digital Filters

In the previous section we studied different ways of describing discrete-time systems that are linear and time invariant. It was verified that the Z-transform greatly simplifies the analysis of discrete-time systems, especially those initially described by difference equations.

In this section, we study in finer detail several structures used to realize a given transfer function associated with a specific difference equation through the use of the Z-transform. The transfer functions considered here will be of the polynomial form (non-recursive filters, FIR) and of the rational-polynomial form (recursive filters, IIR). In the non-recursive case we emphasize the existence of the important subclass of linear-phase filters. Then we introduce some tools to calculate the digital network transfer function, as well as to analyze its internal behavior. We also discuss some properties of generic digital filter structures associated with practical discrete-time systems.

2.1 Important FIR Filter Structures

Non-recursive filters are characterized by a difference equation in the form

y (n) = \sum_{l = 0}^{M} b_{l} x (n - l),

(9.53)

where the b_l coefficients are directly related to the system impulse response; that is, $b_{l} = h (l)$ . Owing to the finite length of their impulse responses, non-recursive filters are also referred to as finite-duration impulse response (FIR) filters. We can rewrite (9.53) as follows

y (n) = \sum_{l = 0}^{M} h (l) x (n - l) .

(9.54)

Applying the Z-transform to the equation above, we end up with the following input–output relationship

H (z) = \frac{Y (z)}{X (z)} = \sum_{l = 0}^{M} h (l) z^{- l} = \sum_{l = 0}^{M} b_{l} z^{- l} .

(9.55)

In practical terms, (9.55) can be implemented in several distinct forms, using as basic elements the delay, the multiplier, and the adder blocks. These basic elements of digital filters and their corresponding standard symbols are depicted in Fig. 9.8. An alternative way of representing such elements is the so-called signal flow graph shown in Fig. 9.9.

These two sets of symbolisms representing the delay, multiplier, and adder elements are used throughout this text interchangeably.

2.1.1 Direct Form

The simplest realization of an FIR digital filter is derived from (9.55). The resulting structure, which can be seen in Fig. 9.10, is called the direct-form realization, as the multiplier coefficients are obtained directly from the filter transfer function. Such a structure is also referred to as the canonic direct form, where we understand canonic form to mean any structure that realizes a given transfer function with the minimum number of delays, multipliers, and adders. More specifically, a structure that utilizes the minimum number of delays is said to be canonical with respect to the delay element, and so on.

An alternative canonical direct form for (9.76) can be derived by expressing H(z) as follows

(z) = \sum_{l = 0}^{M} h (l) z^{- l} = h (0) + z^{- 1} (\sum_{l = 1}^{M} h (l) z^{- l}) .

(9.56)

The implementation of this form is shown in Fig. 9.11.

2.1.2 Cascade Form

Equation (9.55) can be realized through a series of equivalent structures. However, the coefficients of such distinct realizations may not be explicitly the filter impulse response or the corresponding transfer function. An important example of such a realization is the so-called cascade form, which consists of a series of second-order FIR filters connected in cascade, thus the name of the resulting structure, as seen in Fig. 9.12. The transfer function associated with such a realization is of the form

H (z) = \prod_{k = 1}^{N} (γ_{0 k} + γ_{1 k} z^{- 1} + γ_{2 k} z^{- 2}) .

(9.57)

In the above, if M is the filter order, then $N = M / 2$ when M is even and $N = (M + 1) / 2$ when M is odd. In the latter case, one of the $γ_{2 k}$ vanishes.

2.1.3 Linear-Phase Form

An important subclass of FIR digital filters is the one that includes linear-phase filters. Such filters are characterized by a constant group delay τ; therefore, they must present a frequency response of the following form

H (e^{i ω}) = B (ω) \exp (- i ω τ + i φ) .

(9.58)

In the above $B (ω)$ is real and τ and φ are constant. We now proceed to show that linear-phase FIR filters present impulse responses of very particular forms. Specifically, if h(n) is to be causal and of finite duration, for 0 $\leq n \leq M$ , we must necessarily have that

τ = \frac{M}{2} .

(9.59)

Therefore, one obtains that

h (n) = e^{2 i φ} h^{*} (M - n) .

(9.60)

This is the general equation that the coefficients of a linear-phase FIR filter must satisfy. In the common case, where all the filter coefficients are real, one finally obtains that the filter impulse response must be either symmetric or antisymmetric. In effect, the frequency response of linear-phase FIR filters with real coefficients becomes as follows

H (e^{i ω}) = B (ω) \exp (- i ω (\frac{M}{2}) + i (\frac{k π}{2})) .

(9.61)

As a result, solely four distinct cases need be considered that are described by the equations above. Their types have been standardized and are referred to in literature as follows [9.2]:

Type I: k = 0 and M even
Type II: k = 0 and M odd
Type III: k = 1 and M even
Type IV: k = 1 and M odd.

Typical impulse responses of the four cases of linear-phase FIR digital filters are depicted in Fig. 9.13.

2.2 Important IIR Filter Structures

2.2.1 Direct Form

Recursive filters have transfer functions of the following form

H (z) = \frac{N (z)}{D (z)} = \frac{\sum_{i = 0}^{M} b_{i} z^{- i}}{1 + \sum_{i = 1}^{N} a_{i} z^{- i}} .

(9.62)

Since, in most cases, such transfer functions give rise to filters with impulse responses having infinite durations, recursive filters are also referred to as infinite-duration impulse response (GlossaryTerm

IIR

) filters.

We can consider that H(z) as above results from the cascading of two separate filters of transfer functions N(z) and 1/D(z). The N(z) polynomial can be realized with the FIR direct form, as shown in the previous section. The realization of 1/D(z) can be performed as depicted in Fig. 9.14, where the FIR filter shown will be an ( $N - 1$ )-th order filter with transfer function as follows

D^{'} (z) = z (1 - D (z)) = - z \sum_{i = 1}^{N} a_{i} z^{- i} .

(9.63)

The direct form of realizing $1 / D (z)$ is shown in Fig. 9.15.

The complete realization of H(z), as a cascade of N(z) and $1 / D (z)$ , is shown in Fig. 9.16. Such a structure is not canonic with respect to the delays, since for an $(M, N)$ -th order filter this realization requires $(N + M)$ delays.

Clearly, in the general case we can change the order in which we cascade the two separate filters; that is, H(z) can be realized as

N (z) \times \frac{1}{D (z)} or (\frac{1}{D (z)}) \times N (z) .

In the second option, all delays employed start from the same node, which allows us to eliminate the consequent redundant delays. In that manner, the resulting structure, usually referred to as the Type 1 canonic direct form, is the one depicted in Fig. 9.17, for the special case N = M.

An alternative structure, the so-called Type 2 canonic direct form, is shown in Fig. 9.18. Such a realization is generated from the corresponding non-recursive form.

The majority of IIR filter transfer functions used in practice present a numerator degree M smaller than or equal to the denominator degree N. In general, one can consider, without much loss of generality, that M = N. In the case where M < N, we just make the coefficients $b_{M + 1}, b_{M + 2}, \dots, b_{N}$ in Figs. 9.17 and 9.18 equal to zero.

2.2.2 Cascade Form

In the same way as their FIR counterparts, IIR digital filters present a large variety of possible alternative realizations. An important one, referred to as cascade realization, is depicted in Fig. 9.19a, where the basic blocks represent simple transfer functions of orders 2 or 1. In fact, the cascade form, based on second-order blocks, is associated with the following transfer function decomposition

H (z) = \prod_{k = 1}^{m} \frac{γ_{0 k} + γ_{1 k} z^{- 1} + γ_{2 k} z^{- 2}}{1 + m_{1 k} z^{- 1} + m_{2 k} z^{- 2}}, = \prod_{k = 1}^{m} \frac{γ_{0 k} z^{2} + γ_{1 k} z + γ_{2 k}}{z^{2} + m_{1 k} z + m_{2 k}}, = H_{0} \prod_{k = 1}^{m} \frac{z^{2} + γ_{1 k}^{'} z + γ_{2 k}^{'}}{z^{2} + m_{1 k} z + m_{2 k}} .

(9.64)

2.2.3 Parallel Form

Another important realization for recursive digital filters is the parallel form represented in Fig. 9.19b. Using second-order blocks, which are the most commonly used in practice, the parallel realization corresponds to the following transfer function decomposition

H (z) = \prod_{k = 1}^{m} \frac{γ_{0 k}^{p} z^{2} + γ_{1 k}^{p} z + γ_{2 k}^{p}}{z^{2} + m_{1 k} z + m_{2 k}}, = h_{0} + \prod_{k = 1}^{m} \frac{γ_{1 k}^{p^{'}} z + γ_{2 k}^{p^{'}}}{z^{2} + m_{1 k} z + m_{2 k}}, = h_{0}^{'} + \prod_{k = 1}^{m} \frac{γ_{0 k}^{p^{''}} z^{2} + γ_{1 k}^{p^{''}} z}{z^{2} + m_{1 k} z + m_{2 k}} .

(9.65)

The above is also known as the partial-fraction decomposition. This equation indicates three alternative forms of the parallel realization, where the last two are canonic with respect to the number of multiplier elements.

It should be mentioned that each second-order block in the cascade and parallel forms can be realized by any of the existing distinct structures, as, for instance, one of the direct forms shown in Fig. 9.20.

All these digital filter realizations present different properties when one considers practical finite-precision implementations; that is, the quantization of the coefficients and the finite precision of the arithmetic operations, such as additions and multiplications. In fact, the analysis of the finite-precision effects in the distinct realizations is a fundamental step in the overall process of designing any digital filter [9.2].

3 The Fast Fourier Transform (FFT )

The FFT (fast Fourier transform) algorithm is a faster version of the discrete Fourier transform (GlossaryTerm

DFT

). FFT utilizes some clever algorithms to do the same thing as the DFT, but in much less time [9.2].

The DFT is extremely important in the area of frequency (spectral) analysis because it takes a discrete signal in the time domain and transforms that signal into its discrete frequency domain representation. Without a discrete-time to discrete-frequency transform we would not be able to compute the Fourier transform with a microprocessor or DSP-based (GlossaryTerm

DSP

: Digital Signal Processor) system [9.1, 9.2, 9.4, 9.5, 9.6].

It is the speed and discrete nature of the FFT that allows us to analyze a signal’s spectrum, as will soon become evident.

3.1 Review of Integral Transforms

We first give a review of the integral transforms that have been used in the text, possibly in their unilateral (single-side) version.

The bilateral Laplace transform

X (s) = L {x (t)} = \int_{- \infty}^{+ \infty} x (t) e^{- s t} d t, x (t) \overset{L}{⇄} X (s) .

(9.66)

The continuous-time Fourier transform

X (i ω) = F {x (t)} = \int_{- \infty}^{+ \infty} x (t) e^{- i ω t} d t, x (t) \overset{F}{⇄} X (i ω) .

(9.67)

The bilateral Z-transform

X (z) = Z {x (n)} = \sum_{n = - \infty}^{+ \infty} x (n) z^{- n}, x (n) \overset{Z}{⇄} X (z) .

(9.68)

The Laplace transform is used to obtain a pole-zero representation of a continuous-time signal or system, x(t), in the s-plane. Similarly, the Z-transform is used to find a pole-zero representation of a discrete-time signal or system, x(n), in the z-plane.

The continuous-time Fourier transform can be found by evaluating the Laplace transform form at $s = i ω$ . The picture can be extended by introducing the discrete-time Fourier transform (GlossaryTerm

DTFT

). The DTFT can be found by evaluating the Z-transform at

z = \exp (i Ω)

, as follows

X (e^{i Ω}) = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i Ω n}, x (n) \overset{DTFT}{⇄} X (e^{i Ω}) .

(9.69)

One needs to point out here that the frequency variable Ω is in normalized units of radians per sample rather than absolute units of rad∕s, which apply to ω appearing in the standard Fourier transform. This can be justified by recollecting that sequences x(n) are generated by sampling a continuous-time signal x(t) at a certain sampling rate f_s. In this respect, the DTFT can be viewed as a discrete approximation of the Fourier transform as explained below

F {x (t)} = \int_{- \infty}^{+ \infty} x (t) e^{- i ω t} d t ≃ T_{s} \sum_{n = - \infty}^{+ \infty} x (n T_{s}) \exp (- i ω n T_{s}) ⇓ F {x (t)} ≃ T_{s} \cdot X (e^{i Ω}), Ω = ω T_{s} = 2 π \frac{f}{f_{s}} .

(9.70)

3.2 The Discrete Fourier Transform (DFT)

First of all, the discrete Fourier transform (DFT) is not the same as the DTFT. Both start with a discrete-time signal, but DFT produces a discrete frequency domain representation while DTFT is continuous in the frequency domain. These two transforms have much in common, however. It is, therefore, helpful to have a basic understanding of the properties of the DTFT.

3.2.1 Periodicity

DTFT is periodic because of the fact that the signal is a discrete-time one. Indeed,

X (e^{i (Ω + 2 π k)}) = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i (Ω + 2 π k) n} = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i Ω n} e^{- i 2 π k n} = X (e^{i Ω}) .

(9.71)

One fundamental period is, therefore, 2π, i. e., extends from f = 0 to f_s, where f_s is the sampling frequency. Taking advantage of this redundancy, the DFT is only defined in the region between 0 and f_s, in terms of f, or, 0 and 2π, in terms of Ω.

3.2.2 Symmetry

When the region between 0 and f_s is examined, it can be seen that there is even symmetry around the center point, i. e., point $f_{s} / 2$ (half the sampling rate). Indeed,

Ω_{1, 2} = π \pm ε, 0 < ε < π : X (e^{i Ω_{2}}) = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i (π + ε) n} = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i ε n} e^{- i π n} = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i ε n} e^{+ i π n} = \sum_{n = - \infty}^{+ \infty} x (n) e^{- i (π - ε) n} = X (e^{i Ω_{1}}) .

(9.72)

This symmetry adds redundant information. Figure 9.21 shows the DFT (implemented with Matlab’s FFT function) of a cosine with a frequency one tenth of the sampling frequency. Note that the data between 0.5f_s and f_s is a mirror image of the data between 0 and 0.5f_s.

Therefore, the discrete Fourier transform (DFT) can be introduced as follows

X_{k} = \sum_{n = 0}^{N - 1} x_{n} \exp (- \frac{i 2 π n k}{N}), k = 0, 1, 2, \dots, (N - 1) .

(9.73)

Note that the above is actually a transformation between a finite-length real or complex sequence x_n, corresponding to a sampled, at rate $f_{s} = 1 / T_{s}$ , segment of a real or complex signal with actual duration NT $_{s}$ , to a generally complex sequence (discrete spectrum) of equal finite length N corresponding to frequency values in the range between 0 and $(N - 1) f_{s} / N$ with a step equal to $f_{s} / N$ ; note that the corresponding range in terms of Ω is 0 to $(N - 1) 2 π / N$ and the step $2 π / N$ .

3.2.3 Fast Fourier Transform (FFT)

FFT is simply an algorithm to speed up the DFT calculation by reducing the number of multiplications and additions required. It was popularized by J.W. Cooley and J.W. Tukey in the 1960s and, was actually a rediscovery of an idea of Runge (1903) and Danielson and Lanczos (1942), first occurring prior to the availability of computers and calculators – when numerical calculation could take many man hours. In addition, the German mathematician Carl Friedrich Gauss (1777–1855) had used the method more than a century earlier.

In order to understand the basic concepts of FFT and its derivation, note that the DFT expansion shown in Table 9.3 can be greatly simplified by taking advantage of the symmetry and periodicity of the twiddle factors as shown in Table 9.4. If the equations are rearranged and factored, the result is the fast Fourier transform (FFT), which requires only $(N / 2) \log_{2} (N)$ complex multiplications. The computational efficiency of FFT versus DFT becomes highly significant when the FFT point size increases to several thousand, as shown in Table 9.5. However, notice that FFT computes all the output frequency components (either all or none!). If only a few spectral points need to be calculated, DFT may actually be more efficient. Calculation of a single spectral output using the DFT requires only N complex multiplications.

Table 9.3 8-point DFT (N = 8)

Full size table

Table 9.4 8-point DFT. Applying the properties of symmetry and periodicity to $W_{N}^{r}$ for N = 8

Full size table

Table 9.5 FFT versus DFT. FFT is simply an algorithm for efficiently calculating DFT. Computational efficiency of an N-point FFT: 1) DFT: N² Complex multiplications; 2) FFT: $(N / 2) \log_{2} (N)$ Complex multiplications

Full size table

The radix-2 FFT algorithm breaks the entire DFT calculation down into a number of two-point DFTs. Each two-point DFT consists of a multiply-and-accumulate operation called a butterfly, as shown in Fig. 9.22. Two representations of the butterfly are shown in the diagram: the top diagram is the actual functional representation of the butterfly showing the digital multipliers and adders. In the simplified bottom diagram, the multiplications are indicated by placing the multiplier over an arrow, and addition is indicated whenever two arrows converge at a dot.

The eight-point decimation-in-time (GlossaryTerm

DIT

) FFT algorithm computes the final output in three stages as shown in Fig. 9.23. The eight input time samples are first divided (or decimated) into four groups of two-point DFTs. The four two-point DFTs are then combined into two four-point DFTs. The two four-point DFTs are then combined to produce the final output X(k). The detailed process is shown in Fig. 9.24, where all the multiplications and additions are shown. Note that the basic two-point DFT butterfly operation forms the basis for all computations. The computation is done in three stages. After the first stage computation is complete, there is no need to store any previous results. The first stage outputs can be stored in the same registers that originally held the time samples x(n). Similarly, when the second stage computation is completed, the results of the first stage computation can be deleted.

In this way, in-place computation proceeds to the final stage. Note that in order for the algorithm to work properly, the order of the input time samples, x(n), must be properly re-ordered using a bit reversal algorithm.

The bit reversal algorithm used to perform this re-ordering is shown in Table 9.6 . The decimal index, n, is converted to its binary equivalent. The binary bits are then placed in reverse order, and converted back to a decimal number. Bit reversing is often performed in DSP hardware in the data address generator (.*), thereby simplifying the software, reducing overhead, and speeding up the computations.

Table 9.6 Bit reversal example for N = 8

Full size table

The computation of FFT using decimation-in-frequency (GlossaryTerm

DIF

) is shown in Figs. 9.25 and 9.26. This method requires that the bit reversal algorithm be applied to the output X(k). Note that the butterfly for the DIF algorithm differs slightly from the decimation-in-time butterfly, as shown in Fig. 9.27.

The use of decimation-in-time versus decimation-in-frequency algorithms is largely a matter of preference, as either yields the same result. System constraints may make one of the two a more optimal solution. It should be noted that the algorithms required to compute the inverse FFT are nearly identical to those required to compute the FFT, assuming complex FFTs are used. In fact, a useful method for verifying a complex FFT algorithm consists of first taking the FFT of the x(n) time samples and then taking the inverse FFT of the X(k). At the end of this process, the original time samples, $R e (x (n))$ , should be obtained and the imaginary part, $I m (x (n))$ , should be zero (within the limits of the mathematical round off errors).

The FFTs discussed up to this point are radix-2 FFTs, i. e., the computations are based on two-point butterflies. This implies that the number of points in the FFT algorithms must be a power of 2. However, non-radix-2 FFT algorithms have been developed and are available in modern computational software packages like Matlab, Mathematica, Maple, etc., to be used in a variety of applications in ocean engineering as well as beyond [9.1, 9.2, 9.4, 9.5, 9.6].

4 Waveform Analysis

A waveform is one recording of a deterministic signal or that of an instance of a random signal (also referred to in the literature as a stochastic process). Actually, in practice we can make a recording of finite duration, which means that a waveform is the set of signal recordings over possibly non-contiguous time intervals. Typical examples of (lumped) waveforms in ocean engineering are significant wave height at a point or average, oceanic temperature or salinity, acoustic recordings of marine life, sonar or radar signals, etc. [9.3].

4.1 Definitions for Waveforms and Random Signals

The time mean (or average) of a waveform is defined as follows in the continuous time domain

μ_{x} ≜ lim_{T \to \infty} \{\frac{1}{2 T} \int_{- T}^{+ T} x (t) d t\} .

(9.74)

The time-wise cross-correlation of two waveforms is defined as follows

R_{y, x} (τ) ≜ lim_{T \to \infty} \{\frac{1}{2 T} \int_{- T}^{+ T} y (t) x (t + τ) d t\}, = [y \otimes x_{- (t)}] (τ) .

(9.75)

In the above, $\otimes$ stands for the convolution operation applied in the square brackets between signal y and signal $x_{(- t)}$ , which is the mirrored version of signal x about origin of time (t = 0), with time shift τ, i. e.,

x_{- (t)} (t) = x (- t) .

Also, note that cross-correlation as defined above can be used to introduce (time-wise) autocorrelation of a single waveform if we set y ≡ x.

The limiting process to infinity for time variable T introduced in the equations above is required in the case when the waveform has infinite duration so that the definitions make sense and do not give rise to indeterminate or infinite sums.

On the other hand, if the waveform (or at least one of them in the cross-correlation case) is finite in duration, then the time interval ( $2 T$ ) needs to be set equal to the full finite, and not infinite, duration of the signal. In this case, the definitions need to look like the following to avoid any issues

μ_{x} ≜ \frac{1}{2 T} \int_{- \infty}^{+ \infty} x (t) d t,

(9.76)

R_{y, x} (τ) ≜ \frac{1}{2 T} \int_{- \infty}^{+ \infty} y (t) x (t + τ) d t .

(9.77)

The above equations are well defined in the case of deterministic signals and waveforms that occur as instances of random signals. Specifically, a random signal, denoted with a capital letter, e. g., X(t), in contrast to lowercase latters used for deterministic signals, is associated with a probability density function (GlossaryTerm

PDF

),

f_{X} (x)

quantifying the probability of the signal value at any time instant t to lie within an arbitrary real interval [

a, b

] as follows

\Pr [a \leq X (t) \leq b] = \int_{a}^{b} f_{X} (x) d x .

(9.78)

As well established by probability theory, the probability density function is, in turn, associated with a cumulative distribution function (GlossaryTerm

CDF

) for random signal X(t) as follows

F_{X} (x) = \Pr [X (t) \leq x] = \int_{- \infty}^{x} f_{X} (x_{#}) d x_{#} .

(9.79)

One can easily verify by combining (9.78) with (9.79) that

\Pr [a \leq X (t) \leq b] = F_{X} (b) - F_{X} (a)

(9.80)

holds. Finally and assuming certain smoothness conditions hold, the PDF can be calculated as the derivative of the CDF for the same random variable, i. e.,

f_{X} (x) = \frac{d}{d x} F_{X} (x) .

(9.81)

For a random signal one can, in effect, define a probabilistic mean value as follows

⟨ X (t) ⟩ = μ_{X} (t) ≜ \int_{- \infty}^{+ \infty} x f_{X (t)} (x) d x .

(9.82)

Along the same lines a probabilistic correlation (cross and/or autocorrelation) can be introduced

⟨ X (t_{1}), Y (t_{2}) ⟩ = R_{X, Y} (t_{1}, t_{2}), ≜ \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} x y f_{X (t_{1}), Y (t_{2})} (x, y) d y d x .

(9.83)

To properly define it, though, the joint probability density function needs to be introduced; as can be seen below, it quantifies the probability of random signal X assuming a value in an infinitesimal vicinity of x at instant t₁ and, jointly, random signal Y assuming a value in an infinitesimal vicinity of y at instant t₂

f_{X (t_{1}), Y (t_{2})} (x, y) d x d y = \Pr [(x \leq X (t_{1}) \leq x + d x) and (y \leq Y (t_{2}) \leq y + d y)] .

(9.84)

In effect, the probability on the right-hand side in the above is that of the joint event to find the values of signals X and Y at certain time instants within an arbitrary infinitesimal rectangle on the XY plane. Applying Bayes’ theorem on the joint probability of the right-hand side of (9.84) one can derive the following equations involving conditional probabilities

\Pr [x \leq X (t_{1}) \leq x + d x | y \leq Y (t_{2}) \leq y + d y] \cdot \Pr [y \leq Y (t_{2}) \leq y + d y] = f_{X (t_{1}), Y (t_{2})} (x, y) d x d y = \Pr [y \leq Y (t_{2}) \leq y + d y | x \leq X (t_{1}) \leq x + d x] \cdot \Pr [x \leq X (t_{1}) \leq x + d x] .

(9.85)

In the case that random variables X and Y are independent (not to be confused with mutually exclusive variables or events), then using (9.85) for the conditional probabilities one can see that the following holds

f_{X (t_{1}), Y (t_{2})} (x, y) = f_{X (t_{1})} (x) \cdot f_{Y (t_{2})} (y)

(9.86)

A very important category of random signals is that of wide sense stationary (GlossaryTerm

WSS

). A WSS signal is one that has: (a) constant, i. e., time invariant, probabilistic mean, and, (b) autocorrelation solely depending on the time shift (delay)

τ = t_{2} - t_{1}

and not on the individual time instants. In effect, for a random signal to be WSS it must hold that

⟨ X (t) ⟩ = ⟨ X ⟩ = μ_{X},

(9.87)

⟨ X (t), X (t + τ) ⟩ = R_{X, X} (τ) .

(9.88)

In part, WSS signals are important because both the probabilistic mean value and autocorrelation can be obtained as the time-wise mean and autocorrelation of anyone of its instances, x(t), introduced for waveforms earlier. For this to be possible though, the random signal (stochastic process) X needs to be ergodic on top of WSS. Indeed, for an ergodic process statistical (probabilistic) properties (such as its mean, correlation, and variance) can be deduced from a single, sufficiently long instance (sample realization) of the process. Therefore, for an ergodic WSS process, the following hold

⟨ X (t) ⟩ = μ_{x},

(9.89)

⟨ X (t), X (t + τ) ⟩ = R_{x, x} (τ) .

(9.90)

It should be noted, however, that for the above equations to hold exactly the integrals over time in (9.74) and (9.75) need to span the entire real line from minus to plus infinity. If not, then (9.89) and (9.90) provide only an estimate of the mean and autocorrelation values of the random signal; the accuracy of the estimate (probabilistically) increases as time parameter T in the time-wise integrals of (9.74) and (9.75) grow larger.

A final note is made regarding strictly or strongly stationary processes versus WSS ones. A strictly stationary process is a stochastic process whose joint probability distribution does not change when shifted in time. This requirement is much stronger than just mean and autocorrelation and, therefore, much harder to meet or assume in practical situations; that is why in the remaining text we will only employ WSS as well as ergodicity.

4.2 Signal Power and Power Spectral Density

For a waveform x(t), i. e., a deterministic signal or an instance of a random one, the following definitions are given:

1.
The time mean is the direct current (DC ) offset (component) of the waveform and $| μ_{x} |^{2}$ is the DC power of it.
2.
Consider
$R_{x, x} (0) = lim_{T \to \infty} \{\frac{1}{2 T} \int_{- T}^{+ T} x^{2} (t) d t\} \geq 0 .$
(9.91)
This is the (total averaged) power of the waveform.
3.
Then consider
$cov (x) = R_{x, x} (0) - | μ_{x} |^{2} \geq 0 .$
(9.92)
This is the covariance or alternating current (AC ) power of the waveform. Also, $[cov (x)]^{1 / 2}$ is the standard deviation or GlossaryTerm
RMS
(root-mean-square) value of the waveform.

The use of term power here stands for electric power, i. e., that the waveform is considered as voltage across or electric current through a reference resistance equal to 1 ohm.

Finally, the following notation for cross-correlation (for two waveforms) or autocorrelation (for a single waveform)

⟨ y, x ⟩ = R_{y, x} (0) ≜ lim_{T \to \infty} \{\frac{1}{2 T} \int_{- T}^{+ T} y (t) x (t) d t\} .

(9.93)

In the case of two distinct waveforms $⟨ y, x ⟩$ is also identified as their dot product. In the case of a single waveform $⟨ x, x ⟩$ is its power.

In the frequency domain and using the two-sided Fourier transform as introduced in (9.8), the (cross) power spectral density (GlossaryTerm

PSD

) can be defined for a couple of waveforms as follows

S_{y, x} (ω) ≜ F {R_{y, x} (τ)} = Y (ω) X^{*} (ω) .

(9.94)

Note that for a real waveform it holds that

\begin{aligned} X (ω) & = F {x (t)} = \int_{- \infty}^{+ \infty} x (t) \exp (- i ω t) d t \\ ⇕ \\ X^{*} (ω) & = {(\int_{- \infty}^{+ \infty} x (t) \exp (- i ω t) d t)}^{*} \\ = \int_{- \infty}^{+ \infty} x^{*} (t) \exp (+ i ω t) d t = X (- ω) . \end{aligned}

Furthermore

\begin{aligned} F {x (- t)} & = \int_{- \infty}^{+ \infty} x (- t) \exp (- i ω (- t)) d (- t) \\ = \int_{- \infty}^{+ \infty} x (t) \exp (i ω t) d t = X (- ω) . \end{aligned}

Therefore, for a real waveform it holds that

X^{*} (ω) = F {x (- t)} = X (- ω) .

(9.95)

Then, using (9.94) one can directly derive the following

S_{x, y} (ω) = S_{y, x} (- ω) = S_{y, x}^{*} (ω) \Leftrightarrow R_{x, y} (τ) = R_{y, x} (- τ) .

(9.96)

Moreover, in the case of autocorrelation PSD obtains the following form

S_{x, x} (ω) ≜ F {R_{x, x} (τ)} = | X (ω) |^{2} \geq 0 and S_{x, x} (ω) = S_{x, x} (- ω) .

(9.97)

In effect, the PSD of a single signal is an even, non-negative complex function of frequency. Also, for the autocorrelation it holds that it is an even function of time τ, i. e., $R_{x, x} (τ) = R_{x, x} (- τ)$ .

A final note has to do with the extension of the PSD concept to random signals (stochastic processes in mathematical terminology). Assuming ergodicity and WSS, the Wiener–Khinchin theorem can be employed to introduce a PSD for a random signal X or at least a power spectral distribution function $F_{X} (f)$ as follows

⟨ X (t), X (t + τ) ⟩ = R_{X, X} (τ) = \int_{- \infty}^{+ \infty} e^{i ω τ} d F_{X} (f), ω = 2 π f .

(9.98)

For the above to hold, the autocorrelation of X needs to exist and be finite for any value of time shift variable τ, but its Fourier transform may not be well defined. Indeed, the Fourier transform of a random signal may not exist in general, because stationary stochastic processes may not generally be square or absolutely integrable. Nor does their autocorrelation need to be absolutely integrable, so it need not have a Fourier transform, either. However, in most practical applications the autocorrelation is integrable and even satisfies the conditions to obtain a Fourier transform. In this case, the PSD can be introduced through the Fourier transform as follows

S_{X, X} (ω) = \int_{- \infty}^{+ \infty} R_{X, X} (τ) e^{- i ω τ} d τ \Leftrightarrow R_{X, X} (τ) = \int_{- \infty}^{+ \infty} S_{X, X} (f) e^{i ω τ} d f, ω = 2 π f .

(9.99)

Finally, in this case the PSD is the averaged derivative of the power spectral distribution function introduced previously. This is why the latter is also referred to as the integrated spectrum of the stochastic process.

4.3 Waveform Propagation Through a Linear, Time-Invariant System

Consider a linear, time-invariant (GlossaryTerm

LTI

) system with a single input and a single output (GlossaryTerm

SISO

), as shown in Fig. 9.28, driven by input waveform x(t) and generating as response output waveform y(t).

The system is assumed to be in continuous time, as well as the waveforms at its input and its output. Then, the output can be determined as the convolution $(\otimes)$ of the LTI system’s scalar impulse response with the input waveform

y (t) = h (t) \otimes x (t) = \int_{- \infty}^{+ \infty} h (t_{#}) x (t - t_{#}) d t_{#} \Leftrightarrow Y (ω) = H (ω) \cdot X (ω) .

(9.100)

In the above, h(t) stands for the system’s impulse response that can be determined on the basis of the system’s transfer function H(s) in the complex frequency (Laplace transform) domain

h (t) = L^{- 1} {H (s)} .

(9.101)

It is noted here that for a causal system, the convolutional integral must be finitely bounded as follows

y (t) = h (t) \otimes x (t) = \int_{0}^{t} h (t_{#}) x (t - t_{#}) d t_{#} and h (t) = 0, \forall t < 0 .

(9.102)

For the means of the input and output waveforms the following can be derived

μ_{y} ≜ lim_{T \to \infty} \{\frac{1}{2 T} \int_{- T}^{+ T} y (t) d t\} = μ_{x} \cdot \int_{- \infty}^{+ \infty} h (t) d t = H (0) \cdot μ_{x} .

(9.103)

For the various correlations, as well as the PSD of the input and output, the following hold

R_{y, y} (t) = R_{h, h} \otimes R_{x, x} \Leftrightarrow S_{y, y} (ω) = | H (ω) |^{2} \cdot S_{x, x} (ω),

(9.104)

R_{y, x} (t) = h \otimes R_{x, x} \Leftrightarrow S_{y, x} (ω) = H (ω) \cdot S_{x, x} (ω),

(9.105)

R_{x, y} (t) = h_{- (t)} \otimes R_{x, x} \Leftrightarrow S_{x, y} (ω) = H^{*} (ω) \cdot S_{x, x} (ω) .

(9.106)

All of the above can be derived on the basis of the definitions and properties given previously, as well as the commutativity and associativity of scalar convolution. For example, in the case of (9.104)

R_{y, y} (t) = y \otimes y_{- (t)} = [h \otimes x] \otimes [h_{- (t)} \otimes x_{- (t)}] = R_{h, h} \otimes R_{x, x} .

(9.107)

It is noted here that:

1.
$S_{h, h} (ω) = H (ω) H^{*} (ω) = | H (ω) |^{2},$
2.
$R_{h, h} (τ) = \int_{- \infty}^{+ \infty} S_{h, h} (f) e^{i ω τ} d f = \int_{- \infty}^{+ \infty} | H (ω) |^{2} e^{i ω τ} d f, ω = 2 π f,$
and
3.
$(\begin{array}{c} x_{- (t)} \\ y_{- (t)} \\ h_{- (t)} \end{array}) (t) = (\begin{array}{c} x \\ y \\ h \end{array}) (- t), \forall t \in R \Leftrightarrow F \{\begin{array}{c} x_{- (t)} \\ y_{- (t)} \\ h_{- (t)} \end{array} (t)\} = (\begin{array}{c} X^{*} \\ Y^{*} \\ H^{*} \end{array}) (ω or f) .$

5 Optimal Signal Estimation

5.1 System Identification

One of the most fundamental problems in science and engineering is the determination of a system’s dynamics, preferably in the form of a mathematical model, when presented with the system response(s) to given deterministic or random driving inputs. This is the fundamental system identification problem [9.7, 9.8].

In the case of an LTI system, especially in the case of discrete-time or sampled-data systems, system identification is essentially equivalent to a problem of minimum square error approximation. Consider the problem statement as shown in Fig. 9.29.

A fundamental prerequisite for the system identification problem to have a solution in its basic form is that the additive measurement noise superimposed to the system output is of zero mean and uncorrelated to the driving forcing signal applied as input to the system. Therefore, the following two conditions must hold

μ_{v} ≜ lim_{T \to \infty} \{\frac{1}{2 T} \int_{- T}^{+ T} v (t) d t\} = 0,

(9.108)

R_{v x} (τ) = R_{x v} (τ) = 0, \forall τ \Leftrightarrow S_{v x} (ω) = S_{x v} (ω) = 0, \forall ω .

(9.109)

At this point it is important to emphasize the importance of the definition of the appropriate time window T in (9.75). Indeed, it is possible that for some (commonly small) value of T condition (9.109) may not be satisfied with sufficient accuracy. However, assuming that condition (9.109) is satisfied for an infinite time window as well as appropriate stationarity and ergodicity assumptions, it is possible to determine a finite value for T such that condition (9.109) is met at arbitrarily small (epsilon) accuracy.

Given the property in (9.109) and by use of the linearity property of both cross-correlation and PSD, as well as (9.105), one can derive the following

S_{y x} (ω) = S_{u x} (ω) + S_{v x} (ω) = H (ω) \cdot S_{x x} (ω) .

(9.110)

The above holds since $y (t) = u (t) + v (t)$ and $u (t) = h (t) \otimes x (t) \Leftrightarrow U (ω) = H (ω) X (ω)$ . In effect, the unknown transfer functions of the intermitted LTI system

H (ω) = \frac{S_{y x} (ω)}{S_{x x} (ω)} .

(9.111)

In the time domain the equivalent equation to (9.111) is

R_{y x} (t) = h (t) \otimes R_{x x} (t) = \int_{- \infty}^{+ \infty} h (t_{#}) R_{x x} (t - t_{#}) d t_{#} .

(9.112)

In the above equation, the unknown signal (function) is the impulse, h(t). The integral equation is known as the Wiener–Hopf equation with infinite time horizon [9.1, 9.2, 9.4, 9.5, 9.6].

What is most important in both (9.111) and (9.112) is that system identification using them can be implemented with either deterministic or random signals. In effect, signals, x(t), v(t) and y(t) may be known only probabilistically (statistically); this means that the individual instances (realizations) of the signals do not need to be known as long as their statistics are, e. g., means, correlations, or covariance coefficients.

Typically in practice, statistical characterization of signals can be effectively, yet approximately, performed by taking advantage of ergodicity and stationarity, if they apply. Assuming that these hold, then a sufficiently long, yet finite, recording of a single realization of a random signal allows for its statistical characterization in terms of its autocorrelation and cross-correlation with other signals. Such a recording, e. g., for v(t) or x(t) in the case of our system identification framework, can be performed once and be used along with measurements for y(t) to identify the system; all due to ergodicity and stationarity.

5.2 Discrete-Time Wiener–Hopf Equation over a Finite-Duration Window

Integral (9.112), despite its immense theoretical value, does not really indicate how it can be applied in practice for system identification. In this passage, we will introduce a methodology allowing identification of an LTI-SISO system when presented only with sampled-data recordings of x(t) and y(t), with a sampling rate meeting the Nyquist requirement for both. Furthermore, it will be assumed that the observation window (time horizon) of the recordings of the forcing and the noisy response is finite in duration instead of infinite.

The outcome of the processing will be the determination of a given finite number of impulse response samples for the unknown system undergoing identification. In effect, it is possible to define a discrete-system FIR (finite impulse response) system in either the time domain, by its very impulse response, or as a transfer function in the Z-transform domain without any poles. In effect, the FIR system will approximate the behavior of the unknown LTI system undergoing identification in the sense of (truncated) impulse response matching.

We start from the integral equation for y(t) given previously

y (t) = h (t) \otimes x (t) + v (t) = \int_{- \infty}^{+ \infty} h (t_{#}) x (t - t_{#}) d t_{#} + v (t) .

(9.113)

However, to satisfy causality for the unknown system the following needs to be introduced

y (t) = \int_{0}^{t} h (t_{#}) x (t - t_{#}) d t_{#} + v (t) = \int_{0}^{t} h (t - t_{#}) x (t_{#}) d t_{#} + v (t) .

(9.114)

Converting the above to discrete time by sampling interval T_s, compliant to the Nyquist rate criterion, one can derive the following

y (n T_{s}) = T_{s} \sum_{k = 0}^{n} h (k T_{s}) x ((n - k) T_{s}) + v (n T_{s}) .

(9.115)

Without loss of generality we can assume that $T_{s} = 1$ , since any other sampling interval value can be absorbed multiplicatively in the impulse response sample values.

y (n) = \sum_{k = 0}^{n} h (k) x (n - k) + v (n) .

(9.116)

An important prerequisite for our identification method to be successful is that the unknown LTI system is stable. If this holds, then for an arbitrarily small ε one can find a positive integer M, such that

\sum_{n = M + 1}^{+ \infty} h^{2} (n) < ε .

(9.117)

In effect, with epsilon accuracy it is possible to truncate the impulse response of the unknown system. We note here that impulse response truncation is an approximation of the system undergoing identification by the following FIR one with memory M

{\hat{H}}_{FIR} (z^{- 1}) = \sum_{n = 0}^{M} h (n) z^{- n} .

(9.118)

Using the approximation above, (9.116) yields

y (n) = \sum_{k = 0}^{M} h (k) x (n - k) + v (n) .

(9.119)

Furthermore, assuming that the recording for waveform y(t) spans a finite interval $[0, T]$ , the identification problem can be reduced to determining the ( $M + 1$ ) samples of the impulse response, h(n), $0 \leq n \leq M$ , of the unknown system so that the following ( $L + 1$ ) equations are satisfied

y (n) = \sum_{k = 0}^{M} h (k) x (n - k) + v (n), 0 \leq n \leq L and L = \frac{T}{T_{s}} .

(9.120)

It is noted here that containing the recording of signal y(t) within interval $[0, T]$ allows limiting the need to record signal x(t) also in the finite interval $[- M T_{s}, + L T_{s}]$ .

In effect, the ( $(L + 1) \times (M + 1)$ ) set of algebraic (9.120) can be put in the following matrix form

y = \ddot{X} \cdot h + v .

(9.121)

In the above, vectors y and v are defined as follows

y = [\begin{array}{rrrc} y (0) & y (1) & \dots & y (L) \end{array}]^{T} v = [\begin{array}{rrrc} v (0) & v (1) & \dots & v (L) \end{array}]^{T} .

(9.122)

Vector h is defined as follows

h = [\begin{array}{rrrc} h (0) & h (1) & \dots & h (M) \end{array}]^{T} .

(9.123)

Matrix X, known as observation matrix, is defined by the following

X = [\begin{array}{rrrc} x (0) & x (- 1) & \dots & x (- M) \\ x (1) & x (0) & \dots & x (- M + 1) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x (L) & x (L - 1) & \dots & x (- M + L) \end{array}] .

(9.124)

The equation set in (9.121) is not square; therefore, an algebraically exact solution is not feasible. In effect, only a solution minimizing the following mean square error (GlossaryTerm

MSE

) vector spanning the observation window

∥ e ∥_{2} = ∥ y - X \cdot h ∥_{2} .

(9.125)

If the above MSE vector is minimized, then the effect of the noise vector v, i. e., signal v(t), is removed from the measurement vector y, i. e., signal y(t). The Euclidean norm of vector e is defined as follows

∥ e ∥_{2} = e^{T} \cdot e = \sum_{i = 0}^{L} e_{i}^{2} .

(9.126)

The solution achieving minimization of $∥ e ∥_{2}$ can be proven to be that of the following square ( $(M + 1) \times (M + 1)$ ) set of algebraic canonical equations

(X^{T} \cdot X) \cdot h = X^{T} \cdot y \Rightarrow h = (X^{T} X)^{- 1} X^{T} y .

(9.127)

It is interesting to observe the equivalence between the time-domain (9.127) and (9.112); the equivalence can be straightforwardly derived if one observes that (9.112) needs to hold for any time instant t. This means that if it were converted to discrete time a (countable infinite in number) set of algebraic equations would be derived like in (9.121). Furthermore, the solution of canonical (9.127) is fully equivalent to (9.111), since there exists an one-to-one correspondence between: (a) square matrix $(X^{T} X)$ and autocorrelation $R_{x x} (t)$ , and thereof PSD $S_{x x} (ω)$ ; (b) vector ( $X^{T} y$ ) and cross-correlation $R_{y x} (t)$ , and thereof PSD $S_{y x} (ω)$ .

On a technical note, it is pointed out that for (9.127) to hold, matrix $(X^{T} X)$ has to be invertible. In the case of zero initial conditions for x, i. e., $x (n) = 0$ , $- M \leq n < 0$ , it must hold that $x (0) \neq 0$ so that $(X^{T} X)$ is, indeed, invertible.

5.3 Signal Estimation and the Wiener Filter

We will now look into a closely related problem: that of the estimation of a signal or waveform which is outlined in the block diagram of Fig. 9.30.

As can be seen a known, in this case, the LTI-SISO system is driven by an unknown (to be estimated) x(t). or the known LTI system, the transfer function G(s), in the complex frequency (Laplace) domain, or $G (ω)$ , in the Fourier (frequency) domain, or equivalently its impulse response g(t) in the time domain is assumed known. Such an LTI system may model a measurement instrument, or sensor, or transducer, or telecommunications receiver.

Then, a linear filter with transfer function H(s) is sought that will receive as input the noisy instrument response w(t) and output waveform y(t), which is expected to be the optimum, in the mean square sense, estimate of the unknown waveform x(t).

The objective is to determine transfer function H(s), or equivalently $H (ω)$ , of the estimation filter that minimizes the following objective cost function

⟨ e, e ⟩ ≜ R_{e e} (0) = \int_{- \infty}^{+ \infty} e^{2} (t) d t .

(9.128)

In the above, the instantaneous error signal is defined as follows

e (t) = x (t) - y (t) .

(9.129)

For waveform y(t) it holds that

y (t) = h (t) \otimes w (t) = \int_{- \infty}^{+ \infty} h (ξ) w (t - ξ) d ξ \approx T_{s} \sum_{k = - \infty}^{+ \infty} h (k T_{s}) w (t - k T_{s}) .

(9.130)

In the above, the convolutional integral has been approximated by the convolutional sum, assuming that sampling interval, T_s, satisfies the Nyquist criterion. Consequently, (9.128) becomes

⟨ e, e ⟩ = \int_{- \infty}^{+ \infty} {\{x (t) - T_{s} \sum_{k = - \infty}^{+ \infty} h (k T_{s}) w (t - k T_{s})\}}^{2} d t .

(9.131)

In effect, the impulse response of the least-squares estimator has to fulfill the following condition, so that cost $⟨ e, e ⟩$ becomes minimum

\frac{\partial}{\partial [h (n T_{s})]} ⟨ e, e ⟩ = 0, \forall n \in Z .

(9.132)

However,

\frac{\partial}{\partial [h (n T_{s})]} ⟨ e, e ⟩ = - 2 \int_{- \infty}^{+ \infty} \{x (t) - T_{s} \sum_{k = - \infty}^{+ \infty} h (k T_{s}) w (t - k T_{s})\} \times w (t - n T_{s}) d t .

(9.133)

Therefore, condition (9.132) finally yields

\int_{- \infty}^{+ \infty} e (t) w (t - n T_{s}) d t = 0, \forall n \in Z .

(9.134)

Setting $τ = n T_{s}$ in the above and gradually diminishing T_s, we end up back in the continuous time domain, and (9.134) yields the following orthogonality condition

\int_{- \infty}^{+ \infty} e (t) w (t - τ) d t = 0, \forall τ \in R \Leftrightarrow R_{w e} (τ) = 0, \forall τ \in R .

(9.135)

Using the above orthogonality condition

R_{w e} (τ) = 0 \Leftrightarrow S_{w e} (ω) = 0 \Leftrightarrow S_{x w} (ω) = S_{y w} (ω) .

(9.136)

Furthermore, since $S_{y w} (ω) = H (ω) S_{w w} (ω)$ , the following equation can be derived for the transfer function of the estimation filter

H (ω) = \frac{S_{x w} (ω)}{S_{w w} (ω)} .

(9.137)

The above expression for the estimator is commonly known as the Wiener filter and of is of widespread use in many science and engineering application fields.

The signal power of the minimum square error, achieved by the Wiener filter in (9.137), can be determined as follows

S_{e e} (ω) = S_{x e} (ω) - {\underset{⏟}{S_{y e} (ω)}}_{0} = S_{x x} (ω) - S_{x y} (ω) = S_{x x} (ω) - (S_{y y} (ω) + {\underset{⏟}{S_{e y} (ω)}}_{0}) ⇓ S_{e e} (ω) = S_{x x} (ω) - S_{y y} (ω) .

(9.138)

For the derivation of the above it is reminded that

e = x - y .

Also, using the orthogonality condition (9.135), the following important result can be derived

S_{y e} (ω) = H (ω) S_{w e} (ω) \overset{S_{w e} = 0}{\Rightarrow} S_{y e} (ω) = 0 = S_{e y} (ω) .

It is noted here that

R_{e e} (τ) = F^{- 1} {S_{e e} (ω)} = \int_{- \infty}^{+ \infty} S_{e e} (ω) e^{i ω τ} d ω \Rightarrow ⟨ e, e ⟩ = R_{e e} (0) = \int_{- \infty}^{+ \infty} S_{e e} (ω) d ω .

An alternative expression for the minimum error PSD can be derived on the basis of the following

S_{y y} (ω) = | H (ω) |^{2} S_{w w} (ω) = \frac{| S_{x w} (ω) |^{2}}{S_{w w}^{2} (ω)} S_{w w} (ω) = \frac{| S_{x w} (ω) |^{2}}{S_{w w} (ω)} .

Then

S_{e e} (ω) = S_{x x} (ω) - S_{y y} (ω) = \frac{S_{x x} (ω) S_{w w} (ω) - | S_{x w} (ω) |^{2}}{S_{w w} (ω)} .

(9.139)

Further treatment of (9.137) is possible since the additive measurement noise is uncorrelated to the unknown signal x(t) at the instrument input. In this case,

S_{x w} (ω) = S_{x u} (ω) + S_{x v} (ω) = G^{*} (ω) S_{x x} (ω) because S_{x v} (ω) = 0 .

(9.140)

Also

S_{w w} (ω) = S_{u u} (ω) + S_{v v} (ω) = | G (ω) |^{2} S_{x x} (ω) + S_{v v} (ω) since u = v + w .

(9.141)

Therefore, the Wiener filter in (9.137) becomes as follows

H (ω) = \frac{G^{*} (ω) S_{x x} (ω)}{| G (ω) |^{2} S_{x x} (ω) + S_{v v} (ω)} .

(9.142)

The above allows us to obtain an estimate of waveform x(t) appearing at the instrument input port. As can be seen, to achieve this the signal’s (waveform) statistics need to be known in advance, i. e., the signal PSD $S_{x x} (ω)$ or autocorrelation $R_{x x} (τ)$ , as well as the dynamics of the measurement instrument, i. e., its transfer function $G (ω)$ or its impulse response g(t); last, but not least, prior information of the additive noise’s statistics, ie, its PSD $S_{v v} (ω)$ or autocorrelation $R_{v v} (τ)$ .

Two important specific subcases of (9.137) are given below:

1.
Zero-forcing equalization [9.1, 9.5]:

If $S_{v v} (ω) ≪ S_{x x} (ω)$
$H (ω) = \frac{1}{G (ω)} .$
(9.143)
Evidently, in this subcase the statistics of either the unknown input signal or the additive noise need to be known in advance. Furthermore, one can easily verify that, if the inverse of the instrument’s transfer function $G (ω)$ does not introduce unstable poles, estimation error e vanishes.
2.
Matched filter [9.1, 9.5]:

If $| G (ω) |^{2} S_{x x} (ω) ≪ S_{v v} (ω)$
$H (ω) = G^{*} (ω) \frac{S_{x x} (ω)}{S_{v v} (ω)} .$
(9.144)
In this subcase the major impairment toward obtaining an estimate of x(t) is the additive noise rather than the instrument distortion, as was the case previously. Moreover note that if $S_{x x} (ω) < S_{v v} (ω)$ , the instrument’s output w(t) will be noise-like, and yet by use of the Wiener filter it is possible to extract the unknown information signal x(t).

Matched filter estimator design is widely used in communications engineering. At least as a first approximation, transfer function $G (ω)$ models the telecommunication channel that introduces attenuation monotonically increasing with distance from the transmitter source. In a typical system, both the information signal x(t), as well as disturbance noise v(t) can be considered as white noise, i. e., signals with the following property

S_{n n} (ω) = N_{0}, \forall ω \Leftrightarrow R_{n n} (τ) = N_{0} δ (τ), \forall τ .

(9.145)

As can be seen, the case of a white noise signal demonstrates practically no predictability, since the value of the signal at any time instant is entirely uncorrelated with its value at any other time instant in the past or in the future. Therefore, if both the information signal and the additive noise are modeled as white noise, (9.144) becomes

H (ω) = \frac{X_{0}}{V_{0}} G^{*} (ω) .

(9.146)

The ratio $(X_{0} / V_{0})$ with the PSD of the information signal x(t) in the numerator and that of the additive noise in the denominator is known as signal-to-noise ratio (GlossaryTerm

SNR

) and is a very important parameter to characterize a system’s sensitivity to external noise, as well as exogenous disturbance and crosstalk interference [9.1, 9.5].

6 Concluding Remarks

In this chapter, an account of digital signal processing concepts, methods and techniques employed in ocean engineering is given. After looking into the fundamental processes of continuous signal sampling and reconstruction, the Z-transform is introduced as the tool of convenience to analyze difference equations just like the Laplace transform is suited to analyze differential equations. Digital filters are presented then as well as the Fast Fourier Transform algorithm. Fundamentals of waveform analysis and stochastic processes are presented last as employed for system identification and signal estimation.

Abbreviations

A/D:

analog-to-digital

ARMA:

Auto-Regressive Moving Average

CDF:

cumulative distribution function

D/A:

digital-to-analog

DFT:

discrete Fourier transform

DIF:

decimation-in-frequency

DIT:

decimation-in-time

DSP:

Digital Signal Processor

DTFT:

discrete-time Fourier transform

FFT:

fast Fourier transform

FIR:

finite-duration impulse response

FOH:

first-order hold

HMI:

human–machine interface

IIR:

infinite impulse response

infinite-duration impulse response

LTI:

linear time invariant

MSE:

mean squared error

PDF:

probability density function

PSD:

power spectral density

RC:

resistor and capacitor

RMS:

root mean square

SISO:

single-input single-output

SNR:

signal-to-noise ratio

VLSI:

Very Large Scale Integration

WSS:

wide sense stationary

ZOH:

zero order hold

References

J.G. Proakis: Digital Communications, 4th edn. (McGraw-Hill, New York 2000)
MATH Google Scholar
J.G. Proakis, D. Manolakis: Digital Signal Processing: Principles, Algorithms and Applications, 3rd edn. (Prentice Hall, Upper Saddle River 1995)
Google Scholar
W.A. Kuperman, J.F. Lynch: Shallow-water acoustics, Phys. Today 57(10), 55 (2004)
Article Google Scholar
L. Ljung: System Identification: Theory for the User, 2nd edn. (Prentice Hall, Upper Saddle River 1999)
MATH Google Scholar
H.V. Poor, G.W. Wornell: Wireless Communications: Signal Processing Perspectives (Prentice Hall, Upper Saddle River 1998)
Google Scholar
P.S.R. Diniz: Adaptive Filtering: Algorithms and Practical Implementation, 2nd edn. (Springer, Berlin, Heidelberg 2002)
MATH Google Scholar
T. Kohonen: Self Organization and Associative Memory, 3rd edn. (Springer Verlag, Berlin, Heidelberg 1989)
Book MATH Google Scholar
R.E. Schapire: The Design and Analysis of Efficient Learning Algorithms (MIT Press, Cambridge 1992)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Naval Architecture & Marine Engineering, University of New Orleans, 2000 Lakeshore Drive, LA 70148, New Orleans, Louisiana, USA
Nikolaos I. Xiros

Authors

Nikolaos I. Xiros
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Inst. for Ocean and Systems Engineering – SeaTech, Florida Atlantic University, 101 North Beach Road, FL 33004, Dania Beach, Florida, USA
Manhar R. Dhanak
School of Naval Architecture & Marine Engineering, University of New Orleans, 2000 Lakeshore Drive, LA 70148, New Orleans, Louisiana, USA
Nikolaos I. Xiros

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Xiros, N.I. (2016). Digital Signal Processing. In: Dhanak, M.R., Xiros, N.I. (eds) Springer Handbook of Ocean Engineering. Springer Handbooks. Springer, Cham. https://doi.org/10.1007/978-3-319-16649-0_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-16649-0_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16648-3
Online ISBN: 978-3-319-16649-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Digital Signal Processing

Abstract

Similar content being viewed by others

Frequency Domain Representation of Discrete-Time Signals and Systems

Discrete-Time Signals and Systems

Chapter 10: Introduction to Digital Signal Processing

Keywords

1 Discrete-Time Systems

1.1 Discrete-Time Signals and Digital Systems

1.2 Signal Sampling

1.2.1 Modeling of Ideal Sampler

1.2.2 Fourier Series Expansion and Fourier Transform

1.2.3 Nyquist Sampling Rate – Aliasing

1.3 Analog Signal Reconstruction Using a Discrete-Time Signal

1.4 The Z-Transform

1.5 Discrete-Time LTI Systems

1.5.1 Difference Equations

1.5.2 Linear Time-Invariant Discrete-Time Systems

1.6 Continuous-Time System Mapping

1.6.1 Approximate Differentiation or Integration

1.6.2 Impulse Response Matching

1.6.3 Pole and Zero Matching of a Transfer Function

2 Digital Filters

2.1 Important FIR Filter Structures

2.1.1 Direct Form

2.1.2 Cascade Form

2.1.3 Linear-Phase Form

2.2 Important IIR Filter Structures

2.2.1 Direct Form

2.2.2 Cascade Form

2.2.3 Parallel Form

3 The Fast Fourier Transform (FFT )

3.1 Review of Integral Transforms

3.2 The Discrete Fourier Transform (DFT)

3.2.1 Periodicity

3.2.2 Symmetry

3.2.3 Fast Fourier Transform (FFT)

4 Waveform Analysis

4.1 Definitions for Waveforms and Random Signals

4.2 Signal Power and Power Spectral Density

4.3 Waveform Propagation Through a Linear, Time-Invariant System

5 Optimal Signal Estimation

5.1 System Identification

5.2 Discrete-Time Wiener–Hopf Equation over a Finite-Duration Window

5.3 Signal Estimation and the Wiener Filter

6 Concluding Remarks

Abbreviations

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation