Abstract
The process of combining signals acquired by a microphone array in order to ‘focus’ on a signal in a specific direction is known as beamforming or spatial filtering. This chapter considers signal-independent (fixed) beamformers, controlled by weights only dependent on the direction of arrival of the source to be extracted, and which do not otherwise depend on the desired signal. Because the weights of these beamformers are given by simple expressions, they present the advantages of being straightforward to implement and of having low computational complexity.
Access provided by CONRICYT-eBooks. Download chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
The process of combining signals acquired by a microphone array in order to ‘focus’ on a signal in a specific direction is known as beamforming or spatial filtering. We present in this chapter a number of such beamforming methods that are specifically controlled by weights dependent only on the direction of arrival (DOA) of the desired source. They are otherwise signal-independent such that they do not depend on the statistics of the desired or noise signals. We derive maximum directivity and maximum white noise gain beamformers that establish performance bounds for spherical harmonic domain (SHD) beamformers. Because the weights of these beamformers are given by simple expressions, they present the advantages of being straightforward to implement and of having low computational complexity.
6.1 Signal Model
The sound pressure P captured at a position \(\mathbf {r} = (r,\varOmega ) = (r,\theta ,\phi )\) (in spherical coordinates, where \(\theta \) denotes the inclination and \(\phi \) denotes the azimuth) on a spherical microphone array of radius r is commonly expressed as the sum of a desired signal X and a noise signal V [12, 15]. In the spatial domain, the signal model is expressed as
where k denotes the wavenumber.Footnote 1 The desired signal X is assumed to be spatially coherent, while the noise signal V models background noise or sensor noise, for example, and may be spatially incoherent, coherent or partially coherent .
When using spherical microphone arrays, it is convenient to work in the SHD [1, 17]. In this chapter, we assume error-free spatial sampling by Q microphones at positions \(\mathbf {r}_q = (r,\varOmega _q), q \in \left\{ 1, \ldots , Q \right\} \), and refer the reader to Chap. 3 for information on spatial sampling and aliasing. By applying the complex spherical harmonic transform (SHT) to the signal model in (6.1), we obtain the SHD signal model
where \(P_{lm}(k)\), \(X_{lm}(k)\) and \(V_{lm}(k)\) are respectively the spherical harmonic transforms of the spatial domain signals \(P(k,\mathbf {r}_q)\), \(X(k,\mathbf {r}_q)\) and \(V(k,\mathbf {r}_q)\), as defined in (3.6), and are referred to as eigenbeams to reflect the fact that the spherical harmonics are eigensolutions of the wave equation in spherical coordinates [26]. The order and degree of the spherical harmonics are respectively denoted as l and m.
By combining the eigenbeams \(P_{lm}(k)\) in a particular way, the noise V can be suppressed and the desired signal X can be extracted from the noisy mixture P. This is accomplished using a spatio-temporal filter or beamformer. In the spatial domain, the output of a beamformer is obtained as the weighted sum of the pressure signals at each of the microphones [3, 4]; in the SHD, the beamformer output is given by a weighted sum of the eigenbeams \(P_{lm}(k)\) [14, 21]. The output of an Lth-order SHD beamformer can thus be expressed as [21, Eq. 12]Footnote 2
where \(W_{lm}(k)\) denotes the beamformer weights and \((\cdot )^{*}\) denotes the complex conjugate.
Beamformers can either be signal-independent (fixed) or signal-dependent; their weights are chosen in order to achieve specific performance objectives. Signal-independent beamformers apply a constraint to a specific steering direction and optimize the beamformer weights with respect to array performance measures such as the white noise gain (WNG) and directivity. They can also, more generally, attempt to achieve a specific spatial response in all directions by minimizing the difference between the beamformer’s spatial response and the desired spatial response, according to some distance measure (see [6, Sects. 8.3 and 8.4] for examples). Signal-dependent beamformers optimize the weights taking into account characteristics of the desired signal and noise. In this chapter, we will discuss signal-independent beamformers and later address signal-dependent beamformers in Chap. 7 .
A block diagram of a signal-independent beamformer is shown in Fig. 6.1. We begin by capturing the sound pressure signals \(P(k,\mathbf {r}_q)\) at microphones \(q \in \{ 1, \ldots , Q \}\), and applying the SHT to obtain the SHD sound pressure signals, the eigenbeams \(P_{lm}(k)\), gathered together to form a vector \(\mathbf {p}(k)\). The output Z(k) of the beamformer is obtained by taking the weighted sum of these eigenbeams, where the weights \(W_{lm}(k,\varOmega _{\text {u}})\) depend only on the steering direction \(\varOmega _{\text {u}}\) and do not otherwise depend on the sound pressure signals P.
The signal-independent beamformers presented in this chapter are designed assuming anechoic conditions with a single active sound source, though these assumptions are unlikely to be valid in practical use scenarios. Depending on the distance between this source and the array, the desired signal is either assumed to consist of a plane wave or a spherical wave. Under farfield conditions, the eigenbeams of a unit amplitude plane wave incident from a direction \(\varOmega _{\text {s}}\) are given by (3.22a). The SHD sound pressure \(X_{lm}(k,\varOmega _{\text {s}})\) related to a plane wave with power \(P_{\text {pw}}(k)\) can then be written as [18, 20, 26 ]
where \(Y_{lm}(\varOmega _{\text {s}})\) denotes the complex spherical harmonicFootnote 3 of order l and degree m evaluated at an angle \(\varOmega _{\text {s}}\), as defined in (2.14), and the mode strength \(b_l(k)\) captures the eigenbeams’ dependence on the array properties, such as microphone type or array configuration, and is discussed in more detail in Sect. 3.4.2.
All the beamformers designed in this chapter seek to suppress the noise while maintaining a distortionless constraint on the signal originating from the steering direction \(\varOmega _{\text {u}}\). This constraint is expressed as
It is important to note that this distortionless constraint depends only on the steering direction \(\varOmega _{\text {u}}\). It is different from the distortionless constraint imposed in Chap. 7, which takes into account the complex multipath propagation effects of a reverberant environment. Using the constraint in (6.5) can be appealing, as it does not require the estimation of the acoustic transfer functions (ATFs) or relative transfer functions, however this comes at the expense of sensitivity to errors in the steering direction and reduced robustness to reverberation .
For convenience, the SHD signal model in (6.2) can also be expressed in vector form as
where the SHD signal vector \(\mathbf {p}(k)\) of length \((L+1)^2\) is defined as
and \(\mathbf {x}(k)\) and \(\mathbf {v}(k)\) are defined similarly to \(\mathbf {p}(k)\). The beamformer output signal Z(k) can be expressed as
where the filter weights vector is defined as
In matrix form the desired signal is written as
where the vector of spherical harmonics \(\mathbf {y}(\varOmega _{\text {s}})\) of length \((L+1)^2\) is defined as
and the \((L+1)^2 \times (L+1)^2\) matrix of mode strengths \(\mathbf {B}(k)\) is defined as
therefore \(\mathbf {B}(k)\) consists of \(2l+1\) repetitions of \(b_l(k)\) for \(l \in \left\{ 0, \ldots , L \right\} \) along its diagonal. Finally, the distortionless constraint is given by
6.2 Design Criteria
In this section, we introduce a number of measures that can be used to design optimal beamformers as in Sect. 6.3. It should be noted that these measures are defined with respect to the signals with physical significance, namely the spatial domain signals, and not with respect to the eigenbeams. Nevertheless, these measures will still depend on the eigenbeams as they form a part of the spherical harmonic expansion (SHE) of the spatial domain signals.
6.2.1 Directivity
Directivity is a measure of a beamformer’s spatial selectivity and quantifies its ability to suppress sound waves that do not originate from a specifically chosen steering direction. It is defined as the ratio of the power of the beamformer output due to a plane wave arriving from the steering direction \(\varOmega _{\text {u}}\) to the power of the beamformer output averaged over all directions [28]. The directivity \(\mathcal {D}(k)\) is therefore written as
where the notation \(\int _{\varOmega \in \mathcal {S}^2} \text {d}\varOmega \) is used to denote compactly the solid angle \(\int _{\phi = 0}^{2\pi } \int _{\theta = 0}^{\pi } \sin \theta \text {d}\theta \text {d}\phi \). Applying the distortionless constraint (6.5), and by substituting the expression for a plane wave (6.4) into (6.12), we find
Using the orthonormality of the spherical harmonics (2.18), this can be simplified toFootnote 4
or in vector form
where \(\left| \left| \cdot \right| \right| \) denotes the 2-norm. The directivity is therefore a function of the array properties, such as radius or microphone type, and the beamformer weights \(W_{lm}(k)\).
The directivity is frequently expressed in dB and is then referred to as the directivity index (DI),
6.2.2 Front-to-Back Ratio
The front-to-back ratio is another alternative measure of a beamformer’s spatial selectivity and quantifies its ability to differentiate between sound waves that originate from the front and the back. It is defined as the ratio of the average power of the beamformer output due to a plane waves arriving from the front to the average power of the beamformer output due to plane waves arriving from the back. The front-to-back ratio \(\mathcal {F}(k)\) is therefore written as [7 ]
where for a beamformer steered to \((\pi /2,\pi /2)\) we have
and
6.2.3 White Noise Gain
White noise gain (WNG) is a measure of a beamformer’s robustness against sensor noise and errors in microphone placement and steering direction [10], and is defined as the array gain in the presence of spatially incoherent noise [28], i.e., the ratio of the signal-to-noise ratio (SNR) at the beamformer output (\(\text {oSNR}\)) to the SNR at the beamformer input (\(\text {iSNR}\) ) .
We now derive the WNG for a spherical microphone array employing a set of microphones uniformly distributed on the sphere. The desired signal power is different at each microphone, particularly for a rigid sphere where the scattering effects depend on the angle of incidence [16]. When calculating the \(\text {iSNR}\), the desired signal power is therefore averaged over the sphere .
Let us assume that the noise at each microphone has equal power \(\sigma _{v}^2(k)\). The input SNR is then given by
where (6.21b) is obtained using the spherical harmonic decomposition of \(X(k,\mathbf {r})\). Assuming plane-wave incidence from a direction \(\varOmega _{\text {s}}\), by substituting (6.4) into (6.21), we find
Using Unsöld’s theorem [29], a special case of the spherical harmonic addition theorem (2.23), and the orthonormality of the spherical harmonics, we simplify (6.22) to
The input SNR is therefore a function of the plane wave power \(P_{\text {pw}}(k)\), the array properties, via the mode strength \(b_l(k)\), and the noise power \(\sigma _{v}^2(k)\).
The output SNR is given by
Applying the distortionless constraint (6.5), this reduces to
With Q microphones uniformly distributed on the sphere, the cross power spectral density of the noise is given by [31, Eq. 7.31]
where \(\delta \) denotes the Kronecker delta, and \(\text {oSNR}\) simplifies to
The output SNR is a function of the beamformer weights \(W_{lm}(k)\), the plane wave power \(P_{\text {pw}}(k)\), the noise power \(\sigma _{v}^2(k)\), and the beamformer order L. The beamformer order can be increased by adding microphones, as discussed in Sect. 3.4.
Finally, the WNG can be expressed as
The WNG is a function of the beamformer weights \(W_{lm}(k)\), array order L and the array properties. As expected, it is also an increasing function of the number of microphones Q. In the case of an open sphere, \(b_l(k) = i^l j_l(kr)\), and since \(\sum _{l=0}^{\infty } \left| j_l(kr)\right| ^2 (2l+1) = 1\) [2, 13], the WNG is given by the simple expression
6.2.4 Spatial Response
The output of the beamformer in the presence of a single unit amplitude plane wave originating from a DOA \(\varOmega \) is given by
and is known as the spatial response of the beamformer. The square magnitude of the spatial response \(\mathcal {B}(k,\varOmega )\) is referred to as the beam pattern [4].Footnote 5 The beam pattern describes the beamformer’s ability to select signals originating from a direction of interest, while suppressing signals that do not. Beam patterns typically exhibit multiple peaks or lobes; the largest lobe, in the direction of interest, is referred to as the main lobe, while the other lobes are referred to as sidelobes. Due to the effects of spatial aliasing, some sidelobes may have an amplitude equal to that of the main lobe, and they are then referred to as grating lobes [27].
Due to the spherical symmetry of the SHD, the beam pattern can also be expressed as a function of the angle between the DOA \(\varOmega \) and the beamformer’s steering direction \(\varOmega _{\text {u}}\), denoted as \(\varTheta \). Ideally, the response in the steering direction, \(\mathcal {B}(k,\varTheta = 0)\), should be as large as possible compared to the response in other directions, i.e., the sidelobe levels should be minimized. We refer to the width of the region that has a higher response than the maximum sidelobe level as the main lobe width,Footnote 6 as illustrated in Fig. 6.2 .
6.3 Signal-Independent Beamformers
Having established our signal model in Sect. 6.1, we now develop a number of signal-independent beamformers based on the design criteria introduced in Sect. 6.2. The beam patterns of all the beamformers presented in this section are rotationally symmetric about the steering direction.
6.3.1 Farfield Beamformers
In this section, we derive three beamformers suitable for use in farfield conditions: a maximum directivity beamformer, a maximum WNG beamformer, and a multiply constrained beamformer.
6.3.1.1 Maximum Directivity Beamformer
The beamformer that maximizes the directivity while imposing a distortionless constraint in the steering direction satisfies
or equivalently,
where \(\mathbf {y}(\varOmega _{\text {u}})\) is the vector of spherical harmonics defined in (6.9).
Following the approach proposed by Brandwood [5], if we use a Lagrange multiplier to adjoin the constraint to the cost function, the weights of the maximum directivity beamformer are then given by
where \(\mathcal {L}\) is the complex Lagrangian given by
and \(\lambda \) is the Lagrange multiplier. Setting the gradient of \(\mathcal {L}(\mathbf {w}_{\mathrm {maxDI}}(k), \lambda )\) with respect to \(\mathbf {w}^{*}_{\mathrm {maxDI}}\) to zero yields
where \(\mathbf {0}_N\) is a column vector of N zeros. Using the constraint in (6.31), we then find
Using Unsöld’s theorem [29], this simplifies to
or in scalar form
A well-known farfield SHD beamformer is the plane-wave decomposition (PWD) beamformer, also sometimes known as a regular beamformer [24], whose weights are given by [22]
As the (frequency-independent) scaling factor does not affect the directivity, the PWD beamformer is also a maximum directivity beamformer. The reason for the name PWD will become clear in the next paragraph.
Assuming a single unit amplitude plane wave is incident upon the array from a direction \(\varOmega _{\text {s}}\), the output Z(k) of the PWD beamformer is given by
where \(\varTheta \) is the angle between \(\varOmega _{\text {s}}\) and \(\varOmega _{\text {u}}\) and is the Legendre polynomial of order L. The Christoffel summation formula [11, Sect. 8.915] is used to obtain (6.38d) [20]. The beamformer output Z(k) reaches its maximum when \(\varTheta = 0\), such that the steering direction \(\varOmega _{\text {u}}\) is equal to the arrival direction \(\varOmega _{\text {s}}\), as desired. We normalize the beamformer output with respect to its value for \(\varTheta = 0\), and plot it as a function of \(\varTheta \) in Fig. 6.3. We see that as L increases, the distribution of Z(k) narrows around \(\varTheta = 0\), tending towards a delta function for \(L \rightarrow \infty \) [31, Eq. 6.47].
The directivity of the maximum directivity beamformer is given by substituting (6.35) into (6.16)Footnote 7
The directivity of the maximum directivity beamformer is therefore frequency-independent and only depends on the beamformer order L.
Since at least \((L+1)^2\) microphones are required to sample a sound field up to order L without spatial aliasing, the directivity is upper bounded by the number of microphones Q. This is also the maximum directivity of a spatial domain beamformer based on a standard linear array [28, Eq. 2.160].
The WNG of the maximum directivity beamformer is given by substituting (6.35) into (6.28)
In the open sphere case, this simplifies toFootnote 8
or in matrix form
In Fig. 6.4, we plot the WNG of the maximum directivity beamformer of order \(L = 4\) as a function of the product of the wavenumber k and array radius r, kr, for an array of \(Q = 32\) microphones. Assuming a speed of sound of 343 \(\text {m}\cdot \text {s}^{-1}\), a kr value of 1 corresponds to a frequency of 1.1 kHz for an array radius of \(r = 10\) cm, for example. It can be seen that the beamformer’s WNG is low except at high frequencies or large array radii. When an open sphere is used, the maximum directivity beamformer has particularly poor robustness at certain values of kr; this is due to the presence of zeros in the open sphere mode strength (see Sect. 3.4.2). The rigid sphere does not present this issue, and in addition provides an increase in WNG of approximately 3.7 dB over the open sphere at low values of kr .
6.3.1.2 Maximum White Noise Gain Beamformer
The beamformer that maximizes the WNG while imposing a distortionless constraint in the steering direction satisfies
or equivalently,
Proceeding in a similar way as for the analysis of the maximum directivity beamformer, if we use a Lagrange multiplier to adjoin the constraint to the cost function, the weights of the maximum directivity beamformer are then given by
where \(\mathcal {L}\) is the complex Lagrangian given by
and \(\lambda \) is the Lagrange multiplier. Setting the gradient of \(\mathcal {L}(\mathbf {w}_{\mathrm {maxWNG}}(k), \lambda )\) with respect to \(\mathbf {w}^{*}_{\mathrm {maxWNG}}\) to zero yields
where \(\mathbf {0}_N\) is a column vector of N zeros. Using the constraint in (6.43), we then find
Using Unsöld’s theorem [29], this simplifies to
or in scalar form
A well-known farfield SHD beamformer is the delay-and-sum beamformer, whose weights are given by [22]
In the case of an open sphere, \(b_l(k) = i^l j_l(kr)\), and since \(\sum _{l=0}^{\infty } \left| j_l(kr)\right| ^2 (2l+1) = 1\) [2], the following relationship between the maximum WNG and delay-and-sum beamformers is obtained:
When an open sphere is used, the delay-and-sum beamformer therefore approaches a maximum WNG beamformer as \(L \rightarrow \infty \) (ignoring the \(4 \pi \) scaling factor, which does not affect the WNG). For a finite L and/or if another microphone type or array configuration is used (such as a rigid sphere), the delay-and-sum beamformer is slightly suboptimal.
The delay-and-sum beamformer owes its name to the fact that for an open sphere as \(L \rightarrow \infty \), its output converges to the output of the widely known spatial domain delay-and-sum beamformer [22].
The directivity of the maximum WNG beamformer is given by substituting (6.47) into (6.16)
The WNG of the maximum WNG beamformer is given by substituting (6.47) into (6.28)
Using Unsöld’s theorem [29], this simplifies to
In the open sphere case, the WNG approaches Q as \(L \rightarrow \infty \) (as in [22]), so it can be seen that the maximum WNG beamformer achieves a constant WNG of Q that is independent of frequency. This is also the highest achievable WNG for a distortionless beamformer in the spatial domain [28].
In Fig. 6.5, we plot the DI of the maximum directivity and maximum WNG beamformers of order \(L = 4\) as a function of kr for an array of \(Q = 32\) microphones. As expected, the maximum directivity beamformer provides the highest directivity; while the maximum WNG beamformer has poor directivity at low values of kr (i.e., low frequencies or small array radii). Due to the effects of scattering introduced by the rigid sphere (see Sect. 3.4.1), the maximum WNG beamformer has better directivity with a rigid array than with an open array. The directivity of the maximum directivity beamformer is independent of kr, while for the maximum WNG beamformer the directivity decays as kr decreases, tending towards 0 dB (i.e., no directivity) .
The WNG of the maximum WNG beamformer of order \(L = 4\) is shown in Fig. 6.4; as expected, it provides the highest WNG. Using Figs. 6.4 and 6.5, it can be observed that there is a tradeoff between WNG and directivity. The maximum directivity and WNG beamformers provide performance bounds for SHD beamformers in terms of directivity and WNG, and are attractive due to their low computational complexity. However, in practice a compromise solution is desirable, such as the multiply constrained beamformer presented in Sect. 6.3.1.3, or the signal-dependent beamformers in Chap. 7, which adaptively control the tradeoff between these two objectives depending on the nature of the noise to be suppressed.
6.3.1.3 Multiply Constrained Beamformer
Another approach to the design of a signal-independent beamformer is to minimize its sidelobe levels for a given main lobe width, to ensure that interfering signals that do not originate from the steering direction are effectively suppressed. However, in order to obtain a beamformer that is robust to errors in sensor position and steering direction, and to sensor noise, it is desirable to introduce a constraint on the beamformer’s WNG.
In [25], the authors propose a robust minimum sidelobe beamformer, which minimizes the maximum sidelobe level, subject to a distortionless constraint in the steering direction and a minimum WNG constraint. The objective can therefore be expressed in the form of a minimax criterion as
where \(\mathcal {B}(k,\varTheta )\) is the spatial response of the beamformer, \(\varTheta \) denotes the angle between the steering direction and the DOA, \(\varDelta \) denotes the main lobe width (as defined in Sect. 6.2.4), and \(\zeta \) is the minimum WNG. The sidelobe region is defined as \(\varTheta _{\text {SL}} = \left\{ \varTheta | \varTheta > \varDelta /2 \right\} \).
As shown in [25], the problem in (6.54) can be reformulated as a convex optimization problem, solvable using second-order cone programming. The sidelobe region is approximated using a finite grid \(\varTheta _{n_\text {g}} \in \varTheta _{\text {SL}}, n_\text {g} \in \{ 1, \ldots , N_\text {g}\}\); the approximation then improves as \(N_\text {g}\) increases.
Finding a solution to (6.54) can be computationally intensive. However, a significant advantage of SHD beamforming is that if the desired beam pattern is rotationally symmetric about the steering direction \(\varOmega _{\text {u}}\), the process of computing the beamformer weights and steering of the beamformer can be decoupled. In this case, the beamformer weights are expressed as \(W_{lm}(k) = C_{l}(k) Y_{lm}^{*}(\varOmega _{\text {u}})\), and the weights \(C_l(k)\) then become the quantities to be optimized. If the desired beam pattern is not rotationally symmetric about the steering direction, the beam pattern can be rotated by multiplying the SHD beamformer weights by Wigner-D functions that depend on the rotation angles, as proposed in [23 ] .
6.3.2 Nearfield Beamformers
In this chapter, we have until now assumed that the desired signal was due to a single plane wave, i.e., farfield conditions. However, under nearfield conditions, the plane wave assumptions cannot be considered valid. The SHD sound pressure due to a spherical wave originating from a source at a position \(\mathbf {r}_{\text {s}} = (r_{\text {s}},\varOmega _{\text {s}})\) is given by
where \(X_{\text {sw}}(k)\) denotes the spherical wave amplitude and the nearfield mode strength \(b_l^{\text {nf}}(k,r_{\text {s}})\) is given by
and \(h_l^{(2)}\) is the spherical Hankel function of the second kind and of order l.
Beamformers suitable for nearfield conditions [8, 9, 19] can be designed by replacing the farfield mode strength expression \(b_l(k)\) with the nearfield mode strength \(b_l^{\text {nf}}(k,r_{\text {s}})\) in the beamformer weights. For example, the weights of a nearfield plane-wave decomposition beamformer are given by
instead of (6.37). While this process is straightforward, it does require knowledge of the source-array distance \(r_{\text {s}}\). If the source-array knowledge is not known, the source-array distance \(r_{\text {s}}\) becomes a controllable parameter, which is effectively a look distance and enables radial discrimination [9].
An appropriate boundary between the farfield and nearfield regions can be determined by comparing the magnitudes of the farfield mode strength \(b_l(k)\) and the nearfield mode strength \(b_l^{\text {nf}}(k,r_{\text {s}})\), as proposed in [8]. Using this criterion, the cut-off distance \(r_{\text {nf}}\) is determined as
The extent of the nearfield region therefore decreases with frequency. An array with good radial discrimination, i.e., a large nearfield region, can be realized either at low frequencies (small k), or by oversampling the array (large N) [9].
Example: At a frequency of 100 Hz, assuming a speed of sound of 343 \(\text {m}\cdot \text {s}^{-1}\) and an array order \(L = 4\), the cut-off distance is \(r_{\text {nf}}(k) = 2.2\) m, while at a frequency of 4 kHz it is 5.5 cm.
6.4 Chapter Summary
An overview of beamforming in the SHD using signal-independent beamformers has been presented. We introduced a number of performance measures, which were then used to derive beamformers weights that are optimal with respect to these measures. We also showed the relationship between these optimal beamformers and two well-known SHD beamformers: the PWD and delay-and-sum beamformers. Finally, where similarities existed, the performance bounds for SHD beamformers were related to previously derived bounds for spatial domain beamformers.
Notes
- 1.
The dependency on time is omitted for brevity. In practice, the signals acquired using a spherical microphone array are usually processed in the short-time Fourier transform domain, as explained in Sect. 3.1, where the discrete frequency index is denoted by \(\nu \).
- 2.
We use the complex conjugate weights \(W^*_{lm}\) rather than the weights \(W_{lm}\); this notational convention originates in the spatial domain [30].
- 3.
If the real SHT is applied instead of the complex SHT, the complex spherical harmonics \(Y_{lm}\) used throughout this chapter should be replaced with the real spherical harmonics \(R_{lm}\), as defined in Sect. 3.3.
- 4.
- 5.
Note that in some publications, such as [28], \(\mathcal {B}(k,\varOmega )\) is referred to as the beam pattern, and its square magnitude is referred to as the power pattern .
- 6.
The main lobe width is sometimes also defined as the width of the region where the beam pattern is no less than half of its maximum value, or equivalently, no more than 3 dB below its maximum value.
- 7.
This expression is identical to (12) in [22] if we substitute \(d_n = 1\).
- 8.
References
Abhayapala, T.D., Ward, D.B.: Theory and design of high order sound field microphones using spherical microphone array. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1949–1952 (2002). doi:10.1109/ICASSP.2002.1006151
Abramowitz, M., Stegun, I.A. (eds.): Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Dover Publications, New York (1972)
Benesty, J., Chen, J., Huang, Y.: Microphone Array Signal Processing. Springer, Berlin (2008)
Brandstein, M.S., Ward, D.B. (eds.): Microphone Arrays: Signal Processing Techniques and Applications. Springer, Berlin (2001)
Brandwood, D.H.: A complex gradient operator and its application in adaptive array theory. Proc. IEEE 130(1, Parts F and H), 11–16 (1983)
Doclo, S.: Multi-microphone noise reduction and dereverberation techniques for speech applications. Ph.D. thesis, Katholieke Universiteit Leuven, Belgium (2003)
Elko, G.W.: Differential microphone arrays. In: Huang, Y., Benesty, J. (eds.) Audio Signal Processing for Next-Generation Multimedia Communication Systems, pp. 2–65. Kluwer (2004)
Fisher, E., Rafaely, B.: The nearfield spherical microphone array. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5272–5275 (2008). doi:10.1109/ICASSP.2008.4518849
Fisher, E., Rafaely, B.: Near-field spherical microphone array processing with radial filtering. IEEE Trans. Audio, Speech, Lang. Process. 19(2), 256–265 (2011). doi:10.1109/TASL.2010.2047421
Gilbert, E., Morgan, S.: Optimum design of directive antenna arrays subject to random variations. Bell Syst. Tech. J. 34, 637–663 (1955)
Gradshteyn, I.S., Ryzhik, I.M.: Table of Integrals, Series, and Products, seventh edn. Academic Press, Cambridge (2007)
Habets, E.A.P., Benesty, J., Cohen, I., Gannot, S., Dmochowski, J.: New insights into the MVDR beamformer in room acoustics. IEEE Trans. Audio, Speech, Lang. Process. 18, 158–170 (2010)
Jarrett, D.P., Habets, E.A.P.: On the noise reduction performance of a spherical harmonic domain tradeoff beamformer. IEEE Signal Process. Lett. 19(11), 773–776 (2012)
Jarrett, D.P., Habets, E.A.P., Benesty, J., Naylor, P.A.: A tradeoff beamformer for noise reduction in the spherical harmonic domain. In: Proceedings of the International Workshop Acoustics Signal Enhancement (IWAENC). Aachen, Germany (2012)
Jarrett, D.P., Habets, E.A.P., Naylor, P.A.: Spherical harmonic domain noise reduction using an MVDR beamformer and DOA-based second-order statistics estimation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 654–658. Vancouver, Canada (2013)
Jarrett, D.P., Thiergart, O., Habets, E.A.P., Naylor, P.A.: Coherence-based diffuseness estimation in the spherical harmonic domain. In: Proceedings of the IEEE Convention of Electrical and Electronics Engineers in Israel (IEEEI). Eilat, Israel (2012)
Meyer, J., Agnello, T.: Spherical microphone array for spatial sound recording. In: Proceedings of the Audio Engineering Society Convention, pp. 1–9. New York (2003)
Meyer, J., Elko, G.: A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 2, pp. 1781–1784 (2002)
Meyer, J., Elko, G.W.: Position independent close-talking microphone. Signal Processing 86(6), 1254–1259 (2006). doi:10.1016/j.sigpro.2005.05.036
Rafaely, B.: Plane-wave decomposition of the pressure on a sphere by spherical convolution. J. Acoust. Soc. Am. 116(4), 2149–2157 (2004)
Rafaely, B.: Analysis and design of spherical microphone arrays. IEEE Trans. Speech Audio Process. 13(1), 135–143 (2005). doi:10.1109/TSA.2004.839244
Rafaely, B.: Phase-mode versus delay-and-sum spherical microphone array processing. IEEE Signal Process. Lett. 12(10), 713–716 (2005). doi:10.1109/LSP.2005.855542
Rafaely, B., Kleider, M.: Spherical microphone array beam steering using Wigner-D weighting. IEEE Signal Process. Lett. 15, 417–420 (2008). doi:10.1109/LSP.2008.922288
Rafaely, B., Peled, Y., Agmon, M., Khaykin, D., Fisher, E.: Spherical microphone array beamforming. In: Cohen, I., Benesty, J., Gannot, S. (eds.) Speech Processing in Modern Communication: Challenges and Perspectives, Chap. 11. Springer, Heidelberg (2010)
Sun, H., Yan, S., Svensson, U.P.: Robust minimum sidelobe beamforming for spherical microphone arrays. IEEE Trans. Audio, Speech, Lang. Process. 19(4), 1045–1051 (2011). doi:10.1109/TASL.2010.2076393
Teutsch, H.: Wavefield decomposition using microphone arrays and its application to acoustic scene analysis. Ph.D. thesis, Friedrich-Alexander Universität Erlangen-Nürnberg (2005)
van Trees, H.L.: Detection, Estimation, and Modulation Theory, Optimum Array Processing, vol. IV. Wiley, New York (2002)
van Trees, H.L.: Optimum Array Processing. Detection Estimation and Modulation Theory. Wiley, New York (2002)
Unsöld, A.: Beiträge zur Quantenmechanik der Atome. Annalen der Physik 387(3), 355–393 (1927). doi:10.1002/andp.19273870304
van Veen, B.D., Buckley, K.M.: Beamforming: a versatile approach to spatial filtering. IEEE Acoust. Speech Signal Mag. 5(2), 4–24 (1988)
Williams, E.G.: Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography, 1st edn. Academic Press, London (1999)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Jarrett, D.P., Habets, E.A.P., Naylor, P.A. (2017). Signal-Independent Array Processing. In: Theory and Applications of Spherical Microphone Array Processing. Springer Topics in Signal Processing, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-319-42211-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-42211-4_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42209-1
Online ISBN: 978-3-319-42211-4
eBook Packages: EngineeringEngineering (R0)