Keywords

1 Introduction

As psychoacoustics theory puts the fact that human ears are more sensitive to sounds from horizontal direction at the height of ears, it is more valuable to reproduce desired sound field on two-dimension field. Early studies regard it as 2D reproduction problem where secondary sources are modelled as vertical line sources [1]. However, due to the acoustic characteristics of the loudspeaker, modelling it as 3D point source is more reasonable in practice. Due to the intrinsic dimensionality mismatch, modelling secondary sources as 3D point sources to reproduce 2D sound field is named as 2.5D reproduction [2,3,4,5]. Most existing methods for 2.5D reproduction can make good performance in addressing dimensionality mismatch problem [2, 6]. However, these approaches focus on free-field condition while the reverberation caused by time-varying room responses will impair the generated sound field [1]. Thus, room compensation (or room equalization) has been proposed to reduce or eliminate the reverberant effects.

As for room compensation, current proposed methods can be divided into two ways: passive compensation manners and active manners. Passive compensation methods reduce wall reflections by acoustic absorption materials. Nevertheless, it gets costly and impractical especially at low frequencies in many real-world application scenarios. Thus, researchers put forward compensation methods by active manners. In order to equalize the effects of reverberation, adding appropriate compensation signals on the loudspeaker arrays is the core. Currently, most proposed room compensation techniques are based on Multiple Input Multiple Output (MIMO) system [7, 8], which can only reduce the effect of reverberation at discrete points and its adjacent region. Designing compensation signals using modal domain processing [9] achieves compensation within a continuous region. However, this has only been considered for 2D reproduction.

In this paper, we propose an active room compensation approach in 2.5D reproduction through modal domain processing. Section 2 reviews the 2.5D reproduction using sectorial mode matching algorithm, and we propose an active room compensation algorithm in Sect. 3. In Sect. 4, the proposed algorithm is simulated and evaluated by reproduction of narrowband and broadband signals under reverberant environment.

2 2.5D Reproduction

2.1 Problem Formulation

In modal-domain, we express the 2D desired sound field at any point \(\varvec{x}=\left\{ r_x,\phi _x \right\} \) through the interior solution of wave equation [10]

$$\begin{aligned} P_d(\varvec{x},k)\approx \sum _{m = -M}^{M}\alpha _m(k)J_m(kr_x)e^{im\phi _x}, \end{aligned}$$
(1)

 

where k represents the wave number, \(\alpha _m(k)\) represents sound field coefficients, \(J_m(\cdot )\) represents cylindrical Bessel function. The truncation order is determined as \(M=\lceil {ekR/2}\rceil \) [11] where R is the radius of control region.

Consider adopting circular loudspeaker array to generate the desired sound field exactly as (1) shows, where the array is implemented on horizontal plane, the number of loudspeakers has to satisfy \(L \ge 2M+1\) [12].

Thus, reproduced sound field in free-field is formulated as

$$\begin{aligned} P(\varvec{x},k)=\sum _{l = 1}^{L}d_l(k)H_l(\varvec{x},k), \end{aligned}$$
(2)

where \(H_l(\varvec{x},k)\) is acoustic transfer function (ATF) of the lth loudspeaker and the observation spot x, \(d_l(k)\) is the driving signal of lth loudspeaker.

The expression of \(H_l(\varvec{x},k)\) is as follows

$$\begin{aligned} H_l(\varvec{x},k)=\frac{e^{-ik \Vert \varvec{y}_l-\varvec{x}\Vert }}{4\pi \Vert \varvec{y}_l-\varvec{x}\Vert }, \end{aligned}$$
(3)

where \(\varvec{y}_l=\left\{ r_l,\phi _l \right\} \) represents the loudspeaker location and \(\Vert \cdot \Vert \) denotes the Euclidean distance of the vectors.

In modal domain, \(H_l(\varvec{x},k)\) is formulated as

$$\begin{aligned} H_l(\varvec{x},k)\approx \sum _{m = -M}^{M}\sum _{n = \mid m \mid }^{M}\gamma _n^m(l,k)j_n(kr_x)Y_n^m(\frac{\pi }{2},\phi _x), \end{aligned}$$
(4)

where \(\gamma _n^m(l,k)\) is the ATF coefficient of lth source. In free field condition,

$$ \gamma _n^m(l,k)=-ikh_n^{(2)}(kr_l)\overline{Y_n^m(\frac{\pi }{2},\phi _l}), $$

Substituting (4) into (2) gives the expression

$$\begin{aligned} P(\varvec{x},k)\approx \sum _{m = -M}^{M}\sum _{l = 1}^{L}d_l(k)\sum _{n = \mid m \mid }^{M}\gamma _n^m(l,k)j_n(kr_x)Y_n^me^{im\phi _x}, \end{aligned}$$
(5)

where the spherical harmonic function at elevation \(\theta =\pi /2\) is defined as \(Y_n^m(\frac{\pi }{2},\phi _x)=Y_n^m e^{im\phi _x}\), with \(Y_n^m\triangleq A_n^m P_n^m(0)\), \(A_n^m= \sqrt{\frac{2n+1}{4\pi }\frac{(n-\mid m \mid )!}{(n+\mid m \mid )!}}\) and \(P_n^m(\cdot )\) represents associated Legendre function.

The loudspeaker driving signals are designed by equating (1) with (5). Using the orthogonality property of complex exponentials, it gives the following equation

$$\begin{aligned} \alpha _m(k)J_m(kr_x)=\sum _{l = 1}^{L}d_l(k)\underbrace{\sum _{n = \mid m \mid }^{M}\gamma _n^m(l,k)j_n(kr_x)Y_n^m}_{h_m(l,k,r_x)}, \end{aligned}$$
(6)

for \(m=-M, ... ,M\). From (6), it shows that the expansion is only over mode m and the Bessel function \(J_m(kr_x)\) denotes the radial propagation in 2D sound field, and the modal expansion is over both n and m and the spherical Bessel function \(j_n(kr_x)\) denotes the radial propagation in 3D sound field, which reflects the dimensionality mismatching. The solution of this problem is finding appropriate matching distance \(r_x\).

2.2 Sectorial Mode Matching

It has been proved [2] that adopting the center of reproduction region as the matching spot (\(r_x=0\)) can reach a relatively less distortion due to dimensionality mismatch. Note that when \(kr_x \rightarrow 0\), the summation over order n in (4) reduces to the single term \(n = |m |\) [2, 13] where \(Y_{\mid m \mid }^m(\frac{\pi }{2},\phi _x)\) is termed as the sectorial harmonics, and \(h_m(l,k,r_x)\) in (6) is expressed as

$$\begin{aligned} h_m(l,k,r_x)=\gamma _{\mid m \mid }^m(l,k)j_{\mid m \mid }(kr_x)Y_{\mid m \mid }^m, \end{aligned}$$
(7)

Thus, the driving signals \(d_l(k)\) are derived by matrix

$$\begin{aligned} \mathbf{d} = \mathbf{H} ^{\dagger }{} \mathbf{b} , \end{aligned}$$
(8)

where

$$\begin{aligned} \mathbf{d} =\begin{bmatrix} d_{l_1}(k)&\cdots&d_{l_L}(k) \end{bmatrix}^T, \end{aligned}$$
$$\begin{aligned} \mathbf{H} ^{\dagger }=\begin{bmatrix} h_{-M}(l_1,k,r_x) &{} \dots &{} h_{-M}(l_L,k,r_x)\\ \vdots &{} \ddots &{} \vdots \\ h_{M}(l_1,k,r_x) &{} \dots &{} h_M(l_L,k,r_x) \end{bmatrix}^{\dagger }, \end{aligned}$$
$$\begin{aligned} \mathbf{b} =\begin{bmatrix} \alpha _{-M}(k)J_{-M}(kr_x)&\cdots&\alpha _{M}(k)J_{M}(kr_x) \end{bmatrix}^T. \end{aligned}$$

The dependence on wavenumber k is omitted for notation simplicity. Then, driving signals can be expressed simply as

$$\begin{aligned} \mathbf{d} =(\mathbf{H} ^\mathbf{d} )^{\dagger }{} \mathbf{b} , \end{aligned}$$
(9)

where \(\mathbf{H} ^\mathbf{d} \) denotes the direct-path loudspeaker array ATF in modal domain.

3 Active Room Compensation

The previous section reviews sectorial mode matching algorithm for 2.5D reproduction. This section introduces the active room compensation algorithm for 2.5D reproduction in modal domain.

We use a circular microphone array of Q microphones to encircle the control region with the microphones uniformly placed. Figure 1 shows the system setup.

Fig. 1.
figure 1

Active room compensation system setup: the loudspeaker array and a plane control region circled by equalizer microphone array.

Note that the aim here is to compensate the reverberation caused by the loudspeaker signals. Based on Eq. (7) and given the fact that only the sectorial modes are controlled, derive the sound field coefficients measured by equalizer microphone array

$$\begin{aligned} \beta _{m}(k)=\frac{1}{Qj_{|m|}(kr_M)Y_{|m|}^m}\sum _{q=1}^Q P(\varvec{x}_q,k)e^{-im\phi _q}, m=-M,\ldots ,M, \end{aligned}$$
(10)

where \(r_M\) denotes the microphone array radius. In vector form, the measured sound field coefficients are constituted by direct-path modes and reverberant-path modes

$$\begin{aligned} \varvec{\beta }=\varvec{\beta }^\mathbf{d } +\varvec{\beta }^\mathbf{r }, \end{aligned}$$
(11)

The modal domain sound field coefficients can also be expressed as the loudspeaker driving signals and the corresponding ATF coefficients

$$\begin{aligned} \varvec{\beta }=\mathbf {\Gamma }{} \mathbf{d} =\mathbf {\Gamma }^\mathbf{d }{} \mathbf{d} + \mathbf {\Gamma }^\mathbf{r }{} \mathbf{d} , \end{aligned}$$
(12)

where \(\mathbf {\Gamma }=[\varvec{\gamma }_1,\ldots ,\varvec{\gamma }_L]\), and \(\varvec{\gamma }_l=[\gamma _{|M|}^{-M}(l,k),\ldots ,\gamma _{|M|}^{M}(l,k)]^T\) which represent the ATF coefficients matrix. The ATF coefficients matrix is also decomposed into the direct path and reverberant path, that is \(\mathbf {\Gamma }=\mathbf {\Gamma }^\mathbf{d }+\mathbf {\Gamma }^\mathbf{r }\).

Then, the compensation problem is formulated as introducing the compensation signals \(\varvec{\delta }{} \mathbf{d} \) at the loudspeakers to minimize the reverberant-path modes, i.e.,

$$\begin{aligned} \min \Vert \varvec{\beta }^\mathbf{r }\Vert&= \Vert \varvec{\beta }-\varvec{\beta }^\mathbf{d }\Vert \nonumber \\&= \Vert \mathbf {\Gamma }(\mathbf{d} +\varvec{\delta }{} \mathbf{d} ) -\mathbf {\Gamma }^\mathbf{d }{} \mathbf{d} \Vert \nonumber \\&=\Vert (\mathbf {\Gamma }^\mathbf{d }+\mathbf {\Gamma }^\mathbf{r }) \varvec{\delta }{} \mathbf{d} +\mathbf {\Gamma }^\mathbf{r }{} \mathbf{d} \Vert \end{aligned}$$
(13)

Thus, by solving (13) in the least-squares approach, the compensation signals at the loudspeakers for eliminating reverberation is given by

$$\begin{aligned} \varvec{\delta }{} \mathbf{d} =- (\mathbf {\Gamma }^\mathbf{d }+\mathbf {\Gamma }^\mathbf{r })^{\dagger } (\varvec{\beta }- \mathbf {\Gamma }^\mathbf{d }{} \mathbf{d} ), \end{aligned}$$
(14)

where the measured sound field coefficients \(\varvec{\beta }\), the loudspeaker driving signals \(\mathbf{d} \) for direct path (or free-field) propagation \(\mathbf {\Gamma }^\mathbf{d }\) can be accessed straightforwardly. The reverberant path ATF coefficients are obtained in an adaptive manner as addressed in [9].

4 Evaluation

This section describes the simulation based experimental setup and performance of the proposed room compensation algorithm.

4.1 Simulation Results

Consider a 2D reverberant room of size 2 m \(\times \) 2 m. Simulate reverberant environment by image source method [14]: the wall reflection is 0.8 (the ceiling and floor are perfectly-absorbing) and the image depth of 7. Control region circled by equalizer microphone array is centred at (1 m, 1 m) with the radius of 0.25 m. The loudspeaker array is uniformly placed on the circle of 1 m radius. Note that the number of loudspeakers and microphones are both 39, which satisfies the mode truncation requirement \(M=\lceil {ekR/2}\rceil +1\) [13].

We simulate reproduction of two kinds of narrowband sources, i.e., plane wave and cylindrical wave, and a broadband source. For the narrowband cases, the plane wave source is of 3 kHz frequency, incident from \(\phi _v = \pi /3\) and the cylindrical wave source is of 3 kHz frequency, locating at \(r_v\) = 1.5 m, \(\phi _v = \pi /3\) away from the system center. For the broadband cases, the cylindrical wave source is of 3 kHz bandwidth: frequencies 100 Hz to 3 kHz, and the location is as same as narrowband cylindrical wave source. Figure 2 and Fig. 3 show the compensation performance of narrowband sources, where the desired, reverberant and compensated sound fields are displayed in (a), (b), (c) respectively. Compared with the reverberant sound fields in Fig. 2(b) and Fig. 3(b), compensation is achieved within the whole reproduction region.

Fig. 2.
figure 2

Reproduction of a plane wave (frequency at 3 kHz, incident from \(\phi _v = \pi /3\)) in a 0.25 m control region circled by the asterisk

Fig. 3.
figure 3

Reproduction of a cylindrical wave (frequency at 3 kHz, locating at \(r_v\) = 1.5 m, \(\phi _v = \pi /3\) away from the system center) in a 0.25 m control region circled by the asterisk

4.2 Evaluation of Results

The compensation performance is measured by the normalized equalization error \(\varepsilon \) over the whole control region. The normalized equalization error is defined as follows,

$$\begin{aligned} \varepsilon (w)= 10 \log _{10} \frac{{\int _R {{{\left| {P(\varvec{x},k) - {P_d}(\varvec{x},k)} \right| }^2}{d\varvec{x}}}}}{{\int _R {{{\left| {{P_d}(\varvec{x},k)} \right| }^2}{d\varvec{x}}}}} \end{aligned}$$
(15)

In this simulation, 40000 observation points are uniformly selected within the reproduction region to approximate the integral.

In broadband case, set the number of loudspeakers as 45, which is slightly more than the number of loudspeakers designed at the maximum frequency 3 kHz, other simulation settings are the same as in the examples of Fig. 2 and Fig. 3. In Fig. 4, it demonstrates the normalized equalization error over 3 kHz bandwidth, which shows that setting the number of the loudspeakers more than the minimum requirement of certain frequency can achieve satisfying results. As the frequency increases, the performance will gradually degrade.

Fig. 4.
figure 4

The normalized equalization error of cylindrical wave reproduction over 3 kHz bandwidth frequency range.

5 Conclusion

In this paper, we propose an active compensation algorithm for 2.5D sound field reproduction. Firstly, the sectorial mode matching method is used to derive the loudspeaker driving signals assuming direct-path propagation between the loudspeaker array and reproduction region. Then, based on sectorial mode, an active compensation algorithm is introduced to compensate room reverberation. From simulation results, it can proved that the proposed algorithm can achieve effective room compensation over the whole reproduction region.