Active Room Compensation for 2.5D Sound Field Reproduction

Chen, Yitong; Zhang, Wen

doi:10.1007/978-981-16-1649-5_9

Yitong Chen⁴⁰ &
Wen Zhang⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 761))

Included in the following conference series:

National Conference on Sound and Music Technology

441 Accesses

Abstract

Modelling secondary sources as 3D point sources to reproduce 2D desired sound field is named as 2.5D sound field reproduction, which has the intrinsic dimensionality mismatch problem. Existing methods for 2.5D reproduction have focused on solving the dimensionality mismatch problem and mostly considered free-field condition. However, in most cases, the reverberation caused by the listening room will degrade the reproduction performance. In this work, we propose an active room compensation strategy for 2.5D reproduction. Firstly, adopt sectorial mode matching algorithm to achieve 2.5D reproduction, the desired sound field and generated sound field are matched at the reproduction center. Secondly, the modal-domain algorithm is developed to estimate reverberant sound field and design the compensation signals of loudspeakers to compensate reverberant sound field. The proposed method is validated through simulation experiments to demonstrate its effectiveness against room reverberations.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Multizone Sound Field Reproduction Based on Equivalent Source Method

Article 28 March 2021

Sound Field Reproduction in Reverberant Room Using the Alternating Direction Method of Multipliers Based Lasso and Regularized Least-Square

Simplification of 3D Multichannel Sound System Based on Multizone Soundfield Reproduction

Keywords

1 Introduction

As psychoacoustics theory puts the fact that human ears are more sensitive to sounds from horizontal direction at the height of ears, it is more valuable to reproduce desired sound field on two-dimension field. Early studies regard it as 2D reproduction problem where secondary sources are modelled as vertical line sources [1]. However, due to the acoustic characteristics of the loudspeaker, modelling it as 3D point source is more reasonable in practice. Due to the intrinsic dimensionality mismatch, modelling secondary sources as 3D point sources to reproduce 2D sound field is named as 2.5D reproduction [2,3,4,5]. Most existing methods for 2.5D reproduction can make good performance in addressing dimensionality mismatch problem [2, 6]. However, these approaches focus on free-field condition while the reverberation caused by time-varying room responses will impair the generated sound field [1]. Thus, room compensation (or room equalization) has been proposed to reduce or eliminate the reverberant effects.

As for room compensation, current proposed methods can be divided into two ways: passive compensation manners and active manners. Passive compensation methods reduce wall reflections by acoustic absorption materials. Nevertheless, it gets costly and impractical especially at low frequencies in many real-world application scenarios. Thus, researchers put forward compensation methods by active manners. In order to equalize the effects of reverberation, adding appropriate compensation signals on the loudspeaker arrays is the core. Currently, most proposed room compensation techniques are based on Multiple Input Multiple Output (MIMO) system [7, 8], which can only reduce the effect of reverberation at discrete points and its adjacent region. Designing compensation signals using modal domain processing [9] achieves compensation within a continuous region. However, this has only been considered for 2D reproduction.

In this paper, we propose an active room compensation approach in 2.5D reproduction through modal domain processing. Section 2 reviews the 2.5D reproduction using sectorial mode matching algorithm, and we propose an active room compensation algorithm in Sect. 3. In Sect. 4, the proposed algorithm is simulated and evaluated by reproduction of narrowband and broadband signals under reverberant environment.

2 2.5D Reproduction

2.1 Problem Formulation

In modal-domain, we express the 2D desired sound field at any point $\varvec{x}=\left\{ r_x,\phi _x \right\} $ through the interior solution of wave equation [10]

$$\begin{aligned} P_d(\varvec{x},k)\approx \sum _{m = -M}^{M}\alpha _m(k)J_m(kr_x)e^{im\phi _x}, \end{aligned}$$

(1)

where k represents the wave number, $\alpha _m(k)$ represents sound field coefficients, $J_m(\cdot )$ represents cylindrical Bessel function. The truncation order is determined as $M=\lceil {ekR/2}\rceil $ [11] where R is the radius of control region.

Consider adopting circular loudspeaker array to generate the desired sound field exactly as (1) shows, where the array is implemented on horizontal plane, the number of loudspeakers has to satisfy $L \ge 2M+1$ [12].

Thus, reproduced sound field in free-field is formulated as

$$\begin{aligned} P(\varvec{x},k)=\sum _{l = 1}^{L}d_l(k)H_l(\varvec{x},k), \end{aligned}$$

(2)

where $H_l(\varvec{x},k)$ is acoustic transfer function (ATF) of the lth loudspeaker and the observation spot x, $d_l(k)$ is the driving signal of lth loudspeaker.

The expression of $H_l(\varvec{x},k)$ is as follows

$$\begin{aligned} H_l(\varvec{x},k)=\frac{e^{-ik \Vert \varvec{y}_l-\varvec{x}\Vert }}{4\pi \Vert \varvec{y}_l-\varvec{x}\Vert }, \end{aligned}$$

(3)

where $\varvec{y}_l=\left\{ r_l,\phi _l \right\} $ represents the loudspeaker location and $\Vert \cdot \Vert $ denotes the Euclidean distance of the vectors.

In modal domain, $H_l(\varvec{x},k)$ is formulated as

$$\begin{aligned} H_l(\varvec{x},k)\approx \sum _{m = -M}^{M}\sum _{n = \mid m \mid }^{M}\gamma _n^m(l,k)j_n(kr_x)Y_n^m(\frac{\pi }{2},\phi _x), \end{aligned}$$

(4)

where $\gamma _n^m(l,k)$ is the ATF coefficient of lth source. In free field condition,

$$ \gamma _n^m(l,k)=-ikh_n^{(2)}(kr_l)\overline{Y_n^m(\frac{\pi }{2},\phi _l}), $$

Substituting (4) into (2) gives the expression

$$\begin{aligned} P(\varvec{x},k)\approx \sum _{m = -M}^{M}\sum _{l = 1}^{L}d_l(k)\sum _{n = \mid m \mid }^{M}\gamma _n^m(l,k)j_n(kr_x)Y_n^me^{im\phi _x}, \end{aligned}$$

(5)

where the spherical harmonic function at elevation $\theta =\pi /2$ is defined as $Y_n^m(\frac{\pi }{2},\phi _x)=Y_n^m e^{im\phi _x}$, with $Y_n^m\triangleq A_n^m P_n^m(0)$, $A_n^m= \sqrt{\frac{2n+1}{4\pi }\frac{(n-\mid m \mid )!}{(n+\mid m \mid )!}}$ and $P_n^m(\cdot )$ represents associated Legendre function.

The loudspeaker driving signals are designed by equating (1) with (5). Using the orthogonality property of complex exponentials, it gives the following equation

$$\begin{aligned} \alpha _m(k)J_m(kr_x)=\sum _{l = 1}^{L}d_l(k)\underbrace{\sum _{n = \mid m \mid }^{M}\gamma _n^m(l,k)j_n(kr_x)Y_n^m}_{h_m(l,k,r_x)}, \end{aligned}$$

(6)

for $m=-M, ... ,M$. From (6), it shows that the expansion is only over mode m and the Bessel function $J_m(kr_x)$ denotes the radial propagation in 2D sound field, and the modal expansion is over both n and m and the spherical Bessel function $j_n(kr_x)$ denotes the radial propagation in 3D sound field, which reflects the dimensionality mismatching. The solution of this problem is finding appropriate matching distance $r_x$.

2.2 Sectorial Mode Matching

It has been proved [2] that adopting the center of reproduction region as the matching spot ($r_x=0$) can reach a relatively less distortion due to dimensionality mismatch. Note that when $kr_x \rightarrow 0$, the summation over order n in (4) reduces to the single term $n = |m |$ [2, 13] where $Y_{\mid m \mid }^m(\frac{\pi }{2},\phi _x)$ is termed as the sectorial harmonics, and $h_m(l,k,r_x)$ in (6) is expressed as

$$\begin{aligned} h_m(l,k,r_x)=\gamma _{\mid m \mid }^m(l,k)j_{\mid m \mid }(kr_x)Y_{\mid m \mid }^m, \end{aligned}$$

(7)

Thus, the driving signals $d_l(k)$ are derived by matrix

$$\begin{aligned} \mathbf{d} = \mathbf{H} ^{\dagger }{} \mathbf{b} , \end{aligned}$$

(8)

where

$$\begin{aligned} \mathbf{d} =\begin{bmatrix} d_{l_1}(k)&\cdots&d_{l_L}(k) \end{bmatrix}^T, \end{aligned}$$

$$\begin{aligned} \mathbf{H} ^{\dagger }=\begin{bmatrix} h_{-M}(l_1,k,r_x) &{} \dots &{} h_{-M}(l_L,k,r_x)\\ \vdots &{} \ddots &{} \vdots \\ h_{M}(l_1,k,r_x) &{} \dots &{} h_M(l_L,k,r_x) \end{bmatrix}^{\dagger }, \end{aligned}$$

$$\begin{aligned} \mathbf{b} =\begin{bmatrix} \alpha _{-M}(k)J_{-M}(kr_x)&\cdots&\alpha _{M}(k)J_{M}(kr_x) \end{bmatrix}^T. \end{aligned}$$

The dependence on wavenumber k is omitted for notation simplicity. Then, driving signals can be expressed simply as

$$\begin{aligned} \mathbf{d} =(\mathbf{H} ^\mathbf{d} )^{\dagger }{} \mathbf{b} , \end{aligned}$$

(9)

where $\mathbf{H} ^\mathbf{d} $ denotes the direct-path loudspeaker array ATF in modal domain.

3 Active Room Compensation

The previous section reviews sectorial mode matching algorithm for 2.5D reproduction. This section introduces the active room compensation algorithm for 2.5D reproduction in modal domain.

We use a circular microphone array of Q microphones to encircle the control region with the microphones uniformly placed. Figure 1 shows the system setup.

Note that the aim here is to compensate the reverberation caused by the loudspeaker signals. Based on Eq. (7) and given the fact that only the sectorial modes are controlled, derive the sound field coefficients measured by equalizer microphone array

$$\begin{aligned} \beta _{m}(k)=\frac{1}{Qj_{|m|}(kr_M)Y_{|m|}^m}\sum _{q=1}^Q P(\varvec{x}_q,k)e^{-im\phi _q}, m=-M,\ldots ,M, \end{aligned}$$

(10)

where $r_M$ denotes the microphone array radius. In vector form, the measured sound field coefficients are constituted by direct-path modes and reverberant-path modes

$$\begin{aligned} \varvec{\beta }=\varvec{\beta }^\mathbf{d } +\varvec{\beta }^\mathbf{r }, \end{aligned}$$

(11)

The modal domain sound field coefficients can also be expressed as the loudspeaker driving signals and the corresponding ATF coefficients

$$\begin{aligned} \varvec{\beta }=\mathbf {\Gamma }{} \mathbf{d} =\mathbf {\Gamma }^\mathbf{d }{} \mathbf{d} + \mathbf {\Gamma }^\mathbf{r }{} \mathbf{d} , \end{aligned}$$

(12)

where $\mathbf {\Gamma }=[\varvec{\gamma }_1,\ldots ,\varvec{\gamma }_L]$, and $\varvec{\gamma }_l=[\gamma _{|M|}^{-M}(l,k),\ldots ,\gamma _{|M|}^{M}(l,k)]^T$ which represent the ATF coefficients matrix. The ATF coefficients matrix is also decomposed into the direct path and reverberant path, that is $\mathbf {\Gamma }=\mathbf {\Gamma }^\mathbf{d }+\mathbf {\Gamma }^\mathbf{r }$.

Then, the compensation problem is formulated as introducing the compensation signals $\varvec{\delta }{} \mathbf{d} $ at the loudspeakers to minimize the reverberant-path modes, i.e.,

$$\begin{aligned} \min \Vert \varvec{\beta }^\mathbf{r }\Vert&= \Vert \varvec{\beta }-\varvec{\beta }^\mathbf{d }\Vert \nonumber \\&= \Vert \mathbf {\Gamma }(\mathbf{d} +\varvec{\delta }{} \mathbf{d} ) -\mathbf {\Gamma }^\mathbf{d }{} \mathbf{d} \Vert \nonumber \\&=\Vert (\mathbf {\Gamma }^\mathbf{d }+\mathbf {\Gamma }^\mathbf{r }) \varvec{\delta }{} \mathbf{d} +\mathbf {\Gamma }^\mathbf{r }{} \mathbf{d} \Vert \end{aligned}$$

(13)

Thus, by solving (13) in the least-squares approach, the compensation signals at the loudspeakers for eliminating reverberation is given by

$$\begin{aligned} \varvec{\delta }{} \mathbf{d} =- (\mathbf {\Gamma }^\mathbf{d }+\mathbf {\Gamma }^\mathbf{r })^{\dagger } (\varvec{\beta }- \mathbf {\Gamma }^\mathbf{d }{} \mathbf{d} ), \end{aligned}$$

(14)

where the measured sound field coefficients $\varvec{\beta }$, the loudspeaker driving signals $\mathbf{d} $ for direct path (or free-field) propagation $\mathbf {\Gamma }^\mathbf{d }$ can be accessed straightforwardly. The reverberant path ATF coefficients are obtained in an adaptive manner as addressed in [9].

4 Evaluation

This section describes the simulation based experimental setup and performance of the proposed room compensation algorithm.

4.1 Simulation Results

Consider a 2D reverberant room of size 2 m $\times $ 2 m. Simulate reverberant environment by image source method [14]: the wall reflection is 0.8 (the ceiling and floor are perfectly-absorbing) and the image depth of 7. Control region circled by equalizer microphone array is centred at (1 m, 1 m) with the radius of 0.25 m. The loudspeaker array is uniformly placed on the circle of 1 m radius. Note that the number of loudspeakers and microphones are both 39, which satisfies the mode truncation requirement $M=\lceil {ekR/2}\rceil +1$ [13].

We simulate reproduction of two kinds of narrowband sources, i.e., plane wave and cylindrical wave, and a broadband source. For the narrowband cases, the plane wave source is of 3 kHz frequency, incident from $\phi _v = \pi /3$ and the cylindrical wave source is of 3 kHz frequency, locating at $r_v$ = 1.5 m, $\phi _v = \pi /3$ away from the system center. For the broadband cases, the cylindrical wave source is of 3 kHz bandwidth: frequencies 100 Hz to 3 kHz, and the location is as same as narrowband cylindrical wave source. Figure 2 and Fig. 3 show the compensation performance of narrowband sources, where the desired, reverberant and compensated sound fields are displayed in (a), (b), (c) respectively. Compared with the reverberant sound fields in Fig. 2(b) and Fig. 3(b), compensation is achieved within the whole reproduction region.

4.2 Evaluation of Results

The compensation performance is measured by the normalized equalization error $\varepsilon $ over the whole control region. The normalized equalization error is defined as follows,

$$\begin{aligned} \varepsilon (w)= 10 \log _{10} \frac{{\int _R {{{\left| {P(\varvec{x},k) - {P_d}(\varvec{x},k)} \right| }^2}{d\varvec{x}}}}}{{\int _R {{{\left| {{P_d}(\varvec{x},k)} \right| }^2}{d\varvec{x}}}}} \end{aligned}$$

(15)

In this simulation, 40000 observation points are uniformly selected within the reproduction region to approximate the integral.

In broadband case, set the number of loudspeakers as 45, which is slightly more than the number of loudspeakers designed at the maximum frequency 3 kHz, other simulation settings are the same as in the examples of Fig. 2 and Fig. 3. In Fig. 4, it demonstrates the normalized equalization error over 3 kHz bandwidth, which shows that setting the number of the loudspeakers more than the minimum requirement of certain frequency can achieve satisfying results. As the frequency increases, the performance will gradually degrade.

5 Conclusion

In this paper, we propose an active compensation algorithm for 2.5D sound field reproduction. Firstly, the sectorial mode matching method is used to derive the loudspeaker driving signals assuming direct-path propagation between the loudspeaker array and reproduction region. Then, based on sectorial mode, an active compensation algorithm is introduced to compensate room reverberation. From simulation results, it can proved that the proposed algorithm can achieve effective room compensation over the whole reproduction region.

References

Betlehem, T., Abhayapala, T.D.: Theory and design of sound field reproduction in reverberant rooms. J. Acoust. Soc. Am. 117(4), 2100–2111 (2005). https://doi.org/10.1121/1.1863032
Article Google Scholar
Zhang, W., Abhayapala, T.D.: 2.5D sound field reproduction in higher order Ambisonics. In: 14th IEEE International Workshop on Acoustic Signal Enhancement, pp. 342-346. IEEE Press, Juan-les-Pins (2014). https://doi.org/10.1109/iwaenc.2014.6954315
Zhang, W., Zhang, J., Abhayapala, T.D., Zhang, L.: 2.5D multizone reproduction using weighted mode matching. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 476-480. IEEE Press, Calgary (2018). https://doi.org/10.1109/ICASSP.2018.8462511
Okamoto, T.: 2.5D higher order ambisonics for a sound field described by angular spectrum coefficients. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 326-330. IEEE Press, Shanghai (2016). https://doi.org/10.1109/ICASSP.2016.7471690
Okamoto, T.: Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays. In: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 960-964. IEEE Press, Brighton (2019). https://doi.org/10.1109/ICASSP.2019.8683009
Wang, W., Jia, M., Bao, C., Zhang, J.: 2.5D interior/exterior sound field reproduction and its extension to narrowband speech signals. In: 2016 IEEE International Conference on Audio, Language and Image Processing, pp. 7-12. IEEE Press, Shanghai (2016). https://doi.org/10.1109/ICALIP.2016.7846548
Brännmark, L.: Robust audio precompensation with probabilistic modeling of transfer function variability. In: 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 193-196. IEEE Press, New Paltz (2009). https://doi.org/10.1109/ASPAA.2009.5346533
Brännmark, L., Bahne, A., Ahlén, A.: Compensation of loudspeaker-room responses in a robust MIMO control framework. IEEE-ACM Trans. Audio Speech Lang. 21(6), 1201–1216 (2013). https://doi.org/10.1109/TASL.2013.2245650
Article MATH Google Scholar
Talagala, D.S., Zhang, W., Abhayapala, T.D.: Efficient multi-channel adaptive room compensation for spatial soundfield reproduction using a modal decomposition. IEEE-ACM Trans. Audio Speech Lang. 22(10), 1522–1532 (2014). https://doi.org/10.1109/TASLP.2014.2339195
Article Google Scholar
Williams, E.G.: Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography. Academic, New York (1999)
Google Scholar
Kennedy, R.A., Sadeghi, P., Abhayapala, T.D., Jones, H.M.: Intrinsic limits of dimensionality and richness in random multipath fields. IEEE Trans. Signal Process. 55(6), 2542–2556 (2007). https://doi.org/10.1109/TSP.2007.893738
Article MathSciNet MATH Google Scholar
Ward, D.B., Abhayapala, T.D.: Reproduction of a plane-wave sound field using an array of loudspeakers. IEEE-ACM Trans. Audio Speech Lang. 9(6), 697–707 (2001). https://doi.org/10.1109/89.943347
Article Google Scholar
Poletti, M.A., Betlehem, T., Abhayapala, T.D.: Analysis of 2D sound reproduction with fixed-directivity loudspeakers. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 377-380. IEEE Press, Kyoto (2012). https://doi.org/10.1109/ICASSP.2012.6287895
Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979). https://doi.org/10.1121/1.2003643
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University, Xi’an, 710072, China
Yitong Chen & Wen Zhang

Authors

Yitong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Xi Shao
The University of Tokyo, Bunkyo-ku, Tokyo, Japan
Kun Qian
China University of Geosciences, Wuhan, Hubei, China
Li Zhou
Communication University of China, Beijing, China
Xin Wang
Tianjin Normal University, Tianjin, China
Ziping Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, Y., Zhang, W. (2021). Active Room Compensation for 2.5D Sound Field Reproduction. In: Shao, X., Qian, K., Zhou, L., Wang, X., Zhao, Z. (eds) Proceedings of the 8th Conference on Sound and Music Technology . CSMT 2020. Lecture Notes in Electrical Engineering, vol 761. Springer, Singapore. https://doi.org/10.1007/978-981-16-1649-5_9

Download citation

DOI: https://doi.org/10.1007/978-981-16-1649-5_9
Published: 25 April 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1648-8
Online ISBN: 978-981-16-1649-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics