Efficient Computational Techniques for Evaluating Distance-Dependent Head-Related Transfer Functions

Kailas, Ganesh; Tiwari, Nachiketa

doi:10.1007/s40857-022-00263-8

Efficient Computational Techniques for Evaluating Distance-Dependent Head-Related Transfer Functions

Original Paper
Published: 23 February 2022

Volume 50, pages 231–245, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Acoustics Australia Aims and scope Submit manuscript

Efficient Computational Techniques for Evaluating Distance-Dependent Head-Related Transfer Functions

Download PDF

1910 Accesses
Explore all metrics

Abstract

This work proposes and validates two computational tools for synthesizing distance-dependent head-related transfer function (HRTF), which is vital in spatial sound reproduction. HRTF is an anthropometric feature-dependent function that yields the direction-dependent gain of the auditory system. Even though it is subject to the distance of the auditory source, distance-dependent HRTF measurement is rare due to its high experimental cost. Numerical simulation tools can provide viable alternatives. The required computational resources and time increase exponentially with the frequencies and degree of freedom (DoF) of the simulations; still, it is faster than experimental procedures. This work proposes finite element computational solutions to measure distance-dependent HRTFs using domain truncation methods in association with frequency-dependent adaptive meshing. Two hybrid techniques to find HRTF in the entire region, employing infinite elements (IEs) and non-reflective boundary conditions (NRBCs) with near-field to far-field transformation techniques, have been implemented and analyzed. The proposed methods calculate distance-dependent HRTF in 0.2–20 kHz frequency band, with reduced computational cost and time. Additionally, the spatial resolution of the HRTF measurement has increased a 100-fold. Since locally connected finite elements are used, the near-field effects of HRTF are well incorporated, and the obtained HRTF matches well with the experimental results. The proposed tools can also calculate sufficiently accurate HRTFs even when the surface meshes are of reduced quality. The tools also possess the versatility in effortlessly integrating appropriate bioacoustic attributes (e.g., internal reflection of the middle ear walls) into HRTF numerical models, which is noteworthy.

Loudness stability of binaural sound with spherical harmonic representation of sparse head-related transfer functions

Article Open access 15 March 2019

Interaural time difference individualization in HRTF by scaling through anthropometric parameters

Article Open access 12 May 2022

Auralization

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The extended reality (XR) industry has seen a tremendous boom in the past 8 years, with its frontiers stretching into various domains and applications. Spatial audio is a fundamental ingredient of immersive virtual reality scenes, which provide spatial information of auditory events [1, 2]. Virtual spatial audio can be synthesized using perceptual or physical methods. The three-dimensional auditory display has been recreated through earphones in perceptual spatial audio, while physical methods utilize loudspeakers placed at multiplanar locations. Most commonly, perceptual spatial audio is generated using head-related transfer functions (HRTFs).

HRTFs define how human anthropometric features transform the sound waves from different spatial locations while reaching the ear. Generally, it is measured as the alteration in sound pressure when audio waves travel from a certain point to the ear canal. In most of the studies, the directional dependence of HRTF was given more emphasis, while the distance dependence was overlooked [3,4,5,6]. Thus, predominantly HRTF was measured at different points on an imaginary spherical surface with the listener at the center. However, due to the nonlinear distance characteristics of hearing, the HRTF in the entire spherical volume has attracted much attention from the research community. A series of works on close-range sound perception and associated nonlinear complexities was reported in [7, 8]. Incorporating proximity region auditory effects might improve the plausibility of virtual auditory scenes while recreating moving and close-range auditory sources [9, 10]. The synthesis of the distance-dependent HRTF can play a vital role here. In this article, the following definitions have been used to denote the distance-dependent spherical regions with the listener as the center: (i) proximity region: radius within one meter, (ii) distal region: radius more than one meter, (iii) near-field region: radius within one wavelength, and (iv) far-field region: radius more than one wavelength. Please note that proximity and distal regions do not depend on sound source frequency, while near and far-fields depend.

Conventionally, experimental methods are employed to measure the HRTF [4], and the procedures are very tedious and require considerable human endeavor. Most of the available HRTF databases are distance-independent, owing to the substantial experimental cost of measuring high-resolution HRTF in the entire spherical volume around the listener [3,4,5,6, 11, 12]. However, it must be measured in proximity regions and distal regions to understand and incorporate the distance dependency of HRTF [13]. While measuring distance-dependent HRTF, the massive number of measuring positions and poor directivity of loudspeakers in close ranges aggravate the intricacy of the experimental procedure. Some experimental calculations of distance-dependent HRTF have been reported using tiny sound sources like micro-dodecahedral loudspeakers, spark noise, or spark gap [14,15,16,17]. However, in all these works, the process was very demanding, expensive, and required hours of human endurance. The computational solutions are feasible substitutes for measuring high-resolution distance-dependent HRTF. The numerical tools can be convenient considering the advancement in data processing power and solving methods in past decades. An early computational attempt to solve the HRTF problem at a fixed distance was reported using the boundary element method (BEM) on scanned geometric models [18, 19]. Poor speed and insufficient accuracy at very high frequencies were the main disadvantages of these simulations. Irregular mathematical errors were also reported in BEM solutions, and additional algorithms were incorporated to eliminate them [20]. It gets trickier with larger geometrical models at higher frequencies. Moreover, the implementation of BEM for assessing the HRTF in whole spherical volume can be even more complicated due to the absence of local interconnectivity of elements in BEM [21].

This study proposes two computational solutions incorporating finite element tools and exterior acoustic techniques to measure high-quality distance-dependent HRTF. The major challenge associated with the finite element method (FEM) is the high computational resource for meshing the whole acoustic domain at high frequencies, maintaining discretization requirements [22]. While measuring distance-dependent HRTF, the massive volume of the finite domain must be meshed to accommodate the entire space around the listener. In the proposed simulation tools, finite domain volume has been limited and combined with exterior acoustic domains employing techniques of infinite elements (IFE) or non-reflective boundary conditions (NRBCs) using absorbing layers. The truncated bounded region is meshed adaptively with frequency for optimal computational performance. The evaluated high-resolution HRTFs have been compared with experimental data and BEM solutions. The proposed methods also enable the effortless incorporation of bioacoustic properties into the computational models of HRTF. Appropriate middle ear attributes like ear canal absorption coefficients make computed HRTF more congruent with the experimental data.

2 Hybrid Computational Methods: Background, Theory, and Formulation

As discussed in the introduction, HRTF has generally been assessed at a fixed boundary surface from the listener. Hence, the boundary element method (BEM) is the conventional simulation technique to evaluate HRTF as the formulation usually involves discretization of nothing other than domain boundaries [23]. Thus, fewer equations are involved in BEM due to the reduction of the problem’s dimensionality. But the inherent nonlocal connectivity of elements in BEM formulations usually gives less structured and fully populated matrices that reduce the expected efficiency [21]. In the beginning, regular BEM was employed in HRTF measurement with massive computational time, even up to 50 days for narrow bandwidths [18]. Later, the speed of BEM was increased to a certain extent by accommodating fast multipole methods [24,25,26]. However, its limitation in addressing the acoustic problems that require the evaluation of volumetric fields, such as distance-dependent HRTF, has not been tackled well. BEM’s computational cost and storage requirements for sizeable exterior domain problems are enormous and sporadically provide non-unique solutions at some frequencies [23]. Moreover, incorporating appropriate acoustic attributes of the ear canal, hair, skin, and cloths of the listener into HRTF is complicated using BEM owing to the lack of local interconnectivity. Hence the extension of the integral boundary solution to the whole acoustic volume, which is required for evaluating distance-dependent HRTF, is not straightforward.

On the other hand, the finite element method (FEM) can be effectively implemented for large volumetric fields as a common algebraic eigenvalue problem. The numerical advantage of having sparse matrices substantially accelerates the computations and reduces the memory requirement in FEM. Due to this computational edge, FEM is definitively competitive with BEM even with its higher-order formulations. Additionally, FEM can provide more accurate solutions due to inherent local connectivity [27]. The advantages of FEM over BEM in these predicaments have been well described in the literature [21, 27].

However, efficiently modeling a large acoustic volume in an unbounded space is a critical challenge in FEM. Various exterior acoustic techniques should be incorporated with FEM to handle this. It is vital to prevent spurious reflections at boundaries when transforming FEM formulation from unbounded space to bounded domain. Otherwise, it may pollute the whole solution. Infinite elements and non-reflective absorbing layers can be employed proficiently at the finite region (bounded domain) boundary for this purpose. In the infinite element method, the solutions in the exterior domain have been directly given by the infinite nodes at the finite region boundary using its shape functions. The finite meshes can also be truncated by non-reflective absorbing layers satisfying Sommerfeld conditions. Consequently, far-field estimation techniques such as Ffowcs Williams Hawkings (FWH) method are applied to estimate HRTF in the far-field. Brief theoretical formulations of the proposed techniques are described in the coming sections.

2.1 Domains for Finite Element Formulation

Consider a scattering object of arbitrary shape $\mathcal{H}$ with surface ${S}_{\mathcal{H}}$ in an unbounded domain $\mathcal{U}$ as shown in Fig. 1.a. The problem is governed by the Helmholtz differential equations [27, 28]. Additionally, Sommerfeld radiation condition must be satisfied, which means there are only outgoing waves at infinity. The problem can be formulated as,

$$ \nabla^{{2}} p + k^{2} p = 0\;{\text{in}}\;{\mathcal{U}} = {\text{R}}^{3} {\backslash \mathcal{H}} $$

(1)

$$ \frac{\partial p}{{\partial n}} + \beta p = g\;{\text{on}}\;S_{{\mathcal{H}}} $$

(2)

$$ \mathop {\lim }\limits_{r \to \infty } r\left( {\frac{\partial p}{{\partial r}} - ikp} \right) = 0 $$

(3)

where $k$ is the wavenumber; $r= \| x\| $ where $x$ is the radial distance from the sound source; $p = {\mathrm{e}}^{-i\omega t}$ is the acoustic pressure with $\omega $ as natural frequency;$\frac{\partial p}{{\partial n}}: = \nabla p$ where $n$ is the gradient in the outward direction perpendicular to ${S}_{\mathcal{H}}$; $\beta \left(x;k\right), g\left(x;k\right)$ are frequency-dependent complex boundary information functions [27].

The FEM formulation of a huge unbounded finite volume requires impractical computational capacity. To reduce the unbounded region's volume and consequently curtail the meshing load and computational cost, it is necessary to divide the unbounded regions into bounded ($\mathcal{F}$) and external regions ($\mathcal{E}$).

$${\mathcal{U}} = {\mathcal{F}} + \mathcal{E} .$$

(4)

The bounded region is modeled with finite elements, and the bounded and external regions are divided by an artificial boundary ${S}_{\mathcal{F}}$ as shown in Fig. 1.b. The solutions at finite region ($\mathcal{F}$) can be evaluated using FEM, and the solution at different points in the external region ($\mathcal{E}$) is estimated through the far-field expansion of the solution at the surface ${S}_{\mathcal{F}}$.

2.2 Boundary Formulation

There are different approaches to model the artificial boundary ${S}_{\mathcal{F}}$ and exterior regions ($\mathcal{E}$). Finite region truncating tools like infinite elements and non-reflecting boundary conditions using absorbing layers are illustrated in [27]. These techniques in combination with finite elements for bounded volume can be formulated as described in the next section.

2.2.1 Finite Elements with Infinite Elements Method (FIEM)

In the infinite element method, a single convex surface ${S}_{\mathcal{F}}$ is placed at the boundary of the finite region ($\mathcal{F}$) with outer layers extended till infinity, as shown in Fig. 2. The infinite element method for wave problems was established in [29]. The external domain ($\mathcal{E}$) is discretized using a collection of infinite elements. Each node in the boundary is attached to an infinite element in $\mathcal{E}$. The methods for matching the regions $\mathcal{F}$ and $\mathcal{E}$ are well explained in [30]. The finite region field has been evaluated using the finite element method and induced on the surface ${S}_{\mathcal{F}}$. Assume the sound pressure $p\left(x,k\right)$ follows Helmholtz equation (1), with boundary conditions (5) and (6).

$$ \nabla p \cdot {\text{n}}_{{\mathcal{F}}} = - \rho a\left( {\theta ,\phi } \right). $$

(5)

Equation (5) is the kinematic condition on $\mathcal{F}$ for a steady time-harmonic normal acceleration, $a\left(\theta ,\phi \right){e}^{-i\omega t}$.

$$ \nabla p \cdot n_{E} = ikp + \eta $$

(6)

where $\eta = \mathrm{O}\left(1/{X}^{2}\right)$ as $X$ approaches infinity with Sommerfeld radiation condition.

A trial solution can be developed using variational formulation and discretization in the $\mathcal{E}$ domain, as given by (7),

$$ p\left( {x,k} \right) = \mathop \sum \limits_{\mu = 1}^{m} \mathop \sum \limits_{\nu = 1}^{n} q_{\mu \nu } f_{\nu } \left( {r,k} \right)g_{\mu } \left( {\theta ,\phi } \right) $$

(7)

where ${g}_{\mu }\left(\theta ,\phi \right)$ is global shape function of finite region FEM solution on the surface ${S}_{\mathcal{F}}$; ${f}_{\nu }\left(r,k\right)$ is the radial interpolation function, ${q}_{\mu \nu }$ gives the nodal coefficient values of the pressure at corresponding nodes, i.e., $\upnu $ ^th node on a radial path extended from $\mu $ ^th node on the surface ${S}_{\mathcal{F}}$ as shown in Fig. 2 [29].

The infinite element formulation is a function of material properties, interpolation order, and the coordinate system. The sufficient convergence condition for IFE to work adequately is that the finite domain $\mathcal{F}$ and sound sources should be enclosed within the surface ${S}_{\mathcal{F}}$. Interpolation order is also an important criterion for accurate simulations.

2.2.2 Finite Elements with Absorbing Layers Method (FALM)

Another technique for transforming the unbounded problem into a bounded problem is implementing non-reflective boundary conditions (NRBCs) through absorbing layers. Perfectly matched layers (PMLs) belong to the family of absorbing layers with NRBCs. The PML was first developed in the domain of electromagnetics and later widely modified for acoustic problems [31,32,33,34,35]. PMLs do not reflect any wave regardless of its angle of incidence, which gives it an extra edge over infinite elements. The reflectionless characteristics may provide better accuracy in PML based methods. It has been reported in earlier works that PML may provide satisfactory results even if the truncation of the finite region is in the near-field region and can accommodate non-homogeneous situations [34].

In PML, an exterior layer ($\mathcal{L}$) of finite thickness has been introduced at an external boundary ${S}_{\mathcal{F}}$ of the finite domain ($\mathcal{F}$), as shown in Fig. 3. Consequently, the waves are truncated by the finite-absorbing layer using complex variable change, also known as the stretching process. The distance and direction of the stretching should be computed for modeling the stretching function. The PML can be defined in all coordinate systems. The wave equation has to be modified with absorbing material properties to implement PML. A detailed formulation can be found in the literature [27, 32, 35], and the wave equation can be revised as,

$$ \nabla \cdot \left( {D\nabla p} \right) + k^{2} sp = 0\; {\text{in}}\; {\mathcal{L}} $$

(8)

$$ s = s_{1} s_{2} s_{3} $$

(9)

$$ D = \left( {\begin{array}{*{20}c} {s_{2} s_{3} /s_{1} } & 0 & 0 \\ 0 & {s_{1} s_{3} /s_{2} } & 0 \\ 0 & 0 & {s_{1} s_{2} /s_{3} } \\ \end{array} } \right) $$

(10)

where $D$ is a complex-valued material tensor with coefficients ${s}_{i}\left({x}_{i}\right)=1-\left(i{\sigma }_{i}\right)/k$ whereas ${\sigma }_{i}\left({x}_{i}\right), i = \mathrm{1,2},3\dots $ are absorption functions.

Conventionally, the value of absorption function gradually increases inside the layer $\mathcal{L}$ toward the outward direction. The PML equation and absorption functions are fully compatible with finite element data structures and can be easily incorporated into the FEM tools. In addition to PML, external region field values should be measured using near-to-far field estimation techniques; in this work, Ffowcs Williams Hawkings (FWH) technique is employed.

FWH technique can be applied to predict far-field pressure generated by the distributed volume source induced by the finite element domain. It can be considered as an extension of the Kirchhoff problem. The essential character of the Kirchhoff problem is finding an expression for wavefield from the given surface boundary conditions, and FWH is an advanced version of this. The theoretical concept of FWH is well described in the literature [36]. The fundamental FWH formulation has the impermeability condition that the waves should not pass through the surface on which FWH equation is applied. However, if the equation is extended by relaxing the impermeability of the surface, FWH equations could be used to measure the far-field degree of freedom (DoF) from the near-field estimations [37, 38]. For that case, FWH formulation can be applied to an arbitrary and imaginary mathematical surface that divides the domain into the near-field and the far-field. Then the field value on the imaginary surface is the only required parameter for far-field calculations. A complete derivation of the FWH equation used for near-field to far-field estimation can be found in [37,38,39,40], and formulation can be given as,

$$ { \boxdot }^{2} p^{\prime} = \left( {\partial /\partial t} \right)\left\{ {\left[ {\rho u_{n} - \left( {\rho - \rho_{0} } \right)v_{n} } \right]\delta \left( f \right)} \right\} $$

(11)

$$ - \left( {\partial /\partial x_{i} } \right)\left\{ {\left[ {\rho \left( {u_{n} - v_{n} } \right)u_{i} + pn_{i} } \right]\delta \left( f \right)} \right\} + \left( {\overline{\partial }^{2} /\partial x_{i} \partial x_{j} } \right)\left[ {T_{ij} {\text{H}}\left( f \right)} \right] $$

(12)

where wave operator ${ \boxdot }^{2} = \left[ {\left( {{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 {c^{2} }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${c^{2} }$}}} \right)\left( {{\raise0.7ex\hbox{${\partial^{2} }$} \!\mathord{\left/ {\vphantom {{\partial^{2} } {\partial t^{2} }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${\partial t^{2} }$}}} \right)} \right] - \nabla^{2} ,\;p^{\prime} = \left( {\rho - \rho_{0} } \right)c^{2}$ $\rho$ is the density, $c$ is the speed of sound, ${u}_{n},{v}_{n}$ are velocities of normal fluid and surface, ${T}_{ij}$ is Lighthill stress tensor, $p$ is surface pressure on $f$= 0, $H(f)$ is Heaviside function and $\overline{\partial }/\partial $ is generalized differentiation with ${p}^{\mathrm{^{\prime}}} = {p}^{\mathrm{^{\prime}}}$ outside $f=0$ and ${p}^{\mathrm{^{\prime}}}=0$ inside $f=0$. The acoustic pressure at different locations in the entire external region ($\mathcal{E}$) can be evaluated from sound pressure fields on the boundary surface (${S}_{\mathcal{F}}$) using the FWH formulation.

3 Implementation Using Adaptive Frequency Meshing

The two FEM-based methods (FIEM & FALM) are simulated to evaluate HRTF and validate the accuracy and performance compared to other HRTF data. For validating the proposed methods in the distal region, FIEM and FALM are implemented using surface meshes of the human upper body provided by the SYMARE database (Sydney-York Morphological And Recording of Ears Database) [4, 41]. The measured and BEM synthesized HRTFs along with corresponding scanned surface meshes of the subjects are available in the SYMARE depository. However, due to the dearth of proximity region HRTF data with corresponding surface meshes, proximity region experimental HRTF were measured using manikin created for an Indian subject. These experimental measurements were then compared to HRTFs computed by FIEM and FALM.

3.1 3D Modeling of Human Head and Experiment Preparations

The computational analysis of HRTF requires high-quality surface meshes of the human upper body. The surface meshes of the subject’s head are generated using a handheld Artec 3D space spider scanner. A similar capturing process was reported in [42]. The maximum scanning rate of the spider scanner is 15 frames per second, and it produces a high-grade scan. The Artec scanner’s software tool uses a 'global registration algorithm,’ which automatically aligns multiple 3D scan data to create an initial mesh [43]. Later, Meshlab [44], an open-source tool, is employed to improve the mesh without missing any essential details. Some inner parts of the pinna are very arduous to scan, and they are approximated. Mesh resolution and quality matrices are assessed using Meshlab tools [44]. The quality and resolution mapping tool, Per face quality, is used to visualize two standard mesh quality criteria [45] (i) ratio of the triangles’ area and length of the largest side (as shown in Fig. 4a) and (ii) ratio of inscribed and circumscribed ball radii (as shown in Fig. 4b). Both parameters were of high quality, as illustrated in Fig. 4 and, indicate that the mesh has the high resolution required for computational modeling. The scanned head model is first attached to a generalized torso mesh, as shown in Fig. 5.a. Then, the combined model is used to measure distance-dependent HRTF using FIEM and FALM.

For measuring HRTF through experiments, a physical dummy model is 3D printed from the scanned surface mesh. The dummy is printed as two different parts—the ears with acrylonitrile butadiene styrene (ABS) and the remaining head with polystyrene foam. The two ears and head are then attached to a generalized torso made of fiberglass, as shown in Fig. 5b.

A low-cost HRTF experiment setup was constructed, as shown in Fig. 5.b. The measurements were carried out in a fully anechoic chamber (5 m × 5 m × 3 m) with a cutoff frequency of 200 Hz and noise rejection ratio (with respect to outside) of 65 dB. The reverberation time of the room (T₆₀) at 1600 Hz was evaluated to be 250 ms. The speakers (Sony SRS-XB10/BC) were fixed equiangularly (18°) on a rotatable arc. The distance from the listener to the audio source can be adjusted using arcs of different radii. The pre-polarized microphones (PCB Piezotronics 130F20 Preamplifiers) were placed in the ear canal opening of the dummy. The center of the head was aligned to the center of the arc, viz., the origin. HRTF was measured using logarithmic sine sweep signals. Sine sweep signals with frequencies ranging from 0.2 kHz to 20 kHz were used for measurements. A longer source signal (32 k) was chosen to improve the signal-to-noise ratio. Signal acquisition was executed with NI 9234 sound and vibration input module at a sampling rate of 51,200. The signal acquisition was repeated twelve times to reduce the noise level and increase the consistency. A rectangular window of 32 k was applied to the measured signal. HRTF calculation was carried out using fast Fourier transform with an nfft length of 32,768. HRTFs were calculated using (13). Sound pressure levels (SPLs) were taken at four different distances ($r$ = 25 cm, 50 cm, 75 cm, and 100 cm) from the origin in various directions. The post-processing was performed using Matlab™. The SPL measured at the center when the dummy was absent has been used as the reference measurement.

$$ {\text{HRTF}}\left( {\theta ,\phi ,r,\omega } \right) = \frac{{{\text{Sound Pressure}}_{ear} \left( {\theta ,\phi ,r,\omega } \right){ }}}{{{\text{Sound Pressure}}_{{{\text{r}}e{\text{f}}}} \left( {\theta ,\phi ,r,\omega } \right)}}. $$

(13)

3.2 Numerical Calculations

The acoustic principle of reciprocity is used for the computational process. It asserts that acoustic source and microphone locations in HRTF measurement can be swapped, and the proof of the theorem is already established in the literature [46]. Hence, the acoustic source is placed near the ear canal, and sound pressures are measured at different locations in the spherical volume around the listener. This technique facilitates numerical computation by reducing the number of acoustic sources into one.

The element size is a significant concern in any numerical computation, and six elements have been chosen per wavelength in the FEM simulations. Without truncating finite domains, the implementation of FEM is almost impossible for audible range simulations. Above 5 kHz, FEM simulations require enormous computational resources and impractical time, as seen in Table 1. Therefore, the viable simulation methods, (i) finite elements with infinite elements method (FIEM) and (ii) finite elements with absorbing layers method (FALM), were performed for the 3D scanned surface mesh of Indian subject and SYMARE depository meshes [4, 41]. Although we could reduce the mesh volume of the finite domain using FIEM and FALM, the number of elements can still be huge at higher frequencies owing to the minimum volume of the meshes required to avoid truncation within the near-field.

Table 1 Computation time of different finite element simulations

Full size table

3.3 Adaptive Frequency Meshing

It is always better to place truncating boundary surface just outside the near-field of the sound source for accurate results by capturing all near-field effects and reflections from different anthropometric features of the listener. At higher frequencies, mesh volume should be reduced as much as possible for faster simulation. In FALM, the presence of the PML mesh slightly raises the computational cost. A frequency-based adaptive meshing is an optimal approach to reduce the volume of the finite meshes without trading off the accuracy. For tackling the challenge of evaluating the distance-dependent HRTF in the entire audible range with limited computational resources, the mesh volume and element size must be optimized for each frequency band. Hence, 0.2–20 kHz frequency spectra are divided into different bands. The lowest frequency of each band decides the volume of the meshes. The optimal element size is determined by the highest frequency in each band. The mesh thickness, defined as the minimum radial distance between exterior and interior boundaries of the mesh, is the major component in determining the volume of adaptive finite and PML domains. One wavelength thickness has been employed as a rule of thumb for creating each frequency band. Adaptive frequency meshing of PML and finite region in each frequency band is visualized in Fig. 6. As described above, the mesh volume of the finite region and PML decreases, and element size increases when frequency increases (from band 1 to band $n$). FIEM is employed with an interpolation factor of five to evaluate radial functions, as discussed in Sect. 2.2.1.

In the initial acoustic simulations, the human body is considered an acoustically rigid model (absorption factor is zero). The monopole source is placed at the ear canal opening, and sound pressure at different locations in proximity and distal regions is evaluated using FIEM and FALM. The reference HRTF simulations are carried out by placing the source at the origin without a 3D model. Simulations are performed using Actran™ [47], and later, results are post-processed using Matlab™. For understanding the mesh quality dependence of the proposed methods, a reduced quality (doubled the maximum element size) surface mesh was created in Meshlab, and simulations were repeated.

4 Results and Discussion

The HRTF spectral information like peaks and notches are significant and generally considered as the cues for median plane localization [48]. The relative position of frequency components is important in comparing various HRTF data [49]. For effectively analyzing the spectral distributions of different HRTFs, three analytical expressions—frequency scaling difference (FSD), spatial correlation metric (SCM), and spatial magnitude difference (SMD)—were determined during the post-processing of the results. SCM is measured as the mean of the correlation of frequency responses of two different functions over the whole spatial region. SMD is the difference in magnitude of the frequency responses of two different functions over the spatial domain. Consider two HRTF data, ${HRTF}^{(1)}$ and ${HRTF}^{(2)}$. SCM and SMD of these data sets can be calculated using Eqs. (14) and (15), respectively.

$$ \begin{aligned} {\text{SCM}} & = \frac{1}{N}\sum\limits_{{j = 1}}^{N} {\frac{1}{M}} \\ & \quad \sum\limits_{{i = 1}}^{M} {\frac{{\left( {{\text{HRTF}}_{i}^{{\left( 1 \right)}} - \overline{{{\text{HRTF}}^{{\left( 1 \right)}} }} } \right)\left( {({\text{HRTF}}_{i}^{{\left( 2 \right)}} - \overline{{{\text{HRTF}}^{{\left( 2 \right)}} }} } \right)}}{{\sigma _{{{\text{HRTF}}^{{\left( 1 \right)}} }} \sigma _{{{\text{HRTF}}^{{\left( 2 \right)}} }} }}} \\ \end{aligned} $$

(14)

$$ {\text{SMD}} = \left| {{\text{HRTF}}^{\left( 1 \right)} } \right| - \left| {{\text{HRTF}}^{\left( 2 \right)} } \right| $$

(15)

where $N$ is the number of spatial data points, $M$ is the number of spectral data points, and $\sigma $ is the standard deviation.

Frequency scaling difference is a measure of similarity between two HRTF spectra, and its measurement is well described in [49]. FSD value provides the amount of frequency scaling that must be applied to an HRTF to give the best spatial and spectral correlation with another HRTF. FSD value closer to 1.0 gives better correlation [49]. For example, an FSD value of 0.99 or 1.01 between two functions indicates the same level of correlation and would mean that they are better correlated than two other functions with an FSD value of 0.9 or 1.1.

The azimuthal angles of 0°, − 90°, 180°, and 90° represent the acoustic sources in the front, to the right, in the back, and to the left, of the listener, respectively. The elevation angles of 0°, 90°, and − 90° represent the sources on the line-of-sight plane, at the top and the bottom of the listener, respectively. The HRTF was not measured in the region below − 45° elevation angle, and it was linearly interpolated for visualizations.

4.1 Comparison with the SYMARE Database Measurements with Proposed Methods in the Distal Region

BEM-based HRTF and experimental HRTF from the SYMARE database [4, 41] were compared with HRTFs simulated by FIEM and FALM. In most spatial directions, results were congruent and matched well with BEM results, as shown in Fig. 7. The spectral shape of the experimental data also matches well with computational results, but a difference in amplitude is observed. The computational methods adequately captured the distribution pattern of peaks and notches in the HRTF spectrum. The FSD was evaluated as 1.023, 1.026, and 1.032 for FALM, FIEM, and BEM, respectively, for subject-01 of the SYMARE database. It indicates slightly better congruity between experimental data and calculated HRTFs by proposed methods than BEM, mainly owing to the advantage of FEM in capturing near-field effects. Additionally, FALM and FIEM provided slightly better SCM of 0.8768 and 0.8667 than BEM’s 0.8528. Similar trends were also observed for other subjects in the SYMARE depository. Good frequency correlation (0.85–0.9) between experimental results and numerical methods has also been achieved, as seen in Fig. 8.

The numerical results showed more peaks at higher frequencies and higher amplitude in whole spectra compared to the experimental results. The overestimation of magnitude in all numerical measurements compared to SYMARE’s experimental HRTF (measured directly with human subjects) might be due to the acoustic impedance effects of the subject's skin, hair, and clothes in experiments, as suggested in earlier studies [50]. The authors presume that the acoustic reflections from the ear canal walls in the simulation are significant contributors to this difference.

For inferring the contribution of middle ear reflections to the overestimation of magnitude of simulated data, computations are repeated incorporating specific ear canal properties. The simulations can exempt most of the sound reflections occurring inside the middle ear based on the following assumptions: (i) the middle ear absorbs the sound pressure waves as vibrations during the hearing, and (ii) experimental HRTF is measured at the ear canal opening; hence it does not include middle ear reflections. Based on this rationale, the inner part of the ear canal wall is modeled with a high absorption coefficient in simulations. Hence, a frequency-independent absorption factor of 0.8 has been included for the ear canal walls, and other parts remained acoustically rigid in the simulations.

The integration of the middle ear model improved the congruity of the spectral components of simulated results with the experimental results, as shown in Fig. 9. Now, the amplitude of synthesized HRTF at mid-high frequencies is in the same magnitude levels as experimental HRTF. It has also minimized the extra peaks in higher frequencies which did not appear in experimental results. FSD and SCM are also improved to 1.011 and 1.018 for FALM and FIEM with ear canal modeling. The SMD between hybrid methods and experimental HRTF shows a better match than the SMD between BEM and experiment results, as shown in Fig. 10. The SMD greater than 10 dB is only present at certain locations compared to almost the entire spatial region for BEM.

Spatial frequency response surfaces (SFRS) [51] have been created for computational and experimental HRTFs, as shown in Fig. 11. FALM and FIEM show good agreement with experimental results, especially in the frequency regions below 12 kHz. At higher frequencies, especially in contralateral locations, even experimental data have higher noise levels and might contribute to the differences in SFRS. Most of the peaks and notches are well captured by FALM and FIEM. The magnitude level is also in good agreement with BEM, owing to middle ear modeling. It implies that these methods can be used as accurate substitutions for HRTF experiments. Still, there is more scope for improvement in FIEM and FALM. Accounting for the bioacoustic properties of skin, hair, etc., can help reduce the differences between the experimental and computational models. The magnitude difference at lower frequencies, as shown in Figs. 9 and 11, is mainly due to the computational models with limited bioacoustic attributes [50]. A percentage of sound waves passes through the skull and mouth in a real-life scenario; it is also not considered in the numerical analysis here. Additionally, middle ears and inner ears are highly sophisticated biological parts. They require meticulous computational models with attributes like eardrum inclination, frequency-dependent absorption factor, ear wax presence, etc., compared to the simple model used in this work, based on a single attribute of frequency-independent absorption factor. Thus, integrating these factors would make the computational methods more accurate and perfect replacements for experimental methods. BEM tools usually do not assure local connectivity of elements and also require complex modeling procedures to add intermediate bioacoustic parts between the sound source and the surface on which boundary integrals are evaluated [21, 50]. In contrast, the proposed methods have higher scope and convenience of bioacoustics modeling because the modeling of complex connected parts is much easier using locally interconnected computational elements employed in FEM [21].

For comparing the spatial hearing accuracy of different HRTFs, subjective listening analysis is more appropriate. But the audio perception studies are out of the scope of present work, and the HRTF comparisons are limited to analytical comparisons of the spectral features of different HRTFs as in [49]. From the results of various psychophysical experiments [48, 52], it is clear that the frequency distribution of peaks and sharp notches in the HRTF spectra plays a significant role as localization cues. Hence, it is meaningful to examine different HRTFs analytically and compare the different spectral elements. In the next phase of validation, experimental HRTF in proximity regions are compared to corresponding simulation results.

4.2 Comparison with the Experimental Measurements with Proposed Methods in Proximity Region

The simulated proximity region HRTF of the Indian subject using FALM and FIEM are compared with experimental results. As mentioned earlier, there are limitations in conducting accurate proximity region acoustic experiments. Different factors like the directivity of the speakers, poor microphone responses at higher frequencies influence the experimental results. Nevertheless, as shown in Fig. 12, both spectral features of the experimental and computational HRTFs agree satisfactorily in the proximity regions, especially in below 12 kHz. The first peak and notch are perfectly aligned in the proximity region, especially for the ipsilateral sides. It is also noted that FALM results are in better agreement with experimental results than FIEM results, especially at higher frequencies. As mentioned earlier, PML shows better performance in eliminating unwanted reflections and better in accommodating near-field effects of the sound source. These reasons contribute to the better accuracy of the FALM, especially in proximity regions.

4.3 Computational Time

The massive advantage of FIEM and FALM over the BEM methods is the high-resolution volumetric HRTF measurement with reduced computational cost. Initially, BEM methods reported a computational time of 50 h per frequency [18]. Later, employing the fast multipoles, the simulation time was reduced to 5 h [24]. On the other hand, the simulation of FIEM took nearly 2 h, and FALM took 3 h with 68 frequencies. Without any adaptive meshing, it perhaps could take many weeks for the simulations to converge. However, the authors believe that comparing computational speeds on different systems and surface meshes is not meaningful. Nevertheless, both of these proposed methods could calculate the whole region HRTF in a time period that is comparable to the time BEM took for calculating HRTF on a spherical surface alone. In the future, with cloud computing and parallel processing capabilities, this may reduce to even shorter simulation run times, in the order of several minutes.

4.4 Mesh Criteria

In the BEM-based HRTF approach, the field integrals are calculated on the head surface mesh, requiring high-resolution surface mesh for accurate results. But in FALM and FIEM methods, the head surface is not used for integral measurements, which provides a slight advantage to these methods as they could give good results with reduced quality head surface meshes. Only contralateral regions have significant spatial magnitude differences while simulating with reduced quality mesh (maximum element size has doubled), as shown in Fig. 13. The errors are more in the contralateral region than the ipsilateral sides, mainly because of the head shadowing and directivity of the ear pinna [4]. It has enormous benefits because capturing the high-quality head surface mesh is challenging for producing individualized HRTF. This method can be applied with low-resolution meshes to produce acceptable results without massive accuracy degradation. This is one of the remarkable capabilities of this method.

At times, BEM produces critical frequency errors; hence it may require a unique approach and collocation process for each head surface mesh [19, 20]. The proposed methods have not shown similar compatible issues at mesh-dependent critical frequencies, owing to the mathematical superiority of finite element methods. Hence, they can also be implemented much more facilely for automated simulation because it does not require extensive pre-processing.

4.5 Spatial Resolution

The generation of high-resolution distance-dependent field measurements is another advantage of the proposed methods. HRTF at 91,801 points within a 1.25-m radius spherical region was calculated within four hours. It is almost a hundred times the number of the calculation points in regular HRTF measurements. It means that 3D audio rendering tools using this HRTF do not need additional algorithms for the smooth rendering of moving and proximity region objects as in [9]. The methods can contribute to the studies on distance-dependent human hearing and proximity region effects of anthropometric features, owing to the high-resolution whole region measurement [53, 54].

5 Conclusion

This work demonstrates the successful implementation of two finite element-based numerical methods, viz FALM and FIEM, for measuring distance-dependent HRTF with good accuracy and low computational resource requirement. The proposed methods also showed computational convenience in incorporating the absorption factor for the middle ear. Likewise, it can accommodate bone conduction into HRTF computation, provided that magnetic resonance imaging (MRI) is utilized to create internal models of the head. Additionally, the integration of acoustic properties of hair, skin, and clothes is also feasible in the proposed finite element-based methods.

FALM and FIEM can be constructively utilized to understand various diffraction and reflection patterns of the human body in proximity regions, hence convenient in hearing perception studies. The distance dependence of sound perception is a comparatively less explored region of research, and appropriate computational tools can be beneficial. Evaluating personalized distance-dependent HRTF using these methods with less human effort and time can be very beneficial in perceptual spatial audio rendering technologies, especially for moving and proximity region sound sources. Another advantage of the proposed methods is their better performance with lower-quality mesh surfaces. As a result, the photogrammetry tools can be easily incorporated into these simulation techniques to create surface meshes from 2D images of the subjects and measure personalized HRTFs with less effort. Besides the application of FALM and FIEM in understanding HRTFs of human adults, they could be employed in the studies pertaining to children or even other members of the animal kingdom, especially mammals and birds, where the experimental approach is impractical.

As the virtual auditory applications expand their wings, the proposed simulation tools for evaluating distance-dependent HRTFs can be vital for the faster generation of personalized spatial audio. Along with fast-growing computational speed, cloud computing technologies, and advances in machine learning, simulation-based production of spatial audio can play a significant part in the future extended reality applications.

References

Rajguru, C., Obrist, M., Memoli, G.: Spatial soundscapes and virtual worlds: challenges and opportunities. Front. Psychol. 11, 2714 (2020). https://doi.org/10.3389/fpsyg.2020.569056
Article Google Scholar
Kailas G., Tiwari N. Design for immersive experience: Role of spatial audio in extended reality applications. In: Chakrabarti A., Poovaiah R., Bokil P., Kant V. (eds) Design for Tomorrow—Volume 2. Smart Innovation, Systems and Technologies, Springer, Singapore. (2021). https://doi.org/10.1007/978-981-16-0119-4_69
Algazi, V.R., Duda, R.O., Thompson, D.M., Avendano, C.: The CIPIC HRTF database, pp. 99–102. Proc. IEEE Work. Appl. Signal Process, Audio, Acoust (2001)
Google Scholar
Jin, C.T., et al.: Creating the sydney york morphological and acoustic recordings of ears database. IEEE Trans. Multimed. 16, 37–46 (2014). https://doi.org/10.1109/tmm.2013.2282134
Article Google Scholar
Armstrong, C., Thresh, L., Murphy, D., Kearney, G.: A perceptual evaluation of individual and non-individual HRTFs: a case study of the SADIE II database. Appl. Sci. 8, 2029 (2018). https://doi.org/10.3390/app8112029
Article Google Scholar
Li, S., Peissig, J.: Measurement of head-related transfer functions: a review. Appl. Sci. 10, 5014 (2020). https://doi.org/10.3390/app10145014
Article Google Scholar
Zahorik, P., Brungart, D.S., Bronkhorst, A.W.: Auditory distance perception in humans: a summary of past and present research. Acta Acust United Acust. 91, 409–420 (2005)
Google Scholar
Brungart, D.S., Rabinowitz, W.M.: Auditory localization of nearby sources. Head-related transfer functions. J. Acoust. Soc. Am. 106, 1465–1479 (1999). https://doi.org/10.1121/1.427180
Article Google Scholar
Cuevas-Rodríguez, M., et al.: 3D tune-in toolkit: An open-source library for real-time binaural spatialisation. Plos One 14, e0211899 (2019). https://doi.org/10.1371/journal.pone.0211899
Article Google Scholar
Kailas, G., Tiwari, N.: A finite element solution for spatial audio rendering of nearby or moving sound sources. In: Proc. 27th Int. Congr. Sound Vib. ICSV 2021. Silesian University Press (2021)
Gupta, N., Barreto, A., Joshi, M., Agudelo, J.C.: HRTF database at FIU DSP Lab. In: Proc. - ICASSP IEEE Int. Conf. Acoust. Speech Signal Process. 169–172 (2010). https://doi.org/10.1109/icassp.2010.5496084
Carpentier, T., Bahu, H., Noisternig, M., Warusfel, O.: Measurement of a head-related transfer function database with high spatial resolution. Forum Acousticum. (2015)
Chen, Z., Mao, D.: Near-field variation of loudness with distance. Acoust. Aust. 47, 175–184 (2019). https://doi.org/10.1007/s40857-019-00158-1
Article Google Scholar
Yu, G., Wu, R., Liu, Y., Xie, B.: Near-field head-related transfer-function measurement and database of human subjects. J. Acoust. Soc. Am. 143, 194–198 (2018). https://doi.org/10.1121/1.5027019
Article Google Scholar
Hosoe, S., Nishino, T., Itou, K., Takeda, K.: Development of micro-dodecahedral loudspeaker for measuring head-related transfer functions in the proximal region. Proc ICASSP IEEE Int. Conf. Acoust Speech Signal Process. 5, 329–332 (2006). https://doi.org/10.1109/icassp.2006.1661279
Article Google Scholar
Qu, T., Xiao, Z., Gong, M., Huang, Y., Li, X., Wu, X.: Distance-dependent head-related transfer functions measured with high spatial resolution using a spark gap. IEEE Trans Audio Speech Lang. Process. 17, 1124–1132 (2009). https://doi.org/10.1109/tasl.2009.2020532
Article Google Scholar
Araki, J., Nishino, T., Takeda, K., Itakura, F.: Measurement of the head related transfer function using the spark noise. J. Acoust. Soc. Jpn. 60, 314–318 (2004). https://doi.org/10.20697/jasj.60.6_314
Article Google Scholar
Katz, B.F.G.: Boundary element method calculation of individual head-related transfer function. I. Rigid model calculation. J. Acoust. Soc. Am. 110, 2440–2448 (2001). https://doi.org/10.1121/1.1412440
Article Google Scholar
Kahana, Y., Nelson, P.A.: Boundary element simulations of the transfer function of human heads and baffled pinnae using accurate geometric models. J. Sound Vib. 300, 552–579 (2007). https://doi.org/10.1016/j.jsv.2006.06.079
Article Google Scholar
Kahana, Y.: Numerical Modelling of the Head-Related Transfer Function, Doctoral dissertation, University of Southampton (2000)
Harari, I., Hughes, T.J.R.: A cost comparison of boundary element and finite element methods for problems of time-harmonic acoustics. Comput. Methods Appl. Mech. Eng. 97, 77–102 (1992). https://doi.org/10.1016/0045-7825(92)90108-V
Article MathSciNet MATH Google Scholar
Huttunen, T., Seppälä, E.T., Kirkeby, O., Kärkkäinen, A., Kärkkäinen, L.: Simulation of the transfer function for a head-and-torso model over the entire audible frequency range. J. Comput. Acoust. 15, 429–448 (2007). https://doi.org/10.1142/s0218396x07003469
Article Google Scholar
Kirkup, S.: The boundary element method in acoustics: a survey. Appl. Sci. 9, 1642 (2019). https://doi.org/10.3390/app9081642
Article Google Scholar
Kreuzer, W., Majdak, P., Chen, Z.: Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range. J. Acoust. Soc. Am. 126, 1280–1290 (2009). https://doi.org/10.1121/1.3177264
Article Google Scholar
Darrigrand, E.: Coupling of fast multipole method and microlocal discretization for the 3-D Helmholtz equation. J. Comput. Phys. 181, 126–154 (2002). https://doi.org/10.1006/jcph.2002.7091
Article MathSciNet MATH Google Scholar
Chen, Z.S., Waubke, H., Kreuzer, W.: A formulation of the fast multiple boundary element method (FMBEM) for acoustic radiation and scattering from three-dimensional structures. J. Comput. Acoust. 16, 303–320 (2008). https://doi.org/10.1142/s0218396x08003725
Article MathSciNet MATH Google Scholar
Thompson, L.L.: A review of finite-element methods for time-harmonic acoustics. J. Acoust. Soc. Am. 119, 1315–1330 (2006). https://doi.org/10.1121/1.2164987
Article Google Scholar
Harari, I., Hughes, T.J.R.: Finite element methods for the Helmholtz equation in an exterior domain: Model problems. Comput. Methods Appl. Mech. Eng. 87, 59–96 (1991). https://doi.org/10.1016/0045-7825(91)90146-w
Article MathSciNet MATH Google Scholar
Astley, R.J.: Infinite elements for wave problems: a review of current formulations and an assessment of accuracy. Int. J. Numer. Methods Eng. 49, 951–976 (2000). https://doi.org/10.1002/1097-0207(20001110)49:7%3c951::aid-nme989%3e3.0.co;2-t
Article MATH Google Scholar
Astley, R.J., Macaulay, G.J., Coyette, J.P.: Mapped wave envelope elements for acoustical radiation and scattering. J. Sound Vib. 170, 97–118 (1994). https://doi.org/10.1006/jsvi.1994.1048
Article MATH Google Scholar
Turkel, E., Yefet, A.: Absorbing PML boundary layers for wave-like equations. Appl. Numer. Math. 27, 533–557 (1998). https://doi.org/10.1016/s0168-9274(98)00026-9
Article MathSciNet MATH Google Scholar
Jean-Pierre, B., Berenger, J.P.: A perfectly matched layer for the absorption of electromagnetic waves. J. Comput. Phys. 114, 185–200 (1994). https://doi.org/10.1006/jcph.1994.1159
Article MathSciNet MATH Google Scholar
Liu, Q.-H., Tao, J.: The perfectly matched layer for acoustic waves in absorptive media. J. Acoust. Soc. Am. 102, 2072–2082 (1997). https://doi.org/10.1121/1.419657
Article Google Scholar
Abarbanel, S., Gottlieb, D., Hesthaven, J.S.: Well-posed perfectly matched layers for advective acoustics. J. Comput. Phys. 154, 266–283 (1999). https://doi.org/10.1006/jcph.1999.6313
Article MathSciNet MATH Google Scholar
Harari, I., Slavutin, M., Turkel, E.: Analytical and numerical studies of a finite element PML for the Helmholtz equation. J. Comput. Acoust. 8, 121–137 (2000). https://doi.org/10.1142/S0218396X0000008X
Article MathSciNet MATH Google Scholar
Williams, J.E.F., Hawkings, D.L.: Sound generation by turbulence and surfaces in arbitrary motion. Philos. Trans. R. Soc. Lond., Series A, Mathematical and Physical Sciences. 321–342 (1969)
Crighton, D.G., Dowling, A.P., Ffowcs‐Williams, J.E., Heckl, M., Leppington, F.G., Bartram, J.F.: Modern Methods in Analytical Acoustics Lecture Notes. (1992)
di Francescantonio, P.: A new boundary integral formulation for the prediction of sound radiation. J. Sound Vib. 202, 491–509 (1997). https://doi.org/10.1006/jsvi.1996.0843
Article Google Scholar
Farassat, F.: Acoustic radiation from rotating blades - The Kirchhoff method in aeroacoustics. J. Sound Vib. 239, 785–800 (2001). https://doi.org/10.1006/jsvi.2000.3221
Article Google Scholar
Brentner, K.S., Farassat, F.: Analytical comparison of the acoustic analogy and Kirchhoff formulation for moving surfaces. AIAA J. 36, 1379–1386 (1998). https://doi.org/10.2514/2.558
Article Google Scholar
SYMARE database - Morphoacoustics, https://www.morphoacoustics.org/symare-database.html Accessed 22 December 2021
Sandeep Reddy, C., Hegde, R.M.: Design and development of bionic ears for rendering binaural audio. Int. Conf. Signal Process. Commun. SPCOM 2016, 1–5 (2016). https://doi.org/10.1109/SPCOM.2016.7746678
Article Google Scholar
3D Scanning at a Glance - Artec Studio 12 documentation, http://docs.artec-group.com/as/12/en/qsg.html#cha-glance Accessed 22 December 2021
Cignoni, P., et al.: Meshlab: an open-source mesh processing tool. In: Eurographics Italian chapter conference. 129–136 (2008)
Brandts, J., Korotov, S., Křížek, M.: On the equivalence of regularity criteria for triangular and tetrahedral finite element partitions. Comput. Math. with Appl. 55, 2227–2233 (2008). https://doi.org/10.1016/j.camwa.2007.11.010
Article MathSciNet MATH Google Scholar
Zotkin, D.N., Duraiswami, R., Grassi, E., Gumerov, N.A.: Fast head-related transfer function measurement via reciprocity. J. Acoust. Soc. Am. 120, 2202–2215 (2006). https://doi.org/10.1121/1.2207578
Article Google Scholar
Actran, https://www.fft.be/products Accessed 22 December 2021
Iida, K., Itoh, M., Itagaki, A., Morimoto, M.: Median plane localization using a parametric model of the head-related transfer function based on spectral cues. Appl. Acoust. 68, 835–850 (2007). https://doi.org/10.1016/j.apacoust.2006.07.016
Article Google Scholar
Middlebrooks, J.C.: Individual differences in external-ear transfer functions reduced by scaling in frequency. J. Acoust. Soc. Am. 106, 1480–1492 (1999). https://doi.org/10.1121/1.427176
Article Google Scholar
Katz, B.F.G.: Acoustic absorption measurement of human hair and skin within the audible frequency range. Citat J. Acoust. Soc. Am. 108, 2238 (2000). https://doi.org/10.1121/1.1314319
Article Google Scholar
Cheng, C.I., Wakefield, G.H.: Spatial frequency response surfaces: An alternative visualization tool for head-related transfer functions (HRTF’s). I Proc. ICASSP IEEE Int. Conf. Acoust. Speech Signal Process. 2, 961–964 (1999). https://doi.org/10.1109/icassp.1999.759854
Article Google Scholar
Takemoto, H., Mokhtari, P., Kato, H., Nishimura, R., Iida, K.: Mechanism for generating peaks and notches of head-related transfer functions in the median plane. J. Acoust. Soc. Am. 132, 3832–3841 (2012). https://doi.org/10.1121/1.4765083
Article Google Scholar
Lu, D., Zeng, X., Guo, X., Wang, H.: Head-related transfer function personalization based on modified sparse representation with matching in a database of Chinese pilots. Acoust. Aust. 48, 463–471 (2020). https://doi.org/10.1007/S40857-020-00202-5
Article Google Scholar
Lu, D., Zeng, X., Guo, X., Wang, H.: Head-related transfer function reconstruction with anthropometric parameters and the direction of the sound source. Acoust. Aust. 49, 125–132 (2020). https://doi.org/10.1007/s40857-020-00209-y
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Design, Indian Institute of Technology Kanpur, Kanpur, Uttar Pradesh, 208016, India
Ganesh Kailas & Nachiketa Tiwari

Authors

Ganesh Kailas
View author publications
You can also search for this author in PubMed Google Scholar
Nachiketa Tiwari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ganesh Kailas.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kailas, G., Tiwari, N. Efficient Computational Techniques for Evaluating Distance-Dependent Head-Related Transfer Functions. Acoust Aust 50, 231–245 (2022). https://doi.org/10.1007/s40857-022-00263-8

Download citation

Received: 29 September 2021
Accepted: 12 January 2022
Published: 23 February 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s40857-022-00263-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Efficient Computational Techniques for Evaluating Distance-Dependent Head-Related Transfer Functions

Abstract

Similar content being viewed by others

Loudness stability of binaural sound with spherical harmonic representation of sparse head-related transfer functions