Volume 2021, issue 1, December 2021

45 articles in this issue

Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit

Authors (first, second and last of 4)
- Jiacheng Yao
- Jing Zhang
- Li Zhuo
- Content type: Research
- Open Access
- Published: 20 December 2021
- Article: 45
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation

Authors (first, second and last of 7)
- Yuki Takashima
- Ryoichi Takashima
- Nobuaki Motoyama
- Content type: Research
- Open Access
- Published: 11 December 2021
- Article: 44
A recursive expectation-maximization algorithm for speaker tracking and separation

Authors
- Ofer Schwartz
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 04 December 2021
- Article: 43
Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation

Authors (first, second and last of 5)
- Zolzaya Byambadorj
- Ryota Nishimura
- Norihide Kitaoka
- Content type: Research
- Open Access
- Published: 04 December 2021
- Article: 42
Spherical harmonic covariance and magnitude function encodings for beamformer design

Authors
- Yuancheng Luo
- Content type: Research
- Open Access
- Published: 03 December 2021
- Article: 41
U²-VC: one-shot voice conversion using two-level nested U-structure

Authors (first, second and last of 5)
- Fangkun Liu
- Hui Wang
- Xiaodong Li
- Content type: Research
- Open Access
- Published: 24 November 2021
- Article: 40
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing

Authors (first, second and last of 6)
- Diego Di Carlo
- Pinchas Tandeitnik
- Sharon Gannot
- Content type: Empirical Research
- Open Access
- Published: 23 November 2021
- Article: 39
This is part of 1 collection:
Data-Based Spatial Audio Processing
A multichannel learning-based approach for sound source separation in reverberant environments

Authors
- You-Siang Chen
- Zi-Jie Lin
- Mingsian R. Bai
- Content type: Research
- Open Access
- Published: 20 November 2021
- Article: 38
This is part of 1 collection:
Data-Based Spatial Audio Processing
Efficient binaural rendering of spherical microphone array data by linear filtering

Authors
- Johannes M. Arend
- Tim Lübeck
- Christoph Pörschmann
- Content type: Research
- Open Access
- Published: 06 November 2021
- Article: 37
This is part of 1 collection:
Data-Based Spatial Audio Processing
Comparative evaluation of interpolation methods for the directivity of musical instruments

Authors (first, second and last of 5)
- David Ackermann
- Fabian Brinkmann
- Stefan Weinzierl
- Content type: Research
- Open Access
- Published: 30 October 2021
- Article: 36
This is part of 1 collection:
Data-Based Spatial Audio Processing
Nonlinear residual echo suppression based on dual-stream DPRNN

Authors (first, second and last of 4)
- Hongsheng Chen
- Guoliang Chen
- Jing Lu
- Content type: Research
- Open Access
- Published: 07 September 2021
- Article: 35
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Pronunciation augmentation for Mandarin-English code-switching speech recognition

Authors (first, second and last of 4)
- Yanhua Long
- Shuang Wei
- Yijie Li
- Content type: Research
- Open Access
- Published: 30 August 2021
- Article: 34
An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method

Authors (first, second and last of 4)
- Nili Cohen
- Gershon Hazan
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 28 August 2021
- Article: 33
A noise PSD estimation algorithm using derivative-based high-pass filter in non-stationary noise conditions

Authors
- Sujan Kumar Roy
- Kuldip K. Paliwal
- Content type: Research
- Open Access
- Published: 14 August 2021
- Article: 32
Feature compensation based on the normalization of vocal tract length for the improvement of emotion-affected speech recognition

Authors
- Masoud Geravanchizadeh
- Elnaz Forouhandeh
- Meysam Bashirpour
- Content type: Research
- Open Access
- Published: 04 August 2021
- Article: 31
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Musical note onset detection based on a spectral sparsity measure

Authors
- Mina Mounir
- Peter Karsmakers
- Toon van Waterschoot
- Content type: Research
- Open Access
- Published: 28 July 2021
- Article: 30
Single-channel speech enhancement based on joint constrained dictionary learning

Authors (first, second and last of 4)
- Linhui Sun
- Yunyi Bu
- Zihao Wu
- Content type: Research
- Open Access
- Published: 27 July 2021
- Article: 29
Performance vs. hardware requirements in state-of-the-art automatic speech recognition

Authors (first, second and last of 4)
- Alexandru-Lucian Georgescu
- Alessandro Pappalardo
- Michaela Blott
- Content type: Review
- Open Access
- Published: 21 July 2021
- Article: 28
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system

Authors (first, second and last of 6)
- Gui-Xin Shi
- Wei-Qiang Zhang
- Ze-Yu Zhao
- Content type: Research
- Open Access
- Published: 08 July 2021
- Article: 27
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition

Authors (first, second and last of 6)
- Lujun Li
- Yikai Kang
- Gerhard Rigoll
- Content type: Research
- Open Access
- Published: 05 July 2021
- Article: 26
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information

Authors
- Tobias Gburrek
- Joerg Schmalenstroeer
- Reinhold Haeb-Umbach
- Content type: Methodology
- Open Access
- Published: 02 July 2021
- Article: 25
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Components loss for neural networks in mask-based speech enhancement

Authors (first, second and last of 4)
- Ziyi Xu
- Samy Elshamy
- Tim Fingscheidt
- Content type: Research
- Open Access
- Published: 02 July 2021
- Article: 24
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Multi-source localization by using offset residual weight

Authors
- Maoshen Jia
- Shang Gao
- Changchun Bao
- Content type: Research
- Open Access
- Published: 24 June 2021
- Article: 23
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Feature compensation based on independent noise estimation for robust speech recognition

Authors (first, second and last of 4)
- Yong Lü
- Han Lin
- Yitao Chen
- Content type: Research
- Open Access
- Published: 16 June 2021
- Article: 22
Residual feedback suppression with extended model-based postfilters

Authors
- Marco Gimm
- Philipp Bulling
- Gerhard Schmidt
- Content type: Research
- Open Access
- Published: 28 May 2021
- Article: 21
Neural network-based non-intrusive speech quality assessment using attention pooling function

Authors (first, second and last of 4)
- Miao Liu
- Jing Wang
- Fang Liu
- Content type: Research
- Open Access
- Published: 17 May 2021
- Article: 20
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Frequency-dependent auto-pooling function for weakly supervised sound event detection

Authors (first, second and last of 4)
- Sichen Liu
- Feiran Yang
- Jun Yang
- Content type: Research
- Open Access
- Published: 17 May 2021
- Article: 19
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network

Authors (first, second and last of 4)
- Duowei Tang
- Peter Kuppens
- Toon van Waterschoot
- Content type: Research
- Open Access
- Published: 12 May 2021
- Article: 18
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms

Authors (first, second and last of 5)
- Yuxuan Ke
- Andong Li
- Xiaodong Li
- Content type: Research
- Open Access
- Published: 12 April 2021
- Article: 17
Dynamically localizing multiple speakers based on the time-frequency domain

Authors (first, second and last of 4)
- Hodaya Hammer
- Shlomo E. Chazan
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 08 April 2021
- Article: 16
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones

Authors
- Randall Ali
- Toon van Waterschoot
- Marc Moonen
- Content type: Correction
- Open Access
- Published: 06 April 2021
- Article: 15
Acoustic DOA estimation using space alternating sparse Bayesian learning

Authors (first, second and last of 5)
- Zonglong Bai
- Liming Shi
- Mads Græsbøll Christensen
- Content type: Research
- Open Access
- Published: 06 April 2021
- Article: 14
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain

Authors
- Sushmita Thakallapalli
- Suryakanth V. Gangashetty
- Nilesh Madhu
- Content type: Research
- Open Access
- Published: 03 March 2021
- Article: 13
Analysis of transition cost and model parameters in speaker diarization for meetings

Authors (first, second and last of 5)
- Beatriz Martínez-González
- José M. Pardo
- Javier Ferreiros
- Content type: Research
- Open Access
- Published: 24 February 2021
- Article: 12
Accent modification for speech recognition of non-native speakers using neural style transfer

Authors (first, second and last of 4)
- Kacper Radzikowski
- Le Wang
- Robert Nowak
- Content type: Research
- Open Access
- Published: 18 February 2021
- Article: 11
An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones

Authors
- Randall Ali
- Toon van Waterschoot
- Marc Moonen
- Content type: Research
- Open Access
- Published: 10 February 2021
- Article: 10
A CNN-based approach to identification of degradations in speech signals

Authors
- Yuki Saishu
- Amir Hossein Poorjam
- Mads Græsbøll Christensen
- Content type: Research
- Open Access
- Published: 05 February 2021
- Article: 9
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
A review of infant cry analysis and classification

Authors (first, second and last of 4)
- Chunyan Ji
- Thosini Bamunu Mudiyanselage
- Yi Pan
- Content type: Review
- Open Access
- Published: 05 February 2021
- Article: 8
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices

Authors (first, second and last of 9)
- Rajat Hebbar
- Pavlos Papadopoulos
- Shrikanth Narayanan
- Content type: Research
- Open Access
- Published: 03 February 2021
- Article: 7
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Sparse pursuit and dictionary learning for blind source separation in polyphonic music recordings

Authors
- Sören Schulze
- Emily J. King
- Content type: Research
- Open Access
- Published: 28 January 2021
- Article: 6
Audio source separation by activity probability detection with maximum correlation and simplex geometry

Authors
- Bracha Laufer-Goldshtein
- Ronen Talmon
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 28 January 2021
- Article: 5
This is part of 1 collection:
Data-driven Approaches in Acoustic Signal Processing: Methods and Applications
Dynamic out-of-vocabulary word registration to language model for speech recognition

Authors
- Norihide Kitaoka
- Bohan Chen
- Yuya Obashi
- Content type: Research
- Open Access
- Published: 25 January 2021
- Article: 4
Time–frequency scattering accurately models auditory similarities between instrumental playing techniques

Authors (first, second and last of 6)
- Vincent Lostanlen
- Christian El-Hajj
- Mathieu Lagrange
- Content type: Research
- Open Access
- Published: 11 January 2021
- Article: 3
Forward-backward recursive expectation-maximization for concurrent speaker tracking

Authors
- Yuval Dorfan
- Boaz Schwartz
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 09 January 2021
- Article: 2
Progressive loss functions for speech enhancement with deep neural networks

Authors (first, second and last of 6)
- Jorge Llombart
- Dayana Ribas
- Eduardo Lleida
- Content type: Research
- Open Access
- Published: 07 January 2021
- Article: 1

Volume 2021, issue 1, December 2021

Authors (first, second and last of 4)

Authors (first, second and last of 7)

Authors

Authors (first, second and last of 5)

Authors

Authors (first, second and last of 5)

Authors (first, second and last of 6)

Authors

Authors

Authors (first, second and last of 5)

Authors (first, second and last of 4)

Authors (first, second and last of 4)

Authors (first, second and last of 4)

Authors

Authors

Authors

Authors (first, second and last of 4)

Authors (first, second and last of 4)

Authors (first, second and last of 6)

Authors (first, second and last of 6)

Authors

Authors (first, second and last of 4)

Authors

Authors (first, second and last of 4)

Authors

Authors (first, second and last of 4)

Authors (first, second and last of 4)

Authors (first, second and last of 4)

Authors (first, second and last of 5)

Authors (first, second and last of 4)

Authors

Authors (first, second and last of 5)

Authors

Authors (first, second and last of 5)

Authors (first, second and last of 4)

Authors

Authors

Authors (first, second and last of 4)

Authors (first, second and last of 9)

Authors

Authors

Authors

Authors (first, second and last of 6)

Authors

Authors (first, second and last of 6)

For authors

Explore

Search

Navigation