Volume 2021, issue 1, December 2021
45 articles in this issue
-
-
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation
Authors (first, second and last of 7)
- Yuki Takashima
- Ryoichi Takashima
- Nobuaki Motoyama
- Content type: Research
- Open Access
- Published: 11 December 2021
- Article: 44
-
A recursive expectation-maximization algorithm for speaker tracking and separation
Authors
- Ofer Schwartz
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 04 December 2021
- Article: 43
-
Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation
Authors (first, second and last of 5)
- Zolzaya Byambadorj
- Ryota Nishimura
- Norihide Kitaoka
- Content type: Research
- Open Access
- Published: 04 December 2021
- Article: 42
-
Spherical harmonic covariance and magnitude function encodings for beamformer design
Authors
- Yuancheng Luo
- Content type: Research
- Open Access
- Published: 03 December 2021
- Article: 41
-
U2-VC: one-shot voice conversion using two-level nested U-structure
Authors (first, second and last of 5)
- Fangkun Liu
- Hui Wang
- Xiaodong Li
- Content type: Research
- Open Access
- Published: 24 November 2021
- Article: 40
-
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing
Authors (first, second and last of 6)
- Diego Di Carlo
- Pinchas Tandeitnik
- Sharon Gannot
- Content type: Empirical Research
- Open Access
- Published: 23 November 2021
- Article: 39
This is part of 1 collection: -
A multichannel learning-based approach for sound source separation in reverberant environments
Authors
- You-Siang Chen
- Zi-Jie Lin
- Mingsian R. Bai
- Content type: Research
- Open Access
- Published: 20 November 2021
- Article: 38
This is part of 1 collection: -
Efficient binaural rendering of spherical microphone array data by linear filtering
Authors
- Johannes M. Arend
- Tim Lübeck
- Christoph Pörschmann
- Content type: Research
- Open Access
- Published: 06 November 2021
- Article: 37
This is part of 1 collection: -
Comparative evaluation of interpolation methods for the directivity of musical instruments
Authors (first, second and last of 5)
- David Ackermann
- Fabian Brinkmann
- Stefan Weinzierl
- Content type: Research
- Open Access
- Published: 30 October 2021
- Article: 36
This is part of 1 collection: -
Nonlinear residual echo suppression based on dual-stream DPRNN
Authors (first, second and last of 4)
- Hongsheng Chen
- Guoliang Chen
- Jing Lu
- Content type: Research
- Open Access
- Published: 07 September 2021
- Article: 35
This is part of 1 collection: -
Pronunciation augmentation for Mandarin-English code-switching speech recognition
Authors (first, second and last of 4)
- Yanhua Long
- Shuang Wei
- Yijie Li
- Content type: Research
- Open Access
- Published: 30 August 2021
- Article: 34
-
An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method
Authors (first, second and last of 4)
- Nili Cohen
- Gershon Hazan
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 28 August 2021
- Article: 33
-
A noise PSD estimation algorithm using derivative-based high-pass filter in non-stationary noise conditions
Authors
- Sujan Kumar Roy
- Kuldip K. Paliwal
- Content type: Research
- Open Access
- Published: 14 August 2021
- Article: 32
-
Feature compensation based on the normalization of vocal tract length for the improvement of emotion-affected speech recognition
Authors
- Masoud Geravanchizadeh
- Elnaz Forouhandeh
- Meysam Bashirpour
- Content type: Research
- Open Access
- Published: 04 August 2021
- Article: 31
This is part of 1 collection: -
Musical note onset detection based on a spectral sparsity measure
Authors
- Mina Mounir
- Peter Karsmakers
- Toon van Waterschoot
- Content type: Research
- Open Access
- Published: 28 July 2021
- Article: 30
-
Single-channel speech enhancement based on joint constrained dictionary learning
Authors (first, second and last of 4)
- Linhui Sun
- Yunyi Bu
- Zihao Wu
- Content type: Research
- Open Access
- Published: 27 July 2021
- Article: 29
-
Performance vs. hardware requirements in state-of-the-art automatic speech recognition
Authors (first, second and last of 4)
- Alexandru-Lucian Georgescu
- Alessandro Pappalardo
- Michaela Blott
- Content type: Review
- Open Access
- Published: 21 July 2021
- Article: 28
-
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system
Authors (first, second and last of 6)
- Gui-Xin Shi
- Wei-Qiang Zhang
- Ze-Yu Zhao
- Content type: Research
- Open Access
- Published: 08 July 2021
- Article: 27
This is part of 1 collection: -
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition
Authors (first, second and last of 6)
- Lujun Li
- Yikai Kang
- Gerhard Rigoll
- Content type: Research
- Open Access
- Published: 05 July 2021
- Article: 26
-
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information
Authors
- Tobias Gburrek
- Joerg Schmalenstroeer
- Reinhold Haeb-Umbach
- Content type: Methodology
- Open Access
- Published: 02 July 2021
- Article: 25
This is part of 1 collection: -
Components loss for neural networks in mask-based speech enhancement
Authors (first, second and last of 4)
- Ziyi Xu
- Samy Elshamy
- Tim Fingscheidt
- Content type: Research
- Open Access
- Published: 02 July 2021
- Article: 24
This is part of 1 collection: -
Multi-source localization by using offset residual weight
Authors
- Maoshen Jia
- Shang Gao
- Changchun Bao
- Content type: Research
- Open Access
- Published: 24 June 2021
- Article: 23
This is part of 1 collection: -
Feature compensation based on independent noise estimation for robust speech recognition
Authors (first, second and last of 4)
- Yong Lü
- Han Lin
- Yitao Chen
- Content type: Research
- Open Access
- Published: 16 June 2021
- Article: 22
-
Residual feedback suppression with extended model-based postfilters
Authors
- Marco Gimm
- Philipp Bulling
- Gerhard Schmidt
- Content type: Research
- Open Access
- Published: 28 May 2021
- Article: 21
-
Neural network-based non-intrusive speech quality assessment using attention pooling function
Authors (first, second and last of 4)
- Miao Liu
- Jing Wang
- Fang Liu
- Content type: Research
- Open Access
- Published: 17 May 2021
- Article: 20
This is part of 1 collection: -
Frequency-dependent auto-pooling function for weakly supervised sound event detection
Authors (first, second and last of 4)
- Sichen Liu
- Feiran Yang
- Jun Yang
- Content type: Research
- Open Access
- Published: 17 May 2021
- Article: 19
This is part of 1 collection: -
End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network
Authors (first, second and last of 4)
- Duowei Tang
- Peter Kuppens
- Toon van Waterschoot
- Content type: Research
- Open Access
- Published: 12 May 2021
- Article: 18
This is part of 1 collection: -
Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms
Authors (first, second and last of 5)
- Yuxuan Ke
- Andong Li
- Xiaodong Li
- Content type: Research
- Open Access
- Published: 12 April 2021
- Article: 17
-
Dynamically localizing multiple speakers based on the time-frequency domain
Authors (first, second and last of 4)
- Hodaya Hammer
- Shlomo E. Chazan
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 08 April 2021
- Article: 16
This is part of 1 collection: -
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones
Authors
- Randall Ali
- Toon van Waterschoot
- Marc Moonen
- Content type: Correction
- Open Access
- Published: 06 April 2021
- Article: 15
-
Acoustic DOA estimation using space alternating sparse Bayesian learning
Authors (first, second and last of 5)
- Zonglong Bai
- Liming Shi
- Mads Græsbøll Christensen
- Content type: Research
- Open Access
- Published: 06 April 2021
- Article: 14
-
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain
Authors
- Sushmita Thakallapalli
- Suryakanth V. Gangashetty
- Nilesh Madhu
- Content type: Research
- Open Access
- Published: 03 March 2021
- Article: 13
-
Analysis of transition cost and model parameters in speaker diarization for meetings
Authors (first, second and last of 5)
- Beatriz Martínez-González
- José M. Pardo
- Javier Ferreiros
- Content type: Research
- Open Access
- Published: 24 February 2021
- Article: 12
-
Accent modification for speech recognition of non-native speakers using neural style transfer
Authors (first, second and last of 4)
- Kacper Radzikowski
- Le Wang
- Robert Nowak
- Content type: Research
- Open Access
- Published: 18 February 2021
- Article: 11
-
An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones
Authors
- Randall Ali
- Toon van Waterschoot
- Marc Moonen
- Content type: Research
- Open Access
- Published: 10 February 2021
- Article: 10
-
A CNN-based approach to identification of degradations in speech signals
Authors
- Yuki Saishu
- Amir Hossein Poorjam
- Mads Græsbøll Christensen
- Content type: Research
- Open Access
- Published: 05 February 2021
- Article: 9
This is part of 1 collection: -
A review of infant cry analysis and classification
Authors (first, second and last of 4)
- Chunyan Ji
- Thosini Bamunu Mudiyanselage
- Yi Pan
- Content type: Review
- Open Access
- Published: 05 February 2021
- Article: 8
-
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices
Authors (first, second and last of 9)
- Rajat Hebbar
- Pavlos Papadopoulos
- Shrikanth Narayanan
- Content type: Research
- Open Access
- Published: 03 February 2021
- Article: 7
This is part of 1 collection: -
Sparse pursuit and dictionary learning for blind source separation in polyphonic music recordings
Authors
- Sören Schulze
- Emily J. King
- Content type: Research
- Open Access
- Published: 28 January 2021
- Article: 6
-
Audio source separation by activity probability detection with maximum correlation and simplex geometry
Authors
- Bracha Laufer-Goldshtein
- Ronen Talmon
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 28 January 2021
- Article: 5
This is part of 1 collection: -
Dynamic out-of-vocabulary word registration to language model for speech recognition
Authors
- Norihide Kitaoka
- Bohan Chen
- Yuya Obashi
- Content type: Research
- Open Access
- Published: 25 January 2021
- Article: 4
-
Time–frequency scattering accurately models auditory similarities between instrumental playing techniques
Authors (first, second and last of 6)
- Vincent Lostanlen
- Christian El-Hajj
- Mathieu Lagrange
- Content type: Research
- Open Access
- Published: 11 January 2021
- Article: 3
-
Forward-backward recursive expectation-maximization for concurrent speaker tracking
Authors
- Yuval Dorfan
- Boaz Schwartz
- Sharon Gannot
- Content type: Research
- Open Access
- Published: 09 January 2021
- Article: 2
-
Progressive loss functions for speech enhancement with deep neural networks
Authors (first, second and last of 6)
- Jorge Llombart
- Dayana Ribas
- Eduardo Lleida
- Content type: Research
- Open Access
- Published: 07 January 2021
- Article: 1