Abstract
In this paper, we propose a new front-end for Acoustic Event Classification tasks (AEC). First, we study the spectral contents of different acoustic events by applying Non-Negative Matrix Factorization (NMF) on their spectral magnitude and compare them with the structure of speech spectra. Second, from the findings of this study, we propose a new parameterization for AEC, which is an extension of the conventional Mel Frequency Cepstrum Coefficients (MFCC) and is based on the high pass filtering of acoustic event spectra. Also, the influence of different frequency scales on the classification rate of the whole system is studied. The evaluation of the proposed features for AEC shows that relative error reductions about 12% at segment level and about 11% at target event level with respect to the conventional MFCC are achieved.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Temko, A., Nadeu, C.: Classification of acoustic events using SVM-based clustering schemes. Pattern Recognition 39, 684–694 (2006)
Zieger, C.: An HMM based system for acoustic event detection. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) RT 2007 and CLEAR 2007. LNCS, vol. 4625, pp. 338–344. Springer, Heidelberg (2008)
Zhuang, X., Zhou, X., Hasegawa-Johnson, M.A., Huang, T.S.: Real-world acoustic event detection. Pattern Recognition Letters 31, 1543–1551 (2010)
Kwangyoun, K., Hanseok, K.: Hierarchical approach for abnormal acoustic event classification in an elevator. In: IEEE Int. Conf. AVSS, pp. 89–94 (2011)
Portelo, J., Bugalho, M., Trancoso, I., Neto, J., Abad, A., Serralheiro, A.: Non speech audio event detection. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1973–1976 (2009)
Meng, A., Ahrendt, P., Larsen, J.: Temporal feature integration for music genre classification. IEEE Trans. on Audio, Speech, and Language Processing 15, 1654–1664 (2007)
Mejía-Navarrete, D., Gallardo-Antolín, A., Peláez, C., Valverde, F.: Feature extraction assesment for an acoustic-event classification task using the entropy triangle. In: Interspeech, pp. 309–312 (2011)
Lee, D., Seung, H.: Algorithms for non-negative matrix factorization. Nature 401, 788–791 (1999)
Wilson, K., Raj, B., Smaragdis, P., Divakaran, A.: Speech denoising using nonnegative matrix factorization with priors. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4029–4032 (2008)
Ludeña-Choez, J., Gallardo-Antolín, A.: Speech denoising using non-negative matrix factorization with kullback-leibler divergence and sparseness constraints. In: Torre Toledano, D., Ortega Giménez, A., Teixeira, A., González Rodríguez, J., Hernández Gómez, L., San Segundo Hernández, R., Ramos Castro, D. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 207–216. Springer, Heidelberg (2012)
Schuller, B., Weninger, F., Wollmer, M.: Non-negative matrix factorization as noise-robust feature extractor for speech recognition. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4562–4565 (2010)
FBK-Irst database of isolated meeting-room acoustic events, ELRA Catalog no. S0296
UPC-TALP database of isolated meeting-room acoustic events, ELRA Catalog no. S0268
The ShATR multiple simultaneous speaker corpus, http://www.dcs.shef.ac.uk/spandh/projects/shatrweb/index.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ludeña-Choez, J., Gallardo-Antolín, A. (2013). NMF-Based Spectral Analysis for Acoustic Event Classification Tasks. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-38847-7_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)