Abstract
Blind Source Separation (BSS) techniques aim at recovering unobserved source signals from observed mixtures (typically, the outputs of an array of sensors). Practically all classical BSS techniques do not work properly under reverberant conditions and therefore, it still remains an open problem. In this sense, we propose in this document the use of synchronization of speech mixtures in order to improve the results of classical BSS techniques. Specifically, we have applied the synchronization of mixtures combined with one of the most well-known and robust BSS algorithms that works under non-reverberant conditions, the Degenerate Unmixing Estimation Technique (DUET). In the aim of synchronizing speech mixtures prior to the speech source separation, the suitability of working with seven Time Delay Estimation (TDE) techniques has been analyzed. Results show the feasibility of using synchronization since the results of DUET are improved and additionally, it has been observed what is the most useful TDE algorithm in this framework.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Cao, X.R., Liu, R.: General approach to blind source separation. IEEE Transactions on Signal Processing, 562–571 (1996)
Hérault, J., Jutten, C., Ans, B.: Détection de grandeurs primitives dans un message composite par une architecture de calcul neuromimétique en apprentissage non supervisé. In: 10 Colloque sur le Traitement du Signal et Des Images, France (1985)
Diggavi, S.N., Al-Dhahir, N., Stamoulis, A., Calderbank, A.R.: Great expectations: The value of spatial diversity in wireless networks. Proceedings of the IEEE, 219–270 (2004)
Cichocki, A., Georgiev, P.: Blind source separation algorithms with matrix constraints. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 522–531 (2003)
Te-Won, L.: Independent component analysis: theory and applications. Kluwer Academic Publishers, Boston (1998)
Hurley, N., Rickard, S.: Comparing measures of sparsity. IEEE Transactions on Information Theory, 4723–4741 (2009)
Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 1830–1847 (2004)
Zicheng, L.: Sound source separation with distributed microphone arrays in the presence of clock synchronization errors. In: Proc. Int. Workshop Acoustic Echo and Noise Control, IWAENC (2008)
Lienhart, R., Kozintsev, I., Wehr, S., Yeung, M.: On the importance of exact synchronization for distributed audio signal processing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, vol. 4, pp. IV-840–IV-843. IEEE (2003)
Brandstein, M.S., Adcock, J.E., Silverman, H.F.: A practical time-delay estimator for localizing speech sources with a microphone array. Computer Speech and Language, 153–170 (1995)
Yegnanarayana, B., Prasanna, S.R.M., Duraiswami, R., Zotkin, D.: Processing of reverberant speech for time-delay estimation. IEEE Transactions on Speech and Audio Processing, 1110–1118 (2005)
Carter, G.C.: Coherence and time delay estimation. Proceedings of the IEEE, 236–255 (1987)
Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech and Signal Processing, 320–327 (1976)
Emile, B., Comon, P., Le Roux, J.: Estimation of time delays with fewer sensors than sources. IEEE Transactions on Signal Processing, 2012–2015 (1998)
Wehr, S., Kozintsev, I., Lienhart, R., Kellermann, W.: Synchronization of acoustic sensors for distributed ad-hoc audio networks and its use for blind source separation. In: Proceedings of the IEEE Sixth International Symposium on Multimedia Software Engineering, pp. 18–25. IEEE (2004)
Francourt, C., Parra, L.: The coherence function in blind source separation of convolutive mixtures of non-stationary signals. In: IEEE Workshop on Neural Networks for Signal Processing, pp. 303–312 (2001)
Donohue, K.D., Agrinsoni, A., Hannemann, J.: Audio signal delay estimation using partial whitening. In: Proceedings of the IEEE SoutheastCon, pp. 466–471. IEEE (2007)
Saarnisaari, H.: ML time delay estimation in a multipath channel. In: Proceedings of the IEEE 4th International Symposium on Spread Spectrum Techniques and Applications, pp. 1007–1011. IEEE (1996)
Roth, P.R.: Effective measurements using digital signal analysis. IEEE Spectrum 8, 62–70 (1971)
Carter, G.C., Nuttall, A.H., Cable, P.G.: The smoothed coherence transform. Proceedings of the IEEE, 1497–1498 (1973)
Jacovitti, G., Scarano, G.: Discrete time techniques for time delay estimation. IEEE Transactions on Signal Processing, 525–533 (1993)
Seneff, S., Zue, V.: Transcription and alignment of the timit database, TIMIT CD-ROM Documentation (1998)
McGovern, S.G: A model for room acoustics, http://www.2pi.us/rir.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Llerena, C., Gil-Pita, R., Álvarez, L., Rosa-Zurera, M. (2013). Synchronizing Speech Mixtures in Speech Separation Problems under Reverberant Conditions. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science(), vol 7894. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38658-9_52
Download citation
DOI: https://doi.org/10.1007/978-3-642-38658-9_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38657-2
Online ISBN: 978-3-642-38658-9
eBook Packages: Computer ScienceComputer Science (R0)