Abstract
The paper proposes an approach to stationary patterns of auditory neural activity analysis from the point of semi-supervised learning in self-organizing maps (SOM). The suggested approach has allowed to classify and identify complex auditory stimuli, such as vowels, given limited prior information about the data. A computational model of the auditory periphery has been used to obtain auditory nerve fiber responses. Label propagation through Delaunay triangulation proximity graph, derived by SOM algorithm, is implemented to classify unlabeled units. In order to avoid the “dead” unit problem in Emergent SOM and to improve method effectiveness, an adaptive conscience mechanism has been realized. The study has considered the influence of AWGN on the robustness of auditory stimuli identification under various SNRs. The representation of acoustic signals in the form of neural activity in the auditory nerve fibers has proven more noise-robust compared to that in the form of the most common acoustic features, such as MFCC and PLP. The approach has produced high accuracy, both in case of similar sounds and with high SNR.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Meyer, B., Wächter, M., Brand, T., Kollmeier, B.: Phoneme confusions in human and automatic speech recognition. In: Proceedings of Interspeech, pp. 1485–1488 (2007)
Yousafzai, J., Ager, M., Cvetkovic, Z., Sollich, P.: Discriminative and generative machine learning approaches towards robust phoneme classification. In: Proceedings of IEEE Workshop on Information Theory and Application, pp. 471–475 (2008)
Miller, G.A., Nicely, P.E.: An analysis of perceptual confusions among some English consonants. J. Acoust. Soc. Am. 27(2), 338–352 (1955)
Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press, Cambridge (2006)
Yakovenko, A., Malykhina, G.: Bio-inspired approach for automatic speaker clustering using auditory modeling and self-organizing maps. Procedia Comput. Sci. 123, 547–552 (2018)
Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall, Upper Saddle River (2001)
Hermansky, H.: Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87(4), 1738–1752 (1990)
Imai, T.: Positional information in neural map development: lessons from the olfactory system. Dev. Growth. Differ 54(3), 358–365 (2012)
Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2001)
Ultsch, A., Mörchen, F.: ESOM-maps: tools for clustering, visualization, and classification with Emergent SOM. Technical Report, Department of Mathematics and Computer Science, University of Marburg, Germany, p. 46 (2005)
Ultsch, A., Lötsch, J.: Machine-learned cluster identification in high-dimensional data. J. Biomed. Inform. 66, 95–104 (2017)
DeSieno, D.: Adding a Conscience to Competitive Learning. In: Proceedings of the Second Annual IEEE International Conference on Neural Networks, pp. 117–124 (1988)
Zhu, X.: Semi-supervised learning with graphs. Doctoral dissertation, Carnegie Mellon University. CMU-LTI-05-192 (2005)
Herrmann, L., Ultsch, A.: Label propagation for semi-supervised learning in self-organizing maps. In: Proceedings of the 6th International Workshop on Self-Organizing Maps (WSOM). Bielefeld University, Germany (2007)
Hawkins, S., Midgley, J.: Formant frequencies of RP monophthongs in four age groups of speakers. J. Int. Phon. Assoc. 35(2), 183–199 (2005)
Meddis, R., et al.: A computer model of the auditory periphery and its application to the study of hearing. In: Proceedings of the 16th International Symposium on Hearing, Cambridge, UK, pp. 23–27 (2012)
Acknowledgments
The reported study was funded by the Russian Foundation for Basic Research according to the research project 18-31-00304.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Yakovenko, A., Sidorenko, E., Malykhina, G. (2019). Semi-supervised Classifying of Modelled Auditory Nerve Patterns for Vowel Stimuli with Additive Noise. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research II. NEUROINFORMATICS 2018. Studies in Computational Intelligence, vol 799. Springer, Cham. https://doi.org/10.1007/978-3-030-01328-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-01328-8_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01327-1
Online ISBN: 978-3-030-01328-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)