Abstract
We have implemented a speech command system which can understand simple command sentences like “Bot lift ball” or “Bot go table” using hidden Markov models (HMMs) and associative memories with sparse distributed representations. The system is composed of three modules: (1) A set of HMMs is used on phoneme level to get a phonetic transcription of the spoken sentence, (2) a network of associative memories is used to determine the word belonging to the phonetic transcription and (3) a neural network is used on the sentence level to determine the meaning of the sentence. The system is also able to learn new object words during performance.
Chapter PDF
Similar content being viewed by others
Keywords
References
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice-Hall, Inc, Englewood Cliffs (1993)
Young, S., et al.: The HTK Book (for HTK Version 3.2.1). Cambridge University Engineering Department, Cambridge (2002)
Hebb, D.O.: The Organization of Behaviour. John Wiley, New York (1949)
Schwenker, F., Sommer, F., Palm, G.: Iterative Retrieval of Sparsely Coded Associative Memory Patterns. Neural Networks 9, 445–455 (1996)
Willshaw, D., Buneman, O., Longuet-Higgins, H.: Non-holographic Associative Memory. Nature 222, 960–962 (1969)
Palm, G.: On Associative Memory. Biological Cybernetics 36, 19–31 (1980)
Palm, G.: Memory Capacities of Local Rules for Synaptic Modification. A Comparative Review, Concepts in Neuroscience 2, 97–128 (1991)
Knoblauch, A., Palm, G.: Pattern Separation and Synchronization in Spiking Associative Memories and Visual Areas. Neural Networks 14, 763–780 (2001)
TIMIT Acoustic-Phonetic Continuous Speech Corpus. National Institute of Standards and Technology, Speech Disc 1-1.1, NTIS Order No. PB91-505065 (1990)
Markert, H., Knoblauch, A., Palm, G.: Modelling of syntactical processing in the cortex. Biosystems 89, 300–315 (2007)
Fay, R., Kaufmann, U., Knoblauch, A., Markert, H., Palm, G.: Combining Visual Attention, Object Recognition and Associative Information Processing in a Neurobotic System. In: Wermter, S., Palm, G., Elshaw, M. (eds.) Biomimetic Neural Learning for Intelligent Robots. LNCS (LNAI), vol. 3575, pp. 118–143. Springer, Heidelberg (2005)
Arbib, M.A., Billard, A., Iacoboni, M., Oztop, E.: Synthetic brain imaging: grasping, mirror neurons and imitation. Neural Networks 13(8/9), 931–997 (2000)
Roy, D.: Learning visually grounded words and syntax for a scene description task. Comput. Speech Lang. 16(3), 353–385 (2002)
Kirchmar, J.L., Edelman, G.: Machine psychology: autonomous behavior, perceptual categorization and conditioning in a brain-based device. Cereb. Cortex 12(8), 818–830 (2002)
Billard, A., Hayes, G.: DRAMA, a connectionist architecture for control and learning in autonomous robots. Adapt. Behav. J. 7(1), 35–64 (1999)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Markert, H., Kayikci, Z.K., Palm, G. (2008). Sentence Understanding and Learning of New Words with Large-Scale Neural Networks. In: Prevost, L., Marinai, S., Schwenker, F. (eds) Artificial Neural Networks in Pattern Recognition. ANNPR 2008. Lecture Notes in Computer Science(), vol 5064. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69939-2_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-69939-2_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69938-5
Online ISBN: 978-3-540-69939-2
eBook Packages: Computer ScienceComputer Science (R0)