Abstract
This chapter presents a generic internal reward system that drives an agent to increase the complexity of its behavior. This reward system does not reinforce a predefined task. Its purpose is to drive the agent to progress in learning given its embodiment and the environment in which it is placed. The dynamics created by such a system are studied first in a simple environment and then in the context of active vision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Skinner, B.: The Behavior of Organisms. Appleton Century Crofs, New York (1938)
Sutton, R., Barto, A.: Reinforcement learning: an introduction. MIT Press, Cambridge (1998)
Montague, P., Dayan, P., Sejnowski, T.: A framework for mesencephalic dopamine systems based on predictive hebbian learning. Journal of Neuroscience 16, 1936–1947 (1996)
Schultz, W., Dayan, P., Montague, P.: A neural substrate of prediction and reward. Science 275, 1593–1599 (1997)
Doya, K.: Metalearning and neuromodulation. Neural Networks 15 (2002)
Lorenz, K.: Vom Weltbild des Verhaltensforschers. dtv, Munchen (1968)
Csikszenthmihalyi, M.: Flow-the psychology of optimal experience. Harper Perennial (1991)
Kaplan, F., Oudeyer, P.Y.: Motivational principles for visual know-how development. In: Prince, C., Berthouze, L., Kozima, H., Bullock, D., Stojanov, G., Balkenius, C. (eds.) Proceedings of the 3rd international workshop on Epigenetic Robotics: Modeling cognitive development in robotic systems, vol. 101, pp. 73–80. Lund University Cognitive Studies (2003)
Varela, F., Thompson, E., Rosch, E.: The embodied mind: Cognitive science and human experience. MIT Press, Cambridge (1991)
Andry, P., Gaussier, P., Moga, S., Banquet, J., Nadel, J.: Learning and communication in imitation: an autonomous robot perspective. IEEE Transaction on Systems, Man and Cybernetics, Part A: Systems and Humans 31, 431–444 (2001)
Huang, X., Weng, J.: Novelty and reinforcement learning in the value system of developmental robots. In: Proceedings of the 2nd international workshop on Epigenetic Robotics - Lund University Cognitive Studies, vol. 94, pp. 47–55 (2002)
Thrun, S.: Exploration in active learning. In: Arbib, M. (ed.) Handbook of Brain Science and Neural Networks, MIT Press, Cambridge (1995)
Schmidhuber, J.: Curious model-building control systems. In: Proceeding International Joint Conference on Neural Networks, vol. 2, pp. 1458–1463. IEEE, Singapore (1991)
Elman, J.: Finding structure in time. Cognitive Science 14, 179–211 (1990)
Rabiner, L., Juang, B.: An introduction to hidden markov models. IEEE Acoutics, Speech and Signal Processing Magazine 3, 4–16 (1986)
Tani, J., Nolfi, S.: Learning to perceive the world as articulated: An approach for hiearchical learning in sensory-motor systems. Neural Network 12, 1131–1141 (1999)
Jordan, M., Jacobs, R.: Hierarchical mixtures of experts and the em algorithm. Neural Computation 6, 181–214 (1994)
Kato, T., Floreano, D.: An evolutionary active-vision system. In: Proceedings of the congress on evolutionary computation (CEC 2001), IEEE Press, Los Alamitos (2001)
Marocco, D., Floreano, D.: Active vision and feature selection in evolutionary behavioral systems. In: Hallam, B., Floreano, D., Hallam, J., Hayes, G., Meyer, J.A. (eds.) From Animals to Animats, vol. 7, MIT Press, Cambridge (2002)
Lungarella, M., Berthouze, L.: Embodied Artificial Intelligence. LNCS (LNAI), vol. 3139. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kaplan, F., Oudeyer, PY. (2004). Maximizing Learning Progress: An Internal Reward System for Development. In: Iida, F., Pfeifer, R., Steels, L., Kuniyoshi, Y. (eds) Embodied Artificial Intelligence. Lecture Notes in Computer Science(), vol 3139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27833-7_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-27833-7_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22484-6
Online ISBN: 978-3-540-27833-7
eBook Packages: Springer Book Archive