Maximizing Learning Progress: An Internal Reward System for Development

Kaplan, Frédéric; Oudeyer, Pierre-Yves

doi:10.1007/978-3-540-27833-7_19

Frédéric Kaplan²¹ &
Pierre-Yves Oudeyer²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3139))

3443 Accesses
26 Citations
9 Altmetric

Abstract

This chapter presents a generic internal reward system that drives an agent to increase the complexity of its behavior. This reward system does not reinforce a predefined task. Its purpose is to drive the agent to progress in learning given its embodiment and the environment in which it is placed. The dynamics created by such a system are studied first in a simple environment and then in the context of active vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Reinforcement Learning: A Survey

Deep Reinforcement Learning: A New Frontier in Computer Vision Research

Intrinsic Motivation and Reinforcement Learning

References

Skinner, B.: The Behavior of Organisms. Appleton Century Crofs, New York (1938)
Google Scholar
Sutton, R., Barto, A.: Reinforcement learning: an introduction. MIT Press, Cambridge (1998)
Google Scholar
Montague, P., Dayan, P., Sejnowski, T.: A framework for mesencephalic dopamine systems based on predictive hebbian learning. Journal of Neuroscience 16, 1936–1947 (1996)
Google Scholar
Schultz, W., Dayan, P., Montague, P.: A neural substrate of prediction and reward. Science 275, 1593–1599 (1997)
Article Google Scholar
Doya, K.: Metalearning and neuromodulation. Neural Networks 15 (2002)
Google Scholar
Lorenz, K.: Vom Weltbild des Verhaltensforschers. dtv, Munchen (1968)
Google Scholar
Csikszenthmihalyi, M.: Flow-the psychology of optimal experience. Harper Perennial (1991)
Google Scholar
Kaplan, F., Oudeyer, P.Y.: Motivational principles for visual know-how development. In: Prince, C., Berthouze, L., Kozima, H., Bullock, D., Stojanov, G., Balkenius, C. (eds.) Proceedings of the 3rd international workshop on Epigenetic Robotics: Modeling cognitive development in robotic systems, vol. 101, pp. 73–80. Lund University Cognitive Studies (2003)
Google Scholar
Varela, F., Thompson, E., Rosch, E.: The embodied mind: Cognitive science and human experience. MIT Press, Cambridge (1991)
Google Scholar
Andry, P., Gaussier, P., Moga, S., Banquet, J., Nadel, J.: Learning and communication in imitation: an autonomous robot perspective. IEEE Transaction on Systems, Man and Cybernetics, Part A: Systems and Humans 31, 431–444 (2001)
Article Google Scholar
Huang, X., Weng, J.: Novelty and reinforcement learning in the value system of developmental robots. In: Proceedings of the 2nd international workshop on Epigenetic Robotics - Lund University Cognitive Studies, vol. 94, pp. 47–55 (2002)
Google Scholar
Thrun, S.: Exploration in active learning. In: Arbib, M. (ed.) Handbook of Brain Science and Neural Networks, MIT Press, Cambridge (1995)
Google Scholar
Schmidhuber, J.: Curious model-building control systems. In: Proceeding International Joint Conference on Neural Networks, vol. 2, pp. 1458–1463. IEEE, Singapore (1991)
Chapter Google Scholar
Elman, J.: Finding structure in time. Cognitive Science 14, 179–211 (1990)
Article Google Scholar
Rabiner, L., Juang, B.: An introduction to hidden markov models. IEEE Acoutics, Speech and Signal Processing Magazine 3, 4–16 (1986)
Google Scholar
Tani, J., Nolfi, S.: Learning to perceive the world as articulated: An approach for hiearchical learning in sensory-motor systems. Neural Network 12, 1131–1141 (1999)
Article Google Scholar
Jordan, M., Jacobs, R.: Hierarchical mixtures of experts and the em algorithm. Neural Computation 6, 181–214 (1994)
Article Google Scholar
Kato, T., Floreano, D.: An evolutionary active-vision system. In: Proceedings of the congress on evolutionary computation (CEC 2001), IEEE Press, Los Alamitos (2001)
Google Scholar
Marocco, D., Floreano, D.: Active vision and feature selection in evolutionary behavioral systems. In: Hallam, B., Floreano, D., Hallam, J., Hayes, G., Meyer, J.A. (eds.) From Animals to Animats, vol. 7, MIT Press, Cambridge (2002)
Google Scholar
Lungarella, M., Berthouze, L.: Embodied Artificial Intelligence. LNCS (LNAI), vol. 3139. Springer, Heidelberg (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Developmental Robotics Group, Sony Computer Science Laboratory Paris, 6 rue Amyot, 75005, Paris, France
Frédéric Kaplan & Pierre-Yves Oudeyer

Authors

Frédéric Kaplan
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Yves Oudeyer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Artificial Intelligence Laboratory, Department of Informatics, University of Zurich, Andreasstrasse 15, CH-8050, Zurich, Switzerland
Fumiya Iida & Rolf Pfeifer &
VUB AI Lab, Vrije Universiteit Brussels, Pleinlaan 2, 1050, Brussels, Belgium
Luc Steels
JST ERATO Asada Synergistic Intelligence Project,
Yasuo Kuniyoshi

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kaplan, F., Oudeyer, PY. (2004). Maximizing Learning Progress: An Internal Reward System for Development. In: Iida, F., Pfeifer, R., Steels, L., Kuniyoshi, Y. (eds) Embodied Artificial Intelligence. Lecture Notes in Computer Science(), vol 3139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27833-7_19

Download citation

DOI: https://doi.org/10.1007/978-3-540-27833-7_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22484-6
Online ISBN: 978-3-540-27833-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Maximizing Learning Progress: An Internal Reward System for Development

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Reinforcement Learning: A Survey

Deep Reinforcement Learning: A New Frontier in Computer Vision Research

Intrinsic Motivation and Reinforcement Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Maximizing Learning Progress: An Internal Reward System for Development

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Reinforcement Learning: A Survey

Deep Reinforcement Learning: A New Frontier in Computer Vision Research

Intrinsic Motivation and Reinforcement Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation