Integrating Reinforcement Learning and Declarative Programming to Learn Causal Laws in Dynamic Domains

Sridharan, Mohan; Rainge, Sarah

doi:10.1007/978-3-319-11973-1_33

Mohan Sridharan²¹ &
Sarah Rainge²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8755))

Included in the following conference series:

International Conference on Social Robotics

3832 Accesses
2 Citations

Abstract

Robots deployed to assist and collaborate with humans in complex domains need the ability to represent and reason with incomplete domain knowledge, and to learn from minimal feedback obtained from non-expert human participants. This paper presents an architecture that combines the complementary strengths of Reinforcement Learning (RL) and declarative programming to support such commonsense reasoning and incremental learning of the rules governing the domain dynamics. Answer Set Prolog (ASP), a declarative language, is used to represent domain knowledge. The robot’s current beliefs, obtained by inference in the ASP program, are used to formulate the task of learning previously unknown domain rules as an RL problem. The learned rules are, in turn, encoded in the ASP program and used to plan action sequences for subsequent tasks. The architecture is illustrated and evaluated in the context of a simulated robot that plans action sequences to arrange tabletop objects in desired configurations.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Can I Do That? Discovering Domain Axioms Using Declarative Programming and Relational Reinforcement Learning

Learning Affordances for Assistive Robots

Integrating ASP into ROS for Reasoning in Robots

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Aerolla, M.: Incorporating Human and Environmental Feedback for Robust Performance in Agent Domains. Master’s thesis, Department of Computer Science, Texas Tech University (May 2011)
Google Scholar
Balai, E., Gelfond, M., Zhang, Y.: Towards Answer Set Programming with Sorts. In: Cabalar, P., Son, T.C. (eds.) LPNMR 2013. LNCS, vol. 8148, pp. 135–147. Springer, Heidelberg (2013)
Chapter Google Scholar
Baral, C.: Knowledge Representation, Reasoning and Declarative Problem Solving. Cambridge University Press (2003)
Google Scholar
Blumberg, B., Downie, M., Ivanov, Y., Berlin, M., Johnson, M.P., Tomlinson, B.: Integrated Learning for Interactive Synthetic Characters. In: International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), pp. 417–426 (2002)
Google Scholar
Dzeroski, S., Raedt, L.D., Driessens, K.: Relational Reinforcement Learning. Machine Learning 43, 7–52 (2001)
Article MATH Google Scholar
Erdem, E., Aker, E., Patoglu, V.: Answer Set Programming for Collaborative Housekeeping Robotics: Representation, Reasoning, and Execution. Intelligent Service Robotics 5(4), 275–291 (2012)
Article Google Scholar
Gelfond, M., Kahl, Y.: Knowledge Representation, Reasoning and the Design of Intelligent Agents. Cambridge University Press (2014)
Google Scholar
Griffith, S., Subramanian, K., Scholz, J., Isbell, C., Thomaz, A.: Policy Shaping: Integrating Human Feedback with Reinforcement Learning. In: International Conference on Neural Information Processing Systems, Lake Tahoe, USA (2013)
Google Scholar
Kaplan, F., Oudeyer, P.-Y., Kubinyi, E., Miklosi, A.: Robotic Clicker Training. Robotics and Autonomous Systems 38 (2002)
Google Scholar
Knox, W.B., Fasel, I., Stone Design, P.: principles for creating human-shapable agents. In: AAAI Spring 2009 Symposium on Agents that Learn from Human Teachers (2009)
Google Scholar
Knox, W.B., Stone, P.: Tamer: Training an Agent Manually via Evaluative Reinforcement. In: International Conference on Development and Learning, ICDL (2008)
Google Scholar
Knox, W.B., Stone, P.: Combining Manual Feedback with Subsequent MDP Reward Signals for Reinforcement Learning. In: International Conference on Autonomous Agents and Multiagent Systems, AAMAS (2010)
Google Scholar
Leone, N., Pfeifer, G., Faber, W., Eiter, T., Gottlob, G., Perri, S., Scarcello, F.: The DLV System for Knowledge Representation and Reasoning. ACM Transactions on Computational Logic 7(3), 499–562 (2006)
Article MathSciNet Google Scholar
Sridharan, M.: Augmented Reinforcement Learning for Interaction with Non-Expert Humans in Agent Domains. In: International Conference on Machine Learning Applications, ICMLA (December 2011)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)
Google Scholar
Thomaz, A., Breazeal, C.: Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance. In: National Conference on Artificial Intelligence, AAAI (2006)
Google Scholar
Watkins, C., Dayan, P.: Q-learning. Machine Learning 8, 279–292 (1992)
MATH Google Scholar
Zhang, S., Sridharan, M., Gelfond, M., Wyatt, J.: Integrating Probabilistic Graphical Models and Declarative Programming for Knowledge Representation and Reasoning in Robotics. In: Planning and Robotics (PlanRob) Workshop at ICAPS, Portsmouth, USA (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, The University of Auckland, NZ
Mohan Sridharan
Department of Computer Science, Texas Tech University, USA
Sarah Rainge

Authors

Mohan Sridharan
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Rainge
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technische Universität München, Boltzmannstr. 3, 85748, München, Germany
Michael Beetz
Faculty of Engineering and IT, University of Technology, Sydney, 2007, Ultimo, NSW, Australia
Benjamin Johnston & Mary-Anne Williams &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sridharan, M., Rainge, S. (2014). Integrating Reinforcement Learning and Declarative Programming to Learn Causal Laws in Dynamic Domains. In: Beetz, M., Johnston, B., Williams, MA. (eds) Social Robotics. ICSR 2014. Lecture Notes in Computer Science(), vol 8755. Springer, Cham. https://doi.org/10.1007/978-3-319-11973-1_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-11973-1_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11972-4
Online ISBN: 978-3-319-11973-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Integrating Reinforcement Learning and Declarative Programming to Learn Causal Laws in Dynamic Domains

Abstract

Chapter PDF

Similar content being viewed by others

Can I Do That? Discovering Domain Axioms Using Declarative Programming and Relational Reinforcement Learning

Learning Affordances for Assistive Robots

Integrating ASP into ROS for Reasoning in Robots

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Integrating Reinforcement Learning and Declarative Programming to Learn Causal Laws in Dynamic Domains

Abstract

Chapter PDF

Similar content being viewed by others

Can I Do That? Discovering Domain Axioms Using Declarative Programming and Relational Reinforcement Learning

Learning Affordances for Assistive Robots

Integrating ASP into ROS for Reasoning in Robots

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation