Improving Reinforcement Learning by Using Case Based Heuristics

Reinaldo A. C. Bianchi^21,22,
Raquel Ros²² &
Ramon Lopez de Mantaras²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5650))

Included in the following conference series:

International Conference on Case-Based Reasoning

931 Accesses
17 Citations

Abstract

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and Reinforcement Learning (RL) techniques. This approach, called Case Based Heuristically Accelerated Reinforcement Learning (CB-HARL), builds upon an emerging technique, the Heuristic Accelerated Reinforcement Learning (HARL), in which RL methods are accelerated by making use of heuristic information. CB-HARL is a subset of RL that makes use of a heuristic function derived from a case base, in a Case Based Reasoning manner. An algorithm that incorporates CBR techniques into the Heuristically Accelerated Q–Learning is also proposed. Empirical evaluations were conducted in a simulator for the RoboCup Four-Legged Soccer Competition, and results obtained shows that using CB-HARL, the agents learn faster than using either RL or HARL methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning

Article 30 October 2017

Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

A fast hybrid reinforcement learning framework with human corrective feedback

Article Open access 09 August 2018

References

Aamodt, A., Plaza, E.: Case-based reasoning: foundational issues, methodological variations, and system approaches. AI Commun. 7(1), 39–59 (1994)
Google Scholar
de Mántaras, R.L., McSherry, D., Bridge, D., Leake, D., Smyth, B., Craw, S., Faltings, B., Maher, M.L., Cox, M.T., Forbus, K., Keane, M., Aamodt, A., Watson, I.: Retrieval, reuse, revision and retention in case-based reasoning. Knowl. Eng. Rev. 20(3), 215–240 (2005)
Article Google Scholar
Hennessy, D., Hinkle, D.: Applying case-based reasoning to autoclave loading. IEEE Expert: Intelligent Systems and Their Applications 7(5), 21–26 (1992)
Article Google Scholar
Althoff, K.D., Bergmann, R., Wess, S., Manago, M., Auriol, E., Larichev, O.I., Bolotov, A., Zhuravlev, Y.I., Gurov, S.I.: Case-based reasoning for medical decision support tasks: The inreca approach. In: Artificial Intelligence in Medicine, January 1998, pp. 25–41 (1998)
Google Scholar
López de Mántaras, R., Cunningham, P., Perner, P.: Emergent case-based reasoning applications. Knowl. Eng. Rev. 20(3), 325–328 (2005)
Article Google Scholar
Szepesvári, C., Littman, M.L.: Generalized markov decision processes: Dynamic-programming and reinforcement-learning algorithms. Technical report, Brown University, CS-96-11 (1996)
Google Scholar
Littman, M.L., Szepesvári, C.: A generalized reinforcement learning model: convergence and applications. In: Proceedings of the 13th International Conference on Machine Learning (ICML 1996), pp. 310–318 (1996)
Google Scholar
Bianchi, R.A.C., Ribeiro, C.H.C., Costa, A.H.R.: Accelerating autonomous learning by using heuristic selection of actions. Journal of Heuristics 14(2), 135–168 (2008)
Article MATH Google Scholar
Bianchi, R.A.C., Ribeiro, C.H.C., Costa, A.H.R.: Heuristic selection of actions in multiagent reinforcement learning. In: Veloso, M.M. (ed.) IJCAI 2007, Proceedings of the 20th International Joint Conference on Artificial Intelligence, Hyderabad, India, January 6-12, pp. 690–695 (2007)
Google Scholar
RoboCup Technical Committee: Standard platform league homepage (2009), http://www.tzi.de/spl
Watkins, C.J.C.H.: Learning from Delayed Rewards. PhD thesis, University of Cambridge (1989)
Google Scholar
Celiberto, L.A., Ribeiro, C.H.C., Costa, A.H.R., Bianchi, R.A.C.: Heuristic reinforcement learning applied to robocup simulation agents. In: Visser, U., Ribeiro, F., Ohashi, T., Dellaert, F. (eds.) RoboCup 2007: Robot Soccer World Cup XI. LNCS, vol. 5001, pp. 220–227. Springer, Heidelberg (2008)
Chapter Google Scholar
Ros, R., Arcos, J.L., de Mantaras, R.L., Veloso, M.: A case-based approach for coordinated action selection in robot soccer. Artificial Intelligence 173(9-10), 1014–1039 (2009)
Article Google Scholar
Ros, R., de Mántaras, R.L., Arcos, J.L., Veloso, M.: Team playing behavior in robot soccer: A case-based approach. In: Weber, R.O., Richter, M.M. (eds.) ICCBR 2007. LNCS (LNAI), vol. 4626, pp. 46–60. Springer, Heidelberg (2007)
Chapter Google Scholar
Ros, R., Arcos, J.L.: Acquiring a robust case base for the robot soccer domain. In: Veloso, M. (ed.) Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), pp. 1029–1034. AAAI Press, Menlo Park (2007)
Google Scholar
Ros, R.: Action Selection in Cooperative Robot Soccer using Case-Based Reasoning. PhD thesis, Universitat Autònoma de Barcelona, Barcelona (2008)
Google Scholar
Lin, Y., Liu, A., Chen, K.: A hybrid architecture of case-based reasoning and fuzzy behavioral control applied to robot soccer. In: Workshop on Artificial Intelligence, International Computer Symposium (ICS 2002), Hualien, Taiwan, National Dong Hwa University, National Dong Hwa University (2002)
Google Scholar
Ahmadi, M., Lamjiri, A.K., Nevisi, M.M., Habibi, J., Badie, K.: Using a two-layered case-based reasoning for prediction in soccer coach. In: Arabnia, H.R., Kozerenko, E.B. (eds.) MLMTA, pp. 181–185. CSREA Press (2003)
Google Scholar
Karol, A., Nebel, B., Stanton, C., Williams, M.A.: Case based game play in the robocup four-legged league part i the theoretical model. In: Polani, D., Browning, B., Bonarini, A., Yoshida, K. (eds.) RoboCup 2003. LNCS, vol. 3020, pp. 739–747. Springer, Heidelberg (2004)
Chapter Google Scholar
Drummond, C.: Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research 16, 59–104 (2002)
MATH Google Scholar
Sharma, M., Holmes, M., Santamaría, J.C., Irani, A., Isbell Jr., C.L., Ram, A.: Transfer learning in real-time strategy games using hybrid cbr/rl. In: Veloso, M.M. (ed.) IJCAI 2007, Proceedings of the 20th International Joint Conference on Artificial Intelligence, Hyderabad, India, January 6-12, pp. 1041–1046 (2007)
Google Scholar
Gabel, T., Riedmiller, M.A.: CBR for state value function approximation in reinforcement learning. In: Muñoz-Avila, H., Ricci, F. (eds.) ICCBR 2005. LNCS, vol. 3620, pp. 206–221. Springer, Heidelberg (2005)
Chapter Google Scholar
Juell, P., Paulson, P.: Using reinforcement learning for similarity assessment in case-based systems. IEEE Intelligent Systems 18(4), 60–67 (2003)
Article Google Scholar
Auslander, B., Lee-Urban, S., Hogg, C., Muñoz-Avila, H.: Recognizing the enemy: Combining reinforcement learning with strategy selection using case-based reasoning. In: Althoff, K.D., Bergmann, R., Minor, M., Hanft, A. (eds.) ECCBR 2008. LNCS, vol. 5239, pp. 59–73. Springer, Heidelberg (2008)
Chapter Google Scholar
Li, Y., Zonghai, C., Feng, C.: A case-based reinforcement learning for probe robot path planning. In: 4th World Congress on Intelligent Control and Automation, Shanghai, China, pp. 1161–1165 (2002)
Google Scholar
von Hessling, A., Goel, A.K.: Abstracting reusable cases from reinforcement learning. In: Brüninghaus, S. (ed.) ICCBR Workshops, pp. 227–236 (2005)
Google Scholar
Veloso, M., Rybski, P.E., Chernova, S., McMillen, C., Fasola, J., von Hundelshausen, F., Vail, D., Trevor, A., Hauert, S., Ros, R.: Cmdash 2005: Team report. Technical report, School of Computer Science, Carnegie Mellon University (2005)
Google Scholar
RoboCup Technical Committee: RoboCup Four-Legged League Rule Book (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Centro Universitário da FEI, São Bernardo do Campo, Brazil
Reinaldo A. C. Bianchi
Artificial Intelligence Research Institute (IIIA-CSIC), Bellaterra, Spain
Reinaldo A. C. Bianchi, Raquel Ros & Ramon Lopez de Mantaras

Authors

Reinaldo A. C. Bianchi
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Ros
View author publications
You can also search for this author in PubMed Google Scholar
Ramon Lopez de Mantaras
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Adaptive Information Cluster, School of Computer Science & Informatics, University College Dublin,, Computer Science Building, Belfield, Dublin 4, Ireland
Lorraine McGinty
Department of Software and Information Systems, College of Computing and Informatics, University of North Carolina at Charlotte, 9201 University City Boulevard, NC 28223-0001, Charlotte, USA
David C. Wilson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bianchi, R.A.C., Ros, R., Lopez de Mantaras, R. (2009). Improving Reinforcement Learning by Using Case Based Heuristics. In: McGinty, L., Wilson, D.C. (eds) Case-Based Reasoning Research and Development. ICCBR 2009. Lecture Notes in Computer Science(), vol 5650. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02998-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-02998-1_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02997-4
Online ISBN: 978-3-642-02998-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving Reinforcement Learning by Using Case Based Heuristics

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning

Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

A fast hybrid reinforcement learning framework with human corrective feedback

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Improving Reinforcement Learning by Using Case Based Heuristics

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Heuristically Accelerated Reinforcement Learning by Means of Case-Based Reasoning and Transfer Learning

Heuristically-Accelerated Reinforcement Learning: A Comparative Analysis of Performance

A fast hybrid reinforcement learning framework with human corrective feedback

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation