Computer Science > Machine Learning

arXiv:2210.11579 (cs)

[Submitted on 20 Oct 2022]

Title:Model-based Lifelong Reinforcement Learning with Bayesian Exploration

Authors:Haotian Fu, Shangqun Yu, Michael Littman, George Konidaris

View PDF

Abstract:We propose a model-based lifelong reinforcement-learning approach that estimates a hierarchical Bayesian posterior distilling the common structure shared across different tasks. The learned posterior combined with a sample-based Bayesian exploration procedure increases the sample efficiency of learning across a family of related tasks. We first derive an analysis of the relationship between the sample complexity and the initialization quality of the posterior in the finite MDP setting. We next scale the approach to continuous-state domains by introducing a Variational Bayesian Lifelong Reinforcement Learning algorithm that can be combined with recent model-based deep RL methods, and that exhibits backward transfer. Experimental results on several challenging domains show that our algorithms achieve both better forward and backward transfer performance than state-of-the-art lifelong RL methods.

Comments:	Accepted to NeurIPS 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2210.11579 [cs.LG]
	(or arXiv:2210.11579v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.11579

Submission history

From: Haotian Fu [view email]
[v1] Thu, 20 Oct 2022 20:40:47 UTC (4,204 KB)

Computer Science > Machine Learning

Title:Model-based Lifelong Reinforcement Learning with Bayesian Exploration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Model-based Lifelong Reinforcement Learning with Bayesian Exploration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators