Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning.

scholar.google.com › citations

… model-based reinforcement learning through optimistic …
Curi · Cited by 94

… for data-efficient reinforcement learning
Schwarzer · Cited by 139

… optimistic perspective on offline reinforcement learning
Agarwal · Cited by 586

Optimistic Sampling Strategy for Data-Efficient Reinforcement ...

Apr 24, 2019 · The agent selects the most informative samples using an optimization method. This way, the initial sample is more informative than in random and ...

(PDF) Optimistic Sampling Strategy for Data-Efficient Reinforcement ...

www.researchgate.net › publication › 33...

The agent selects the most informative samples using an optimization method. This way, the initial sample is more informative than in random and fixed strategy.

Optimistic Sampling Strategy for Data-Efficient Reinforcement ...

ieeexplore.ieee.org › iel7

ABSTRACT A high required number of interactions with the environment is one of the most important problems in reinforcement learning.

[PDF] Efficient Model-Based Reinforcement Learning through Optimistic Policy ...

proceedings.neurips.cc › paper › file

Model-based reinforcement learning algorithms with probabilistic dynamical models are amongst the most data-efficient learning methods. This is often.

Bigger, Regularized, Optimistic: scaling for compute and sample ...

arxiv.org › html

May 25, 2024 · Sample efficiency in Reinforcement Learning (RL) has traditionally been driven by algorithmic enhancements. In this work, we demonstrate that ...

On Sample-Efficient Offline Reinforcement Learning: Data ...

openreview.net › forum

Sep 21, 2023 · The paper considers a new notion of diversity in offline RL, and shows the efficiency of regularization optimization and posterior sampling ...

Efficient and scalable reinforcement learning via hypermodel

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Sample-Efficient Reinforcement Learning by Breaking the Replay ...

Robust On-Policy Sampling for Data-Efficient Policy Evaluation in ...

More results from openreview.net

[PDF] Efficient Model-Based Reinforcement Learning through Optimistic Policy ...

las.inf.ethz.ch › files › curi20_hucrl

Model-based reinforcement learning algorithms with probabilistic dynamical models are amongst the most data-efficient learning methods. This is often.

Off-policy Reinforcement Learning with Optimistic Exploration and ...

arxiv.org › cs

Oct 22, 2021 · Improving the sample efficiency of reinforcement learning algorithms requires effective exploration. Following the principle of \textit{optimism ...

[PDF] Sample Efficient Reinforcement Learning with Gaussian Processes

proceedings.mlr.press › grande14

Abstract. This paper derives sample complexity results for using Gaussian Processes (GPs) in both model- based and model-free reinforcement learning.

Towards Sample Efficient Reinforcement... | ERA

era.library.ualberta.ca › items

To attain optimistic value function estimation without resorting to a UCB-style bonus, we introduce a reward sampling procedure that guarantees optimism in the ...

Scholarly articles for Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning.

Optimistic Sampling Strategy for Data-Efficient Reinforcement ...

(PDF) Optimistic Sampling Strategy for Data-Efficient Reinforcement ...

Optimistic Sampling Strategy for Data-Efficient Reinforcement ...

[PDF] Efficient Model-Based Reinforcement Learning through Optimistic Policy ...

Bigger, Regularized, Optimistic: scaling for compute and sample ...

On Sample-Efficient Offline Reinforcement Learning: Data ...

[PDF] Efficient Model-Based Reinforcement Learning through Optimistic Policy ...

Off-policy Reinforcement Learning with Optimistic Exploration and ...

[PDF] Sample Efficient Reinforcement Learning with Gaussian Processes

Towards Sample Efficient Reinforcement... | ERA