High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
-
Updated
Aug 3, 2023 - Python
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
A collection of robotics simulation environments for reinforcement learning
Clean single-file implementation of offline RL algorithms in JAX
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
PyTorch Implementation of Offline Reinforcement Learning algorithms
Non-modular implementation of common RL algorithms
Learning from Sparse Offline Datasets via Conservative Density Estimation (ICLR 2024)
Add a description, image, and links to the d4rl topic page so that developers can more easily learn about it.
To associate your repository with the d4rl topic, visit your repo's landing page and select "manage topics."