Keywords: Deep Reinforcement Learning, UC Berkeley
This repository contains some personal Deep Reinforcement Learning class notes in .pdf format.
- Class notes content
1.1 CS285 (UC Berkeley) - Deep Reinforcement Learning by Sergey Levine - Resources
These notes summarize the main Reinforcement Learning algorithms, both in theory and in practice with some tips & hacks for efficient implementation.
WIP
- Goal
- Algorithms
2.1 DAgger: Dataset Aggregation - Tips & hacks
- Goal
- Algorithms
2.1 Global structure
2.2 Exemples
- Algorithms
1.1. REINFORCE - Tips & hacks
- Algorithms
1.1. Batch Actor-Critic
1.2. Online Actor-Critic - Tips & hacks
- Algorithms
1.1. Policy iteration
1.2. Policy iteration with Dynamic programming
1.3. Value iteration
1.4. Fitted Value iteration
1.5. Fitted Q-iteration
1.6. Online Q-iteration - Tips & hacks
- Algorithms
1.1. Q-learning with replay buffer
1.2. Q-learning with replay buffer and target network
1.3. DQN: classic Deep Q-learning
1.4. DDPG: Q-learning for continuous actions - Tips & hacks
WIP
WIP
- Algorithms
1.1. Model-based Reinforcement Learning version 0.5
1.2. Model-based Reinforcement Learning version 1.0
1.3. Model-based Reinforcement Learning version 1.5
1.4. Model-based Reinforcement Learning with latent space models - Tips & hacks
- Algorithms
1.1. Model-based Reinforcement Learning version 2.0
1.2. DYNA: online Q-learning model-free Reinforcement Learning with a model
1.3. General DYNA-style model-based Reinforcement Learning
1.4. MBA: Model-based Acceleration – MVE: Model-based Value Ex- pansion – MBPO: Model-based Policy Optimization
1.5. Divide and Conquer Reinforcement Learning - Tips & hacks
WIP
- Algorithms
1.1 Soft Q-learning
- Algorithms
1.1. Maximum Entropy Inverse Reinforcement Learning - Tips & hacks
- Tips & hacks
WIP
- Algorithms
1.1. Pre-train & finetune - Tips & hacks
WIP
WIP
These notes were widely inpspired by: