ML Assignment 2
ML Assignment 2
ML Assignment 2
Reinforcement Learning
Sources:
AI
Reinforcement learning (RL) can be broadly categorized into two main types:
Model-Based and Model-Free. Each type has its own approach to learning
and decision-making.
Q3. Q- learning
1.Introduction
Q-learning is a popular model-free reinforcement learning algorithm that aims
to learn the optimal action-value function, often referred to as the Q-function.
This function estimates the expected future reward for taking a particular
action in a given state. By maximizing the Q-function, the agent can learn to
make decisions that lead to the highest cumulative reward.
s: current state
a: current action
r: reward received
s’: next state
a’: next action
α: learning rate
γ: discount factor
4.Advantages of Q-Learning:
Simple and effective for small action spaces.
Doesn’t require a model of the environment.
Q-learning has been applied to a wide range of problems, including: Game playing,
Robotics, Finance, Healthcare