0% found this document useful (0 votes)

8 views9 pages

RL Learning

Reinforcement Learning (RL) is a machine learning approach where agents learn by interacting with their environment through trial and error. Key components include the agent, environment, states, actions, rewards, and policies, with mathematical foundations provided by Markov Decision Processes (MDP). The document outlines various RL types, including model-free and model-based approaches, and highlights Q-Learning as a prominent algorithm for optimizing action-selection policies.

Uploaded by

shrutirajjmp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views9 pages

RL Learning

Uploaded by

shrutirajjmp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Reinforcement Learning:

Teaching Machines to Learn

from Experience
Understanding the Decision-Making Power of AI

Presented by:

RA2311027010051 - DIYA SHARMA

RA2311027010063 - SHRUTI RAJ

RA2311027010065 - HARSH KUMAR

What is Reinforcement
Learning?
Definition
RL is machine learning where an agent learns by interacting
with an environment.

Key Idea
Learn by trial and error, like humans and animals.
Real-Life Examples of RL
AlphaGo defeating human champions
Robots learning to walk or grasp objects
AI mastering video games like Atari, Dota, Minecraft
Self-driving cars navigating roads
Components of Reinforcement
Learning
Agent The learner or decision maker

Environment The world the agent interacts

with

State (s) Current situation of the agent

Action (a) Possible moves the agent can

make

Reward (r) Feedback from the environment

Policy (π) Strategy the agent follows

Value (V/Q) Long-term value of state or

action
The Reinforcement Learning
Observe State
Agent perceives current environment state.

Choose Action
Agent selects and performs an action.

Receive Reward
Agent gets feedback from environment.

Update Knowledge
Agent adjusts policy or value estimates.

Repeat
Goal: maximize cumulative future rewards.
Mathematical Foundation: Markov Decision Processes
A Markov Decision Process (MDP) is a mathematical framework used to describe decision-making in situations where outcomes are
partly random and partly under the control of a decision-maker (agent).

It provides a formal model for environments in reinforcement learning.

🧱 MDP Components (5-Tuple)

An MDP is defined as a 5-tuple:

MDP=⟨S,A,P,R,γ⟩MDP=⟨S,A,P,R,γ⟩

States (S) Actions (A) Transition Probabilities (P)

Possible situations the agent can be in. Choices available to the agent. Likelihood of moving between states.

Reward Function (R) Discount Factor (γ)

Feedback received after actions. Importance of future rewards.
Types of Reinforcement
Model-Free vs Model- Value-Based Policy-Based Actor-Critic
Based
Examples: Q-Learning Example: REINFORCE Examples: PPO, A3C
Different approaches to algorithm combining value and
learning environment policy methods
dynamics.
Q-Learning Algorithm
Q-Learning is a model-free reinforcement learning algorithm used to learn the optimal action-selection policy for an agent interacting with an environment. The goal of Q-
Learning is to learn a policy that tells an agent the best action to take in each state in order to maximize the total cumulative reward over time.

Key Concepts:
State (s): The condition or configuration of the environment at any given time.
Action (a): The decision or move made by the agent.
Reward (r): The feedback received after taking an action in a state.
Q-value (Q(s, a)): The expected future reward for taking action aa in state ss. It represents the quality of the action taken in a given state.

Update Rule Parameters

Q(s,a) ← Q(s,a) + α [r + γ max Q(s',a') - Q(s,a)] α: learning rate
γ: discount factor
r: reward
s', a': next state and action
Summary and Next Steps
RL teaches agents to learn from experience.

Key components include agent, environment, rewards,

and policies.

Mathematical foundations guide algorithm design.

Explore advanced RL methods and applications next.

ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
RL Presentation2
No ratings yet
RL Presentation2
19 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
Unit-5 MLT
No ratings yet
Unit-5 MLT
13 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
Fai Mid2 4ans
No ratings yet
Fai Mid2 4ans
4 pages
Lecture Notes On Reinforcement Learning Basics
No ratings yet
Lecture Notes On Reinforcement Learning Basics
6 pages
RL Presentation
No ratings yet
RL Presentation
12 pages
RL Vishnu Sankar
No ratings yet
RL Vishnu Sankar
26 pages
Unit 4
No ratings yet
Unit 4
56 pages
Unit 5
No ratings yet
Unit 5
45 pages
Reinforcement MDP Final PDF
No ratings yet
Reinforcement MDP Final PDF
10 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Lecture 9 Reiforcement Learning
No ratings yet
Lecture 9 Reiforcement Learning
29 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Reinforcement Learning Enhanced
No ratings yet
Reinforcement Learning Enhanced
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
17 pages
Esraa Khaled
No ratings yet
Esraa Khaled
27 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
19 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
Unit 5 ML
No ratings yet
Unit 5 ML
15 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Unit 3
No ratings yet
Unit 3
29 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
Reinforcement Learning MY101
No ratings yet
Reinforcement Learning MY101
15 pages
Unit4 (AI) 2024 Docx-1
No ratings yet
Unit4 (AI) 2024 Docx-1
22 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
4.3 Reinforcement Learning
No ratings yet
4.3 Reinforcement Learning
27 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Module 01
No ratings yet
Module 01
66 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
12 pages
Intro to Reinforcement Learning
No ratings yet
Intro to Reinforcement Learning
9 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
86 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
10 pages
ML 10
No ratings yet
ML 10
9 pages
Unit-8 - Reinforcement Learning
No ratings yet
Unit-8 - Reinforcement Learning
52 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
8 pages
Unit-6 Reinforcement Learning
No ratings yet
Unit-6 Reinforcement Learning
75 pages
Ai (It) Unit-5
No ratings yet
Ai (It) Unit-5
43 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
Bharatiya Antariksh Hackathon 2025 Idea Submission
No ratings yet
Bharatiya Antariksh Hackathon 2025 Idea Submission
14 pages
Room Colouring Assignment
No ratings yet
Room Colouring Assignment
11 pages
Coding Challenges Lab 3
No ratings yet
Coding Challenges Lab 3
4 pages
Program 1 Oodp
No ratings yet
Program 1 Oodp
7 pages
Dissertation Topics in Education Examples
100% (2)
Dissertation Topics in Education Examples
6 pages
Presention - AI
No ratings yet
Presention - AI
38 pages
Grade 8 Lesson Characteristics of Matter
88% (17)
Grade 8 Lesson Characteristics of Matter
2 pages
Skill in Sport 5.3
No ratings yet
Skill in Sport 5.3
52 pages
AQ Based On LXT 3.1
No ratings yet
AQ Based On LXT 3.1
4 pages
DLP English 9
No ratings yet
DLP English 9
189 pages
Iep Bsed English
No ratings yet
Iep Bsed English
7 pages
Science Scope and Sequence
100% (5)
Science Scope and Sequence
41 pages
Module 3-Lesson 4
No ratings yet
Module 3-Lesson 4
6 pages
Fine-Tuned Xception For Image Classification On Tiny ImageNet
No ratings yet
Fine-Tuned Xception For Image Classification On Tiny ImageNet
4 pages
EC115 Teaching Arts in The Elem Grades
100% (4)
EC115 Teaching Arts in The Elem Grades
31 pages
Bernstein (1991)
No ratings yet
Bernstein (1991)
7 pages
DeepFake Detection (DR - Kalam)
No ratings yet
DeepFake Detection (DR - Kalam)
21 pages
Just Mercy Lesson Plan 1-3
No ratings yet
Just Mercy Lesson Plan 1-3
27 pages
EGCSE Religious Education Syllabus 2024-26-1675150671
100% (2)
EGCSE Religious Education Syllabus 2024-26-1675150671
15 pages
1 HRMP OL Cap Performance p69 Mo Hinh Hooi2020 Xem
No ratings yet
1 HRMP OL Cap Performance p69 Mo Hinh Hooi2020 Xem
24 pages
Introversion-Extraversion A N D The Efl Proficiency of Japanese Students Deborah Busch Cornell University'
No ratings yet
Introversion-Extraversion A N D The Efl Proficiency of Japanese Students Deborah Busch Cornell University'
24 pages
Grade 12 Unity Class Schedule
No ratings yet
Grade 12 Unity Class Schedule
2 pages
5th Grade English Grammar Lesson
No ratings yet
5th Grade English Grammar Lesson
2 pages
Bonotto (2013) - Artifacts As Sources For Problem-Posing Activities. Educational Studies 83 (1) 37-55
No ratings yet
Bonotto (2013) - Artifacts As Sources For Problem-Posing Activities. Educational Studies 83 (1) 37-55
19 pages
Shatri Zone
No ratings yet
Shatri Zone
7 pages
Intelligence Theories & Testing
No ratings yet
Intelligence Theories & Testing
14 pages
Cold War Lesson for Students
No ratings yet
Cold War Lesson for Students
2 pages
2023 Monitoring and Evaluation Report
100% (1)
2023 Monitoring and Evaluation Report
4 pages
The Works of Dr. Jose P. Rizal
No ratings yet
The Works of Dr. Jose P. Rizal
2 pages
BSMM-8000-1 Brunet Winter 2025
No ratings yet
BSMM-8000-1 Brunet Winter 2025
19 pages
TG Sample
No ratings yet
TG Sample
2 pages
Grade 4-Math-Perimeter-Lesson Plan
No ratings yet
Grade 4-Math-Perimeter-Lesson Plan
2 pages
Is Homework Outlawed in France
100% (1)
Is Homework Outlawed in France
8 pages
LC 39
No ratings yet
LC 39
3 pages

RL Learning

Uploaded by

RL Learning

Uploaded by

Reinforcement Learning:

Teaching Machines to Learn

RA2311027010051 - DIYA SHARMA

RA2311027010063 - SHRUTI RAJ

RA2311027010065 - HARSH KUMAR

Environment The world the agent interacts

State (s) Current situation of the agent

Action (a) Possible moves the agent can

Reward (r) Feedback from the environment

Policy (π) Strategy the agent follows

Value (V/Q) Long-term value of state or

It provides a formal model for environments in reinforcement learning.

🧱 MDP Components (5-Tuple)

States (S) Actions (A) Transition Probabilities (P)

Reward Function (R) Discount Factor (γ)

Update Rule Parameters

Key components include agent, environment, rewards,

Mathematical foundations guide algorithm design.

Explore advanced RL methods and applications next.

You might also like