0% found this document useful (0 votes)

29 views8 pages

Reinforcement Learning

Uploaded by

ashima.arya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views8 pages

Reinforcement Learning

Uploaded by

ashima.arya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

What is Reinforcement Learning?

Reinforcement Learning is defined as a Machine Learning method that is

concerned with how software agents should take actions in an environment.
Reinforcement Learning is a part of the deep learning method that helps you to
maximize some portion of the cumulative reward.
This neural network learning method helps you to learn how to attain a complex
objective or maximize a specific dimension over many steps.

mportant Components of Deep Reinforcement

Learning Method

Here are some important terms used in Reinforcement AI:

 Agent: It is an assumed entity which performs actions in an environment

to gain some reward.
 Environment (e): A scenario that an agent has to face.
 Reward (R): An immediate return given to an agent when he or she
performs specific action or task.
 State (s): State refers to the current situation returned by the
environment.
 Policy (π): It is a strategy which applies by the agent to decide the next
action based on the current state.
 Value (V): It is expected long-term return with discount, as compared to
the short-term reward.
 Value Function: It specifies the value of a state that is the total amount of
reward. It is an agent which should be expected beginning from that
state.
 Model of the environment: This mimics the behavior of the
environment. It helps you to make inferences to be made and also
determine how the environment will behave.
 Model based methods: It is a method for solving reinforcement learning
problems which use model-based methods.
 Q value or action value (Q): Q value is quite similar to value. The only
difference between the two is that it takes an additional parameter as a
current action.

How Reinforcement Learning works?

Let’s see some simple example which helps you to illustrate the reinforcement
learning mechanism.

Consider the scenario of teaching new tricks to your cat

10 Most Common Interview Questions and Answers ????

 As cat doesn’t understand English or any other human language, we can’t

tell her directly what to do. Instead, we follow a different strategy.
 We emulate a situation, and the cat tries to respond in many different
ways. If the cat’s response is the desired way, we will give her fish.
 Now whenever the cat is exposed to the same situation, the cat executes
a similar action with even more enthusiastically in expectation of getting
more reward(food).
 That’s like learning that cat gets from “what to do” from positive
experiences.
 At the same time, the cat also learns what not do when faced with
negative experiences.
Example of Reinforcement Learning

How Reinforcement Learning works

In this case,

 Your cat is an agent that is exposed to the environment. In this case, it is

your house. An example of a state could be your cat sitting, and you use a
specific word in for cat to walk.
 Our agent reacts by performing an action transition from one “state” to
another “state.”
 For example, your cat goes from sitting to walking.
 The reaction of an agent is an action, and the policy is a method of
selecting an action given a state in expectation of better outcomes.
 After the transition, they may get a reward or penalty in return.

Reinforcement Learning Algorithms

There are three approaches to implement a Reinforcement Learning algorithm.

Value-Based:
In a value-based Reinforcement Learning method, you should try to maximize a
value function V(s). In this method, the agent is expecting a long-term return of
the current states under policy π.
Policy-based:
In a policy-based RL method, you try to come up with such a policy that the
action performed in every state helps you to gain maximum reward in the
future.

Two types of policy-based methods are:

 Deterministic: For any state, the same action is produced by the policy π.
 Stochastic: Every action has a certain probability, which is determined by
the following equation.Stochastic Policy :
n{a\s) = P\A, = a\S, =S]
Model-Based:
In this Reinforcement Learning method, you need to create a virtual model for
each environment. The agent learns to perform in that specific environment.

Characteristics of Reinforcement Learning

Here are important characteristics of reinforcement learning

 There is no supervisor, only a real number or reward signal

 Sequential decision making
 Time plays a crucial role in Reinforcement problems
 Feedback is always delayed, not instantaneous
 Agent’s actions determine the subsequent data it receives

Types of Reinforcement Learning

Two types of reinforcement learning methods are:

Positive:
It is defined as an event, that occurs because of specific behavior. It increases
the strength and the frequency of the behavior and impacts positively on the
action taken by the agent.

This type of Reinforcement helps you to maximize performance and sustain

change for a more extended period. However, too much Reinforcement may
lead to over-optimization of state, which can affect the results.
Negative:
Negative Reinforcement is defined as strengthening of behavior that occurs
because of a negative condition which should have stopped or avoided. It helps
you to define the minimum stand of performance. However, the drawback of
this method is that it provides enough to meet up the minimum behavior.

Learning Models of Reinforcement

There are two important learning models in reinforcement learning:

 Markov Decision Process

 Q learning

Markov Decision Process

The following parameters are used to get a solution:

 Set of actions- A
 Set of states -S
 Reward- R
 Policy- n
 Value- V

The mathematical approach for mapping a solution in reinforcement Learning is

recon as a Markov Decision Process or (MDP).
Q-Learning
Q learning is a value-based method of supplying information to inform which
action an agent should take.

Let’s understand this method by the following example:

 There are five rooms in a building which are connected by doors.

 Each room is numbered 0 to 4
 The outside of the building can be one big outside area (5)
 Doors number 1 and 4 lead into the building from room 5

Next, you need to associate a reward value to each door:

 Doors which lead directly to the goal have a reward of 100

 Doors which is not directly connected to the target room gives zero
reward
 As doors are two-way, and two arrows are assigned for each room
 Every arrow in the above image contains an instant reward value

Explanation:

In this image, you can view that room represents a state

Agent’s movement from one room to another represents an action

In the below-given image, a state is described as a node, while the arrows show
the action.
For example, an agent traverse from room number 2 to 5

 Initial state = state 2

 State 2-> state 3
 State 3 -> state (2,1,4)
 State 4-> state (0,5,3)
 State 1-> state (5,3)
 State 0-> state 4

Reinforcement Learning vs. Supervised Learning

Parameters Reinforcement Learning Supervised Learning

reinforcement learning helps you to take your In this method, a decision is

Decision style
decisions sequentially. input given at the beginning

Works on Works on interacting with the environment. Works on examples or given

In RL method learning decision is dependent. Supervised learning the dec

Dependency on
Therefore, you should give labels to all the independent of each other,
decision
dependent decisions. for every decision.

Supports and work better in AI, where human It is mostly operated with an
Best suited
interaction is prevalent. software system or applicat
Parameters Reinforcement Learning Supervised Learning

Example Chess game Object recognition

Applications of Reinforcement Learning

Here are applications of Reinforcement Learning:

 Robotics for industrial automation.

 Business strategy planning
 Machine learning and data processing
 It helps you to create training systems that provide custom instruction
and materials according to the requirement of students.
 Aircraft control and robot motion control

Why use Reinforcement Learning?

Here are prime reasons for using Reinforcement Learning:

 It helps you to find which situation needs an action

 Helps you to discover which action yields the highest reward over the
longer period.
 Reinforcement Learning also provides the learning agent with a reward
function.
 It also allows it to figure out the best method for obtaining large rewards.

When Not to Use Reinforcement Learning?

You can’t apply reinforcement learning model is all the situation. Here are some
conditions when you should not use reinforcement learning model.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Annex 2 HG School Implementation Tool Sample SY 2021 2022
100% (1)
Annex 2 HG School Implementation Tool Sample SY 2021 2022
2 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
12 pages
RL Vishnu Sankar
No ratings yet
RL Vishnu Sankar
26 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Reinforcement learning is an autonomous
No ratings yet
Reinforcement learning is an autonomous
3 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Unit-5
No ratings yet
Unit-5
58 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
UNIT-4
No ratings yet
UNIT-4
56 pages
Unit 5 ML 3year
No ratings yet
Unit 5 ML 3year
17 pages
21ai020 & Reinforcement Learning UNIT 1-LM:1
No ratings yet
21ai020 & Reinforcement Learning UNIT 1-LM:1
8 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
unit4(AI)2024.docx-1
No ratings yet
unit4(AI)2024.docx-1
22 pages
3GP ML Reinforcement Learning
No ratings yet
3GP ML Reinforcement Learning
3 pages
Sara Reinforcement Learning
No ratings yet
Sara Reinforcement Learning
69 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Winter Semester 2023-24_CSE4037_ETH_AP2023246000594_2024-01-05_Reference-Material-I
No ratings yet
Winter Semester 2023-24_CSE4037_ETH_AP2023246000594_2024-01-05_Reference-Material-I
35 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
UNIT-3
No ratings yet
UNIT-3
29 pages
Assignment_15_Modern_AI
No ratings yet
Assignment_15_Modern_AI
3 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Reinforcement learning
No ratings yet
Reinforcement learning
9 pages
Module_1 - Reinforcement Learning and Markov Decision Process
No ratings yet
Module_1 - Reinforcement Learning and Markov Decision Process
19 pages
Unit - 5 Re-Inforcement Learning
No ratings yet
Unit - 5 Re-Inforcement Learning
3 pages
Unit 5
No ratings yet
Unit 5
45 pages
AI unit -3.docx
No ratings yet
AI unit -3.docx
102 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
ML-10
No ratings yet
ML-10
9 pages
SL-Week01
No ratings yet
SL-Week01
13 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Unit I
No ratings yet
Unit I
8 pages
REINFORCEMENT LEARNING-1
No ratings yet
REINFORCEMENT LEARNING-1
19 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
32 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
Types of Data:: Reference Website
No ratings yet
Types of Data:: Reference Website
15 pages
Reinforcement learning
No ratings yet
Reinforcement learning
7 pages
Unit 6
No ratings yet
Unit 6
34 pages
UNIT-V-Reinforcement Learning
No ratings yet
UNIT-V-Reinforcement Learning
4 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement Learning MY101
No ratings yet
Reinforcement Learning MY101
15 pages
Reinforcement Learning (RL) : Agent
No ratings yet
Reinforcement Learning (RL) : Agent
35 pages
R22ML-5
No ratings yet
R22ML-5
24 pages
IntroductiontoRL-BR
No ratings yet
IntroductiontoRL-BR
22 pages
AI Week 15
No ratings yet
AI Week 15
3 pages
UNIT V reinforcement learning
No ratings yet
UNIT V reinforcement learning
8 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
Reinforcement Learning - Basics
No ratings yet
Reinforcement Learning - Basics
7 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
RL Week_1
No ratings yet
RL Week_1
53 pages
Reinforcement learning-WPS Office
No ratings yet
Reinforcement learning-WPS Office
1 page
First Reinforcement Learning Blog Post
No ratings yet
First Reinforcement Learning Blog Post
2 pages
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet
G3 12abm-B Research Paper
No ratings yet
G3 12abm-B Research Paper
27 pages
Math Workshop Model
No ratings yet
Math Workshop Model
6 pages
LAC Reflection Journal
No ratings yet
LAC Reflection Journal
8 pages
ST Xavier's College, Ahmedabad: Quick Facts of Course
No ratings yet
ST Xavier's College, Ahmedabad: Quick Facts of Course
3 pages
The Role of Technology in Modern Education
No ratings yet
The Role of Technology in Modern Education
2 pages
20-Minute High-Impact Survey
No ratings yet
20-Minute High-Impact Survey
1 page
Crop Tool and Lasso Tool Lesson Plan
No ratings yet
Crop Tool and Lasso Tool Lesson Plan
2 pages
DLP 4.1 - 4.2
No ratings yet
DLP 4.1 - 4.2
4 pages
C P Ob Og 2017
No ratings yet
C P Ob Og 2017
15 pages
Field Study 5.
100% (1)
Field Study 5.
46 pages
RESEARCH METHODOLOGY
No ratings yet
RESEARCH METHODOLOGY
6 pages
Lesson 23
No ratings yet
Lesson 23
2 pages
Week 4 - Accreditation and Outcome Based Learning - Unit 7
No ratings yet
Week 4 - Accreditation and Outcome Based Learning - Unit 7
7 pages
New PPTX Presentation (3) 2
No ratings yet
New PPTX Presentation (3) 2
1 page
(I) Course Introduction and Overview: DR Premalatha.P Assistant Professor SH&M Nit Ap
No ratings yet
(I) Course Introduction and Overview: DR Premalatha.P Assistant Professor SH&M Nit Ap
67 pages
Gagne - Conditions of Learning
No ratings yet
Gagne - Conditions of Learning
29 pages
University of Moratuwa: Department of Electronic and Telecommunication Engineering
No ratings yet
University of Moratuwa: Department of Electronic and Telecommunication Engineering
1 page
A Narrative On The Orientation of The Learning Delivery Modalities Course
No ratings yet
A Narrative On The Orientation of The Learning Delivery Modalities Course
5 pages
Republic of The Philippines Department of Education: Bonifacio Javier National High School
No ratings yet
Republic of The Philippines Department of Education: Bonifacio Javier National High School
13 pages
DepEd Widens Learners' Access To Quality Education Through Alternative Delivery Mode - BusinessMirror
No ratings yet
DepEd Widens Learners' Access To Quality Education Through Alternative Delivery Mode - BusinessMirror
1 page
What Matters Now
No ratings yet
What Matters Now
1 page
Week 5-General Biology 1
No ratings yet
Week 5-General Biology 1
2 pages
Guitar Sight Reading Specimen For Grade 4 Examination
100% (2)
Guitar Sight Reading Specimen For Grade 4 Examination
2 pages
Kennedy-Lugar Youth Exchange and Study (YES) Program Preliminary Application 2018 - 2019
No ratings yet
Kennedy-Lugar Youth Exchange and Study (YES) Program Preliminary Application 2018 - 2019
4 pages
Effect of The Blended Learning Approach On Pupils Science Academic Performance in Eti-Osa Local Govt Area.
No ratings yet
Effect of The Blended Learning Approach On Pupils Science Academic Performance in Eti-Osa Local Govt Area.
45 pages
Feduc 06 750328
No ratings yet
Feduc 06 750328
7 pages
DLP
No ratings yet
DLP
2 pages
Transferable Skills: What Skills and Qualities Are Important To Employers?
No ratings yet
Transferable Skills: What Skills and Qualities Are Important To Employers?
2 pages
Communication Skills Training Material English PDF
0% (4)
Communication Skills Training Material English PDF
2 pages