0% found this document useful (0 votes)

17 views17 pages

Lecture1 Introduction Part1

The document provides an overview of reinforcement learning (RL) and its distinction from other machine learning branches, such as supervised and unsupervised learning. It emphasizes the unique characteristics of RL, including the absence of a supervisor, delayed feedback, and the impact of an agent's actions on future states. Additionally, it discusses the applications of RL in various fields, including gaming, robotics, and autonomous systems.

Uploaded by

mscai2024.avinesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views17 pages

Lecture1 Introduction Part1

Uploaded by

mscai2024.avinesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Introduction to Reinforcement

Learning
Revolution History
Branches of ML
Branches of ML - Supervised Learning
• In Supervised Learning, models learn from labeled
training data, where input-output pairs are provided.

• The algorithm generalizes from this labeled data to

make predictions or classifications on new, unseen
data.

• Commonly used in tasks like image recognition,

natural language processing, and regression analysis.
Branches of ML - Unsupervised Learning
• UL deals with unlabeled data, aiming to discover
patterns, structures, or relationships within the data
itself.

• Clustering and dimensionality reduction are common

tasks in UL.

• Applications include customer segmentation, anomaly

detection, and feature extraction.
Branches of ML - Reinforcement Learning
• RL involves an agent learning to make decisions by
interacting with an environment.

• It receives feedback in the form of rewards or

penalties, guiding the agent toward optimal decision-
making strategies.

• RL is well-suited for scenarios where actions influence

future states, making it applicable in gaming, robotics,
and autonomous systems.
Can Machines Think?

The imitation game – Movie

Computing Machinery & Intelligence – Paper

What is intelligence according to
you?

-To be able to make decisions to achieve the goal

What is RL?
Example

Learning by interacting with the environment

RL Characteristics
• What makes reinforcement learning different
from other machine learning paradigms?
– There is no supervisor, only a reward signal
– Feedback is delayed, not instantaneous
– Time really matters - sequential
– Agent’s actions affect the subsequent data it
receives
Agent Environment Loop
Reward Hypothesis
• Any goal can be formalized as the outcome of
maximizing a cumulative reward

• Also we can consider minimizing the penalty

RL Problems
• Fly helicopter – inverse distance

• Walking robot – distance, speed

• Board games - maximize score or +1 (win) -1

(lose)
Reasons to learn
• Find a solution
– A program that plays chess very well
– A manufacturing robot with a specific purpose

• Adapt online to handle unforeseen

circumstances
– Chess program can learn to adapt to you
– Candy crush
– A robot that learns to navigate unknown terrains
What is RL?

Science and framework to make decisions from interactions

Thank You

Intro
No ratings yet
Intro
28 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
Introduction - Week 1
No ratings yet
Introduction - Week 1
52 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
Unit 5 ML
No ratings yet
Unit 5 ML
49 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
Playbook Executive Briefing Reinforcement Learning
No ratings yet
Playbook Executive Briefing Reinforcement Learning
20 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Module 01
No ratings yet
Module 01
66 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
AI Unit - 3
No ratings yet
AI Unit - 3
102 pages
Module 1
No ratings yet
Module 1
72 pages
Reinforcement Learning Details
No ratings yet
Reinforcement Learning Details
9 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Reinforcement Learning: Pablo Zometa - Department of Mechatronics - GIU Berlin 1
No ratings yet
Reinforcement Learning: Pablo Zometa - Department of Mechatronics - GIU Berlin 1
12 pages
RL Report
No ratings yet
RL Report
15 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
35 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
RL
No ratings yet
RL
94 pages
RL Unit-1
No ratings yet
RL Unit-1
52 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Lec 23
No ratings yet
Lec 23
51 pages
Introduction To Reinforcement Learning (RL)
No ratings yet
Introduction To Reinforcement Learning (RL)
3 pages
Reinforcement Learning Enhanced
No ratings yet
Reinforcement Learning Enhanced
3 pages
tiếng anhi
No ratings yet
tiếng anhi
7 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
63 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
9 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Final
No ratings yet
Final
18 pages
1 Introduction To RL
No ratings yet
1 Introduction To RL
46 pages
RL PyTexas 2017 PDF
No ratings yet
RL PyTexas 2017 PDF
29 pages
Reinforcement Learning 1
No ratings yet
Reinforcement Learning 1
14 pages
Seminar Report
No ratings yet
Seminar Report
12 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
UNIT-V-Reinforcement Learning
No ratings yet
UNIT-V-Reinforcement Learning
4 pages
Lec 1 Intro Course Overview
No ratings yet
Lec 1 Intro Course Overview
50 pages
Reinforcement Learning - Introduction
No ratings yet
Reinforcement Learning - Introduction
19 pages
Green and Black Modern Machine Learning Presentation
No ratings yet
Green and Black Modern Machine Learning Presentation
14 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Reinforcement Learning (RL) : by Abhiram Sharma (19311A12P0)
No ratings yet
Reinforcement Learning (RL) : by Abhiram Sharma (19311A12P0)
14 pages
Week 4 ML
No ratings yet
Week 4 ML
8 pages
RL Chap 5
No ratings yet
RL Chap 5
21 pages
Reinforcement learning-WPS Office
No ratings yet
Reinforcement learning-WPS Office
1 page
ML Unit2
No ratings yet
ML Unit2
17 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
17 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Intro to Reinforcement Learning
No ratings yet
Intro to Reinforcement Learning
9 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
9 pages
Dmbi Mcqs Mcqs For Data Mining and Business Intelligence
No ratings yet
Dmbi Mcqs Mcqs For Data Mining and Business Intelligence
24 pages
AI Voice Agents - PPT - Presentation
100% (2)
AI Voice Agents - PPT - Presentation
22 pages
Casaletto 2017
No ratings yet
Casaletto 2017
13 pages
BMHP5016 - PHSC - C1 - September 24
No ratings yet
BMHP5016 - PHSC - C1 - September 24
11 pages
English Basics for Young Learners
No ratings yet
English Basics for Young Learners
2 pages
The Structure of Gyorgy Kepes's Language of Vision - p3 - S
No ratings yet
The Structure of Gyorgy Kepes's Language of Vision - p3 - S
14 pages
How Cultural Factors Influence Organizational Performance A Case Study of A Scientific Equipme
No ratings yet
How Cultural Factors Influence Organizational Performance A Case Study of A Scientific Equipme
21 pages
AlphaGo Zero: Reinforcement Learning Mastery
No ratings yet
AlphaGo Zero: Reinforcement Learning Mastery
42 pages
Final Year Project Proposal
No ratings yet
Final Year Project Proposal
2 pages
CRO Olympiad Book For Class 7
No ratings yet
CRO Olympiad Book For Class 7
10 pages
Language and Literature DP 2 - 2019-2020
No ratings yet
Language and Literature DP 2 - 2019-2020
29 pages
Lesson 3: Elements of Communication: 1. Sender or Encoder
67% (3)
Lesson 3: Elements of Communication: 1. Sender or Encoder
3 pages
Test Bank For Theories of Personality 10th Edition Schultz
100% (61)
Test Bank For Theories of Personality 10th Edition Schultz
10 pages
Veloso, May Ann - Mid-Term-Ed 209
No ratings yet
Veloso, May Ann - Mid-Term-Ed 209
4 pages
10th Grade Mental Health Lesson Plan
No ratings yet
10th Grade Mental Health Lesson Plan
3 pages
L1 Lesson Plan - Using Loops To Create Shapes - Y4
No ratings yet
L1 Lesson Plan - Using Loops To Create Shapes - Y4
6 pages
8 Useful Phrases For IELTS Writing Task 2 Task 2 Vocabulary
No ratings yet
8 Useful Phrases For IELTS Writing Task 2 Task 2 Vocabulary
10 pages
1Q Cmap Tof Diass12
No ratings yet
1Q Cmap Tof Diass12
8 pages
QR Code - Tolerance For Ambiguity Quiz
No ratings yet
QR Code - Tolerance For Ambiguity Quiz
2 pages
Phonics Lesson Plan for Teachers
No ratings yet
Phonics Lesson Plan for Teachers
6 pages
7th STD PDF
No ratings yet
7th STD PDF
4 pages
LiveYour Strongest Life Sohail Zindani
No ratings yet
LiveYour Strongest Life Sohail Zindani
9 pages
Advances in Information Retrieval
No ratings yet
Advances in Information Retrieval
913 pages
Regularization 3 Part
No ratings yet
Regularization 3 Part
22 pages
Chapter 9: Turing Machine: The Standard Turing Machine Page: 336-338, 20 Problem
No ratings yet
Chapter 9: Turing Machine: The Standard Turing Machine Page: 336-338, 20 Problem
12 pages
How To Learn To Let Go of What You Can
100% (4)
How To Learn To Let Go of What You Can
5 pages
6th Grade Fairytale Communication
100% (1)
6th Grade Fairytale Communication
3 pages
Planning Phases
No ratings yet
Planning Phases
11 pages
Webquest
No ratings yet
Webquest
4 pages
Contoh Soal TOEFL Dan Pembahasannya
No ratings yet
Contoh Soal TOEFL Dan Pembahasannya
5 pages

Lecture1 Introduction Part1

Uploaded by

Lecture1 Introduction Part1

Uploaded by

Introduction to Reinforcement

• The algorithm generalizes from this labeled data to

• Commonly used in tasks like image recognition,

• Clustering and dimensionality reduction are common

• Applications include customer segmentation, anomaly

• It receives feedback in the form of rewards or

• RL is well-suited for scenarios where actions influence

The imitation game – Movie

Computing Machinery & Intelligence – Paper

-To be able to make decisions to achieve the goal

Learning by interacting with the environment

• Also we can consider minimizing the penalty

• Walking robot – distance, speed

• Board games - maximize score or +1 (win) -1

• Adapt online to handle unforeseen

Science and framework to make decisions from interactions

You might also like