0% found this document useful (0 votes)

44 views2 pages

21cse417t - Fundamentals of Reinforcement Learning Syllabus

Uploaded by

ratakondak253

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views2 pages

21cse417t - Fundamentals of Reinforcement Learning Syllabus

Uploaded by

ratakondak253

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Course Course Course L T P C

21CSE417T REINFORCEMENT LEARNING TECHNIQUES E PROFESSIONAL ELECTIVE

Code Name Category 2 1 0 3

Pre-requisite Co- requisite Progressive

Nil Nil Nil
Courses Courses Courses
Course Offering Department School of Computing Data Book / Codes / Standards Nil

Course Learning Rationale (CLR): The purpose of learning this course is to: Program Outcomes (PO) Program
Specific
CLR-1: introduce the fundamentals of Reinforcement Learning 1 2 3 4 5 6 7 8 9 10 11 12 Outcomes
CLR-2: illustrate model-based prediction and control using dynamic programming

Individual & Team Work

Engineering Knowledge

Design/development of

Project Mgt. & Finance

Conduct investigations
of complex problems
CLR-3: illustrate model-free prediction and control

Modern Tool Usage

Life Long Learning

The engineer and
Problem Analysis

Communication
CLR-4: introduce planning and learning with tabular methods

Environment &
Sustainability
CLR-5: explain approximation of a value function

solutions

society

PSO-1

PSO-2

PSO-3
Ethics
Course Outcomes (CO): At the end of this course, learners will be able to:
CO-1: understand basic concepts of reinforcement learning 3 2 - 2 - - - - - - - - - - 2
CO-2: perform model-based prediction and control using dynamic programming 3 3 - 3 - - - - - - - - - - 2
CO-3: apply model-free prediction and control 3 3 - 3 - - - - - - - - - - 3
CO-4: comprehend the use of tabular methods 3 3 - 3 - - - - - - - - - - 3
CO-5: understand how a value function can be approximated 3 3 - 3 - - - - - - - - - - 3

Unit-1 - Introduction 9 Hour

Introduction to Reinforcement learning, examples - Elements of reinforcement learning - Limitations and Scope- An extended example - multi-armed bandits - k-armed bandit problem - action-value methods - the
10-armed testbed - incremental implementation - tracking a nonstationary problem - optimistic initial values - upper-confidence-bound action selection - associative search (contextual bandits)
T1: Implementing the 10-armed testbed
T2: Comparing performance for different values
T3: Upper-confidence bound action selection performance comparison with –greedy
Unit-2 - Markov Decision Process and Model-Based Prediction and Control 9 Hour
Finite Markov Decision Process - The Agent–Environment Interface - Goals and Rewards - Returns and Episodes - Unified Notation for Episodic and Continuing Tasks - Policies and Value Functions - Optimal
Policies and Optimal Value Functions - Optimality and Approximation - Dynamic Programming - Policy Evaluation (Prediction) - Policy Improvement - Policy Iteration - Value Iteration - Generalized Policy Iteration -
Efficiency of Dynamic Programming - Asynchronous Dynamic Programming
T4: MDP for Recycling Robot
T5: Policies and value functions for Gridworld example
T6: Policy evaluation for Gridworld example

139
B.Tech / M.Tech (Integrated) Programmes-Regulations 2021-Volume-11-CSE-Higher Semester Syllabi-Control Copy
Unit-3 - Model-Free Prediction and Control 9 Hour
Model-free learning - Model-free prediction - Monte Carlo methods - Monte Carlo Prediction - Monte Carlo Estimation of Action Values - Temporal-Difference Learning - TD Prediction - Advantages of TD Prediction
Methods - Optimality of TD(0) - n-step Bootstrapping - n-step TD Prediction - n-step Sarsa - Model-free control - Monte Carlo Control - Monte Carlo Control without Exploring Starts - Off policy learning - Importance
sampling - Off-policy Monte Carlo Control - Sarsa: On-policy TD Control - Q-learning: Off-policy TD control
T7: Monte Carlo Policy Evaluation for Blackjack
T8: TD Prediction for Driving Home example
T9: Sarsa vs Q-learning using Cliff Walking example
Unit-4 - Planning and Learning with Tabular Methods 9 Hour
Models and planning - Dyna: Integrated Planning, Acting and Learning - When the model is wrong - Prioritized Sweeping - Real-time Dynamic Programming - Monte Carlo Tree Search
T10: Simple maze using Dyna-Q
T11: Prioritized sweeping on Maze example
T12: Real-time Dynamic Programming for Racetrack example
Unit-5 - Value Function Approximation 9 Hour
On-policy Prediction with Approximation - Value Function Approximation - The Prediction Objective (VE) - Stochastic-gradient and Semi-gradient Methods - Linear Methods - Least-Squares TD
T13: State aggregation on the 1000-state Random Walk
T14: Bootstrapping on the 1000-state Random Walk
T15: Least squares TD example

1. Richard S. Sutton and Andrew G. Barto, Reinforcement Learning: An 3. Artificial Intelligence: A Modern Approach, Stuart J. Russell and Peter Norvig, 3rd edition, Pearson, 2015.
Learning introduction, 2nd edition, The MIT Press, 2015. 4. I. Goodfellow, Y. Bengio, A. Courville, Deep Learning, MIT Press Ltd., 2016.
Resources 2. Martijn van Otterlo, Marco Wiering, Reinforcement Learning: State-of-the-Art, 5. https://deepmind.com/learning-resources/-introduction-reinforcement-learning-david-silver
Springer-Verlag Berlin Heidelberg, 2012. 6. Reinforcement Learning with MATLAB, MathWorks Inc., 2020.

Learning Assessment
Continuous Learning Assessment (CLA)
Summative
Formative Life-Long Learning
Bloom’s Final Examination
CLA-1 Average of unit test CLA-2
Level of Thinking (40% weightage)
(50%) (10%)
Theory Practice Theory Practice Theory Practice
Level 1 Remember 40% - 40% - 40% -
Level 2 Understand 40% - 40% - 40% -
Level 3 Apply 20% - 20% - 20% -
Level 4 Analyze - - - - - -
Level 5 Evaluate - - - - - -
Level 6 Create - - - - - -
Total 100 % 100 % 100 %

Course Designers
Experts from Industry Experts from Higher Technical Institutions Internal Experts
1. Mr. Ghulam Ahmed Ansari, Applied Research Engineer, LinkedIn 1. Dr. Manikantan Srinivasan, , Adjunct Faculty, CSE, IIT Madras 1. Dr. Saad Y. Sait, SRMIST

140
B.Tech / M.Tech (Integrated) Programmes-Regulations 2021-Volume-11-CSE-Higher Semester Syllabi-Control Copy

Deep Reinforcement Learning Handout v2.0
0% (1)
Deep Reinforcement Learning Handout v2.0
6 pages
20ad41e8 - Reinforcement Learning
No ratings yet
20ad41e8 - Reinforcement Learning
2 pages
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
No ratings yet
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
2 pages
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
No ratings yet
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
3 pages
Reinforcement Learning Syllabus
No ratings yet
Reinforcement Learning Syllabus
6 pages
Advanced Reinforcement Learning
No ratings yet
Advanced Reinforcement Learning
2 pages
20CM1111
No ratings yet
20CM1111
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
1 page
RL Syllabus
No ratings yet
RL Syllabus
2 pages
MTech Reinforcement Learning Course
No ratings yet
MTech Reinforcement Learning Course
2 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
Algorithm For RL
No ratings yet
Algorithm For RL
99 pages
Reinforcement Learning Syl-Shashimam
No ratings yet
Reinforcement Learning Syl-Shashimam
2 pages
Unitwise Important Questions: Reinforcement Learning
No ratings yet
Unitwise Important Questions: Reinforcement Learning
5 pages
MLT Unit-5 Notes
No ratings yet
MLT Unit-5 Notes
17 pages
Lecture Notes v1.0 687 F22
No ratings yet
Lecture Notes v1.0 687 F22
115 pages
Question Bank - Reinforcement Learning
No ratings yet
Question Bank - Reinforcement Learning
3 pages
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton Updated 2025
No ratings yet
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton Updated 2025
113 pages
Lecture 30 Reinforcement-Learning
No ratings yet
Lecture 30 Reinforcement-Learning
50 pages
Alg RLearning Ejemplo
No ratings yet
Alg RLearning Ejemplo
99 pages
CS 4501-Introduction To Reinforcement Learning
No ratings yet
CS 4501-Introduction To Reinforcement Learning
7 pages
Mlunit 5
No ratings yet
Mlunit 5
10 pages
Unit 5 ML
No ratings yet
Unit 5 ML
15 pages
Deep Reinforcement Learning: Lecture Notes
No ratings yet
Deep Reinforcement Learning: Lecture Notes
60 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
00 Syllabus Copy-21am71
No ratings yet
00 Syllabus Copy-21am71
2 pages
Elementos Basicos Aprendizaje Por Refuerzo
No ratings yet
Elementos Basicos Aprendizaje Por Refuerzo
52 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
Ai (It) Unit-5
No ratings yet
Ai (It) Unit-5
43 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
28 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
15) EXPLAIN Fitted Q and Deep Q-Learning
No ratings yet
15) EXPLAIN Fitted Q and Deep Q-Learning
17 pages
Reinforcement Learning Question Bank
No ratings yet
Reinforcement Learning Question Bank
11 pages
Reinforcement Learning Algorithms
No ratings yet
Reinforcement Learning Algorithms
98 pages
RL Unit - Iii
No ratings yet
RL Unit - Iii
20 pages
Reinforcement Learning and Dynamic Programming For Control
100% (1)
Reinforcement Learning and Dynamic Programming For Control
111 pages
ML 10
No ratings yet
ML 10
9 pages
RLAlgs in MDPs
No ratings yet
RLAlgs in MDPs
98 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
ML Assignment
No ratings yet
ML Assignment
7 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
Unit 3
No ratings yet
Unit 3
29 pages
Einforcement Learning
No ratings yet
Einforcement Learning
27 pages
Artificial Intelligence: Computer Science & Engineering, Khulna University
No ratings yet
Artificial Intelligence: Computer Science & Engineering, Khulna University
30 pages
Foundations of Deep Reinforcement Learning Theory and Practice in Python First Edition Laura Graesser PDF Download
100% (1)
Foundations of Deep Reinforcement Learning Theory and Practice in Python First Edition Laura Graesser PDF Download
141 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Unit 1 Reinforcement Learning
No ratings yet
Unit 1 Reinforcement Learning
70 pages
DLMAIRIL01 Q4-2024 Session4
No ratings yet
DLMAIRIL01 Q4-2024 Session4
80 pages
SP14 CS188 Lecture 10 - Reinforcement Learning I
No ratings yet
SP14 CS188 Lecture 10 - Reinforcement Learning I
35 pages
11-DL-Deep Learning For Reinforcement Learning
No ratings yet
11-DL-Deep Learning For Reinforcement Learning
47 pages
RL
No ratings yet
RL
1 page
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Lecture 29 RL
No ratings yet
Lecture 29 RL
38 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
RL Catalogue
No ratings yet
RL Catalogue
3 pages
CSE3001: Artificial Intelligence and Machine Learning
No ratings yet
CSE3001: Artificial Intelligence and Machine Learning
3 pages
BTech RL CIAP - B - Assignment 1
No ratings yet
BTech RL CIAP - B - Assignment 1
2 pages
APP - Unit 3
No ratings yet
APP - Unit 3
112 pages
12-Hour Hackathon - AI Database Capacity & Performa
No ratings yet
12-Hour Hackathon - AI Database Capacity & Performa
4 pages
Report Writing: 21CSC402P FP - 1
No ratings yet
Report Writing: 21CSC402P FP - 1
9 pages
German University Shortlist
No ratings yet
German University Shortlist
1 page
iCAL Data Summary for Analysts
No ratings yet
iCAL Data Summary for Analysts
2 pages
Determinants of Awareness of Rythu Bharosa Kendra's (RBK'S) Programmes in Andhra Pradesh, India - Compressed
No ratings yet
Determinants of Awareness of Rythu Bharosa Kendra's (RBK'S) Programmes in Andhra Pradesh, India - Compressed
11 pages
Transformer Bid Specifications
No ratings yet
Transformer Bid Specifications
7 pages
Operation Cheatsheet
No ratings yet
Operation Cheatsheet
2 pages
TITLE: HV070WSA-100 Product Specification Rev.O: Beijing Boe Optoelectronics Technology
No ratings yet
TITLE: HV070WSA-100 Product Specification Rev.O: Beijing Boe Optoelectronics Technology
28 pages
Assembly Chapter3 PDF
No ratings yet
Assembly Chapter3 PDF
7 pages
Re-Sendup 05.01.2024 Class 9 General Mathematics
No ratings yet
Re-Sendup 05.01.2024 Class 9 General Mathematics
2 pages
NPSH in Series and Parallel Pumps
No ratings yet
NPSH in Series and Parallel Pumps
1 page
Log
No ratings yet
Log
4 pages
Simultaneous Optimal System and Controller Design For Multibody Systems With Joint Friction Using Direct Sensitivities
No ratings yet
Simultaneous Optimal System and Controller Design For Multibody Systems With Joint Friction Using Direct Sensitivities
31 pages
REVIEWER-BEATITUDE (1) .Docx - 20240323 - 165653 - 0000
No ratings yet
REVIEWER-BEATITUDE (1) .Docx - 20240323 - 165653 - 0000
5 pages
Electronic Hand Glove
No ratings yet
Electronic Hand Glove
6 pages
Class Xi Chapter 3 Full Test
No ratings yet
Class Xi Chapter 3 Full Test
2 pages
Wa0005.
No ratings yet
Wa0005.
10 pages
09W - CAC Multi V - Sistena Heat Recovery
No ratings yet
09W - CAC Multi V - Sistena Heat Recovery
56 pages
Application of A Dual Simplex Method To Transportation Problem To Minimize The Cost
No ratings yet
Application of A Dual Simplex Method To Transportation Problem To Minimize The Cost
5 pages
Math II Group Project Guide
No ratings yet
Math II Group Project Guide
4 pages
Cost Control During The Pre-Contract Stage of A Building Project
100% (1)
Cost Control During The Pre-Contract Stage of A Building Project
27 pages
Supreme Mathematic African Ma'at Magic - African Creative Energy
97% (147)
Supreme Mathematic African Ma'at Magic - African Creative Energy
145 pages
Biology Chapter Life Processes Class 10
No ratings yet
Biology Chapter Life Processes Class 10
10 pages
Zambia - Sun Path
No ratings yet
Zambia - Sun Path
7 pages
TCMIII Educational Qualification
No ratings yet
TCMIII Educational Qualification
2 pages
Project Cost Analysis: Proposed Product/Service Crytocurrency Mining
No ratings yet
Project Cost Analysis: Proposed Product/Service Crytocurrency Mining
3 pages
Home Automation Research Paper PDF
No ratings yet
Home Automation Research Paper PDF
7 pages
Data Structures - Unit 4 Stack and Queue
No ratings yet
Data Structures - Unit 4 Stack and Queue
199 pages
Mathematics Syllabus 7
No ratings yet
Mathematics Syllabus 7
21 pages
References
No ratings yet
References
2 pages
Nozzles
No ratings yet
Nozzles
25 pages
'Voebnfoubm 1sjodjqmft PG .Fdibojdbm &ohjoffsjoh: '-&/%&3 ESJWFT
No ratings yet
'Voebnfoubm 1sjodjqmft PG .Fdibojdbm &ohjoffsjoh: '-&/%&3 ESJWFT
150 pages
Lug Angles & Splice Plates Guide
No ratings yet
Lug Angles & Splice Plates Guide
17 pages

21cse417t - Fundamentals of Reinforcement Learning Syllabus

Uploaded by

21cse417t - Fundamentals of Reinforcement Learning Syllabus

Uploaded by

Course Course Course L T P C

21CSE417T REINFORCEMENT LEARNING TECHNIQUES E PROFESSIONAL ELECTIVE

Pre-requisite Co- requisite Progressive

Individual & Team Work

Project Mgt. & Finance

Modern Tool Usage

Life Long Learning

Unit-1 - Introduction 9 Hour

You might also like