(CS 5008) Reinforcement Learning: Assignment 5

This document contains 10 questions regarding reinforcement learning concepts such as: 1) Writing the expression for squared loss when approximating values with a linear function. 2) Writing the matrix formulation for minimizing squared loss of a linear function that does not pass through the origin. 3) Providing an example of feature vectors and values that can be approximated with a linear function. 4) Writing the loss function and solving for the optimal theta given a feature matrix and values to approximate. 5) Finding the eigenvalues and an eigenvector of a given matrix.

Uploaded by

Sparsh Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views2 pages

(CS 5008) Reinforcement Learning: Assignment 5

Uploaded by

Sparsh Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

(CS 5008) Reinforcement Learning : Assignment 5

Q1) Consider the problem of approximating the Y value in terms of X values, i.e., Yi ≈ Xi θ. For a
given set of data points (actual value not important) such as the one shown in Figure 1 write down the
expression for the squared loss L(θ).

Figure 1: Fit a line through origin.

Q2) Now consider the approximation where Yi ≈ θ(0) + θ(1)Xi (say to fit a line for the data set
in Figure 2). Suppose we are interested in minimising the squared loss i) what is the expression of
L(θ)? (note that θ = (θ(0), θ(1)) ∈ R2 has two co-ordinates), ii) what is the matrix formulation of
the same problem, in particular what is the X matrix?

Figure 2: Fit a line that does not pass through origin.

Q3) Give an example of (Xi , Yi )1i=1 0 ∈ R2 × R, such that the approximation error Yi ≈ Xi> θ.
Q4) Consider a 2 × 2 grid with 4 states, and let the feature of positions be X(1) = (1, 1), X(2) =
(1, 2), X(3) = (2, 1), X(4) = (2, 2). Write down the loss function to project V (1) = 2, V (2) = 3,
V (3) = 3, V (4) = 4, i..e, appoximate V ≈ X > θ. Find θ∗ = arg minθ∈R2 kV − X > θk22 .
1 1 1
" #
Q5) What are the eigenvalues of the matrix A = 1 1 1 , and an eigenvector?
1 1 1
Q6) Consider a 3 state Markov Chain where the probability of going from any state to any state is 13 .
Write down the probability transition matrix P. What is the stationary distribution d> = d> P . What
are the eigenvalues of P ?
Q7) For any real matrix A ∈ Rd×d and vector x> , show that x> Ax = x> A> x.

1 2 4 3
Q8) Let X = ,Y = . Verify that XY = X1 Y1 + X2 Y2 , where X1 , X2 are columns
3 4 2 1
of X and Y1 and Y2 are rows of Y .
Q9) Consider a 2 × 2 grid with 4 states, and let the feature of positions be X1 = (1, 1), X2 =
(1, 2), X3 = (2, 1), X4 = (2, 2). The probability of going from any state to any other state is equal.
Let s = st be the current state and s0 = st+1 be the next state, then calculate E[X(s)X(> s0 )] for all
s = 1, 2, 3, 4.
iid
Q10) Consider learning the mean of random variable U nif orm[0, 1]. We are given samples Yt ∼
U nif orm[0, 1], and consider the following update rule:
Vt+1 = Vt + αt (Yt − Vt ) (1)
Estimate the expected squared error in θt after t = 100 for a constant step size α = 0.1.

Mock End Term Solution
No ratings yet
Mock End Term Solution
12 pages
Col726 A2
No ratings yet
Col726 A2
5 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
MATH341 Exercises
No ratings yet
MATH341 Exercises
14 pages
Assignment 8: Reinforcement Learning Prof. B. Ravindran
100% (2)
Assignment 8: Reinforcement Learning Prof. B. Ravindran
4 pages
LAforAIML 2
No ratings yet
LAforAIML 2
3 pages
2008S Final1
No ratings yet
2008S Final1
3 pages
CS209 Practice Problems 1 ML
No ratings yet
CS209 Practice Problems 1 ML
4 pages
DAMA 50 Exam Resit 22-23
No ratings yet
DAMA 50 Exam Resit 22-23
11 pages
Fall2020 CS395T Mock Midterm Solutions
No ratings yet
Fall2020 CS395T Mock Midterm Solutions
4 pages
Machine Learning MCQ Assignment
No ratings yet
Machine Learning MCQ Assignment
56 pages
Solution 2
0% (1)
Solution 2
6 pages
MCQ All Units
No ratings yet
MCQ All Units
42 pages
Machine Learning Practice Quiz
No ratings yet
Machine Learning Practice Quiz
37 pages
Exercise Sheet 1
No ratings yet
Exercise Sheet 1
5 pages
Reinforcement Learning Solutions
No ratings yet
Reinforcement Learning Solutions
50 pages
Math 19B Final Exam: Linear Algebra & Probability
No ratings yet
Math 19B Final Exam: Linear Algebra & Probability
4 pages
Extra 2
No ratings yet
Extra 2
7 pages
Machine Learning Quiz for Students
No ratings yet
Machine Learning Quiz for Students
45 pages
Final 2018
No ratings yet
Final 2018
15 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Exercise 01 Math Refresher
No ratings yet
Exercise 01 Math Refresher
4 pages
2dbi00 202006 23 This Is An Interesting Materials I Agree THN This Form For Registration and
No ratings yet
2dbi00 202006 23 This Is An Interesting Materials I Agree THN This Form For Registration and
5 pages
HW 0
No ratings yet
HW 0
5 pages
MLF Mock 1 Solution
No ratings yet
MLF Mock 1 Solution
5 pages
Optimization Project Guidelines
No ratings yet
Optimization Project Guidelines
5 pages
Week 0
No ratings yet
Week 0
4 pages
Extra Practice Probs
No ratings yet
Extra Practice Probs
4 pages
Optimisation Eg
No ratings yet
Optimisation Eg
7 pages
Exercise 01 Solution
No ratings yet
Exercise 01 Solution
8 pages
MLF Q2 Practice Problems
No ratings yet
MLF Q2 Practice Problems
61 pages
Ncert and Ncert Examplar M.C.Q-12TH SCIENCE-MATHEMATICS
No ratings yet
Ncert and Ncert Examplar M.C.Q-12TH SCIENCE-MATHEMATICS
14 pages
Matrix Algebra II Assignment
No ratings yet
Matrix Algebra II Assignment
2 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
MA1513 (2223sem1) Examplify Solution
No ratings yet
MA1513 (2223sem1) Examplify Solution
16 pages
ML ES 23-24-II Key
No ratings yet
ML ES 23-24-II Key
4 pages
Data Science Exam Solutions
No ratings yet
Data Science Exam Solutions
8 pages
MA1513 (2122sem2) Examplify Solution
No ratings yet
MA1513 (2122sem2) Examplify Solution
14 pages
LASP
No ratings yet
LASP
4 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Midterm Solutions
No ratings yet
Midterm Solutions
11 pages
Math 3310 Finalpractice
No ratings yet
Math 3310 Finalpractice
4 pages
ONLA Qual F23
No ratings yet
ONLA Qual F23
9 pages
2021-2022 Sem 1 Dse FC04 Fa04 1-2021 Dseclzc416 Mathematical Foundations For Data Science Ec2 Regular 02-01-2022 FN
No ratings yet
2021-2022 Sem 1 Dse FC04 Fa04 1-2021 Dseclzc416 Mathematical Foundations For Data Science Ec2 Regular 02-01-2022 FN
7 pages
Col726 A1 1
No ratings yet
Col726 A1 1
3 pages
Linear Algebra With Applications 2nd Edition Holt Test Bank Download
No ratings yet
Linear Algebra With Applications 2nd Edition Holt Test Bank Download
53 pages
Linear Algebra Exam for Engineers
No ratings yet
Linear Algebra Exam for Engineers
3 pages
Final Alg 233
No ratings yet
Final Alg 233
33 pages
Practice 2
No ratings yet
Practice 2
3 pages
LINEAR ALGEBRA MCQ's
No ratings yet
LINEAR ALGEBRA MCQ's
13 pages
Pass/fail Recommend On This Form.: Do Not Write Your Name!
No ratings yet
Pass/fail Recommend On This Form.: Do Not Write Your Name!
10 pages
Endsem
No ratings yet
Endsem
4 pages
Final Alg 241
No ratings yet
Final Alg 241
33 pages
Final2018 Solutions
No ratings yet
Final2018 Solutions
19 pages
Amazon ML Pyq
No ratings yet
Amazon ML Pyq
8 pages
Lecture 17 Least Squares, State Estimation
No ratings yet
Lecture 17 Least Squares, State Estimation
29 pages
3cs1113 Ir RPR December 2019
No ratings yet
3cs1113 Ir RPR December 2019
3 pages
Machine Learning Homework Guide
No ratings yet
Machine Learning Homework Guide
3 pages
TR19 003
No ratings yet
TR19 003
99 pages
PP Online Test2 Aw Sample Essays and Commentary Issue
No ratings yet
PP Online Test2 Aw Sample Essays and Commentary Issue
8 pages
Mreza
No ratings yet
Mreza
21 pages
3 of 3 - Trade Secretes and Conflict of Interest, CSR PDF
No ratings yet
3 of 3 - Trade Secretes and Conflict of Interest, CSR PDF
38 pages
(CS 5008) Reinforcement Learning: Assignment 1: 1 Single Random Variable
No ratings yet
(CS 5008) Reinforcement Learning: Assignment 1: 1 Single Random Variable
1 page
(CS 5008) Reinforcement Learning: Assignment 3: t+1 T t+1 T 0 0 0 4
No ratings yet
(CS 5008) Reinforcement Learning: Assignment 3: t+1 T t+1 T 0 0 0 4
1 page
1 of 3. PHILOSOPHY OF TECHNOLOGY
No ratings yet
1 of 3. PHILOSOPHY OF TECHNOLOGY
3 pages
Model - qp01 - Bmats201 For Computer Science Stream
0% (1)
Model - qp01 - Bmats201 For Computer Science Stream
3 pages
Comprehensive Guide To Seismic Processing Workflows
No ratings yet
Comprehensive Guide To Seismic Processing Workflows
12 pages
Cramers Rule
No ratings yet
Cramers Rule
17 pages
AES Encryption and Cipher Modes
No ratings yet
AES Encryption and Cipher Modes
22 pages
Shmth1: General Mathematics: La Salle College Antipolo
No ratings yet
Shmth1: General Mathematics: La Salle College Antipolo
2 pages
ECS171: Machine Learning: Lecture 13: Validation, Model Selection
No ratings yet
ECS171: Machine Learning: Lecture 13: Validation, Model Selection
32 pages
CH08a - Finite Element Programming
100% (1)
CH08a - Finite Element Programming
11 pages
Methodology of System Dynamics
No ratings yet
Methodology of System Dynamics
10 pages
Starting With Quantum Mechanics Can Seem Challenging
No ratings yet
Starting With Quantum Mechanics Can Seem Challenging
3 pages
Artificial Intelligence (Ai) in e Procurement A Literature Review
No ratings yet
Artificial Intelligence (Ai) in e Procurement A Literature Review
16 pages
Understanding Algorithms & Pseudocode
No ratings yet
Understanding Algorithms & Pseudocode
13 pages
Pardo 2012 Optimisation
No ratings yet
Pardo 2012 Optimisation
322 pages
SS Notes Final
No ratings yet
SS Notes Final
182 pages
01 - Pengantar Risiko Dan Tipe-Tipe Risiko
No ratings yet
01 - Pengantar Risiko Dan Tipe-Tipe Risiko
9 pages
Lagrange Multipliers in Optimization
No ratings yet
Lagrange Multipliers in Optimization
7 pages
CH 04
No ratings yet
CH 04
106 pages
EC-203: Signals & Systems
No ratings yet
EC-203: Signals & Systems
2 pages
A Seminar On: Wavelet Video Processing
No ratings yet
A Seminar On: Wavelet Video Processing
18 pages
Turing Machines for CS Students
No ratings yet
Turing Machines for CS Students
64 pages
Quicksort and Merge Sort
No ratings yet
Quicksort and Merge Sort
5 pages
2D Vehicle EKF Prediction Step Exercise
No ratings yet
2D Vehicle EKF Prediction Step Exercise
9 pages
Attacks On RSA Digital Signature
No ratings yet
Attacks On RSA Digital Signature
4 pages
Nayak 2017 IOP Conf. Ser. Mater. Sci. Eng. 263 052030
No ratings yet
Nayak 2017 IOP Conf. Ser. Mater. Sci. Eng. 263 052030
8 pages
Age World Problem Mixture Problem Distance Rate and Time Problem
No ratings yet
Age World Problem Mixture Problem Distance Rate and Time Problem
5 pages
Cyclomatic and Loc Complexity
No ratings yet
Cyclomatic and Loc Complexity
15 pages
20 Binary Search Tree
No ratings yet
20 Binary Search Tree
23 pages
4-Uninformed Search Breadth First Search-02!01!2025
No ratings yet
4-Uninformed Search Breadth First Search-02!01!2025
75 pages
Questions and Answers For Image Processing
No ratings yet
Questions and Answers For Image Processing
9 pages
Lecture07 CE72.12Galerkin - To - FEM
No ratings yet
Lecture07 CE72.12Galerkin - To - FEM
13 pages
ECS 303 CBNST Final Paper December 2021
No ratings yet
ECS 303 CBNST Final Paper December 2021
2 pages