(CS 5008) Reinforcement Learning: Assignment 3: t+1 T t+1 T 0 0 0 4

This document contains 6 questions related to Markov chains and reinforcement learning. The questions cover: 1) calculating the distribution after 4 time steps of a random walk on integers with 0.5 probability of moving left or right, 2) determining the number of states and transition matrix for a queue with maximum length 9, 3) verifying that the distribution at time t+1 is also a distribution if the distribution at time t is a distribution, 4) using Bayes' rule to calculate the probability of the initial state given observations, 5) calculating the probability of the state at time t given observations up to time k<t, and 6) working through an example of rain and umbrellas for filtering, prediction, smoothing and maximum likelihood

Uploaded by

Sparsh Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views1 page

(CS 5008) Reinforcement Learning: Assignment 3: t+1 T t+1 T 0 0 0 4

Uploaded by

Sparsh Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

(CS 5008) Reinforcement Learning : Assignment 3

Markov Chain

Q1) Consider a random walk on the set of integer Z described as follow: P rob(st+1 = s + 1|st =
s) = 0.5 and P rob(st+1 = s−1|st = s) = 0.5. Start from various initial distributions µ0 (remember
s0 ∼ µ0 ), and find out µ4 , the distribution after 4 time steps.
Q2) Consider a single queue with maximum length n = 9. What is the total number of states? Let the
queue evolve in discrete time steps t = 0, 1, . . ., and let probability of arrival of a customer between
times t and t + 1 be p, and let the probability that a customer is serviced between t and t + 1 be q.
Also, let arrival and service be independent of each other. Describe the probability transition matrix
for this system.
Q3) We know that µ>
t+1 = µt P. Verify that µt+1 is a distribution if µt is a distribution.
Q4) Consider the filtering problem discussed in the class? Verify that P rob(s0 |o0 , s0 ∼ µ0 ) is
nothing but the Bayes rule.
Q5) Consider the filtering problem discussed in the class? How will we find P rob(st |ok , s0 ∼ µ0 ),
where k < t.
Q6) Please work out the “rain and umbrella" explained in class for i) Filtering ii) Prediction iii)
Smoothing and iv) Maximum likelihood sequence (Viterbi Algorithm). Play around with different
numbers.

Tutorial 3: Markov Chain: Universiti Tunku Abdul Rahman (Utar) UDPS 2133mathematical Programming
No ratings yet
Tutorial 3: Markov Chain: Universiti Tunku Abdul Rahman (Utar) UDPS 2133mathematical Programming
4 pages
Assignment 1 - PE - Applied Statistics
No ratings yet
Assignment 1 - PE - Applied Statistics
2 pages
Machine Learning Exam Prep
No ratings yet
Machine Learning Exam Prep
35 pages
U3 Markov Chain
No ratings yet
U3 Markov Chain
15 pages
MA 2213 - Tutorial 3
No ratings yet
MA 2213 - Tutorial 3
2 pages
Directed Work On Markov Process - 073928
No ratings yet
Directed Work On Markov Process - 073928
13 pages
Lecture07 HMM S
No ratings yet
Lecture07 HMM S
26 pages
Chapter 1
No ratings yet
Chapter 1
13 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Markov Chain Problems and Solutions
No ratings yet
Markov Chain Problems and Solutions
5 pages
Lecture 29 - P - DS
No ratings yet
Lecture 29 - P - DS
3 pages
SS 2020/2021 St. Year 3D Test: 3. For Each Graph of The Given CTMC
No ratings yet
SS 2020/2021 St. Year 3D Test: 3. For Each Graph of The Given CTMC
1 page
Cosm QB-5
No ratings yet
Cosm QB-5
2 pages
Solution Set P DS 11
No ratings yet
Solution Set P DS 11
3 pages
Probability and Random Processes PS13
No ratings yet
Probability and Random Processes PS13
1 page
Fuzzy Stat Prob
No ratings yet
Fuzzy Stat Prob
24 pages
Dissecting Reinforcement Learning-Part8
No ratings yet
Dissecting Reinforcement Learning-Part8
16 pages
Lec7 - 10 - HMM Learning
No ratings yet
Lec7 - 10 - HMM Learning
88 pages
Markov Chains
No ratings yet
Markov Chains
73 pages
Stanford - Discrete Time Markov Chains PDF
No ratings yet
Stanford - Discrete Time Markov Chains PDF
23 pages
Programming Assignment 5
No ratings yet
Programming Assignment 5
8 pages
APS Assignment
No ratings yet
APS Assignment
2 pages
Bioinformatics HMM Updated
No ratings yet
Bioinformatics HMM Updated
28 pages
Markov Decission Process. Unit 3
No ratings yet
Markov Decission Process. Unit 3
37 pages
hw3 Solution
No ratings yet
hw3 Solution
7 pages
Solution Set P DS 13
No ratings yet
Solution Set P DS 13
11 pages
Latihan 4 Februari 2025
No ratings yet
Latihan 4 Februari 2025
1 page
Markov Chain Models: BMI/CS 576 WWW - Biostat.wisc - Edu/bmi576/ Cdewey@biostat - Wisc.edu Fall 2010
No ratings yet
Markov Chain Models: BMI/CS 576 WWW - Biostat.wisc - Edu/bmi576/ Cdewey@biostat - Wisc.edu Fall 2010
36 pages
19MAT301 - Practice Sheet 2 & 3
No ratings yet
19MAT301 - Practice Sheet 2 & 3
10 pages
Lec2 Markov ChainsI
No ratings yet
Lec2 Markov ChainsI
48 pages
6761 4 MarkovChains
No ratings yet
6761 4 MarkovChains
76 pages
Ee5110 Quiz 3
No ratings yet
Ee5110 Quiz 3
2 pages
502
No ratings yet
502
3 pages
ps3 Mathy
No ratings yet
ps3 Mathy
14 pages
Assignment 2 Search
No ratings yet
Assignment 2 Search
5 pages
ps3 Sol
No ratings yet
ps3 Sol
21 pages
Bioinformatics-Lesson 07 - Hidden Markov Model
No ratings yet
Bioinformatics-Lesson 07 - Hidden Markov Model
28 pages
Lec7 MarkovChains
No ratings yet
Lec7 MarkovChains
14 pages
Classproblem2 PDF
No ratings yet
Classproblem2 PDF
2 pages
Problems Markov Chains
No ratings yet
Problems Markov Chains
35 pages
CS 229, Summer 2019 Problem Set #3 Solutions
No ratings yet
CS 229, Summer 2019 Problem Set #3 Solutions
19 pages
ProblemsMarkovChains PDF
No ratings yet
ProblemsMarkovChains PDF
35 pages
Continuous-Time Markov Chains: Solved Problems
No ratings yet
Continuous-Time Markov Chains: Solved Problems
1 page
Chaptert 02 Markov Chain
No ratings yet
Chaptert 02 Markov Chain
13 pages
Week 5
No ratings yet
Week 5
8 pages
Sol A6 MarkovChain
No ratings yet
Sol A6 MarkovChain
24 pages
Ioc Ai Ese 2023 24 13 04 2024
No ratings yet
Ioc Ai Ese 2023 24 13 04 2024
3 pages
DSBD Unit-Ii 2
No ratings yet
DSBD Unit-Ii 2
47 pages
Markov Chains
No ratings yet
Markov Chains
22 pages
Markov Chains
No ratings yet
Markov Chains
6 pages
Markov Chains and Stochastic Processes Tutorial
No ratings yet
Markov Chains and Stochastic Processes Tutorial
4 pages
4703 07 Notes MC PDF
No ratings yet
4703 07 Notes MC PDF
7 pages
Solution Review Set P DS
No ratings yet
Solution Review Set P DS
10 pages
Solution Set P DS 10
No ratings yet
Solution Set P DS 10
4 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
Markov Chains Differentiations
No ratings yet
Markov Chains Differentiations
17 pages
Discrete Time Markov Chains & Algorithms
No ratings yet
Discrete Time Markov Chains & Algorithms
30 pages
Gene Finding and HMMS: 6.096 - Algorithms For Computational Biology - Lecture 7
No ratings yet
Gene Finding and HMMS: 6.096 - Algorithms For Computational Biology - Lecture 7
69 pages
TR19 003
No ratings yet
TR19 003
99 pages
Mreza
No ratings yet
Mreza
21 pages
3 of 3 - Trade Secretes and Conflict of Interest, CSR PDF
No ratings yet
3 of 3 - Trade Secretes and Conflict of Interest, CSR PDF
38 pages
PP Online Test2 Aw Sample Essays and Commentary Issue
No ratings yet
PP Online Test2 Aw Sample Essays and Commentary Issue
8 pages
(CS 5008) Reinforcement Learning: Assignment 5
No ratings yet
(CS 5008) Reinforcement Learning: Assignment 5
2 pages
1 of 3. PHILOSOPHY OF TECHNOLOGY
No ratings yet
1 of 3. PHILOSOPHY OF TECHNOLOGY
3 pages
(CS 5008) Reinforcement Learning: Assignment 1: 1 Single Random Variable
No ratings yet
(CS 5008) Reinforcement Learning: Assignment 1: 1 Single Random Variable
1 page

(CS 5008) Reinforcement Learning: Assignment 3: t+1 T t+1 T 0 0 0 4

Uploaded by

(CS 5008) Reinforcement Learning: Assignment 3: t+1 T t+1 T 0 0 0 4

Uploaded by

(CS 5008) Reinforcement Learning : Assignment 3

You might also like