[go: up one dir, main page]

"A policy gradient method for semi-Markov decision processes with ..."

Sumeetpal S. Singh, Vladislav Z. B. Tadic, Arnaud Doucet (2007)

Details and statistics

DOI: 10.1016/J.EJOR.2006.02.023

access: closed

type: Journal Article

metadata version: 2023-09-30