An Online POMDP Algorithm Used by the PoliceForce Agents in the RoboCupRescue Simulation

Sébastien Paquet²²,
Ludovic Tobin²² &
Brahim Chaib-draa²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4020))

Included in the following conference series:

Robot Soccer World Cup

2105 Accesses

Abstract

In the RoboCupRescue simulation, the PoliceForce agents have to decide which roads to clear to help other agents to navigate in the city. In this article, we present how we have modelled their environment as a POMDP and more importantly we present our new online POMDP algorithm enabling them to make good decisions in real-time during the simulation. Our algorithm is based on a look-ahead search to find the best action to execute at each cycle. We thus avoid the overwhelming complexity of computing a policy for each possible situation. To show the efficiency of our algorithm, we present some results on standard POMDPs and in the RoboCupRescue simulation environment.

Download to read the full chapter text

Chapter PDF

An Innovative Heuristic for Planning-Based Urban Traffic Control

Dynamic Adaptive Planning (DAP): The Case of Intelligent Speed Adaptation

MPC-Based Routing and Tracking Architecture for Safe Autonomous Driving in Urban Traffic

Article Open access 29 March 2024

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Papadimitriou, C., Tsisiklis, J.N.: The complexity of markov decision processes. Mathematics of Operations Research 12, 441–450 (1987)
Article MATH MathSciNet Google Scholar
Braziunas, D., Boutilier, C.: Stochastic local search for POMDP controllers. In: The Nineteenth National Conference on Artificial Intelligence (AAAI 2004) (2004)
Google Scholar
Pineau, J., Gordon, G., Thrun, S.: Point-based value iteration: An anytime algorithm for POMDPs. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI 2003), Acapulco, Mexico, pp. 1025–1032 (2003)
Google Scholar
Poupart, P.: Exploiting Structure to Efficiently Solve Large Scale Partially Observable Markov Decision Processes. PhD thesis, University of Toronto (to appear, 2005)
Google Scholar
Smith, T., Simmons, R.: Heuristic search value iteration for POMDPs. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence(UAI 2004), Banff, Canada (2004)
Google Scholar
Spaan, M.T.J., Vlassis, N.: A point-based POMDP algorithm for robot planning. In: Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, Louisiana, pp. 2399–2404 (2004)
Google Scholar
Aberdeen, D.: A (revised) survey of approximate methods for solving partially observable markov decision processes. Technical report, National ICT Australia (2003)
Google Scholar
Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Technical Report CS-96-08, Brown University (1996)
Google Scholar
Boutilier, C., Poole, D.: Computing optimal policies for partially observable decision processes using compact representations. In: Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI 1996), Portland, Oregon, USA, pp. 1168–1175. AAAI Press / MIT Press (1996)
Google Scholar
Boyen, X., Koller, D.: Tractable inference for complex stochastic processes. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 33–42 (1998)
Google Scholar
Kitano, H.: Robocup rescue: A grand challenge for multi-agent systems. In: Proceedings ICMAS 2000, Boston (2000)
Google Scholar
Geffner, H., Bonet, B.: Solving large POMDPs using real time dynamic programming (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

DAMAS Laboratory, Laval University,
Sébastien Paquet, Ludovic Tobin & Brahim Chaib-draa

Authors

Sébastien Paquet
View author publications
You can also search for this author in PubMed Google Scholar
Ludovic Tobin
View author publications
You can also search for this author in PubMed Google Scholar
Brahim Chaib-draa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fraunhofer Institute for Autonomous Intelligent Systems (AIS), D-53754, Sankt Augustin, Germany
Ansgar Bredenfeld
Intelligent Systems Division, National Institute of Standards and Technology, USA
Adam Jacoff
Information Technology Research Institute National Institute of Advanced Industrial Science and Technology,, 1-1-1 Umezono, Tsukuba, Ibaraki, Japan
Itsuki Noda
Dept. of Adaptive Machine Systems, Graduate School of Engineering, Osaka University,
Yasutake Takahashi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Paquet, S., Tobin, L., Chaib-draa, B. (2006). An Online POMDP Algorithm Used by the PoliceForce Agents in the RoboCupRescue Simulation. In: Bredenfeld, A., Jacoff, A., Noda, I., Takahashi, Y. (eds) RoboCup 2005: Robot Soccer World Cup IX. RoboCup 2005. Lecture Notes in Computer Science(), vol 4020. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11780519_18

Download citation

DOI: https://doi.org/10.1007/11780519_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35437-6
Online ISBN: 978-3-540-35438-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Online POMDP Algorithm Used by the PoliceForce Agents in the RoboCupRescue Simulation

Abstract

Chapter PDF

Similar content being viewed by others

An Innovative Heuristic for Planning-Based Urban Traffic Control

Dynamic Adaptive Planning (DAP): The Case of Intelligent Speed Adaptation

MPC-Based Routing and Tracking Architecture for Safe Autonomous Driving in Urban Traffic

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

An Online POMDP Algorithm Used by the PoliceForce Agents in the RoboCupRescue Simulation

Abstract

Chapter PDF

Similar content being viewed by others

An Innovative Heuristic for Planning-Based Urban Traffic Control

Dynamic Adaptive Planning (DAP): The Case of Intelligent Speed Adaptation

MPC-Based Routing and Tracking Architecture for Safe Autonomous Driving in Urban Traffic

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation