Formal Explanations of Neural Network Policies for Planning
Formal Explanations of Neural Network Policies for Planning
Renee Selvey, Alban Grastien, Sylvie Thiébaux
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Main Track. Pages 5446-5456.
https://doi.org/10.24963/ijcai.2023/605
Deep learning is increasingly used to learn policies for planning problems, yet policies represented by neural networks are difficult to interpret, verify and trust. Existing formal approaches to post-hoc explanations provide concise reasons for a single decision made by an ML model. However, understanding planning policies require explaining sequences of decisions. In this paper, we formulate the problem of finding explanations for the sequence of decisions recommended by a learnt policy in a given state. We show that, under certain assumptions, a minimal explanation for a sequence can be computed by solving a number of single decision explanation problems which is linear in the length of the sequence. We present experimental results of our implementation of this approach for ASNet policies for classical planning domains.
Keywords:
Planning and Scheduling: PS: Model-based reasoning
Machine Learning: ML: Explainable/Interpretable machine learning
Planning and Scheduling: PS: Learning in planning and scheduling