Counterfactual Explanations for Reinforcement Learning Agents


연구 분야: Artificial Intelligence



학회: AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems


초록

Reinforcement learning (RL) algorithms often use neural networks to represent agent's policy, making them difficult to interpret. Counterfactual explanations are human-friendly explanations which offer users actionable advice on how to change their features to obtain a desired output from a black-box model. However, methods for generating counterfactuals in RL ignore the stochastic and sequential nature of RL tasks, and can generate counterfactuals which are difficult to obtain, affecting user effort and trust. My dissertation focuses on developing methods that take into account the complexities of RL framework and provide counterfactual explanations that are easy to reach and confidently produce the desired output


Author Profile
Jasmina Gajcin

Trinity College Dublin Dublin Ireland

Ireland

📄 논문 정보

발행 연도 2023년
인용수 0
출판 국가 Ireland
사이트 ACM
좋아요 수 0

연관 논문 목록 (539건)