ITRT(IT Research Trends)

Counterfactual Explanations for Reinforcement Learning Agents

연구 분야: Artificial Intelligence

논문 키워드: #learning #neural #algorithms #complexities #counterfactuals

학회: AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems

초록

Reinforcement learning (RL) algorithms often use neural networks to represent agent's policy, making them difficult to interpret. Counterfactual explanations are human-friendly explanations which offer users actionable advice on how to change their features to obtain a desired output from a black-box model. However, methods for generating counterfactuals in RL ignore the stochastic and sequential nature of RL tasks, and can generate counterfactuals which are difficult to obtain, affecting user effort and trust. My dissertation focuses on developing methods that take into account the complexities of RL framework and provide counterfactual explanations that are easy to reach and confidently produce the desired output

Jasmina Gajcin

Trinity College Dublin Dublin Ireland

Ireland

📄 논문 정보

발행 연도	2023년
인용수	0
출판 국가	Ireland
사이트	ACM
좋아요 수	0

Counterfactual Explanations for Reinforcement Learning Agents

Counterfactual Explanations for Reinforcement Learning Agents

📄 논문 정보

연관 논문 목록 (539건) 내 서재 담기

연관 논문 목록 (539건)