Interpretable Reinforcement Learning: Bridging the Gap between Performance and Transparency


연구 분야: Artificial Intelligence



학회: 2024 1st International Conference on Communications and Computer Science (InCCCS)


초록

Complex sequential decision-making tasks have proven remarkably successful for Reinforcement Learning (RL). However, the intrinsic complexity of RL algorithms frequently results in opaque, black-box models that are difficult to read and comprehend. In order to achieve a balance between performance and transparency, this study examines the changing environment of interpretable reinforcement learning. We introduce IRL and thoroughly assess five different IRL approaches, providing insights into each method's advantages and disadvantages. We evaluate RL and IRL's performance and transparency. Our results underline the revolutionary potential of IRL by highlighting its capacity to raise user confidence, responsibility, and safety by offering interpretable insights into RL agent judgement.


Author Profile
Shreyas R S

Dept. of Artificial Intelligence and Machine Learning Bangalore Institute of Technology Bangalore India

Andorra
Author Profile
Jyothi D G

Dept. of Artificial Intelligence and Machine Learning Bangalore Institute of Technology Bangalore India

Andorra

📄 논문 정보

발행 연도 2024년
인용수 156
출판 국가 Andorra
사이트 IEEE
좋아요 수 0

연관 논문 목록 (440건)