Symbolic Task Inference in Deep Reinforcement Learning


연구 분야: Artificial Intelligence



학회: Journal of Artificial Intelligence Research, Volume 80


초록

This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability.


Author Profile
Hosein Hasanbeig

University of Oxford

정보 없음
Author Profile
Natasha Yogananda Jeppu

University of Oxford

정보 없음
Author Profile
Alessandro Abate

University of Oxford

정보 없음

📄 논문 정보

발행 연도 2024년
인용수 1
출판 국가
사이트 ACM
좋아요 수 0

연관 논문 목록 (400건)