Action Priors for Large Action Spaces in Robotics


연구 분야: Artificial Intelligence



학회: AAMAS '21: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems


초록

In robotics, it is often not possible to learn useful policies using pure model-free reinforcement learning without significant reward shaping or curriculum learning. As a consequence, many researchers rely on expert demonstrations to guide learning. However, acquiring expert demonstrations can be expensive. This paper proposes an alternative approach where the solutions of previously solved tasks are used to produce an action prior that can facilitate exploration in future tasks. The action prior is a probability distribution over actions that summarizes the set of policies found solving previous tasks. Our results indicate that this approach can be used to solve robotic manipulation problems that would otherwise be infeasible without expert demonstrations. Source code is available at https://github.com/ondrejba/action_priors.


Author Profile
Ondrej Biza

Northeastern University Boston MA USA

Morocco
Author Profile
Dian Wang

Northeastern University Boston MA USA

Morocco
Author Profile
Robert J Platt

Northeastern University Boston MA USA

Morocco

📄 논문 정보

발행 연도 2021년
인용수 0
출판 국가 Morocco
사이트 ACM
좋아요 수 0

연관 논문 목록 (160건)