Continuity and Smoothness Analysis and Possible Improvement of Traditional Reinforcement Learning Methods


연구 분야: Artificial Intelligence



학회: 2020 IEEE International Conference on Mechatronics and Automation (ICMA)


초록

At present, the deep reinforcement learning method has become one of the important branches in the field of artificial intelligence. Its model-free feature makes it considered as one of the ways to achieve the goal of strong artificial intelligence. In addition to discrete decision-making tasks, deep reinforcement learning has been gradually applied to continuous control tasks. Nevertheless, compared with classical control strategies and methods, the instability of deep reinforcement learning limits its extensive application in real scenarios. The instability of reinforcement learning mainly comes from two aspects: the first is the inherent discontinuity of reinforcement learning action strategies; the second is the randomness of reinforcement learning action strategies. This paper will discuss theoretical reasons of this instability and evaluate the instability of the current mainstream reinforcement learning algorithms by time-frequency analysis. Finally, we give an improved framework based on stochastic differential equation, and theoretically solve the inherent discontinuity of reinforcement learning action strategy.


Author Profile
Tianhao Chen

School of Mechatronic Engineering and Automation Shanghai Key Laboratory of Intelligent Manufacturing and Robotics Shanghai University Shanghai China

Andorra
Author Profile
Wenchuan Jia

School of Mechatronic Engineering and Automation Shanghai Key Laboratory of Intelligent Manufacturing and Robotics Shanghai University Shanghai China

Andorra
Author Profile
Jianjun Yuan

School of Mechatronic Engineering and Automation Shanghai Key Laboratory of Intelligent Manufacturing and Robotics Shanghai University Shanghai China

Andorra

📄 논문 정보

발행 연도 2020년
인용수 3
출판 국가 Andorra
사이트 IEEE
좋아요 수 0

연관 논문 목록 (470건)