Multi-agent reinforcement learning behavioral control for nonlinear second-order systems


연구 분야: Software Development



학회: Frontiers of Information Technology & Electronic Engineering


초록

Reinforcement learning behavioral control (RLBC) is limited to an individual agent without any swarm mission, because it models the behavior priority learning as a Markov decision process. In this paper, a novel multi-agent reinforcement learning behavioral control (MARLBC) method is proposed to overcome such limitations by implementing joint learning. Specifically, a multi-agent reinforcement learning mission supervisor (MARLMS) is designed for a group of nonlinear second-order systems to assign the behavior priorities at the decision layer. Through modeling behavior priority switching as a cooperative Markov game, the MARLMS learns an optimal joint behavior priority to reduce dependence on human intelligence and high-performance computing hardware. At the control layer, a group of second-order reinforcement learning controllers are designed to learn the optimal control policies to track position and velocity signals simultaneously. In particular, input saturation constraints are strictly implemented via designing a group of adaptive compensators. Numerical simulation results show that the proposed MARLBC has a lower switching frequency and control cost than finite-time and fixed-time behavioral control and RLBC methods.


Author Profile
Zhenyi Zhang (张祯毅)

College of Electrical Engineering and Automation Fuzhou University Fuzhou 350108 China

Andorra
Author Profile
Jie Huang (黄捷)

5G+ Industrial Internet Institute Fuzhou University Fuzhou 350108 China

China
Author Profile
Congjie Pan (潘聪捷)

College of Electrical Engineering and Automation Fuzhou University Fuzhou 350108 China

Andorra

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Andorra, China
사이트 Springer
좋아요 수 0

연관 논문 목록 (97건)