Exploring multi-agent reinforcement learning for unrelated parallel machine scheduling


연구 분야: Artificial Intelligence



학회: The Journal of Supercomputing


초록

Scheduling problems present significant challenges in optimization, particularly in resource allocation and production management. This study addresses the Unrelated Parallel Machine Scheduling Problem with setup times and resource constraints (UPMSR) using a Multi-Agent Reinforcement Learning (MARL) framework. We develop a reinforcement learning (RL) environment for dynamic scheduling and compare MARL with Single-Agent RL approaches through various neural network policies. Results show that Single-Agent algorithms, particularly the Maskable Proximal Policy Optimization (PPO) variant, excel in smaller-scale scenarios, balancing decision quality and computational efficiency. Multi-agent PPO exhibits scalable potential but faces challenges in cooperative learning, underscoring the complexities of coordination in distributed decision-making tasks. This work provides insights into the strengths and limitations of MARL techniques, emphasizing their adaptability to dynamic environments and the need to balance sophistication with scalability in scheduling optimization.


Author Profile
Maria Zampella

Vicomtech Foundation Basque Research and Technology Alliance (BRTA) 20009 Donostia-San Sebastián Spain

Andorra
Author Profile
Urtzi Otamendi

Department of Physics “Ettore Pancini” University of Naples “Federico II” 80126 Naples Italy

Italy
Author Profile
Xabier Belaunzaran

Vicomtech Foundation Basque Research and Technology Alliance (BRTA) 20009 Donostia-San Sebastián Spain

Andorra

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Italy, Andorra
사이트 Springer
좋아요 수 0

연관 논문 목록 (476건)