ITRT(IT Research Trends)

Significance extraction based on data augmentation for reinforcement learning

연구 분야: Artificial Intelligence

논문 키워드: #learning #algorithm #training #experimental #hard

학회: Frontiers of Information Technology & Electronic Engineering

초록

Deep reinforcement learning has shown remarkable capabilities in visual tasks, but it does not have a good generalization ability in the context of interference signals in the input images; this approach is therefore hard to be applied to trained agents in a new environment. To enable agents to distinguish between noise signals and important pixels in images, data augmentation techniques and the establishment of auxiliary networks are proven effective solutions. We introduce a novel algorithm, namely, saliency-extracted Q-value by augmentation (SEQA), which encourages the agent to explore unknown states more comprehensively and focus its attention on important information. Specifically, SEQA masks out interfering features and extracts salient features and then updates the mask decoder network with critic losses to encourage the agent to focus on important features and make correct decisions. We evaluate our algorithm on the DeepMind Control generalization benchmark (DMControl-GB), and the experimental results show that our algorithm greatly improves training efficiency and stability. Meanwhile, our algorithm is superior to state-of-the-art reinforcement learning methods in terms of sample efficiency and generalization in most DMControl-GB tasks.

📄 논문 정보

발행 연도	2025년
인용수	0
출판 국가	Andorra
사이트	Springer
좋아요 수	0

Significance extraction based on data augmentation for reinforcement learning

Significance extraction based on data augmentation for reinforcement learning

Yuxi Han (韩玉玺)

Dequan Li (李德权)

Yang Yang (杨洋)

📄 논문 정보

연관 논문 목록 (398건)

Significance extraction based on data augmentation for reinforcement learning

Significance extraction based on data augmentation for reinforcement learning

📄 논문 정보

연관 논문 목록 (398건) 내 서재 담기

연관 논문 목록 (398건)