Enhancing Adversarial Robustness of Vision-Language Models through Low-Rank Adaptation


연구 분야: Artificial Intelligence



학회: ICMR '25: Proceedings of the 2025 International Conference on Multimedia Retrieval


초록

Vision-Language Models (VLMs) play a crucial role in the advancement of Artificial General Intelligence (AGI). As AGI rapidly evolves, addressing security concerns has emerged as one of the most significant challenges for VLMs. In this paper, we present extensive experiments that expose the vulnerabilities of conventional adaptation methods for VLMs, highlighting significant security risks. Moreover, as VLMs grow in size, the application of traditional adversarial adaptation techniques incurs substantial computational costs. To address these issues, we propose a parameter-efficient adversarial adaptation method called AdvLoRA based on Low-Rank Adaptation. We investigate and reveal the inherent low-rank properties involved in adversarial adaptation for VLMs. Different from LoRA, we enhance the efficiency and robustness of adversarial adaptation by introducing a novel reparameterization method that leverages parameter clustering and alignment. Additionally, we propose an adaptive parameter update strategy to further bolster robustness. These innovations enable our AdvLoRA to mitigate issues related to model security and resource wastage. Extensive experiments confirm the effectiveness and efficiency of AdvLoRA.


Author Profile
Yuheng Ji

Institute of Automation Chinese Academy of Sciences Beijing China and School of Artificial Intelligence University of Chinese Academy of Sciences Beijing China

Andorra
Author Profile
Yue Liu

Institute of Data Science National University of Singapore Singapore Singapore

Singapore
Author Profile
Zhicheng Zhang

Institute of Automation Chinese Academy of Sciences Beijing China and School of Artificial Intelligence University of Chinese Academy of Sciences Beijing China

Andorra

📄 논문 정보

발행 연도 2025년
인용수 1
출판 국가 Singapore, Andorra, China
사이트 ACM
좋아요 수 0

연관 논문 목록 (35건)