Affective image recognition with multi-attribute knowledge in deep neural networks


연구 분야: Artificial Intelligence



학회: Multimedia Tools and Applications


초록

Incorporating visual attributes such as objects and scene features into deep models has been proved valuable for affective image recognition. In general, the existing works achieve it by either fine-tuning popular CNNs for emotion recognition, or connecting external attributes through additional well-designed modules. However, they do not realize the diversity of emotional representations for different styles of affective images, or utilize the inter-hierarchical correlations in deep models. In this paper, we propose a multi-attribute model which incorporates different visual concepts to solve this problem. The model consists of 2 branch modules from local to global view: one trains a gram encoder to capture local visual details, and the other trains a semantic tokenizer to extract global semantics simultaneously. Through a fusion layer, we represent image sentiments with aggregated attributes. Different from the existing methods, our model is composed of stacked CNNs without additional backbones, and it shows the great ability to learn hierarchical attributes from internal intermediate features. Furthermore, inspired by deep metric learning, we design an emotional contrast loss to consider dynamic polarity embedded in affective images, and optimize the model within cross-entropy loss as well. A comprehensive evaluation on 5 datasets supports that our model outperforms the others.


Author Profile
Hao Zhang

School of Information Science and Engineering Yunnan University Kunming China

Andorra
Author Profile
Gaifang Luo

School of Software Shanxi Agricultural University Jinzhong China

China
Author Profile
Yingying Yue

School of Mathematics and Information Technology Yuxi Normal University Yuxi China

Andorra

📄 논문 정보

발행 연도 2023년
인용수 5
출판 국가 Andorra, China
사이트 Springer
좋아요 수 0

연관 논문 목록 (295건)