Semantic image representation for image recognition and retrieval using multilayer variational auto-encoder, InceptionNet and low-level image features


연구 분야: Artificial Intelligence



학회: The Journal of Supercomputing


초록

This paper presents a novel image descriptor that enhances performance in image recognition and retrieval by combining deep learning and handcrafted features. Our method integrates high-level semantic features extracted via InceptionResNet-V2 with color and texture features to create a comprehensive representation of image content. The descriptor’s effectiveness is demonstrated through extensive experiments across a range of image recognition and retrieval tasks. Our approach is tested on six benchmark datasets, including Corel-1 K, VS, OT, QT, SUN-397, and ILSVRC-2012 for single-label classification, and COCO and NUS-WIDE for multi-label classification, achieving high performances. The results establish that the proposed method is versatile and robust, excelling in single-label and multi-label recognition as well as image retrieval tasks, and outperforms several state-of-the-art methods. This work provides a significant advancement in image representation, with broad applicability in various computer vision domains.


Author Profile
Davar Giveki

Department of Computer Engineering Malayer University Malayer Iran

Iran
Author Profile
Sajad Esfandyari

Department of Computer Engineering Malayer University Malayer Iran

Iran

📄 논문 정보

발행 연도 2024년
인용수 2
출판 국가 Iran
사이트 Springer
좋아요 수 0

연관 논문 목록 (293건)