Enhancing image recognition with CapsResNet: a hybrid model integrating CNNs, residual blocks, and capsule layers


연구 분야: Artificial Intelligence



학회: Signal, Image and Video Processing


초록

This paper presents CapsResNet, a novel hybrid image recognition network that integrates Convolutional Neural Networks (CNNs), Residual Blocks, and Capsule Layers. CapsResNet is designed to address the limitations of traditional CNNs in capturing spatial hierarchies and relationships between objects within images. By incorporating residual blocks, the model mitigates the vanishing gradient problem, enabling the training of deeper and more complex networks. The capsule layers, on the other hand, provide a richer vector representation of features, preserving spatial relationships and improving the model’s ability to recognize complex shapes and structures. The proposed model is comprehensively evaluated on five benchmark datasets: Fashion-MNIST, CIFAR-10, CIFAR-100, EMNIST, and SVHN. CapsResNet achieves 94.02% accuracy on CIFAR-10 and 96.85% on SVHN, consistently surpassing baseline CNNs and CapsNet-based models in terms of accuracy, precision, recall, and F1-score. An ablation study further confirms the essential role of both residual blocks and capsule layers in enhancing the model’s robustness and performance. Overall, CapsResNet represents a significant step toward more interpretable and efficient image recognition solutions.


Author Profile
Fouad Khattari

Laboratory LIPIM National School of Applied Sciences Khouribga Sultan Moulay Slimane University Beni-Mellal Morocco

Morocco
Author Profile
Youssef El Hadfi

Laboratory LIPIM National School of Applied Sciences Khouribga Sultan Moulay Slimane University Beni-Mellal Morocco

Morocco
Author Profile
Aissam Hadri

Lab SIV Ibnou Zohr University Ouarzazate Morocco

Morocco

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Morocco
사이트 Springer
좋아요 수 0

연관 논문 목록 (179건)