Enhancing semantic scene segmentation for indoor autonomous systems using advanced attention-supported improved UNet


연구 분야: Strategies



학회: Signal, Image and Video Processing


초록

This paper introduces EFFB7-UNet, an advanced semantic segmentation framework tailored for Indoor Autonomous Vision Systems (IAVSs) utilizing the U-Net architecture. The framework employs EfficientNetB4 as its encoder, significantly enhancing feature extraction. It integrates a spatial and channel Squeeze-and-Excitation (scSE) attention block, emphasizing critical areas and features to refine segmentation outcomes. Comprehensive evaluations using the NYUv2 Dataset and various augmented datasets were conducted. This study systematically compares EFFB7-UNet’s performance with multiple U-Net encoders, including ResNet50, ResNet101, MobileNet V2, VGG16, VGG19, and EfficientNets B0-B6. The findings reveal that EFFB7-UNet not only surpasses these configurations in terms of accuracy but also highlights the effectiveness of the scSE attention block in achieving superior segmentation results. Without the utilization of depth information, EFFB7-UNet achieves a 12% improvement in mean Intersection over Union (mIOU). This significant enhancement demonstrates EFFB7-UNet’s adaptability across various domains, implying substantial progress in enhancing the effectiveness and reliability of Intelligent Autonomous Vision Systems (IAVS) technologies.


Author Profile
Hoang N. Tran

FPT University Can Tho 94000 Vietnam

Canada
Author Profile
Thu A. N. Le

FPT University Can Tho 94000 Vietnam

Canada
Author Profile
Nghi V. Nguyen

FPT University Can Tho 94000 Vietnam

Canada

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Canada
사이트 Springer
좋아요 수 0

연관 논문 목록 (45건)