Multi-object behaviour recognition based on object detection cascaded image classification in classroom scenes


연구 분야: Artificial Intelligence



학회: Applied Intelligence


초록

For multi-object behaviour recognition in classroom scenes, crowded objects have heavy occlusion, invisible keypoints, scale variation, which directly overwhelms the recognition performance. Due to the dense student objects and similar student behaviours, multi-object behaviour recognition brings great challenges. Therefore, we proposed multi-object behaviour recognition based on object detection cascaded image classification. Specifically, object detection extracts student objects, followed by Vision Transformer (ViT) classification of student behaviour. To ensure the accuracy of behaviour recognition, it is first necessary to improve the detection performance of object detection. This paper proposes the Shallow Auxiliary Module for object detection to assist the backbone network in extracting hybrid multi-scale feature information. The multi-scale and multi-channel feature information is fused to alleviate object overlap and scale variation. We propose a Scale Assignment Fusion Mechanism that non-heuristically guides objects to learn the optimal feature layer. Furthermore, the Anchor-free Dynamic Label Assignment can suppress the prediction of low-quality bounding boxes, stabling training and improving detection performance. The proposed student object detector achieves the state-of-the-art mAP of 88.03 and AP of 57.64, outperforming state-of-the-art object detection methods. Our multi-object behaviour recognition method achieves the recognition of four behaviour classes, which is significantly better than the results of other comparison methods.


Author Profile
Min Dang

School of Computer Science and Technology Xidian University No. 266 Xinglong Section Xifeng Road Xi’an 710126 Shaanxi China

Andorra
Author Profile
Gang Liu

Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province No. 266 Xinglong Section Xifeng Road Xi’an 710126 Shaanxi China

Andorra
Author Profile
Hao Li

School of Computer Science and Technology Xidian University No. 266 Xinglong Section Xifeng Road Xi’an 710126 Shaanxi China

Andorra

📄 논문 정보

발행 연도 2024년
인용수 17
출판 국가 Andorra, China
사이트 Springer
좋아요 수 0

연관 논문 목록 (131건)