3D Human Pose Estimation from Multiple Dynamic Views via Single-view Pretraining with Procrustes Alignment


연구 분야: Strategies



학회: MM '24: Proceedings of the 32nd ACM International Conference on Multimedia


초록

3D Human pose estimation from multiple cameras with unknown calibration has received less attention than it should. The few existing data-driven solutions do not fully exploit 3D training data that are available on the market, and typically train from scratch for every novel multi-view scene, which impedes both accuracy and efficiency. We show how to exploit 3D training data to the fullest and associate multiple dynamic views efficiently to achieve high precision on novel scenes using a simple yet effective framework, dubbed Multiple Dynamic View Pose estimation (MDVPose). MDVPose utilizes novel scenarios data to finetune a single-view pretrained motion encoder in multi-view setting, aligns arbitrary number of views in a unified coordinate via Procruste alignment, and imposes multi-view consistency. The proposed method achieves 22.1 mm P-MPJPE or 34.2 mm MPJPE on the challenging in-the-wild Ski-Pose PTZ dataset, which outperforms the state-of-the-art method by 24.8% P-MPJPE (-7.3 mm) and 19.0% MPJPE (-8.0 mm). It also outperforms the state-of-the-art methods by a large margin (-18.2mm P-MPJPE and -28.3mm MPJPE) on the EgoBody dataset. In addition, MDVPose achieves robust performance on the Human3.6M datasets featuring multiple static cameras. Code is available at https://github.com/iGame-Lab/MDVPose.


Author Profile
Renshu Gu

Hangzhou Dianzi University Hangzhou China

China
Author Profile
Jiajun Zhu

Hangzhou Dianzi University Hangzhou China

China
Author Profile
Yixuan Si

Hangzhou Dianzi University Hangzhou China

China

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 China
사이트 ACM
좋아요 수 0

연관 논문 목록 (113건)