CycleHand: Increasing 3D Pose Estimation Ability on In-the-wild Monocular Image through Cyclic Flow


연구 분야: Strategies



학회: MM '22: Proceedings of the 30th ACM International Conference on Multimedia


초록

Current methods for 3D hand pose estimation fail to generalize well to in-the-wild new scenarios due to varying camera viewpoints, self-occlusions, and complex environments. To address this problem, we propose CycleHand to improve the generalization ability of the model in a self-supervised manner. Our motivation is based on an observation: if one globally rotates the whole hand and reversely rotates it back, the estimated 3D poses of fingers should keep consistent before and after the rotation because the wrist-relative hand poses stay unchanged during global 3D rotation. Hence, we propose arbitrary-rotation self-supervised consistency learning to improve the model's robustness for varying viewpoints. Another innovation of CycleHand is that we propose a high-fidelity texture map to render the photorealistic rotated hand with different lighting conditions, backgrounds, and skin tones to further enhance the effectiveness of our self-supervised task. To reduce the potential negative effects brought by the domain shift of synthetic images, we use the idea of contrastive learning to learn a synthetic-real consistent feature extractor in extracting domain-irrelevant hand representations. Experiments show that CycleHand can largely improve the hand pose estimation performance in both canonical datasets and real-world applications.


Author Profile
Daiheng Gao

XR Lab Alibaba Group Beijing China

China
Author Profile
Xindi Zhang

Queen Mary University of London London United Kingdom

United Kingdom
Author Profile
Xingyu Chen

Xiaobing AI Beijing China

Anguilla

📄 논문 정보

발행 연도 2022년
인용수 4
출판 국가 Anguilla, United Kingdom, China, Germany
사이트 ACM
좋아요 수 0

연관 논문 목록 (19건)