ULDC: uncertainty-based learning for deep clustering


연구 분야: Artificial Intelligence



학회: Applied Intelligence


초록

Deep clustering has gained prominence due to its impressive capability to handle high-dimensional real-world data. However, in the absence of ground-truth labels, existing clustering methods struggle to discern false positives that resemble the target cluster and false negatives that visually differ but maintain semantic consistency. The unreliable projections caused by visual ambiguity disrupt representation learning, leading to sub-optimal clustering outcomes. To address this challenge, we propose a novel method called uncertainty-based learning for deep clustering (ULDC), which aims to discover more optimal cluster structures within data from an uncertainty perspective. Specifically, we utilize the Dirichlet distribution to quantify the uncertainty of feature projections in the latent space, providing a probabilistic framework for modeling uncertainty during the clustering process. We then develop uncertainty-based learning to mitigate the interference caused by false positives and negatives in the clustering tasks. Additionally, a semantic calibration module is introduced to achieve a global alignment of cross-instance semantics, facilitating the learning of clustering-favorite representations. Extensive experiments on five widely-used benchmarks demonstrate the effectiveness of ULDC. The source code is available from https://github.com/YL616/ULDC.


Author Profile
Luyao Chang

School of Information and Software Engineering University of Electronic Science and Technology of China Chengdu 610054 Sichuan China

Andorra
Author Profile
Xinzheng Niu

School of Computer Science and Engineering University of Electronic Science and Technology of China Chengdu 611731 Sichuan China

Andorra
Author Profile
Zhenghua Li

Department of Biliary Surgery West China Hospital Sichuan University Chengdu 610041 Sichuan China

China

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Andorra, China
사이트 Springer
좋아요 수 0

연관 논문 목록 (283건)