Control Color: Multimodal Diffusion-Based Interactive Image Colorization


연구 분야: Software Development



학회: International Journal of Computer Vision


초록

Despite the existence of numerous colorization methods, several limitations still exist, such as lack of user interaction, inflexibility in local colorization, unnatural color rendering, insufficient color variation, and color overflow. To solve these issues, we introduce Control Color (CtrlColor), a multi-modal colorization method that leverages the pre-trained Stable Diffusion (SD) model, offering promising capabilities in highly controllable interactive image colorization (Fig. 1). While several diffusion-based methods have been proposed, supporting colorization in multiple modalities remains non-trivial. In this study, we aim to tackle both unconditional and conditional image colorization (text prompts, strokes, exemplars) and address color overflow and incorrect color within a unified framework. Apart from accepting text prompts as conditions, we present an effective way to encode user strokes to enable precise local color manipulation and employ a practical method to learn the implicit color distribution in exemplars, adding versatility to our approach. We also introduce a novel module based on self-attention and a content-guided deformable autoencoder to address the long-standing issues of color overflow and inaccurate coloring. Extensive comparisons show that our model outperforms state-of-the-art image colorization methods both qualitatively and quantitatively. Project page: https://zhexinliang.github.io/Control_Color/


Author Profile
Zhexin Liang

S-Lab Nanyang Technological University Singapore Singapore

Singapore
Author Profile
Zhaochen Li

S-Lab Nanyang Technological University Singapore Singapore

Singapore
Author Profile
Shangchen Zhou

S-Lab Nanyang Technological University Singapore Singapore

Singapore

📄 논문 정보

발행 연도 2025년
인용수 4
출판 국가 Singapore, China
사이트 Springer
좋아요 수 0

연관 논문 목록 (4건)