Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild


연구 분야: Strategies



학회: European Conference on Computer Vision


초록

Despite the success in large language models, constructing a data-efficient generalist for dense visual prediction presents a distinct challenge due to the variation in label structures across different tasks. In this study, we explore a universal model that can flexibly adapt to unseen dense label structures with a few examples, enabling it to serve as a data-efficient vision generalist in diverse real-world scenarios. To this end, we base our method on a powerful meta-learning framework and explore several axes to improve its performance and versatility for real-world problems, such as flexible adaptation mechanisms and scalability. We evaluate our model across a spectrum of unseen real-world scenarios where low-shot learning is desirable, including video, 3D, medical, biological, and user-interactive tasks. Equipped with a generic architecture and an effective adaptation mechanism, our model flexibly adapts to all of these tasks with at most 50 labeled images, showcasing a significant advancement over existing data-efficient generalist approaches. Codes are available at https://github.com/GitGyun/chameleon.


Author Profile
Donggyun Kim

School of Computing KAIST Daejeon South Korea

Korea
Author Profile
Seongwoong Cho

School of Computing KAIST Daejeon South Korea

Korea
Author Profile
Semin Kim

School of Computing KAIST Daejeon South Korea

Korea

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 China, Korea
사이트 Springer
좋아요 수 0

연관 논문 목록 (91건)