Enhancing Task Performance in Continual Instruction Fine-tuning Through Format Uniformity


연구 분야: Artificial Intelligence



학회: SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval


초록

In recent advancements, large language models (LLMs) have demonstrated remarkable capabilities in diverse tasks, primarily through interactive question-answering with humans. This development marks significant progress towards artificial general intelligence (AGI). Despite their superior performance, LLMs often exhibit limitations when adapted to domain-specific tasks through instruction fine-tuning (IF). The primary challenge lies in the discrepancy between the data distribution in general and domain-specific contexts, leading to suboptimal accuracy in specialized tasks. To address this, continual instruction fine-tuning (CIF), particularly supervised fine-tuning (SFT), on targeted domain-specific instruction datasets is necessary. Our ablation study reveals that the structure of these instruction datasets critically influences CIF performance, with substantial data distributional shifts resulting in notable performance degradation. In this paper, we introduce a novel framework that enhances CIF by promoting format uniformity. We assess our approach using the Llama2 chat model across various domain-specific instruction datasets. The results demonstrate not only an improvement in task-specific performance under CIF but also a reduction in catastrophic forgetting (CF). This study contributes to the optimization of LLMs for domain-specific applications, highlighting the significance of data structure and distribution in CIF.


Author Profile
Xiaoyu Tan

INF Technology (Shanghai) Co. Ltd. Shanghai China

China
Author Profile
Leijun Cheng

School of Electronic and Electrical Engineering Shanghai University of Engineering Science Shanghai China

Andorra
Author Profile
Xihe Qiu

School of Electronic and Electrical Engineering Shanghai University of Engineering Science Shanghai China

Andorra

📄 논문 정보

발행 연도 2024년
인용수 1
출판 국가 Andorra, China
사이트 ACM
좋아요 수 0

연관 논문 목록 (89건)