ETL: Ensemble–at–Training for Low-cost Open-source Large Language Models


연구 분야: Databases



학회: 2025 15th International Conference on Electrical Engineering (ICEENG)


초록

Text summarization with Large Language Models (LLMs) has proven superior to traditional LSTM models, even surpassing human annotators’ preferences over original reference summaries in popular datasets. However, small open-source LLMs are often less capable than proprietary models like GPT-4. To address this, a method of combining multiple LLM outputs via an aggregator LLM has been proposed, though this increases computational requirements during inference. We propose a novel fine-tuning approach that eliminates the need for ensemble models during inference by shifting the ensemble mechanism to the training phase. In our method, a single fine-tuned model is deployed at inference, reducing computational overhead and complexity. Unlike traditional methods relying on static datasets, our approach leverages outputs from multiple pre-trained LLMs during training, enabling the model to generate high-quality summaries without requiring multiple models at inference. This ensemble-at-training paradigm improves summarization performance while ensuring deployment efficiency. Experimental results show that our approach outperforms baseline models and traditional fine-tuning techniques, offering a scalable and practical solution for efficient LLM fine-tuning.


Author Profile
Amr M. Zaki

Queen’s University Kingston Canada

Canada
Author Profile
Mahmoud I. Khalil

Department of Computer and Systems Engineering Ain-Shams University Cairo Egypt

Andorra
Author Profile
Hazem Abbas

Queen’s University Kingston Canada

Canada

📄 논문 정보

발행 연도 2025년
인용수 14
출판 국가 Andorra, Canada
사이트 IEEE
좋아요 수 0

연관 논문 목록 (67건)