ITRT(IT Research Trends)

ETL: Ensemble–at–Training for Low-cost Open-source Large Language Models

연구 분야: Databases

논문 키워드: #outperforms #efficient #popular #efficiency #improves

학회: 2025 15th International Conference on Electrical Engineering (ICEENG)

초록

Text summarization with Large Language Models (LLMs) has proven superior to traditional LSTM models, even surpassing human annotators’ preferences over original reference summaries in popular datasets. However, small open-source LLMs are often less capable than proprietary models like GPT-4. To address this, a method of combining multiple LLM outputs via an aggregator LLM has been proposed, though this increases computational requirements during inference. We propose a novel fine-tuning approach that eliminates the need for ensemble models during inference by shifting the ensemble mechanism to the training phase. In our method, a single fine-tuned model is deployed at inference, reducing computational overhead and complexity. Unlike traditional methods relying on static datasets, our approach leverages outputs from multiple pre-trained LLMs during training, enabling the model to generate high-quality summaries without requiring multiple models at inference. This ensemble-at-training paradigm improves summarization performance while ensuring deployment efficiency. Experimental results show that our approach outperforms baseline models and traditional fine-tuning techniques, offering a scalable and practical solution for efficient LLM fine-tuning.

📄 논문 정보

발행 연도	2025년
인용수	14
출판 국가	Andorra, Canada
사이트	IEEE
좋아요 수	0

ETL: Ensemble–at–Training for Low-cost Open-source Large Language Models

ETL: Ensemble–at–Training for Low-cost Open-source Large Language Models

Amr M. Zaki

Mahmoud I. Khalil

Hazem Abbas

📄 논문 정보

연관 논문 목록 (67건)

ETL: Ensemble–at–Training for Low-cost Open-source Large Language Models

ETL: Ensemble–at–Training for Low-cost Open-source Large Language Models

📄 논문 정보

연관 논문 목록 (67건) 내 서재 담기

연관 논문 목록 (67건)