On the Construction of Text-to-SQL Tools Based on Large Language Models for Real-World Relational Databases


연구 분야: Databases



학회: International Conference on Web Information Systems and Technologies


초록

A strategy to construct natural language database interfaces is to use Large Language Models (LLMs) to translate the end-user questions into SQL queries. Such interfaces will be called LLM-based text-to-SQL tools. This article analyses the limitations and proposes solutions to improve the performance of LLM-based text-to-SQL tools for real-world relational databases that have large, complex schemas often expressed in terms different from those adopted by end-users to formulate their questions. The article considers implementations based on Prompt Engineering, including Retrieval-Augmented Generation, and LLM fine-tuning. Finally, it describes experiments that analyze the accuracy of some implementations on two benchmarks built upon databases with large schemas.


Author Profile
Lucas Feijó

Instituto Tecgraf PUC-Rio Rio de Janeiro RJ 22451-900 Brazil

Brazil
Author Profile
Eduardo Nascimento

Instituto Tecgraf PUC-Rio Rio de Janeiro RJ 22451-900 Brazil

Brazil
Author Profile
Gustavo Coelho

Departamento de Informática PUC-Rio Rio de Janeiro RJ 22451-900 Brazil

Brazil

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Brazil
사이트 Springer
좋아요 수 0

연관 논문 목록 (462건)