Gar $$\scriptstyle ++$$ : Natural Language to SQL Translation with Efficient Generate-and-Rank


연구 분야: Databases



학회: Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data


초록

Web applications heavily depend on databases, yet the conventional database interface often presents challenges for efficient data utilization. It is imperative to address the considerable demand emanating from a vast array of end users seeking seamless input of their requirements and effortless retrieval of query results. Natural Language (NL) Interfaces to Databases serve to make databases accessible to end users. Mainstream approaches typically prioritize building language translation models for converting NL queries to SQL queries, while a novel generate-and-rank approach is proposed to achieve this through a procedure involving generation and ranking. Despite yielding superior translation results on the public benchmark, this generate-and-rank approach encounters efficiency issues that may impede its practical application. In this paper, we introduce GAR , which extends the existing generate-and-rank approach for a more efficient generation and robust ranking procedure. Specifically, GAR utilizes the bloom filter to accelerate the data generation process by reducing unnecessary function calls. Additionally, GAR provides a brand-new implementation of the ranking module, specifically the re-ranking model, empowered with enhanced language understanding ability. We evaluate the effectiveness of GAR on three public benchmarks, namely GEO, SPIDER, and MT-TEQL. GAR achieved an overall accuracy of 66.6% on GEO, 80.6% on SPIDER, and 78.4% on MT-TEQL, respectively.


Author Profile
Yuankai Fan

Fudan University Shanghai China

China
Author Profile
Tonghui Ren

Fudan University Shanghai China

China
Author Profile
Can Huang

Fudan University Shanghai China

China

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 China
사이트 Springer
좋아요 수 0

연관 논문 목록 (473건)