Towards Reliable SQL Synthesis: Fuzzing-Based Evaluation and Disambiguation


연구 분야: Databases



학회: International Conference on Fundamental Approaches to Software Engineering


초록

In recent years, more people have seen their work depend on data manipulation tasks. However, many of these users do not have the background in programming required to write complex programs, particularly SQL queries. One way of helping these users is automatically synthesizing the SQL query given a small set of examples. Several program synthesizers for SQL have been recently proposed, but they do not leverage multicore architectures. This paper proposes CUBES, a parallel program synthesizer for the domain of SQL queries using input-output examples. Since input-output examples are an under-specification of the desired SQL query, sometimes, the synthesized query does not match the user’s intent. CUBES incorporates a new disambiguation procedure based on fuzzing techniques that interacts with the user and increases the confidence that the returned query matches the user intent. We perform an extensive evaluation on around 4000 SQL queries from different domains. Experimental results show that our parallel approach can scale up to 16 processes with super-linear speedups for many hard instances, and that our disambiguation approach is critical to achieving an accuracy of around 60%, significantly larger than other SQL synthesizers.


Author Profile
Ricardo Brancas

INESC-ID / Instituto Superior Técnico Universidade de Lisboa Lisbon Portugal

Germany
Author Profile
Miguel Terra-Neves

OutSystems Linda-a-Velha Portugal

Portugal
Author Profile
Miguel Ventura

OutSystems Linda-a-Velha Portugal

Portugal

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Germany, Portugal, United States
사이트 Springer
좋아요 수 0

연관 논문 목록 (188건)