ANTLR-Based Complex SQL Data Lineage Parsing and Visualization


연구 분야: Databases



학회: 2025 5th International Symposium on Computer Technology and Information Science (ISCTIS)


초록

With the widespread adoption of complex SQL queries in modern data environments, traditional data lineage tools face challenges in parsing accuracy and coverage, struggling to meet the demands of fine-grained data management and increasingly stringent data regulations. To address this challenge, this paper proposes an ANTLR-based automated method for accurately parsing complex SQL queries and extracting fine-grained, column-level data lineage. The core contributions of this method include: constructing and optimizing ANTLR grammar rules adapted to complex SQL scenarios, implementing a column-level lineage extraction algorithm that accurately traces data transformation logic, and designing a corresponding visualization scheme to support data dependency exploration, impact analysis, and issue tracing. This method effectively enhances the accuracy and depth of data lineage analysis, providing strong technical support for enterprises to strengthen data governance, compliance auditing, optimize data quality, and accelerate data development and debugging processes.


Author Profile
Zengqian Huang

The 32nd Research Institute of China Electronics Technology Group Corporation Shanghai China

China
Author Profile
Xiaoli Chai

The 32nd Research Institute of China Electronics Technology Group Corporation Shanghai China

China
Author Profile
Ding Qiu

The 32nd Research Institute of China Electronics Technology Group Corporation Shanghai China

China

📄 논문 정보

발행 연도 2025년
인용수 22
출판 국가 China
사이트 IEEE
좋아요 수 0

연관 논문 목록 (323건)