Fast detection of denial constraint violations


연구 분야: Safety



학회: Proceedings of the VLDB Endowment, Volume 15, Issue 4


초록

The detection of constraint-based errors is a critical task in many data cleaning solutions. Previous works perform the task either using traditional data management systems or using specialized systems that speed up error detection. Unfortunately, both approaches may fail to execute in a reasonable time or even exhaust the available memory in the attempt. To address the main drawbacks of previous approaches, we present the FAst Constraint-based Error DeTector (FACET) to detect violations of denial constraints (DCs). FACET uses column sketch information to organize a pipeline of special operators for DC predicates and it implements these operators using a set of efficient algorithms and data structures that adapt to different data characteristics and predicate structures. We evaluate our system on a diverse array of datasets and constraints, showing its robustness and performance gains compared to different types of DBMSs and to a specialized system.


Author Profile
Eduardo H M Pena

Federal University of Technology Campo Mourão Paraná Brazil

Brazil
Author Profile
Eduardo Cunha de Almeida

Federal University of Paraná Curitiba Paraná Brazil

Brazil
Author Profile
Felix Naumann

University of Potsdam Germany

Germany

📄 논문 정보

발행 연도 2021년
인용수 13
출판 국가 Germany, Brazil
사이트 ACM
좋아요 수 0

연관 논문 목록 (165건)