Discovering Top-k Rules using Subjective and Objective Criteria


연구 분야: Verification



학회: Proceedings of the ACM on Management of Data, Volume 1, Issue 1


초록

This paper studies two questions about rule discovery. Can we characterize the usefulness of rules using quantitative criteria? How can we discover rules using those criteria? As a testbed, we consider entity enhancing rules (REEs), which subsume common association rules and data quality rules as special cases. We characterize REEs using a bi-criteria model, with both objective measures such as support and confidence, and subjective measures for the user's needs; we learn the subjective measure and the weight vectors via active learning. Based on the bi-criteria model, we develop a top-k algorithm to discover top-ranked REEs, and an any-time algorithm for successive discovery via lazy evaluation. We parallelize these algorithms such that they guarantee to reduce runtime when more processors are used. Using real-life and synthetic datasets, we show that the algorithms are able to find top-ranked rules and speed up conventional rule-discovery methods by 134X on average.


Author Profile
Wenfei Fan

Shenzhen Institute of Computing Sciences

정보 없음
Author Profile
Ziyan Han

University of Edinburgh

정보 없음
Author Profile
Yaoshu Wang

& Beihang University Shenzhen China

China

📄 논문 정보

발행 연도 2023년
인용수 7
출판 국가 China
사이트 ACM
좋아요 수 0

연관 논문 목록 (213건)