An accuracy-privacy optimization framework considering user’s privacy requirements for data stream mining


연구 분야: Databases



학회: Journal of Big Data


초록

Data stream mining is a critical process utilized by organizations to derive insights from real-time data. Consequently, preserving the privacy of sensitive information while maintaining high accuracy remains a persistent challenge. Privacy-preserving data mining techniques modify data to increase privacy, a process that invariably decreases the accuracy of data mining algorithms. Though different techniques have been proposed to preserve privacy, there is a lack of well-formulated frameworks to optimize the trade-off between accuracy and privacy. This paper introduces a novel Accuracy-Privacy Optimization Framework (APOF) that allows users to define privacy requirements and predicts achievable accuracy levels, enabling fine-tuning of this balance. The logistic cumulative noise addition was used as the data perturbation method that has experimentally shown better performance and Hoeffding trees as the classifier. Additionally, a data fitting module using kernel regression is integrated, a unique approach that predicts accuracy levels based on user-defined privacy thresholds. Experimental results show that the proposed framework archives an optimal privacy level above 97% while minimising the accuracy loss across various datasets. By addressing critical gaps in privacy-preserving data mining, this study offers significant contributions to real-world applications, facilitating secure and efficient data utilization in dynamic environments.


Author Profile
Waruni Hewage

Department of IT Otago Polytechnic Auckland International Campus 350 Queen Street 1010 Auckland New Zealand

Italy
Author Profile
R. Sinha

School of Information Technology Deakin University Melbourne 610101 Australia

Australia
Author Profile
M. Asif Naeem

National University of Computer & Emerging Sciences (NUCES) 3 A. K. Brohi Road H-11/4 Islamabad Pakistan

Pakistan

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Australia, Italy, Pakistan
사이트 Springer
좋아요 수 0

연관 논문 목록 (191건)