ITRT(IT Research Trends)

Obfuscating the Dataset: Impacts and Applications

연구 분야: Analysis

논문 키워드: #learning #training #experiments #leakage #guarding

학회: ACM Transactions on Intelligent Systems and Technology, Volume 14, Issue 5

초록

Obfuscating a dataset by adding random noises to protect the privacy of sensitive samples in the training dataset is crucial to prevent data leakage to untrusted parties when dataset sharing is essential. We conduct comprehensive experiments to investigate how the dataset obfuscation can affect the resultant model weights —in terms of the model accuracy, ℓ2-distance-based model distance, and level of data privacy—and discuss the potential applications with the proposed Privacy, Utility, and Distinguishability (PUD)-triangle diagram to visualize the requirement preferences. Our experiments are based on the popular MNIST and CIFAR-10 datasets under both independent and identically distributed (IID) and non-IID settings. Significant results include a tradeoff between the model accuracy and privacy level and a tradeoff between the model difference and privacy level. The results indicate broad application prospects for training outsourcing and guarding against attacks in federated learning both of which have been increasingly attractive in many areas, particularly learning in edge computing.

📄 논문 정보

발행 연도	2023년
인용수	4
출판 국가	Australia, China
사이트	ACM
좋아요 수	0

Obfuscating the Dataset: Impacts and Applications

Obfuscating the Dataset: Impacts and Applications

Renping Liu

Wei Ni

Ping Yu

Caijun Sun

Xu Wang

Guangsheng Yu

📄 논문 정보

연관 논문 목록 (78건)

Obfuscating the Dataset: Impacts and Applications

Obfuscating the Dataset: Impacts and Applications

📄 논문 정보

연관 논문 목록 (78건) 내 서재 담기

연관 논문 목록 (78건)