Query Obfuscation for Information Retrieval Through Differential Privacy


연구 분야: Analysis



학회: European Conference on Information Retrieval


초록

Protecting the privacy of a user querying an Information Retrieval (IR) system is of utmost importance. The problem is exacerbated when the IR system is not cooperative in satisfying the user’s privacy requirements. To address this, obfuscation techniques split the user’s sensitive query into multiple non-sensitive ones that can be safely transmitted to the IR system. To generate such queries, current approaches rely on lexical databases, such as WordNet, or heuristics of word co-occurrences. At the same time, advances in Natural Language Processing (NLP) have shown the power of Differential Privacy (DP) in releasing privacy-preserving text for completely different purposes, such as spam detection and sentiment analysis. We investigate for the first time whether DP mechanisms, originally designed for specific NLP tasks, can effectively be used in IR to obfuscate queries. We also assess their performance compared to state-of-the-art techniques in IR. Our empirical evaluation shows that the Vickrey DP mechanism based on the Mahalanobis norm with privacy budget achieves state-of-the-art privacy protection and improved effectiveness. Furthermore, differently from previous approaches that are substantially on/off, by changing the privacy budget , DP allows users to adjust their desired level of privacy protection, offering a trade-off between effectiveness and privacy.


Author Profile
Guglielmo Faggioli

University of Padua Padua Italy

Italy
Author Profile
Nicola Ferro

University of Padua Padua Italy

Italy

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Italy
사이트 Springer
좋아요 수 0

연관 논문 목록 (9건)