Adversarial machine learning for spam filters


연구 분야: Artificial Intelligence



학회: ARES '20: Proceedings of the 15th International Conference on Availability, Reliability and Security


초록

Email spam filters based on machine learning techniques are widely deployed in today's organizations. As our society relies more on artificial intelligence (AI), the security of AI, especially the machine learning algorithms, becomes increasingly important and remains largely untested. Adversarial machine learning, on the other hand, attempts to defeat machine learning models through malicious input. In this paper, we experiment how adversarial scenario may impact the security of machine learning based mechanisms such as email spam filters. Using natural language processing (NLP) and Baysian model as an example, we developed and tested three invasive techniques, i.e., synonym replacement, ham word injection and spam word spacing. Our adversarial examples and results suggest that these techniques are effective in fooling the machine learning models. The study calls for more research on understanding and safeguarding machine learning based security mechanisms in the presence of adversaries.


Author Profile
Bhargav Kuchipudi

Central Michigan University

정보 없음
Author Profile
Ravi Teja Nannapaneni

Central Michigan University

정보 없음
Author Profile
Qi Liao

Central Michigan University

정보 없음

📄 논문 정보

발행 연도 2020년
인용수 20
출판 국가
사이트 ACM
좋아요 수 0

연관 논문 목록 (216건)