Text mining for plagiarism detection: multivariate pattern detection for recognition of text similarities


연구 분야: Safety



학회: ASONAM '18: Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining


초록

The problem of plagiarism the recent years has been intensified by the availability of information in digital form and the accessibility of the electronic libraries through the Internet. As a result, plagiarism detection has been transformed into a big data analytics problem since the number of digital sources is extravagant and a new document needs to be compared with millions of other existing documents. In this paper, a text mining methodology is proposed that can detect all common patterns between a document and the documents in a reference database. The technique is based on a pattern detection algorithm and the corresponding data structure that enables the algorithm to detect all common patterns. The methodology has been applied in a well-defined dataset providing very promising results identifying difficult cases of plagiarism such as technical disguise.


Author Profile
Konstantinos F Xylogiannopoulos

University of Calgary Calgary Canada

Canada
Author Profile
Panagiotis Karampelas

Hellenic Air Force Academy Dekelia Greece

Greece
Author Profile
Reda Sleiman Alhajj

University of Calgary Calgary Canada

Canada

📄 논문 정보

발행 연도 2020년
인용수 2
출판 국가 Greece, Canada
사이트 ACM
좋아요 수 0

연관 논문 목록 (185건)