Data for Digital Forensics: Why a Discussion on “How Realistic is Synthetic Data” is Dispensable


연구 분야: Safety



학회: Digital Threats: Research and Practice, Volume 4, Issue 3


초록

Digital forensics depends on data sets for various purposes like concept evaluation, educational training, and tool validation. Researchers have gathered such data sets into repositories and created data simulation frameworks for producing large amounts of data. Synthetic data often face skepticism due to its perceived deviation from real-world data, raising doubts about its realism. This paper addresses this concern, arguing that there is no definitive answer. We focus on four common digital forensic use cases that rely on data. Through these, we elucidate the specifications and prerequisites of data sets within their respective contexts. Our discourse uncovers that both real-world and synthetic data are indispensable for advancing digital forensic science, software, tools, and the competence of practitioners. Additionally, we provide an overview of available data set repositories and data generation frameworks, contributing to the ongoing dialogue on digital forensic data sets’ utility.


Author Profile
Thomas Göbel

Research Institute CODE University of the Bundeswehr Munich Germany

Germany
Author Profile
Harald Baier

Research Institute CODE University of the Bundeswehr Munich Germany

Germany
Author Profile
Frank Breitinger

School of Criminal Justice University of Lausanne Switzerland

Switzerland

📄 논문 정보

발행 연도 2023년
인용수 8
출판 국가 Germany, Switzerland
사이트 ACM
좋아요 수 0

연관 논문 목록 (239건)