A Survey on Speech Deepfake Detection


연구 분야: Strategies



학회: ACM Computing Surveys, Volume 57, Issue 7


초록

The availability of smart devices leads to an exponential increase in multimedia content. However, advancements in deep learning have also enabled the creation of highly sophisticated Deepfake content, including speech Deepfakes, which pose a serious threat by generating realistic voices and spreading misinformation. To combat this, numerous challenges have been organized to advance speech Deepfake detection techniques. In this survey, we systematically analyze more than 200 papers published up to March 2024. We provide a comprehensive review of each component in the detection pipeline, including model architectures, optimization techniques, generalizability, evaluation metrics, performance comparisons, available datasets, and open source availability. For each aspect, we assess recent progress and discuss ongoing challenges. In addition, we explore emerging topics such as partial Deepfake detection, cross-dataset evaluation, and defenses against adversarial attacks, while suggesting promising research directions. This survey not only identifies the current state of the art to establish strong baselines for future experiments but also offers clear guidance for researchers aiming to enhance speech Deepfake detection systems.


Author Profile
Menglu Li

Department of Electrical Computer and Biomedical Engineering Toronto Metropolitan University Toronto Canada

Andorra
Author Profile
Yasaman Ahmadiadli

Department of Electrical Computer and Biomedical Engineering Toronto Metropolitan University Toronto Canada

Andorra
Author Profile
Xiaoping Zhang

Shenzhen Key Laboratory of Ubiquitous Data Enabling Tsinghua Shenzhen International Graduate School Tsinghua University Shenzhen China and Department of Electrical Computer and Biomedical Engineering Toronto Metropolitan University Toronto Canada

Andorra

📄 논문 정보

발행 연도 2025년
인용수 5
출판 국가 Andorra
사이트 ACM
좋아요 수 0

연관 논문 목록 (11건)