Detecting audio forgery using deep learning techniques with attention mechanisms on ResNet++


연구 분야: Artificial Intelligence



학회: Signal, Image and Video Processing


초록

The emergence of deepfake technology, along with other forms of audio forgeries, raises new concerns for digital security and forensic examination. This paper proposes an effective audio forgery detection framework that utilizes a modified ResNet++ deep learning model with attention mechanisms and advanced preprocessing techniques. The algorithm includes a robust detection method that incorporates multi-scale feature fusion and transformer-based attention to a Mel-spectrogram. The model successfully overcomes challenges posed by intricate forgery patterns and overlapping background noise, demonstrating enhanced performance in noisy conditions. In addition, the inclusion of systematic preprocessing steps increases estimation accuracy, which is critical for real-time processing. The proposed algorithm shows good results, boasting 99.92% training accuracy, 95.45% validation accuracy, 99.52% precision, and a false positive rate of only 1.58%.


Author Profile
Deep Das

Department of Artificial Intelligence Sardar Vallabhbhai National Institute of Technology Surat 395007 Gujarat India

India
Author Profile
Rahul Dixit

Department of Artificial Intelligence Sardar Vallabhbhai National Institute of Technology Surat 395007 Gujarat India

India

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 India
사이트 Springer
좋아요 수 0

연관 논문 목록 (368건)