Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module


연구 분야: Analysis



학회: International Conference on Security and Privacy in New Computing Environments


초록

The correct distinction between highly realistic computer-generated (CG) images and photographic (PG) images has become an important area of research. In recent years, most of the CG image forensics methods are proposed based on deep learning, but the detection performances of these methods still need to be improved, especially in terms of robustness and generalization. To tackle these issues, we leverage the Vision Transformer (ViT) model, which excels in capturing the global features of images, and design a Forensic Feature Pre-processing (FFP) module to further improve the detection performance. Experiments are conducted on a large-scale CG image benchmark (LSCGB), which is a challenging dataset for CG image detection. The proposed approach can achieve high detection accuracy. Extensive experiments on different public datasets and common post-processing operations demonstrate our approach can achieve significantly better generalization and robustness than the state-of-the-art approaches.


Author Profile
Yifang Chen

Guangdong Polytechnic Normal University GuangZhou 510665 GuangDong China

China
Author Profile
Guanchen Wen

Guangdong Polytechnic Normal University GuangZhou 510665 GuangDong China

China
Author Profile
Yong Wang

Guangdong Polytechnic Normal University GuangZhou 510665 GuangDong China

China

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 China
사이트 Springer
좋아요 수 0

연관 논문 목록 (231건)