연구 분야: Software Development
학회: The Journal of Supercomputing
With the rapid development of intelligent security systems, the demand for vehicle re-identification has surged exponentially. Vehicle re-identification involves recognizing the same vehicle across different camera perspectives, necessitating robust local feature processing. While transformers have shown promising results in this field, their inherent self-attention mechanism tends to dilute high-frequency texture details, hindering local feature extraction. Additionally, challenges such as occlusion and misalignment can lead to information loss and noise introduction, reducing re-identification accuracy. To address these issues, we introduce the frequency transformer with local feature enhancement (LFFT). The proposed framework comprises a frequency layer and a jigsaw select patches module (JSPM). The frequency layer enhances the weights of high-frequency component features using fast Fourier transform to improve local feature extraction at the lower layers. Meanwhile, the attention layer at the higher layers continues to extract global features. The JSPM incorporates discriminative patches obtained from attention layers into randomly shuffled and reorganized groups, enhancing the global discriminative capability of local features. The method does not utilize additional information or auxiliary networks. Experimental evaluations on two vehicle re-identification datasets, VeRi-776 and VehicleID, demonstrate the effectiveness of our method compared to recent approaches. The code is available at https://github.com/xianghlin/LFFT, accompanied by detailed usage instructions.
| 발행 연도 | 2025년 |
|---|---|
| 인용수 | 0 |
| 출판 국가 | China |
| 사이트 | Springer |
| 좋아요 수 | 0 |