Adaptive transformer with Pyramid Fusion for cloth-changing Person Re-Identification

Guoqing Zhang, Jieqiong Zhou, Yuhui Zheng, Gaven Martin, Ruili Wang

Research output: Journal PublicationArticlepeer-review

Abstract

Recently, Transformer-based methods have made great progress in person re-identification (Re-ID), especially in handling identity changes in clothing-changing scenarios. Most current studies usually use biometric information-assisted methods such as human pose estimation to enhance the local perception ability of clothes-changing Re-ID. However, it is usually difficult for them to establish the connection between local biometric information and global identity semantics during training, resulting in the lack of local perception ability during the inference phase, which limits the improvement of model performance. In this paper, we propose a Transformer-based Adaptive-Aware Attention and Pyramid Fusion Network (A3PFN) for CC Re-ID, which can capture and integrate multi-scale visual information to enhance recognition ability. Firstly, to improve the information utilization efficiency of the model in cloth-changing scenarios, we propose a Multi-Layer Dynamic Concentration module (MLDC) to evaluate the importance features at each layer in real time and reduce the computational overlap between related layers. Secondly, we propose a Local Pyramid Aggregation Module (LPAM) to extract multi-scale features, aiming to maintain global perceptual capability and focus on key local information. In this module, we also combine the Fast Fourier Transform (FFT) with self-attention mechanism to more effectively identify and analyze pedestrian gait and other structural details in the frequency domain and reduce the computational complexity of processing high-dimensional data in the self-attention mechanism. Finally, we build a new dataset incorporating diverse atmospheric conditions (for instance wind and rain) to more realistically simulate natural scenarios for the changing of clothes. Extensive experiments on multiple cloth-changing datasets clearly confirm the superior performance of A3PFN. The dataset and related code are available on the website: https://github.com/jieqiongz1999/vcclothes-w-r.

Original languageEnglish
Article number111443
JournalPattern Recognition
Volume163
DOIs
Publication statusPublished - Jul 2025

Keywords

  • Cloth changing
  • Person re-identification
  • Vision transformer

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Adaptive transformer with Pyramid Fusion for cloth-changing Person Re-Identification'. Together they form a unique fingerprint.

Cite this