Combating spatial redundancy with spectral norm attention in convolutional learners

Jiansheng Fang, Dan Zeng, Xiao Yan, Yubing Zhang, Hongbo Liu, Bo Tang, Ming Yang, Jiang Liu

Research output: Journal PublicationArticlepeer-review

1 Citation (Scopus)

Abstract

There is an inherent and longstanding challenge for vision learners to exploit informative features from digital images with spatial redundancy. Given pre-processing image methods require task-specific customization and may rise unanticipated poor performance due to redundancy removal, we explore improving vision learners to combat spatial redundancy during vision learning, a task-agnostic and robust solution. Among popular vision learners, vision transformers with self-attention can mitigate pixel redundancy by capturing global dependencies, while convolutional learners fall into locality via a limited receptive field. To this end, based on investigating inter-pixel spatial redundancy of images, in this work, we propose spectral norm attention (SNA), a novel yet efficient attention block to help convolutional neural networks (CNNs) highlight informative features. We can seamlessly plug SNA into off-the-shelf CNNs to suppress the contributions of redundant features by globally differentiating and weighting. In particular, SNA performs singular value decomposition (SVD) on intermediate features of each image within a mini-batch to obtain its spectral norm. The features in the direction of the spectral norm are most informative, while the discriminative features in other directions leave less. Hence, we apply the rank-one approximation of the spectral norm direction as attention weights to enhance informative features. Besides, we adopt the power iteration algorithm to approximate the spectral norm to significantly reduce the matrix computation overhead during training, thus keeping inference speed on par with vanilla CNNs. We extensively evaluate our SNA on four mainstream natural datasets to demonstrate the effectiveness and favourability of our SNA against its counterparts. In addition, the experimental results of image classification and object detection show our SNA can bring more gains to medical images with heavy redundancy than other state-of-the-art attention modules.

Original languageEnglish
Pages (from-to)105-116
Number of pages12
JournalNeurocomputing
Volume511
DOIs
Publication statusPublished - 28 Oct 2022
Externally publishedYes

Keywords

  • Attention
  • Convolutional Neural Networks
  • Singular value decomposition
  • Spatial redundancy
  • Spectral norm

ASJC Scopus subject areas

  • Computer Science Applications
  • Cognitive Neuroscience
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Combating spatial redundancy with spectral norm attention in convolutional learners'. Together they form a unique fingerprint.

Cite this