Spatiotemporal Saliency Based Multi-stream Networks for Action Recognition

Zhenbing Liu, Zeya Li, Ming Zong, Wanting Ji, Ruili Wang, Yan Tian

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Human action recognition is a challenging research topic since videos often contain clutter backgrounds, which impairs the performance of human action recognition. In this paper, we propose a novel spatiotemporal saliency based multi-stream ResNet for human action recognition, which combines three different streams: a spatial stream with RGB frames as input, a temporal stream with optical flow frames as input, and a spatiotemporal saliency stream with spatiotemporal saliency maps as input. The spatiotemporal saliency stream is responsible for capturing the spatiotemporal object foreground information from spatiotemporal saliency maps which are generated by a geodesic distance based video segmentation method. Such architecture can reduce the background interference in videos and provide the spatiotemporal object foreground information for human action recognition. Experimental results on UCF101 and HMDB51 datasets demonstrate that the complementary spatiotemporal information can further improve the performance of action recognition, and our proposed method obtains the competitive performance compared with the state-of-the-art methods.

Original languageEnglish
Title of host publicationPattern Recognition - ACPR 2019 Workshops, Proceedings
EditorsMichael Cree, Fay Huang, Junsong Yuan, Wei Qi Yan
PublisherSpringer
Pages74-84
Number of pages11
ISBN (Print)9789811536502
DOIs
Publication statusPublished - 2020
Externally publishedYes
Event5th Asian Conference on Pattern Recognition,ACPR 2019 - Auckland, New Zealand
Duration: 26 Nov 201929 Nov 2019

Publication series

NameCommunications in Computer and Information Science
Volume1180 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference5th Asian Conference on Pattern Recognition,ACPR 2019
Country/TerritoryNew Zealand
CityAuckland
Period26/11/1929/11/19

Keywords

  • Action recognition
  • ResNet
  • Spatiotemporal saliency map image

ASJC Scopus subject areas

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Spatiotemporal Saliency Based Multi-stream Networks for Action Recognition'. Together they form a unique fingerprint.

Cite this