An unsupervised video foreground co-localization and segmentation process by incorporating motion cues and frame features

Chao Zhang, Qian Zhang, Chi Zheng, Guoping Qiu

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Video foreground segmentation is one of the key problems in video processing. In this paper, we proposed a novel and fully unsupervised approach for foreground object co-localization and segmentation of unconstrained videos. We firstly compute both the actual edges and motion boundaries of the video frames, and then align them by their HOG feature maps. Then, by filling the occlusions generated by the aligned edges, we obtained more precise masks about the foreground object. Such motion-based masks could be derived as the motion-based likelihood. Moreover, the color-base likelihood is adopted for the segmentation process. Experimental Results show that our approach outperforms most of the State-of-the-art algorithms.

Original languageEnglish
Title of host publicationNinth International Conference on Graphic and Image Processing, ICGIP 2017
EditorsHui Yu, Junyu Dong
PublisherSPIE
ISBN (Electronic)9781510617414
DOIs
Publication statusPublished - 2018
Event9th International Conference on Graphic and Image Processing, ICGIP 2017 - Qingdao, China
Duration: 14 Oct 201716 Oct 2017

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume10615
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference9th International Conference on Graphic and Image Processing, ICGIP 2017
Country/TerritoryChina
CityQingdao
Period14/10/1716/10/17

Keywords

  • Foreground Detection
  • Motion Boundaries
  • Object Co-Localization
  • Segmentation
  • Video Processing

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'An unsupervised video foreground co-localization and segmentation process by incorporating motion cues and frame features'. Together they form a unique fingerprint.

Cite this