Hybrid generative–discriminative hash tracking with spatio-temporal contextual cues

Manna Dai; Shuying Cheng; Xiangjian He

doi:10.1007/s00521-016-2452-z

Hybrid generative–discriminative hash tracking with spatio-temporal contextual cues

Manna Dai, Shuying Cheng, Xiangjian He

Research output: Journal Publication › Article › peer-review

2 Citations (Scopus)

Abstract

Visual object tracking is of a great application value in video monitoring systems. Recent work on video tracking has taken into account spatial relationship between the targeted object and its background. In this paper, the spatial relationship is combined with the temporal relationship between features on different video frames so that a real-time tracker is designed based on a hash algorithm with spatio-temporal cues. Different from most of the existing work on video tracking, which is regarded as a mechanism for image matching or image classification alone, we propose a hierarchical framework and conduct both matching and classification tasks to generate a coarse-to-fine tracking system. We develop a generative model under a modified particle filter with hash fingerprints for the coarse matching by the maximum a posteriori and a discriminative model for the fine classification by maximizing a confidence map based on a context model. The confidence map reveals the spatio-temporal dynamics of the target. Because hash fingerprint is merely a binary vector and the modified particle filter uses only a small number of particles, our tracker has a low computation cost. By conducting experiments on eight challenging video sequences from a public benchmark, we demonstrate that our tracker outperforms eight state-of-the-art trackers in terms of both accuracy and speed.

Original language	English
Pages (from-to)	389-399
Number of pages	11
Journal	Neural Computing and Applications
Volume	29
Issue number	2
DOIs	https://doi.org/10.1007/s00521-016-2452-z
Publication status	Published - 1 Jan 2018
Externally published	Yes

Keywords

Confidence map
Hash algorithm
Hierarchical framework
Maximum a posteriori (MAP)
Spatio-temporal cues

ASJC Scopus subject areas

Software
Artificial Intelligence

Access to Document

10.1007/s00521-016-2452-z

Cite this

@article{c912fe6c0e2b4a019391c5ab03acc398,

title = "Hybrid generative–discriminative hash tracking with spatio-temporal contextual cues",

abstract = "Visual object tracking is of a great application value in video monitoring systems. Recent work on video tracking has taken into account spatial relationship between the targeted object and its background. In this paper, the spatial relationship is combined with the temporal relationship between features on different video frames so that a real-time tracker is designed based on a hash algorithm with spatio-temporal cues. Different from most of the existing work on video tracking, which is regarded as a mechanism for image matching or image classification alone, we propose a hierarchical framework and conduct both matching and classification tasks to generate a coarse-to-fine tracking system. We develop a generative model under a modified particle filter with hash fingerprints for the coarse matching by the maximum a posteriori and a discriminative model for the fine classification by maximizing a confidence map based on a context model. The confidence map reveals the spatio-temporal dynamics of the target. Because hash fingerprint is merely a binary vector and the modified particle filter uses only a small number of particles, our tracker has a low computation cost. By conducting experiments on eight challenging video sequences from a public benchmark, we demonstrate that our tracker outperforms eight state-of-the-art trackers in terms of both accuracy and speed.",

keywords = "Confidence map, Hash algorithm, Hierarchical framework, Maximum a posteriori (MAP), Spatio-temporal cues",

author = "Manna Dai and Shuying Cheng and Xiangjian He",

note = "Publisher Copyright: {\textcopyright} 2016, The Natural Computing Applications Forum.",

year = "2018",

month = jan,

day = "1",

doi = "10.1007/s00521-016-2452-z",

language = "English",

volume = "29",

pages = "389--399",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "2",

}

TY - JOUR

T1 - Hybrid generative–discriminative hash tracking with spatio-temporal contextual cues

AU - Dai, Manna

AU - Cheng, Shuying

AU - He, Xiangjian

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Visual object tracking is of a great application value in video monitoring systems. Recent work on video tracking has taken into account spatial relationship between the targeted object and its background. In this paper, the spatial relationship is combined with the temporal relationship between features on different video frames so that a real-time tracker is designed based on a hash algorithm with spatio-temporal cues. Different from most of the existing work on video tracking, which is regarded as a mechanism for image matching or image classification alone, we propose a hierarchical framework and conduct both matching and classification tasks to generate a coarse-to-fine tracking system. We develop a generative model under a modified particle filter with hash fingerprints for the coarse matching by the maximum a posteriori and a discriminative model for the fine classification by maximizing a confidence map based on a context model. The confidence map reveals the spatio-temporal dynamics of the target. Because hash fingerprint is merely a binary vector and the modified particle filter uses only a small number of particles, our tracker has a low computation cost. By conducting experiments on eight challenging video sequences from a public benchmark, we demonstrate that our tracker outperforms eight state-of-the-art trackers in terms of both accuracy and speed.

AB - Visual object tracking is of a great application value in video monitoring systems. Recent work on video tracking has taken into account spatial relationship between the targeted object and its background. In this paper, the spatial relationship is combined with the temporal relationship between features on different video frames so that a real-time tracker is designed based on a hash algorithm with spatio-temporal cues. Different from most of the existing work on video tracking, which is regarded as a mechanism for image matching or image classification alone, we propose a hierarchical framework and conduct both matching and classification tasks to generate a coarse-to-fine tracking system. We develop a generative model under a modified particle filter with hash fingerprints for the coarse matching by the maximum a posteriori and a discriminative model for the fine classification by maximizing a confidence map based on a context model. The confidence map reveals the spatio-temporal dynamics of the target. Because hash fingerprint is merely a binary vector and the modified particle filter uses only a small number of particles, our tracker has a low computation cost. By conducting experiments on eight challenging video sequences from a public benchmark, we demonstrate that our tracker outperforms eight state-of-the-art trackers in terms of both accuracy and speed.

KW - Confidence map

KW - Hash algorithm

KW - Hierarchical framework

KW - Maximum a posteriori (MAP)

KW - Spatio-temporal cues

UR - http://www.scopus.com/inward/record.url?scp=84978792975&partnerID=8YFLogxK

U2 - 10.1007/s00521-016-2452-z

DO - 10.1007/s00521-016-2452-z

M3 - Article

AN - SCOPUS:84978792975

SN - 0941-0643

VL - 29

SP - 389

EP - 399

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 2

ER -

Hybrid generative–discriminative hash tracking with spatio-temporal contextual cues

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this