Unsupervised video hashing by exploiting spatio-temporal feature

Chao Ma; Yun Gu; Wei Liu; Jie Yang; Xiangjian He

doi:10.1007/978-3-319-46675-0_56

Unsupervised video hashing by exploiting spatio-temporal feature

Chao Ma, Yun Gu, Wei Liu, Jie Yang, Xiangjian He

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

6 Citations (Scopus)

Abstract

Video hashing is a common solution for content-based video retrieval by encoding high-dimensional feature vectors into short binary codes. Videos not only have spatial structure inside each frame but also have temporal correlation structure between frames, while the latter has been largely neglected by many existing methods. Therefore, in this paper we propose to perform video hashing by incorporating the temporal structure as well as the conventional spatial structure. Specifically, the spatial features of videos are obtained by utilizing Convolutional Neural Network (CNN), and the temporal features are established via Long-Short Term Memory (LSTM). The proposed spatio-temporal feature learning framework can be applied to many existing unsupervised hashing methods such as Iterative Quantization (ITQ), Spectral Hashing (SH), and others. Experimental results on the UCF-101 dataset indicate that by simultaneously employing the temporal features and spatial features, our hashing method is able to significantly improve the performance of existing methods which only deploy the spatial feature.

Original language	English
Title of host publication	Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings
Editors	Akira Hirose, Minho Lee, Derong Liu, Kenji Doya, Kazushi Ikeda, Seiichi Ozawa
Publisher	Springer Verlag
Pages	511-518
Number of pages	8
ISBN (Print)	9783319466743
DOIs	https://doi.org/10.1007/978-3-319-46675-0_56
Publication status	Published - 2016
Externally published	Yes

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	9949 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Keywords

Spatio-temporal feature
Unsupervised method
Video hashing

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-319-46675-0_56

Cite this

Ma, C., Gu, Y., Liu, W., Yang, J., & He, X. (2016). Unsupervised video hashing by exploiting spatio-temporal feature. In A. Hirose, M. Lee, D. Liu, K. Doya, K. Ikeda, & S. Ozawa (Eds.), Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings (pp. 511-518). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9949 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-46675-0_56

Ma, Chao ; Gu, Yun ; Liu, Wei et al. / Unsupervised video hashing by exploiting spatio-temporal feature. Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. editor / Akira Hirose ; Minho Lee ; Derong Liu ; Kenji Doya ; Kazushi Ikeda ; Seiichi Ozawa. Springer Verlag, 2016. pp. 511-518 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{84f727035ca84ea6bf40c6a9d5446019,

title = "Unsupervised video hashing by exploiting spatio-temporal feature",

abstract = "Video hashing is a common solution for content-based video retrieval by encoding high-dimensional feature vectors into short binary codes. Videos not only have spatial structure inside each frame but also have temporal correlation structure between frames, while the latter has been largely neglected by many existing methods. Therefore, in this paper we propose to perform video hashing by incorporating the temporal structure as well as the conventional spatial structure. Specifically, the spatial features of videos are obtained by utilizing Convolutional Neural Network (CNN), and the temporal features are established via Long-Short Term Memory (LSTM). The proposed spatio-temporal feature learning framework can be applied to many existing unsupervised hashing methods such as Iterative Quantization (ITQ), Spectral Hashing (SH), and others. Experimental results on the UCF-101 dataset indicate that by simultaneously employing the temporal features and spatial features, our hashing method is able to significantly improve the performance of existing methods which only deploy the spatial feature.",

keywords = "Spatio-temporal feature, Unsupervised method, Video hashing",

author = "Chao Ma and Yun Gu and Wei Liu and Jie Yang and Xiangjian He",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG 2016.",

year = "2016",

doi = "10.1007/978-3-319-46675-0_56",

language = "English",

isbn = "9783319466743",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "511--518",

editor = "Akira Hirose and Minho Lee and Derong Liu and Kenji Doya and Kazushi Ikeda and Seiichi Ozawa",

booktitle = "Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings",

address = "Germany",

}

Ma, C, Gu, Y, Liu, W, Yang, J & He, X 2016, Unsupervised video hashing by exploiting spatio-temporal feature. in A Hirose, M Lee, D Liu, K Doya, K Ikeda & S Ozawa (eds), Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9949 LNCS, Springer Verlag, pp. 511-518. https://doi.org/10.1007/978-3-319-46675-0_56

Unsupervised video hashing by exploiting spatio-temporal feature. / Ma, Chao; Gu, Yun; Liu, Wei et al.
Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. ed. / Akira Hirose; Minho Lee; Derong Liu; Kenji Doya; Kazushi Ikeda; Seiichi Ozawa. Springer Verlag, 2016. p. 511-518 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9949 LNCS).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Unsupervised video hashing by exploiting spatio-temporal feature

AU - Ma, Chao

AU - Gu, Yun

AU - Liu, Wei

AU - Yang, Jie

AU - He, Xiangjian

N1 - Publisher Copyright: © Springer International Publishing AG 2016.

PY - 2016

Y1 - 2016

N2 - Video hashing is a common solution for content-based video retrieval by encoding high-dimensional feature vectors into short binary codes. Videos not only have spatial structure inside each frame but also have temporal correlation structure between frames, while the latter has been largely neglected by many existing methods. Therefore, in this paper we propose to perform video hashing by incorporating the temporal structure as well as the conventional spatial structure. Specifically, the spatial features of videos are obtained by utilizing Convolutional Neural Network (CNN), and the temporal features are established via Long-Short Term Memory (LSTM). The proposed spatio-temporal feature learning framework can be applied to many existing unsupervised hashing methods such as Iterative Quantization (ITQ), Spectral Hashing (SH), and others. Experimental results on the UCF-101 dataset indicate that by simultaneously employing the temporal features and spatial features, our hashing method is able to significantly improve the performance of existing methods which only deploy the spatial feature.

AB - Video hashing is a common solution for content-based video retrieval by encoding high-dimensional feature vectors into short binary codes. Videos not only have spatial structure inside each frame but also have temporal correlation structure between frames, while the latter has been largely neglected by many existing methods. Therefore, in this paper we propose to perform video hashing by incorporating the temporal structure as well as the conventional spatial structure. Specifically, the spatial features of videos are obtained by utilizing Convolutional Neural Network (CNN), and the temporal features are established via Long-Short Term Memory (LSTM). The proposed spatio-temporal feature learning framework can be applied to many existing unsupervised hashing methods such as Iterative Quantization (ITQ), Spectral Hashing (SH), and others. Experimental results on the UCF-101 dataset indicate that by simultaneously employing the temporal features and spatial features, our hashing method is able to significantly improve the performance of existing methods which only deploy the spatial feature.

KW - Spatio-temporal feature

KW - Unsupervised method

KW - Video hashing

UR - http://www.scopus.com/inward/record.url?scp=84992743705&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-46675-0_56

DO - 10.1007/978-3-319-46675-0_56

M3 - Conference contribution

AN - SCOPUS:84992743705

SN - 9783319466743

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 511

EP - 518

BT - Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings

A2 - Hirose, Akira

A2 - Lee, Minho

A2 - Liu, Derong

A2 - Doya, Kenji

A2 - Ikeda, Kazushi

A2 - Ozawa, Seiichi

PB - Springer Verlag

ER -

Ma C, Gu Y, Liu W, Yang J, He X. Unsupervised video hashing by exploiting spatio-temporal feature. In Hirose A, Lee M, Liu D, Doya K, Ikeda K, Ozawa S, editors, Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. Springer Verlag. 2016. p. 511-518. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-46675-0_56

Unsupervised video hashing by exploiting spatio-temporal feature

Abstract

Publication series

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this