Learning a discriminative dictionary with CNN for image classification

Shuai Yu; Tao Zhang; Chao Ma; Lei Zhou; Jie Yang; Xiangjian He

doi:10.1007/978-3-319-46672-9_22

Learning a discriminative dictionary with CNN for image classification

Shuai Yu, Tao Zhang, Chao Ma, Lei Zhou, Jie Yang, Xiangjian He

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

3 Citations (Scopus)

Abstract

In this paper, we propose a novel framework for image recognition based on an extended sparse model. First, inspired by the impressive results of CNN over different tasks in computer vision, we use the CNN models pre-trained on large datasets to generate features. Then we propose an extended sparse model which learns a dictionary from the CNN features by incorporating the reconstruction residual term and the coefficients adjustment term. Minimizing the reconstruction residual term guarantees that the class-specific sub-dictionary has good representation power for the samples from the corresponding class and minimizing the coefficients adjustment term encourages samples from different classes to be reconstructed by different class-specific sub-dictionaries. With this learned dictionary, not only the representation residual but also the representation coefficients will be discriminative. Finally, a metric involving these discriminative information is introduced for image classification. Experiments on Caltech101 and PASCAL VOC 2012 datasets show the effectiveness of the proposed method on image classification.

Original language	English
Title of host publication	Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings
Editors	Seiichi Ozawa, Kazushi Ikeda, Derong Liu, Akira Hirose, Kenji Doya, Minho Lee
Publisher	Springer Verlag
Pages	185-194
Number of pages	10
ISBN (Print)	9783319466712
DOIs	https://doi.org/10.1007/978-3-319-46672-9_22
Publication status	Published - 2016
Externally published	Yes
Event	23rd International Conference on Neural Information Processing, ICONIP 2016 - Kyoto, Japan Duration: 16 Oct 2016 → 21 Oct 2016

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	9948 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	23rd International Conference on Neural Information Processing, ICONIP 2016
Country/Territory	Japan
City	Kyoto
Period	16/10/16 → 21/10/16

Keywords

Convolutional Neural Networks
Image classification
Sparse model
Unsupervised dictionary learning

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-319-46672-9_22

Cite this

Yu, S., Zhang, T., Ma, C., Zhou, L., Yang, J., & He, X. (2016). Learning a discriminative dictionary with CNN for image classification. In S. Ozawa, K. Ikeda, D. Liu, A. Hirose, K. Doya, & M. Lee (Eds.), Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings (pp. 185-194). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9948 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-46672-9_22

Yu, Shuai ; Zhang, Tao ; Ma, Chao et al. / Learning a discriminative dictionary with CNN for image classification. Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. editor / Seiichi Ozawa ; Kazushi Ikeda ; Derong Liu ; Akira Hirose ; Kenji Doya ; Minho Lee. Springer Verlag, 2016. pp. 185-194 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{91ee026f86264633b9834994d4efb27a,

title = "Learning a discriminative dictionary with CNN for image classification",

abstract = "In this paper, we propose a novel framework for image recognition based on an extended sparse model. First, inspired by the impressive results of CNN over different tasks in computer vision, we use the CNN models pre-trained on large datasets to generate features. Then we propose an extended sparse model which learns a dictionary from the CNN features by incorporating the reconstruction residual term and the coefficients adjustment term. Minimizing the reconstruction residual term guarantees that the class-specific sub-dictionary has good representation power for the samples from the corresponding class and minimizing the coefficients adjustment term encourages samples from different classes to be reconstructed by different class-specific sub-dictionaries. With this learned dictionary, not only the representation residual but also the representation coefficients will be discriminative. Finally, a metric involving these discriminative information is introduced for image classification. Experiments on Caltech101 and PASCAL VOC 2012 datasets show the effectiveness of the proposed method on image classification.",

keywords = "Convolutional Neural Networks, Image classification, Sparse model, Unsupervised dictionary learning",

author = "Shuai Yu and Tao Zhang and Chao Ma and Lei Zhou and Jie Yang and Xiangjian He",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG 2016.; 23rd International Conference on Neural Information Processing, ICONIP 2016 ; Conference date: 16-10-2016 Through 21-10-2016",

year = "2016",

doi = "10.1007/978-3-319-46672-9_22",

language = "English",

isbn = "9783319466712",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "185--194",

editor = "Seiichi Ozawa and Kazushi Ikeda and Derong Liu and Akira Hirose and Kenji Doya and Minho Lee",

booktitle = "Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings",

address = "Germany",

}

Yu, S, Zhang, T, Ma, C, Zhou, L, Yang, J & He, X 2016, Learning a discriminative dictionary with CNN for image classification. in S Ozawa, K Ikeda, D Liu, A Hirose, K Doya & M Lee (eds), Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9948 LNCS, Springer Verlag, pp. 185-194, 23rd International Conference on Neural Information Processing, ICONIP 2016, Kyoto, Japan, 16/10/16. https://doi.org/10.1007/978-3-319-46672-9_22

Learning a discriminative dictionary with CNN for image classification. / Yu, Shuai; Zhang, Tao; Ma, Chao et al.
Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. ed. / Seiichi Ozawa; Kazushi Ikeda; Derong Liu; Akira Hirose; Kenji Doya; Minho Lee. Springer Verlag, 2016. p. 185-194 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9948 LNCS).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Learning a discriminative dictionary with CNN for image classification

AU - Yu, Shuai

AU - Zhang, Tao

AU - Ma, Chao

AU - Zhou, Lei

AU - Yang, Jie

AU - He, Xiangjian

N1 - Publisher Copyright: © Springer International Publishing AG 2016.

PY - 2016

Y1 - 2016

N2 - In this paper, we propose a novel framework for image recognition based on an extended sparse model. First, inspired by the impressive results of CNN over different tasks in computer vision, we use the CNN models pre-trained on large datasets to generate features. Then we propose an extended sparse model which learns a dictionary from the CNN features by incorporating the reconstruction residual term and the coefficients adjustment term. Minimizing the reconstruction residual term guarantees that the class-specific sub-dictionary has good representation power for the samples from the corresponding class and minimizing the coefficients adjustment term encourages samples from different classes to be reconstructed by different class-specific sub-dictionaries. With this learned dictionary, not only the representation residual but also the representation coefficients will be discriminative. Finally, a metric involving these discriminative information is introduced for image classification. Experiments on Caltech101 and PASCAL VOC 2012 datasets show the effectiveness of the proposed method on image classification.

AB - In this paper, we propose a novel framework for image recognition based on an extended sparse model. First, inspired by the impressive results of CNN over different tasks in computer vision, we use the CNN models pre-trained on large datasets to generate features. Then we propose an extended sparse model which learns a dictionary from the CNN features by incorporating the reconstruction residual term and the coefficients adjustment term. Minimizing the reconstruction residual term guarantees that the class-specific sub-dictionary has good representation power for the samples from the corresponding class and minimizing the coefficients adjustment term encourages samples from different classes to be reconstructed by different class-specific sub-dictionaries. With this learned dictionary, not only the representation residual but also the representation coefficients will be discriminative. Finally, a metric involving these discriminative information is introduced for image classification. Experiments on Caltech101 and PASCAL VOC 2012 datasets show the effectiveness of the proposed method on image classification.

KW - Convolutional Neural Networks

KW - Image classification

KW - Sparse model

KW - Unsupervised dictionary learning

UR - http://www.scopus.com/inward/record.url?scp=84992677771&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-46672-9_22

DO - 10.1007/978-3-319-46672-9_22

M3 - Conference contribution

AN - SCOPUS:84992677771

SN - 9783319466712

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 185

EP - 194

BT - Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings

A2 - Ozawa, Seiichi

A2 - Ikeda, Kazushi

A2 - Liu, Derong

A2 - Hirose, Akira

A2 - Doya, Kenji

A2 - Lee, Minho

PB - Springer Verlag

T2 - 23rd International Conference on Neural Information Processing, ICONIP 2016

Y2 - 16 October 2016 through 21 October 2016

ER -

Yu S, Zhang T, Ma C, Zhou L, Yang J, He X. Learning a discriminative dictionary with CNN for image classification. In Ozawa S, Ikeda K, Liu D, Hirose A, Doya K, Lee M, editors, Neural Information Processing - 23rd International Conference, ICONIP 2016, Proceedings. Springer Verlag. 2016. p. 185-194. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-46672-9_22

Learning a discriminative dictionary with CNN for image classification

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this