Facial expression recognition using a hybrid CNN-SIFT aggregator

Tee Connie; Mundher Al-Shabi; Wooi Ping Cheah; Michael Goh

doi:10.1007/978-3-319-69456-6_12

Facial expression recognition using a hybrid CNN-SIFT aggregator

Tee Connie, Mundher Al-Shabi, Wooi Ping Cheah, Michael Goh

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

103 Citations (Scopus)

Abstract

Deriving an effective facial expression recognition component is important for a successful human-computer interaction system. Nonetheless, recognizing facial expression remains a challenging task. This paper describes a novel approach towards facial expression recognition task. The proposed method is motivated by the success of Convolutional Neural Networks (CNN) on the face recognition problem. Unlike other works, we focus on achieving good accuracy while requiring only a small sample data for training. Scale Invariant Feature Transform (SIFT) features are used to increase the performance on small data as SIFT does not require extensive training data to generate useful features. In this paper, both Dense SIFT and regular SIFT are studied and compared when merged with CNN features. Moreover, an aggregator of the models is developed. The proposed approach is tested on the FER-2013 and CK+ datasets. Results demonstrate the superiority of CNN with Dense SIFT over conventional CNN and CNN with SIFT. The accuracy even increased when all the models are aggregated which generates state-of-art results on FER-2013 and CK+ datasets, where it achieved 73.4% on FER-2013 and 99.1% on CK+.

Original language	English
Title of host publication	Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings
Editors	Somnuk Phon-Amnuaisuk, Swee-Peng Ang, Soo-Young Lee
Publisher	Springer Verlag
Pages	139-149
Number of pages	11
ISBN (Print)	9783319694559
DOIs	https://doi.org/10.1007/978-3-319-69456-6_12
Publication status	Published - 2017
Externally published	Yes
Event	11th Multi-disciplinary International Workshop on Artificial Intelligence, MIWAI 2017 - Gadong, Brunei Darussalam Duration: 20 Nov 2017 → 22 Nov 2017

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	10607 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	11th Multi-disciplinary International Workshop on Artificial Intelligence, MIWAI 2017
Country/Territory	Brunei Darussalam
City	Gadong
Period	20/11/17 → 22/11/17

Keywords

CNN
Dense SIFT
Facial expression recognition
SIFT

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-319-69456-6_12

Cite this

Connie, T., Al-Shabi, M., Cheah, W. P., & Goh, M. (2017). Facial expression recognition using a hybrid CNN-SIFT aggregator. In S. Phon-Amnuaisuk, S.-P. Ang, & S.-Y. Lee (Eds.), Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings (pp. 139-149). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10607 LNAI). Springer Verlag. https://doi.org/10.1007/978-3-319-69456-6_12

Connie, Tee ; Al-Shabi, Mundher ; Cheah, Wooi Ping et al. / Facial expression recognition using a hybrid CNN-SIFT aggregator. Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings. editor / Somnuk Phon-Amnuaisuk ; Swee-Peng Ang ; Soo-Young Lee. Springer Verlag, 2017. pp. 139-149 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{2a468accfa784e96a0e7b0e17dacd89c,

title = "Facial expression recognition using a hybrid CNN-SIFT aggregator",

abstract = "Deriving an effective facial expression recognition component is important for a successful human-computer interaction system. Nonetheless, recognizing facial expression remains a challenging task. This paper describes a novel approach towards facial expression recognition task. The proposed method is motivated by the success of Convolutional Neural Networks (CNN) on the face recognition problem. Unlike other works, we focus on achieving good accuracy while requiring only a small sample data for training. Scale Invariant Feature Transform (SIFT) features are used to increase the performance on small data as SIFT does not require extensive training data to generate useful features. In this paper, both Dense SIFT and regular SIFT are studied and compared when merged with CNN features. Moreover, an aggregator of the models is developed. The proposed approach is tested on the FER-2013 and CK+ datasets. Results demonstrate the superiority of CNN with Dense SIFT over conventional CNN and CNN with SIFT. The accuracy even increased when all the models are aggregated which generates state-of-art results on FER-2013 and CK+ datasets, where it achieved 73.4% on FER-2013 and 99.1% on CK+.",

keywords = "CNN, Dense SIFT, Facial expression recognition, SIFT",

author = "Tee Connie and Mundher Al-Shabi and Cheah, {Wooi Ping} and Michael Goh",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG 2017.; 11th Multi-disciplinary International Workshop on Artificial Intelligence, MIWAI 2017 ; Conference date: 20-11-2017 Through 22-11-2017",

year = "2017",

doi = "10.1007/978-3-319-69456-6_12",

language = "English",

isbn = "9783319694559",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "139--149",

editor = "Somnuk Phon-Amnuaisuk and Swee-Peng Ang and Soo-Young Lee",

booktitle = "Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings",

address = "Germany",

}

Connie, T, Al-Shabi, M, Cheah, WP & Goh, M 2017, Facial expression recognition using a hybrid CNN-SIFT aggregator. in S Phon-Amnuaisuk, S-P Ang & S-Y Lee (eds), Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10607 LNAI, Springer Verlag, pp. 139-149, 11th Multi-disciplinary International Workshop on Artificial Intelligence, MIWAI 2017, Gadong, Brunei Darussalam, 20/11/17. https://doi.org/10.1007/978-3-319-69456-6_12

Facial expression recognition using a hybrid CNN-SIFT aggregator. / Connie, Tee; Al-Shabi, Mundher; Cheah, Wooi Ping et al.
Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings. ed. / Somnuk Phon-Amnuaisuk; Swee-Peng Ang; Soo-Young Lee. Springer Verlag, 2017. p. 139-149 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10607 LNAI).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Facial expression recognition using a hybrid CNN-SIFT aggregator

AU - Connie, Tee

AU - Al-Shabi, Mundher

AU - Cheah, Wooi Ping

AU - Goh, Michael

N1 - Publisher Copyright: © Springer International Publishing AG 2017.

PY - 2017

Y1 - 2017

N2 - Deriving an effective facial expression recognition component is important for a successful human-computer interaction system. Nonetheless, recognizing facial expression remains a challenging task. This paper describes a novel approach towards facial expression recognition task. The proposed method is motivated by the success of Convolutional Neural Networks (CNN) on the face recognition problem. Unlike other works, we focus on achieving good accuracy while requiring only a small sample data for training. Scale Invariant Feature Transform (SIFT) features are used to increase the performance on small data as SIFT does not require extensive training data to generate useful features. In this paper, both Dense SIFT and regular SIFT are studied and compared when merged with CNN features. Moreover, an aggregator of the models is developed. The proposed approach is tested on the FER-2013 and CK+ datasets. Results demonstrate the superiority of CNN with Dense SIFT over conventional CNN and CNN with SIFT. The accuracy even increased when all the models are aggregated which generates state-of-art results on FER-2013 and CK+ datasets, where it achieved 73.4% on FER-2013 and 99.1% on CK+.

AB - Deriving an effective facial expression recognition component is important for a successful human-computer interaction system. Nonetheless, recognizing facial expression remains a challenging task. This paper describes a novel approach towards facial expression recognition task. The proposed method is motivated by the success of Convolutional Neural Networks (CNN) on the face recognition problem. Unlike other works, we focus on achieving good accuracy while requiring only a small sample data for training. Scale Invariant Feature Transform (SIFT) features are used to increase the performance on small data as SIFT does not require extensive training data to generate useful features. In this paper, both Dense SIFT and regular SIFT are studied and compared when merged with CNN features. Moreover, an aggregator of the models is developed. The proposed approach is tested on the FER-2013 and CK+ datasets. Results demonstrate the superiority of CNN with Dense SIFT over conventional CNN and CNN with SIFT. The accuracy even increased when all the models are aggregated which generates state-of-art results on FER-2013 and CK+ datasets, where it achieved 73.4% on FER-2013 and 99.1% on CK+.

KW - CNN

KW - Dense SIFT

KW - Facial expression recognition

KW - SIFT

UR - http://www.scopus.com/inward/record.url?scp=85034228597&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-69456-6_12

DO - 10.1007/978-3-319-69456-6_12

M3 - Conference contribution

AN - SCOPUS:85034228597

SN - 9783319694559

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 139

EP - 149

BT - Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings

A2 - Phon-Amnuaisuk, Somnuk

A2 - Ang, Swee-Peng

A2 - Lee, Soo-Young

PB - Springer Verlag

T2 - 11th Multi-disciplinary International Workshop on Artificial Intelligence, MIWAI 2017

Y2 - 20 November 2017 through 22 November 2017

ER -

Connie T, Al-Shabi M, Cheah WP, Goh M. Facial expression recognition using a hybrid CNN-SIFT aggregator. In Phon-Amnuaisuk S, Ang SP, Lee SY, editors, Multi-disciplinary Trends in Artificial Intelligence - 11th International Workshop, MIWAI 2017, Proceedings. Springer Verlag. 2017. p. 139-149. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-69456-6_12

Facial expression recognition using a hybrid CNN-SIFT aggregator

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this