Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition

Weicheng Xie; Wenting Chen; Linlin Shen; Jinming Duan; Meng Yang

doi:10.1016/j.patcog.2020.107701

Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition

Weicheng Xie, Wenting Chen, Linlin Shen, Jinming Duan, Meng Yang

Research output: Journal Publication › Article › peer-review

21 Citations (Scopus)

Abstract

For facial expression recognition, the sparseness constraints of the features or weights can improve the generalization ability of a deep network. However, the optimization of the hyper-parameters in fusing different sparseness strategies demands much computation, when the traditional gradient-based algorithms are used. In this work, an iterative framework with surrogate network is proposed for the optimization of hyper-parameters in fusing different sparseness strategies. In each iteration, a network with significantly smaller model complexity is fitted to the original large network based on four Euclidean losses, where the hyper-parameters are optimized with heuristic optimizers. Since the surrogate network uses the same deep metrics and embeds the same hyper-parameters as the original network, the optimized hyper-parameters are then used for the training of the original deep network in the next iteration. While the performance of the proposed algorithm is justified with a tiny model, i.e. LeNet on the FER2013 database, our approach achieved competitive performances on six publicly available expression datasets, i.e., FER2013, CK+, Oulu-CASIA, MMI, AFEW and AffectNet.

Original language	English
Article number	107701
Journal	Pattern Recognition
Volume	111
DOIs	https://doi.org/10.1016/j.patcog.2020.107701
Publication status	Published - Mar 2021
Externally published	Yes

Keywords

Deep sparseness strategies
Expression recognition
Heuristic optimizer
Hyper-parameter optimization
Surrogate network

ASJC Scopus subject areas

Software
Signal Processing
Computer Vision and Pattern Recognition
Artificial Intelligence

Access to Document

10.1016/j.patcog.2020.107701

Cite this

@article{4d2d646d216e46ffbc264646cebf743a,

title = "Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition",

abstract = "For facial expression recognition, the sparseness constraints of the features or weights can improve the generalization ability of a deep network. However, the optimization of the hyper-parameters in fusing different sparseness strategies demands much computation, when the traditional gradient-based algorithms are used. In this work, an iterative framework with surrogate network is proposed for the optimization of hyper-parameters in fusing different sparseness strategies. In each iteration, a network with significantly smaller model complexity is fitted to the original large network based on four Euclidean losses, where the hyper-parameters are optimized with heuristic optimizers. Since the surrogate network uses the same deep metrics and embeds the same hyper-parameters as the original network, the optimized hyper-parameters are then used for the training of the original deep network in the next iteration. While the performance of the proposed algorithm is justified with a tiny model, i.e. LeNet on the FER2013 database, our approach achieved competitive performances on six publicly available expression datasets, i.e., FER2013, CK+, Oulu-CASIA, MMI, AFEW and AffectNet.",

keywords = "Deep sparseness strategies, Expression recognition, Heuristic optimizer, Hyper-parameter optimization, Surrogate network",

author = "Weicheng Xie and Wenting Chen and Linlin Shen and Jinming Duan and Meng Yang",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Ltd",

year = "2021",

month = mar,

doi = "10.1016/j.patcog.2020.107701",

language = "English",

volume = "111",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition

AU - Xie, Weicheng

AU - Chen, Wenting

AU - Shen, Linlin

AU - Duan, Jinming

AU - Yang, Meng

PY - 2021/3

Y1 - 2021/3

N2 - For facial expression recognition, the sparseness constraints of the features or weights can improve the generalization ability of a deep network. However, the optimization of the hyper-parameters in fusing different sparseness strategies demands much computation, when the traditional gradient-based algorithms are used. In this work, an iterative framework with surrogate network is proposed for the optimization of hyper-parameters in fusing different sparseness strategies. In each iteration, a network with significantly smaller model complexity is fitted to the original large network based on four Euclidean losses, where the hyper-parameters are optimized with heuristic optimizers. Since the surrogate network uses the same deep metrics and embeds the same hyper-parameters as the original network, the optimized hyper-parameters are then used for the training of the original deep network in the next iteration. While the performance of the proposed algorithm is justified with a tiny model, i.e. LeNet on the FER2013 database, our approach achieved competitive performances on six publicly available expression datasets, i.e., FER2013, CK+, Oulu-CASIA, MMI, AFEW and AffectNet.

AB - For facial expression recognition, the sparseness constraints of the features or weights can improve the generalization ability of a deep network. However, the optimization of the hyper-parameters in fusing different sparseness strategies demands much computation, when the traditional gradient-based algorithms are used. In this work, an iterative framework with surrogate network is proposed for the optimization of hyper-parameters in fusing different sparseness strategies. In each iteration, a network with significantly smaller model complexity is fitted to the original large network based on four Euclidean losses, where the hyper-parameters are optimized with heuristic optimizers. Since the surrogate network uses the same deep metrics and embeds the same hyper-parameters as the original network, the optimized hyper-parameters are then used for the training of the original deep network in the next iteration. While the performance of the proposed algorithm is justified with a tiny model, i.e. LeNet on the FER2013 database, our approach achieved competitive performances on six publicly available expression datasets, i.e., FER2013, CK+, Oulu-CASIA, MMI, AFEW and AffectNet.

KW - Deep sparseness strategies

KW - Expression recognition

KW - Heuristic optimizer

KW - Hyper-parameter optimization

KW - Surrogate network

UR - http://www.scopus.com/inward/record.url?scp=85092737122&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2020.107701

DO - 10.1016/j.patcog.2020.107701

M3 - Article

AN - SCOPUS:85092737122

SN - 0031-3203

VL - 111

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 107701

ER -

Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this