Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection

Zihan Wang; Siyang Song; Cheng Luo; Yuzhi Zhou; Shiling Wu; Weicheng Xie; Linlin Shen

doi:10.1109/CVPRW59228.2023.00627

Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection

Zihan Wang, Siyang Song, Cheng Luo, Yuzhi Zhou, Shiling Wu, Weicheng Xie, Linlin Shen

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

3 Citations (Scopus)

Abstract

This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our approach consists of three main modules: (i) a pre-trained facial representation encoder which produce a strong facial representation from each input face image in the input sequence; (ii) an AU-specific feature generator that specifically learns a set of AU features from each facial representation; and (iii) a spatio-temporal graph learning module that constructs a spatio-temporal graph representation. This graph representation describes AUs contained in all frames and predicts the occurrence of each AU based on both the modeled spatial information within the corresponding face and the learned temporal dynamics among frames. The experimental results show that our approach outperformed the baseline and the spatio-temporal graph representation learning allows our model to generate the best results among all ablated systems. Our model ranks at the 4th place in the AU recognition track at the 5th ABAW Competition. Our code is publicly available at https://github.com/wzh125/ABAW-5.

Original language	English
Title of host publication	Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023
Publisher	IEEE Computer Society
Pages	5899-5907
Number of pages	9
ISBN (Electronic)	9798350302493
DOIs	https://doi.org/10.1109/CVPRW59228.2023.00627
Publication status	Published - 2023
Externally published	Yes
Event	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023 - Vancouver, Canada Duration: 18 Jun 2023 → 22 Jun 2023

Publication series

Name	IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Volume	2023-June
ISSN (Print)	2160-7508
ISSN (Electronic)	2160-7516

Conference

Conference	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023
Country/Territory	Canada
City	Vancouver
Period	18/06/23 → 22/06/23

ASJC Scopus subject areas

Computer Vision and Pattern Recognition
Electrical and Electronic Engineering

Access to Document

10.1109/CVPRW59228.2023.00627

Cite this

Wang, Z., Song, S., Luo, C., Zhou, Y., Wu, S., Xie, W., & Shen, L. (2023). Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection. In Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023 (pp. 5899-5907). (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops; Vol. 2023-June). IEEE Computer Society. https://doi.org/10.1109/CVPRW59228.2023.00627

Wang, Zihan ; Song, Siyang ; Luo, Cheng et al. / Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection. Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023. IEEE Computer Society, 2023. pp. 5899-5907 (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops).

@inproceedings{87aa18b019db4ceab40afcf3f71fb4fc,

title = "Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection",

abstract = "This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our approach consists of three main modules: (i) a pre-trained facial representation encoder which produce a strong facial representation from each input face image in the input sequence; (ii) an AU-specific feature generator that specifically learns a set of AU features from each facial representation; and (iii) a spatio-temporal graph learning module that constructs a spatio-temporal graph representation. This graph representation describes AUs contained in all frames and predicts the occurrence of each AU based on both the modeled spatial information within the corresponding face and the learned temporal dynamics among frames. The experimental results show that our approach outperformed the baseline and the spatio-temporal graph representation learning allows our model to generate the best results among all ablated systems. Our model ranks at the 4th place in the AU recognition track at the 5th ABAW Competition. Our code is publicly available at https://github.com/wzh125/ABAW-5.",

author = "Zihan Wang and Siyang Song and Cheng Luo and Yuzhi Zhou and Shiling Wu and Weicheng Xie and Linlin Shen",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023 ; Conference date: 18-06-2023 Through 22-06-2023",

year = "2023",

doi = "10.1109/CVPRW59228.2023.00627",

language = "English",

series = "IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops",

publisher = "IEEE Computer Society",

pages = "5899--5907",

booktitle = "Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023",

address = "United States",

}

Wang, Z, Song, S, Luo, C, Zhou, Y, Wu, S, Xie, W & Shen, L 2023, Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection. in Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, vol. 2023-June, IEEE Computer Society, pp. 5899-5907, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023, Vancouver, Canada, 18/06/23. https://doi.org/10.1109/CVPRW59228.2023.00627

Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection. / Wang, Zihan; Song, Siyang; Luo, Cheng et al.
Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023. IEEE Computer Society, 2023. p. 5899-5907 (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops; Vol. 2023-June).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection

AU - Wang, Zihan

AU - Song, Siyang

AU - Luo, Cheng

AU - Zhou, Yuzhi

AU - Wu, Shiling

AU - Xie, Weicheng

AU - Shen, Linlin

PY - 2023

Y1 - 2023

N2 - This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our approach consists of three main modules: (i) a pre-trained facial representation encoder which produce a strong facial representation from each input face image in the input sequence; (ii) an AU-specific feature generator that specifically learns a set of AU features from each facial representation; and (iii) a spatio-temporal graph learning module that constructs a spatio-temporal graph representation. This graph representation describes AUs contained in all frames and predicts the occurrence of each AU based on both the modeled spatial information within the corresponding face and the learned temporal dynamics among frames. The experimental results show that our approach outperformed the baseline and the spatio-temporal graph representation learning allows our model to generate the best results among all ablated systems. Our model ranks at the 4th place in the AU recognition track at the 5th ABAW Competition. Our code is publicly available at https://github.com/wzh125/ABAW-5.

AB - This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW). Our approach consists of three main modules: (i) a pre-trained facial representation encoder which produce a strong facial representation from each input face image in the input sequence; (ii) an AU-specific feature generator that specifically learns a set of AU features from each facial representation; and (iii) a spatio-temporal graph learning module that constructs a spatio-temporal graph representation. This graph representation describes AUs contained in all frames and predicts the occurrence of each AU based on both the modeled spatial information within the corresponding face and the learned temporal dynamics among frames. The experimental results show that our approach outperformed the baseline and the spatio-temporal graph representation learning allows our model to generate the best results among all ablated systems. Our model ranks at the 4th place in the AU recognition track at the 5th ABAW Competition. Our code is publicly available at https://github.com/wzh125/ABAW-5.

UR - http://www.scopus.com/inward/record.url?scp=85170828128&partnerID=8YFLogxK

U2 - 10.1109/CVPRW59228.2023.00627

DO - 10.1109/CVPRW59228.2023.00627

M3 - Conference contribution

AN - SCOPUS:85170828128

T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

SP - 5899

EP - 5907

BT - Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023

PB - IEEE Computer Society

T2 - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023

Y2 - 18 June 2023 through 22 June 2023

ER -

Wang Z, Song S, Luo C, Zhou Y, Wu S, Xie W et al. Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection. In Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023. IEEE Computer Society. 2023. p. 5899-5907. (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops). doi: 10.1109/CVPRW59228.2023.00627

Spatial-Temporal Graph-Based AU Relationship Learning for Facial Action Unit Detection

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this