3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition

Ziqi Gao; Qiufu Li; Linlin Shen; Junpeng Yang

doi:10.1007/978-981-97-8795-1_33

3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition

Ziqi Gao, Qiufu Li, Linlin Shen, Junpeng Yang

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

Abstract

Compared to 2D face recognition, 3D face recognition exhibits stronger robustness against variations like pose and illumination. However, due to the limited training data, the accuracy of existing 3D face recognition methods is still unsatisfactory. In this paper, we introduce 3DFaceMAE, which is the first masked autoencoder (MAE) based 3D face recognition method using point clouds. Specifically, we first synthesize a large-scale 3D point cloud facial dataset and combine it with the small-scale real data. In the pre-training of 3DFaceMAE, we extract the key facial regions from the input 3D facial point cloud, using normal difference techniques, and reconstruct these key regions using patch-based random masking reconstruction and super-resolution. We finally fine-tune the encoder of 3DFaceMAE on the real 3D face point cloud data. In the experiments, we test 3DFaceMAE on three 3D face datasets, as high as 91.17% was achieved on the Lock3DFace dataset, which is the first reported result surpassing 90%. In addition, the experimental results indicate that 3DFaceMAE has strong cross-quality generalization performance. We also validate the effectiveness of different components of 3DFaceMAE through ablation study.

Original language	English
Title of host publication	Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings
Editors	Zhouchen Lin, Hongbin Zha, Ming-Ming Cheng, Ran He, Cheng-Lin Liu, Kurban Ubul, Wushouer Silamu, Jie Zhou
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	488-503
Number of pages	16
ISBN (Print)	9789819787944
DOIs	https://doi.org/10.1007/978-981-97-8795-1_33
Publication status	Published - 2025
Externally published	Yes
Event	7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024 - Urumqi, China Duration: 18 Oct 2024 → 20 Oct 2024

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15041 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024
Country/Territory	China
City	Urumqi
Period	18/10/24 → 20/10/24

Keywords

3D point cloud
Face recognition

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-981-97-8795-1_33

Cite this

Gao, Z., Li, Q., Shen, L., & Yang, J. (2025). 3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition. In Z. Lin, H. Zha, M.-M. Cheng, R. He, C.-L. Liu, K. Ubul, W. Silamu, & J. Zhou (Eds.), Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings (pp. 488-503). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15041 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-97-8795-1_33

Gao, Ziqi ; Li, Qiufu ; Shen, Linlin et al. / 3DFaceMAE : Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition. Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings. editor / Zhouchen Lin ; Hongbin Zha ; Ming-Ming Cheng ; Ran He ; Cheng-Lin Liu ; Kurban Ubul ; Wushouer Silamu ; Jie Zhou. Springer Science and Business Media Deutschland GmbH, 2025. pp. 488-503 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{48cdfe1acb7647c9972932849ae2f0d6,

title = "3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition",

abstract = "Compared to 2D face recognition, 3D face recognition exhibits stronger robustness against variations like pose and illumination. However, due to the limited training data, the accuracy of existing 3D face recognition methods is still unsatisfactory. In this paper, we introduce 3DFaceMAE, which is the first masked autoencoder (MAE) based 3D face recognition method using point clouds. Specifically, we first synthesize a large-scale 3D point cloud facial dataset and combine it with the small-scale real data. In the pre-training of 3DFaceMAE, we extract the key facial regions from the input 3D facial point cloud, using normal difference techniques, and reconstruct these key regions using patch-based random masking reconstruction and super-resolution. We finally fine-tune the encoder of 3DFaceMAE on the real 3D face point cloud data. In the experiments, we test 3DFaceMAE on three 3D face datasets, as high as 91.17\% was achieved on the Lock3DFace dataset, which is the first reported result surpassing 90\%. In addition, the experimental results indicate that 3DFaceMAE has strong cross-quality generalization performance. We also validate the effectiveness of different components of 3DFaceMAE through ablation study.",

keywords = "3D point cloud, Face recognition",

author = "Ziqi Gao and Qiufu Li and Linlin Shen and Junpeng Yang",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.; 7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024 ; Conference date: 18-10-2024 Through 20-10-2024",

year = "2025",

doi = "10.1007/978-981-97-8795-1\_33",

language = "English",

isbn = "9789819787944",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "488--503",

editor = "Zhouchen Lin and Hongbin Zha and Ming-Ming Cheng and Ran He and Cheng-Lin Liu and Kurban Ubul and Wushouer Silamu and Jie Zhou",

booktitle = "Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings",

address = "Germany",

}

Gao, Z, Li, Q, Shen, L & Yang, J 2025, 3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition. in Z Lin, H Zha, M-M Cheng, R He, C-L Liu, K Ubul, W Silamu & J Zhou (eds), Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15041 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 488-503, 7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024, Urumqi, China, 18/10/24. https://doi.org/10.1007/978-981-97-8795-1_33

3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition. / Gao, Ziqi; Li, Qiufu; Shen, Linlin et al.
Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings. ed. / Zhouchen Lin; Hongbin Zha; Ming-Ming Cheng; Ran He; Cheng-Lin Liu; Kurban Ubul; Wushouer Silamu; Jie Zhou. Springer Science and Business Media Deutschland GmbH, 2025. p. 488-503 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15041 LNCS).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - 3DFaceMAE

T2 - 7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024

AU - Gao, Ziqi

AU - Li, Qiufu

AU - Shen, Linlin

AU - Yang, Junpeng

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

PY - 2025

Y1 - 2025

N2 - Compared to 2D face recognition, 3D face recognition exhibits stronger robustness against variations like pose and illumination. However, due to the limited training data, the accuracy of existing 3D face recognition methods is still unsatisfactory. In this paper, we introduce 3DFaceMAE, which is the first masked autoencoder (MAE) based 3D face recognition method using point clouds. Specifically, we first synthesize a large-scale 3D point cloud facial dataset and combine it with the small-scale real data. In the pre-training of 3DFaceMAE, we extract the key facial regions from the input 3D facial point cloud, using normal difference techniques, and reconstruct these key regions using patch-based random masking reconstruction and super-resolution. We finally fine-tune the encoder of 3DFaceMAE on the real 3D face point cloud data. In the experiments, we test 3DFaceMAE on three 3D face datasets, as high as 91.17% was achieved on the Lock3DFace dataset, which is the first reported result surpassing 90%. In addition, the experimental results indicate that 3DFaceMAE has strong cross-quality generalization performance. We also validate the effectiveness of different components of 3DFaceMAE through ablation study.

AB - Compared to 2D face recognition, 3D face recognition exhibits stronger robustness against variations like pose and illumination. However, due to the limited training data, the accuracy of existing 3D face recognition methods is still unsatisfactory. In this paper, we introduce 3DFaceMAE, which is the first masked autoencoder (MAE) based 3D face recognition method using point clouds. Specifically, we first synthesize a large-scale 3D point cloud facial dataset and combine it with the small-scale real data. In the pre-training of 3DFaceMAE, we extract the key facial regions from the input 3D facial point cloud, using normal difference techniques, and reconstruct these key regions using patch-based random masking reconstruction and super-resolution. We finally fine-tune the encoder of 3DFaceMAE on the real 3D face point cloud data. In the experiments, we test 3DFaceMAE on three 3D face datasets, as high as 91.17% was achieved on the Lock3DFace dataset, which is the first reported result surpassing 90%. In addition, the experimental results indicate that 3DFaceMAE has strong cross-quality generalization performance. We also validate the effectiveness of different components of 3DFaceMAE through ablation study.

KW - 3D point cloud

KW - Face recognition

UR - http://www.scopus.com/inward/record.url?scp=85209347536&partnerID=8YFLogxK

U2 - 10.1007/978-981-97-8795-1_33

DO - 10.1007/978-981-97-8795-1_33

M3 - Conference contribution

AN - SCOPUS:85209347536

SN - 9789819787944

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 488

EP - 503

BT - Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings

A2 - Lin, Zhouchen

A2 - Zha, Hongbin

A2 - Cheng, Ming-Ming

A2 - He, Ran

A2 - Liu, Cheng-Lin

A2 - Ubul, Kurban

A2 - Silamu, Wushouer

A2 - Zhou, Jie

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 18 October 2024 through 20 October 2024

ER -

Gao Z, Li Q, Shen L, Yang J. 3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition. In Lin Z, Zha H, Cheng MM, He R, Liu CL, Ubul K, Silamu W, Zhou J, editors, Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 488-503. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-981-97-8795-1_33

3DFaceMAE: Pre-training of Masked Autoencoder Using Patch-Based Random Masking Reconstruction and Super-resolution for 3D Face Recognition

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this