Skip to main navigation Skip to search Skip to main content

Online Emotion-Driven Generation of Multiple Appropriate Facial Reactions

  • Jiajian Huang
  • , Siyang Song
  • , Xiangyu Kong
  • , Weicheng Xie
  • , Linlin Shen
  • , Zitong Yu

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

The multiple appropriate facial reaction online generation task aims to generate real-time, appropriate, and diverse facial reactions for virtual listeners in response to audio-visual behaviours expressed by a human speaker. While recent approaches have focused on improving reaction diversity and coarse synchronicity, they often fail to capture emotionally coherent responses that align with both the emotion type and intensity level of the speaker. In this work, we propose an emotion-driven framework that treats the speaker’s emotional state as the core driving force behind listener behavior. Our framework integrates a pre-trained audio emotion encoder (PAEE) and visual emotion encoder (PVEE) to extract fine-grained emotional representations from speech and facial expressions. We further design a lightweight, online-capable Motion Representation Module (MRM), optimized for real-time generation, that captures emotional intensity through facial motion amplitude and variation, enabling our system to dynamically modulate the strength of listener reactions with low latency. Besides, an Unpredictable Motion Generator (UMG) further introduces minor, stochastic perturbations, making the generated reactions more lifelike and individualized. Extensive experiments demonstrate that our method achieves significant improvements in reaction appropriateness and diversity, while maintaining real-time performance. The codes are available at this link.

Original languageEnglish
Title of host publicationBiometric Recognition - 19th Chinese Conference, CCBR 2025, Proceedings
EditorsWei Jia, Lu Leng, Weidong Min, Jun Chu, Jie Gui, Xiangbo Shu, Xianye Ben, Zhenan Sun, Yuming Fang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages183-194
Number of pages12
ISBN (Print)9789819561223
DOIs
Publication statusPublished - 2026
Externally publishedYes
Event19th Chinese Conference on Biometric Recognition, CCBR 2025 - Nanchang, China
Duration: 21 Nov 202523 Nov 2025

Publication series

NameLecture Notes in Computer Science
Volume16360 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Chinese Conference on Biometric Recognition, CCBR 2025
Country/TerritoryChina
CityNanchang
Period21/11/2523/11/25

Free Keywords

  • Emotion-driven modeling
  • Online facial reaction generation

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Online Emotion-Driven Generation of Multiple Appropriate Facial Reactions'. Together they form a unique fingerprint.

Cite this