Ensemble methods for spoken emotion recognition in call-centres

Donn Morrison, Ruili Wang, Liyanage C. De Silva

Research output: Journal PublicationArticlepeer-review

282 Citations (Scopus)

Abstract

Machine-based emotional intelligence is a requirement for more natural interaction between humans and computer interfaces and a basic level of accurate emotion perception is needed for computer systems to respond adequately to human emotion. Humans convey emotional information both intentionally and unintentionally via speech patterns. These vocal patterns are perceived and understood by listeners during conversation. This research aims to improve the automatic perception of vocal emotion in two ways. First, we compare two emotional speech data sources: natural, spontaneous emotional speech and acted or portrayed emotional speech. This comparison demonstrates the advantages and disadvantages of both acquisition methods and how these methods affect the end application of vocal emotion recognition. Second, we look at two classification methods which have not been applied in this field: stacked generalisation and unweighted vote. We show how these techniques can yield an improvement over traditional classification methods.

Original languageEnglish
Pages (from-to)98-112
Number of pages15
JournalSpeech Communication
Volume49
Issue number2
DOIs
Publication statusPublished - Feb 2007
Externally publishedYes

Keywords

  • Affect recognition
  • Emotion recognition
  • Ensemble methods
  • Speech databases
  • Speech processing

ASJC Scopus subject areas

  • Software
  • Modelling and Simulation
  • Communication
  • Language and Linguistics
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Ensemble methods for spoken emotion recognition in call-centres'. Together they form a unique fingerprint.

Cite this