Fusion approach for improving the performance in voice-biometrics

Di Liu; Siu Yeung Cho; Dong Mei Sun; Zheng Ding Qiu

Fusion approach for improving the performance in voice-biometrics

Di Liu, Siu Yeung Cho, Dong Mei Sun, Zheng Ding Qiu

Department of Electrical and Electronic Engineering

Research output: Chapter in Book/Conference proceeding › Book Chapter › peer-review

1 Citation (Scopus)

Abstract

Voice biometrics, also called speaker recognition, is the process of determining who spoke in a recorded utterance. This technique is widely used in many areas e.g., access management, access control, and forensic detection. On the constraint of the sole feature as input pattern, either low level acoustic feature e.g., Mel Frequency Cepstral Coefficients, Linear Predictive Coefficients or high level feature, e.g., phonetic, voice biometrics have been researched over several decades in the community of speech recognition including many sophisticated approaches, e.g., Gaussian Mixture Model, Hidden Markov Model, Support Vector Machine etc. However, a bottleneck to improve performance came into the existence by only using one kind of features. In order to break through it, the fusion approach is introduced into voice biometrics. The objective of this paper is to show the rationale behind of using fusion methods. At the point of view of biometrics, it systematically classifies the existing approaches into three fusion levels, feature level, matching-score level, and decision-making level. After descriptions of the fundamental basis, each level fusion technique will be described. Then several experimental results will be presented to show the effectiveness of the performance of the fusion techniques.

Original language	English
Title of host publication	Biometrics
Subtitle of host publication	Theory, Applications, and Issues
Publisher	Nova Science Publishers, Inc.
Pages	57-80
Number of pages	24
ISBN (Print)	9781617287657
Publication status	Published - 2011

ASJC Scopus subject areas

General Biochemistry,Genetics and Molecular Biology

Cite this

@inbook{a1336443e55746819f131f536d4543f6,

title = "Fusion approach for improving the performance in voice-biometrics",

abstract = "Voice biometrics, also called speaker recognition, is the process of determining who spoke in a recorded utterance. This technique is widely used in many areas e.g., access management, access control, and forensic detection. On the constraint of the sole feature as input pattern, either low level acoustic feature e.g., Mel Frequency Cepstral Coefficients, Linear Predictive Coefficients or high level feature, e.g., phonetic, voice biometrics have been researched over several decades in the community of speech recognition including many sophisticated approaches, e.g., Gaussian Mixture Model, Hidden Markov Model, Support Vector Machine etc. However, a bottleneck to improve performance came into the existence by only using one kind of features. In order to break through it, the fusion approach is introduced into voice biometrics. The objective of this paper is to show the rationale behind of using fusion methods. At the point of view of biometrics, it systematically classifies the existing approaches into three fusion levels, feature level, matching-score level, and decision-making level. After descriptions of the fundamental basis, each level fusion technique will be described. Then several experimental results will be presented to show the effectiveness of the performance of the fusion techniques.",

author = "Di Liu and Cho, \{Siu Yeung\} and Sun, \{Dong Mei\} and Qiu, \{Zheng Ding\}",

year = "2011",

language = "English",

isbn = "9781617287657",

pages = "57--80",

booktitle = "Biometrics",

publisher = "Nova Science Publishers, Inc.",

address = "United States",

}

TY - CHAP

T1 - Fusion approach for improving the performance in voice-biometrics

AU - Liu, Di

AU - Cho, Siu Yeung

AU - Sun, Dong Mei

AU - Qiu, Zheng Ding

PY - 2011

Y1 - 2011

N2 - Voice biometrics, also called speaker recognition, is the process of determining who spoke in a recorded utterance. This technique is widely used in many areas e.g., access management, access control, and forensic detection. On the constraint of the sole feature as input pattern, either low level acoustic feature e.g., Mel Frequency Cepstral Coefficients, Linear Predictive Coefficients or high level feature, e.g., phonetic, voice biometrics have been researched over several decades in the community of speech recognition including many sophisticated approaches, e.g., Gaussian Mixture Model, Hidden Markov Model, Support Vector Machine etc. However, a bottleneck to improve performance came into the existence by only using one kind of features. In order to break through it, the fusion approach is introduced into voice biometrics. The objective of this paper is to show the rationale behind of using fusion methods. At the point of view of biometrics, it systematically classifies the existing approaches into three fusion levels, feature level, matching-score level, and decision-making level. After descriptions of the fundamental basis, each level fusion technique will be described. Then several experimental results will be presented to show the effectiveness of the performance of the fusion techniques.

AB - Voice biometrics, also called speaker recognition, is the process of determining who spoke in a recorded utterance. This technique is widely used in many areas e.g., access management, access control, and forensic detection. On the constraint of the sole feature as input pattern, either low level acoustic feature e.g., Mel Frequency Cepstral Coefficients, Linear Predictive Coefficients or high level feature, e.g., phonetic, voice biometrics have been researched over several decades in the community of speech recognition including many sophisticated approaches, e.g., Gaussian Mixture Model, Hidden Markov Model, Support Vector Machine etc. However, a bottleneck to improve performance came into the existence by only using one kind of features. In order to break through it, the fusion approach is introduced into voice biometrics. The objective of this paper is to show the rationale behind of using fusion methods. At the point of view of biometrics, it systematically classifies the existing approaches into three fusion levels, feature level, matching-score level, and decision-making level. After descriptions of the fundamental basis, each level fusion technique will be described. Then several experimental results will be presented to show the effectiveness of the performance of the fusion techniques.

UR - http://www.scopus.com/inward/record.url?scp=84892104176&partnerID=8YFLogxK

M3 - Book Chapter

AN - SCOPUS:84892104176

SN - 9781617287657

SP - 57

EP - 80

BT - Biometrics

PB - Nova Science Publishers, Inc.

ER -

Fusion approach for improving the performance in voice-biometrics

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this