Combining novel acoustic features using SVM to detect speaker changing points

Haishan Zhong; David Cho; Vladimir Pervouchine; Graham Leedham

Combining novel acoustic features using SVM to detect speaker changing points

Haishan Zhong, David Cho, Vladimir Pervouchine, Graham Leedham

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

Abstract

Automatic speaker change point detection separates different speakers from continuous speech signal by utilising the speaker characteristics. It is often a necessary step before using a speaker recognition system. Acoustic features of the speech signal such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) are commonly used to represent a speaker. However, the features are affected by speech content, environment, type of recording device, etc. So far, no features have been discovered, which values depend only on the speaker. In this paper four novel feature types proposed in recent journals and conference papers for speaker verification problem, are applied to the problem of speaker change point detection. The features are also used to form a combination scheme using an SVM classifier. The results shows that the proposed scheme improves the performance of speaker changing point detection as compared to the system that uses MFCC features only. Some of the novel features of low dimensionality give comparable speaker change point detection accuracy to the high-dimensional MFCC features.

Original language	English
Title of host publication	BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing
Pages	224-227
Number of pages	4
Publication status	Published - 2008
Externally published	Yes
Event	BIOSIGNALS 2008 - 1st International Conference on Bio-inspired Systems and Signal Processing - Funchal, Madeira, Portugal Duration: 28 Jan 2008 → 31 Jan 2008

Publication series

Name	BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing
Volume	1

Conference

Conference	BIOSIGNALS 2008 - 1st International Conference on Bio-inspired Systems and Signal Processing
Country/Territory	Portugal
City	Funchal, Madeira
Period	28/01/08 → 31/01/08

Keywords

Feature evaluation
Feature extraction
Speaker recognition

ASJC Scopus subject areas

Signal Processing
Biomedical Engineering
Control and Systems Engineering

Cite this

Zhong, H., Cho, D., Pervouchine, V., & Leedham, G. (2008). Combining novel acoustic features using SVM to detect speaker changing points. In BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing (pp. 224-227). (BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing; Vol. 1).

Zhong, Haishan ; Cho, David ; Pervouchine, Vladimir et al. / Combining novel acoustic features using SVM to detect speaker changing points. BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing. 2008. pp. 224-227 (BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing).

@inproceedings{219d97b6c3f74db09459e3b7b26c0a2c,

title = "Combining novel acoustic features using SVM to detect speaker changing points",

abstract = "Automatic speaker change point detection separates different speakers from continuous speech signal by utilising the speaker characteristics. It is often a necessary step before using a speaker recognition system. Acoustic features of the speech signal such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) are commonly used to represent a speaker. However, the features are affected by speech content, environment, type of recording device, etc. So far, no features have been discovered, which values depend only on the speaker. In this paper four novel feature types proposed in recent journals and conference papers for speaker verification problem, are applied to the problem of speaker change point detection. The features are also used to form a combination scheme using an SVM classifier. The results shows that the proposed scheme improves the performance of speaker changing point detection as compared to the system that uses MFCC features only. Some of the novel features of low dimensionality give comparable speaker change point detection accuracy to the high-dimensional MFCC features.",

keywords = "Feature evaluation, Feature extraction, Speaker recognition",

author = "Haishan Zhong and David Cho and Vladimir Pervouchine and Graham Leedham",

year = "2008",

language = "English",

isbn = "9789898111180",

series = "BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing",

pages = "224--227",

booktitle = "BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing",

note = "BIOSIGNALS 2008 - 1st International Conference on Bio-inspired Systems and Signal Processing ; Conference date: 28-01-2008 Through 31-01-2008",

}

Zhong, H, Cho, D, Pervouchine, V & Leedham, G 2008, Combining novel acoustic features using SVM to detect speaker changing points. in BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing. BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing, vol. 1, pp. 224-227, BIOSIGNALS 2008 - 1st International Conference on Bio-inspired Systems and Signal Processing, Funchal, Madeira, Portugal, 28/01/08.

Combining novel acoustic features using SVM to detect speaker changing points. / Zhong, Haishan; Cho, David; Pervouchine, Vladimir et al.
BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing. 2008. p. 224-227 (BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing; Vol. 1).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Combining novel acoustic features using SVM to detect speaker changing points

AU - Zhong, Haishan

AU - Cho, David

AU - Pervouchine, Vladimir

AU - Leedham, Graham

PY - 2008

Y1 - 2008

N2 - Automatic speaker change point detection separates different speakers from continuous speech signal by utilising the speaker characteristics. It is often a necessary step before using a speaker recognition system. Acoustic features of the speech signal such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) are commonly used to represent a speaker. However, the features are affected by speech content, environment, type of recording device, etc. So far, no features have been discovered, which values depend only on the speaker. In this paper four novel feature types proposed in recent journals and conference papers for speaker verification problem, are applied to the problem of speaker change point detection. The features are also used to form a combination scheme using an SVM classifier. The results shows that the proposed scheme improves the performance of speaker changing point detection as compared to the system that uses MFCC features only. Some of the novel features of low dimensionality give comparable speaker change point detection accuracy to the high-dimensional MFCC features.

AB - Automatic speaker change point detection separates different speakers from continuous speech signal by utilising the speaker characteristics. It is often a necessary step before using a speaker recognition system. Acoustic features of the speech signal such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) are commonly used to represent a speaker. However, the features are affected by speech content, environment, type of recording device, etc. So far, no features have been discovered, which values depend only on the speaker. In this paper four novel feature types proposed in recent journals and conference papers for speaker verification problem, are applied to the problem of speaker change point detection. The features are also used to form a combination scheme using an SVM classifier. The results shows that the proposed scheme improves the performance of speaker changing point detection as compared to the system that uses MFCC features only. Some of the novel features of low dimensionality give comparable speaker change point detection accuracy to the high-dimensional MFCC features.

KW - Feature evaluation

KW - Feature extraction

KW - Speaker recognition

UR - http://www.scopus.com/inward/record.url?scp=70350464022&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:70350464022

SN - 9789898111180

T3 - BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing

SP - 224

EP - 227

BT - BIOSIGNALS 2008 - Proceedings of the 1st International Conference on Bio-inspired Systems and Signal Processing

T2 - BIOSIGNALS 2008 - 1st International Conference on Bio-inspired Systems and Signal Processing

Y2 - 28 January 2008 through 31 January 2008

ER -

Combining novel acoustic features using SVM to detect speaker changing points

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this