Automatic Speech-Based Smoking Status Identification

Zhizhong Ma, Satwinder Singh, Yuanhang Qiu, Feng Hou, Ruili Wang, Christopher Bullen, Joanna Ting Wai Chu

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 3
  • Captures
    • Readers: 2
see details

Abstract

Identifying the smoking status of a speaker from speech has a range of applications including smoking status validation, smoking cessation tracking, and speaker profiling. Previous research on smoking status identification mainly focuses on employing the speaker's low-level acoustic features such as fundamental frequency (F0), jitter, and shimmer. However, the use of high-level acoustic features, such as Mel Frequency Cepstral Coefficients (MFCC) and filter bank (Fbank) for smoking status identification, has rarely been explored. In this study, we utilise both high-level acoustic features (i.e., MFCC, Fbank) and low-level acoustic features (i.e., F0, jitter, shimmer) for smoking status identification. Furthermore, we propose a deep neural network approach for smoking status identification by employing ResNet along with these acoustic features. We also explore a data augmentation technique for smoking status identification to further improve the performance. Finally, we present a comparison of identification accuracy results for each feature settings, and obtain the best accuracy of 82.3%, a relative improvement of 12.7% and 29.8% on the initial audio classification approach and rule-based approach, respectively.

Original languageEnglish
Title of host publicationIntelligent Computing - Proceedings of the 2022 Computing Conference
EditorsKohei Arai
PublisherSpringer Science and Business Media Deutschland GmbH
Pages193-203
Number of pages11
ISBN (Print)9783031104664
DOIs
Publication statusPublished - 2022
Externally publishedYes
EventComputing Conference, 2022 - Virtual, Online
Duration: 14 Jul 202215 Jul 2022

Publication series

NameLecture Notes in Networks and Systems
Volume508 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceComputing Conference, 2022
CityVirtual, Online
Period14/07/2215/07/22

Keywords

  • Acoustic features
  • Smoking status identification
  • Speech processing

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Automatic Speech-Based Smoking Status Identification'. Together they form a unique fingerprint.

Cite this

Ma, Z., Singh, S., Qiu, Y., Hou, F., Wang, R., Bullen, C., & Chu, J. T. W. (2022). Automatic Speech-Based Smoking Status Identification. In K. Arai (Ed.), Intelligent Computing - Proceedings of the 2022 Computing Conference (pp. 193-203). (Lecture Notes in Networks and Systems; Vol. 508 LNNS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-10467-1_11