Abstract
Depression is a serious mental disorder that affects millions of people all over the world. Traditional clinical diagnosis methods are subjective, complicated and need extensive participation of experts. Audio-visual automatic depression analysis systems predominantly base their predictions on very brief sequential segments, sometimes as little as one frame. Such data contains much redundant information, causes a high computational load, and negatively affects the detection accuracy. Final decision making at the sequence level is then based on the fusion of frame or segment level predictions. However, this approach loses longer term behavioural correlations, as the behaviours themselves are abstracted away by the frame-level predictions. We propose to on the one hand use automatically detected human behaviour primitives such as Gaze directions, Facial action units (AU), etc. as low-dimensional multi-channel time series data, which can then be used to create two sequence descriptors. The first calculates the sequence-level statistics of the behaviour primitives and the second casts the problem as a Convolutional Neural Network problem operating on a spectral representation of the multichannel behaviour signals. The results of depression detection (binary classification) and severity estimation (regression) experiments conducted on the AVEC 2016 DAIC-WOZ database show that both methods achieved significant improvement compared to the previous state of the art in terms of the depression severity estimation.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 158-165 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781538623350 |
| DOIs | |
| Publication status | Published - 5 Jun 2018 |
| Externally published | Yes |
| Event | 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 - Xi'an, China Duration: 15 May 2018 → 19 May 2018 |
Publication series
| Name | Proceedings - 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 |
|---|
Conference
| Conference | 13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018 |
|---|---|
| Country/Territory | China |
| City | Xi'an |
| Period | 15/05/18 → 19/05/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Free Keywords
- Automatic depression analysis
- Convolution Nerual Networks
- Human behaviour signals
- Spectrum maps
- Statistic
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Control and Optimization
Fingerprint
Dive into the research topics of 'Human behaviour-based automatic depression analysis using hand-crafted statistics and deep learned spectral features'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver