Intelligent system for depression scale estimation with facial expressions and case study in industrial intelligence

Lang He, Chenguang Guo, Prayag Tiwari, Hari Mohan Pandey, Wei Dang

Research output: Contribution to journalArticleScientificpeer-review


As a mental disorder, depression has affected people's lives, works, and so on. Researchers have proposed various industrial intelligent systems in the pattern recognition field for audiovisual depression detection. This paper presents an end‐to‐end trainable intelligent system to generate high‐level representations over the entire video clip. Specifically, a three‐dimensional (3D) convolutional neural network equipped with a module spatiotemporal feature aggregation module (STFAM) is trained from scratch on audio/visual emotion challenge (AVEC)2013 and AVEC2014 data, which can model the discriminative patterns closely related to depression. In the STFAM, channel and spatial attention mechanism and an aggregation method, namely 3D DEP‐NetVLAD, are integrated to learn the compact characteristic based on the feature maps. Extensive experiments on the two databases (i.e., AVEC2013 and AVEC2014) are illustrated that the proposed intelligent system can efficiently model the underlying depression patterns and obtain better performances over the most video‐based depression recognition approaches. Case studies are presented to describes the applicability of the proposed intelligent system for industrial intelligence.
Original languageEnglish
Number of pages18
JournalInternational Journal of Intelligent Systems
Publication statusE-pub ahead of print - 8 Apr 2021
MoE publication typeA1 Journal article-refereed

Fingerprint Dive into the research topics of 'Intelligent system for depression scale estimation with facial expressions and case study in industrial intelligence'. Together they form a unique fingerprint.

Cite this