Reducing noisy annotations for depression estimation from facial images

Lang He, Prayag Tiwari*, Chonghua Lv, Wen Shuai Wu, Liyong Guo

*Tämän työn vastaava kirjoittaja

Tutkimustuotos: LehtiartikkeliArticleScientificvertaisarvioitu

11 Sitaatiot (Scopus)
146 Lataukset (Pure)

Abstrakti

Depression has been considered the most dominant mental disorder over the past few years. To help clinicians effectively and efficiently estimate the severity scale of depression, various automated systems based on deep learning have been proposed. To estimate the severity of depression, i.e., the depression severity score (Beck Depression Inventory-II), various deep architectures have been designed to perform regression using the Euclidean loss. However, they do not consider the label distribution, and they do not learn the relationships between the facial images and BDI-II scores, which can be resulting in the noisy labeling for automatic depression estimation (ADE). To mitigate this problem, we propose an automated deep architecture, namely the self-adaptation network (SAN), to improve this uncertain labeling for ADE. Specifically, the architecture consists of four modules: (1) ResNet-18 and ResNet-50 are adopted in the deep feature extraction module (DFEM) to extract informative deep features; (2) a self-attention module (SAM) is adopted to learn the weights from the mini-batch; (3) a square ranking regularization module (SRRM) to create high partitions and low partitions is proposed; and (4) a re-label module (RM) is used to re-label the uncertain annotations for ADE in the low partitions. We conduct extensive experiments on depression databases (i.e., AVEC2013 and AVEC2014) and obtain a performance comparable to the performances of other ADE methods in assessing the severity of depression. More importantly, the proposed method can learn valuable depression patterns from facial videos and obtain a performance comparable to the performances of other methods for depression recognition.

AlkuperäiskieliEnglanti
Sivut120-129
Sivumäärä10
JulkaisuNeural Networks
Vuosikerta153
DOI - pysyväislinkit
TilaJulkaistu - syysk. 2022
OKM-julkaisutyyppiA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Sormenjälki

Sukella tutkimusaiheisiin 'Reducing noisy annotations for depression estimation from facial images'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.
  • INTERVENE: International consortium for integrative genomics prediction

    Kaski, S. (Vastuullinen tutkija)

    01/01/202131/12/2025

    Projekti: EU: Framework programmes funding

  • DATALIT: Data Literacy for Responsible Decision-Making

    Marttinen, P. (Vastuullinen tutkija), Ji, S. (Projektin jäsen), Gröhn, T. (Projektin jäsen), Honkamaa, J. (Projektin jäsen), Kumar, Y. (Projektin jäsen), Pöllänen, A. (Projektin jäsen), Ojala, F. (Projektin jäsen), Raj, V. (Projektin jäsen) & Tiwari, P. (Projektin jäsen)

    01/10/202030/09/2023

    Projekti: Academy of Finland: Strategic research funding

  • eMOM: eMOM

    Marttinen, P. (Vastuullinen tutkija), Alizadeh Ashrafi, R. (Projektin jäsen), Hizli, C. (Projektin jäsen) & Zhang, G. (Projektin jäsen)

    05/02/201831/01/2023

    Projekti: Business Finland: Other research funding

Siteeraa tätä