Deep Contextual Attention for Human-Object Interaction Detection

Tiancai Wang, Rao Muhammad Anwer, Muhammad Haris Khan, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Jorma Laaksonen

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionScientificvertaisarvioitu

1 Sitaatiot (Scopus)
21 Lataukset (Pure)

Abstrakti

This work proposes to combine neural networks with the compositional hierarchy of human bodies for efficient and complete human parsing. We formulate the approach as a neural information fusion framework. Our model assembles the information from three inference processes over the hierarchy: direct inference (directly predicting each part of a human body using image information), bottom-up inference (assembling knowledge from constituent parts), and top-down inference (leveraging context from parent nodes). The bottom-up and top-down inferences explicitly model the compositional and decompositional relations in human bodies, respectively. In addition, the fusion of multi-source information is conditioned on the inputs, i.e., by estimating and considering the confidence of the sources. The whole model is end-to-end differentiable, explicitly modeling information flows and structures. Our approach is extensively evaluated on four popular datasets, outperforming the state-of-the-arts in all cases, with a fast processing speed of 23fps. Our code and results have been released to help ease future research in this direction.
AlkuperäiskieliEnglanti
OtsikkoProceedings of the International Conference on Computer Vision (ICCV2019)
KustantajaIEEE
Sivut5694-5702
ISBN (elektroninen)9781728148038
DOI - pysyväislinkit
TilaJulkaistu - helmikuuta 2020
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaIEEE International Conference on Computer Vision - Seoul, Etelä-Korea
Kesto: 27 lokakuuta 20192 marraskuuta 2019
http://iccv2019.thecvf.com/

Julkaisusarja

NimiProceedings of the IEEE International Conference on Computer Vision
Vuosikerta2019-October
ISSN (elektroninen)1550-5499

Conference

ConferenceIEEE International Conference on Computer Vision
LyhennettäICCV
MaaEtelä-Korea
KaupunkiSeoul
Ajanjakso27/10/201902/11/2019
www-osoite

Sormenjälki Sukella tutkimusaiheisiin 'Deep Contextual Attention for Human-Object Interaction Detection'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

  • Projektit

    MeMAD Laaksonen

    Laaksonen, J., Sjöberg, M., Laria Mantecon, H. & Pehlivan Tort, S.

    01/01/201831/12/2020

    Projekti: EU: Framework programmes funding

    Laitteet

    Science-IT

    Mikko Hakala (Manager)

    Perustieteiden korkeakoulu

    Laitteistot/tilat: Facility

  • Siteeraa tätä

    Wang, T., Anwer, R. M., Khan, M. H., Khan, F. S., Pang, Y., Shao, L., & Laaksonen, J. (2020). Deep Contextual Attention for Human-Object Interaction Detection. teoksessa Proceedings of the International Conference on Computer Vision (ICCV2019) (Sivut 5694-5702). (Proceedings of the IEEE International Conference on Computer Vision; Vuosikerta 2019-October). IEEE. https://doi.org/10.1109/ICCV.2019.00579