Projects per year
Abstract
This work proposes to combine neural networks with the compositional hierarchy of human bodies for efficient and complete human parsing. We formulate the approach as a neural information fusion framework. Our model assembles the information from three inference processes over the hierarchy: direct inference (directly predicting each part of a human body using image information), bottom-up inference (assembling knowledge from constituent parts), and top-down inference (leveraging context from parent nodes). The bottom-up and top-down inferences explicitly model the compositional and decompositional relations in human bodies, respectively. In addition, the fusion of multi-source information is conditioned on the inputs, i.e., by estimating and considering the confidence of the sources. The whole model is end-to-end differentiable, explicitly modeling information flows and structures. Our approach is extensively evaluated on four popular datasets, outperforming the state-of-the-arts in all cases, with a fast processing speed of 23fps. Our code and results have been released to help ease future research in this direction.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Conference on Computer Vision (ICCV2019) |
Publisher | IEEE |
Pages | 5693-5701 |
Number of pages | 9 |
ISBN (Electronic) | 978-1-7281-4803-8 |
ISBN (Print) | 978-1-7281-4804-5 |
DOIs | |
Publication status | Published - Feb 2020 |
MoE publication type | A4 Article in a conference publication |
Event | IEEE International Conference on Computer Vision - Seoul, Korea, Republic of Duration: 27 Oct 2019 → 2 Nov 2019 http://iccv2019.thecvf.com/ |
Publication series
Name | Proceedings of the IEEE International Conference on Computer Vision |
---|---|
Volume | 2019-October |
ISSN (Electronic) | 1550-5499 |
Conference
Conference | IEEE International Conference on Computer Vision |
---|---|
Abbreviated title | ICCV |
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 27/10/2019 → 02/11/2019 |
Internet address |
Fingerprint
Dive into the research topics of 'Deep Contextual Attention for Human-Object Interaction Detection'. Together they form a unique fingerprint.Projects
- 2 Finished
-
MeMAD Laaksonen
Laaksonen, J., Sjöberg, M., Pehlivan Tort, S. & Laria Mantecon, H.
01/01/2018 → 31/03/2021
Project: EU: Framework programmes funding
-
Deep neural networks in scene graph generation for perception of visual multimedia semantics
Anwer, R., Laaksonen, J., Wang, T., Pehlivan Tort, S. & Sjöberg, M.
01/01/2018 → 31/12/2019
Project: Academy of Finland: Other research funding