TY - JOUR
T1 - Probabilistic Mapping of Human Visual Attention from Head Pose Estimation
AU - Veronese, Andrea
AU - Racca, Mattia
AU - Pieters, Roel
AU - Kyrki, Ville
PY - 2017/10/30
Y1 - 2017/10/30
N2 - Effective interaction between a human and a robot requires the bidirectional perception and interpretation of actions and behavior. While actions can be identified as a directly observable activity, this might not be sufficient to deduce actions in a scene. For example, orienting our face toward a book might suggest the action toward “reading.” For a human observer, this deduction requires the direction of gaze, the object identified as a book and the intersection between gaze and book. With this in mind, we aim to estimate and map human visual attention as directed to a scene, and assess how this relates to the detection of objects and their related actions. In particular, we consider human head pose as measurement to infer the attention of a human engaged in a task and study which prior knowledge should be included in such a detection system. In a user study, we show the successful detection of attention to objects in a typical office task scenario (i.e., reading, working with a computer, studying an object). Our system requires a single external RGB camera for head pose measurements and a pre-recorded 3D point cloud of the environment.
AB - Effective interaction between a human and a robot requires the bidirectional perception and interpretation of actions and behavior. While actions can be identified as a directly observable activity, this might not be sufficient to deduce actions in a scene. For example, orienting our face toward a book might suggest the action toward “reading.” For a human observer, this deduction requires the direction of gaze, the object identified as a book and the intersection between gaze and book. With this in mind, we aim to estimate and map human visual attention as directed to a scene, and assess how this relates to the detection of objects and their related actions. In particular, we consider human head pose as measurement to infer the attention of a human engaged in a task and study which prior knowledge should be included in such a detection system. In a user study, we show the successful detection of attention to objects in a typical office task scenario (i.e., reading, working with a computer, studying an object). Our system requires a single external RGB camera for head pose measurements and a pre-recorded 3D point cloud of the environment.
KW - Object detection
KW - attention detection
KW - visual attention mapping
KW - head pose
KW - human-robot interface
UR - http://www.scopus.com/inward/record.url?scp=85061920299&partnerID=8YFLogxK
U2 - 10.3389/frobt.2017.00053
DO - 10.3389/frobt.2017.00053
M3 - Article
AN - SCOPUS:85061920299
SN - 2296-9144
VL - 4
JO - Frontiers in Robotics and AI
JF - Frontiers in Robotics and AI
IS - OCT
M1 - 53
ER -