Abstract
This paper explores the role of eye gaze in human-robot interactions and proposes a novel system for detecting objects gazed by the human using solely visual feedback. The system leverages on face detection, human attention prediction, and online object detection, and it allows the robot to perceive and interpret human gaze accurately, paving the way for establishing joint attention with human partners. Additionally, a novel dataset collected with the humanoid robot iCub is introduced, comprising over 22,000 images from ten participants gazing at different annotated objects. This dataset serves as a benchmark for evaluating the performance of the proposed pipeline. The paper also includes an experimental analysis of the pipeline's effectiveness in a human-robot interaction setting, examining the performance of each component. Furthermore, the developed system is deployed on the humanoid robot iCub, and a supplementary video showcases its functionality. The results demonstrate the potential of the proposed approach to enhance social awareness and responsiveness in social robotics, as well as improve assistance and support in collaborative scenarios, promoting efficient human-robot collaboration. The code and the collected dataset will be released upon acceptance.
Abstract (translated)
本论文探讨了人类-机器人交互中的目光注视作用,并提出了一种新型的系统,仅使用视觉反馈来检测人类注视的对象。该系统利用面部检测、人类注意力预测和在线对象检测技术,使机器人能够准确感知和解释人类注视的对象,为与人类合作伙伴建立联合关注铺平了道路。此外,与直立型机器人iCub一起收集了一个新的数据集,其中包括超过22,000张由十名参与者注视不同注释对象的图像。这个数据集作为评估 proposed 管道性能的标准。本文还包含了在人类-机器人交互环境中实验分析管道有效性的文献,考察了每个组件的性能。此外,该系统被部署在直立型机器人 iCub 上,一个补充视频展示了其功能。结果表明,该方法的潜力在于增强社会机器人中的社会意识和响应能力,改进协作场景中的协助和支持,促进高效的人类-机器人协作。代码和收集的数据集将在接受后发布。
URL
https://arxiv.org/abs/2308.13318