Abstract
With the escalated demand of human-machine interfaces for intelligent systems, development of gaze controlled system have become a necessity. Gaze, being the non-intrusive form of human interaction, is one of the best suited approach. Appearance based deep learning models are the most widely used for gaze estimation. But the performance of these models is entirely influenced by the size of labeled gaze dataset and in effect affects generalization in performance. This paper aims to develop a semi-supervised contrastive learning framework for estimation of gaze direction. With a small labeled gaze dataset, the framework is able to find a generalized solution even for unseen face images. In this paper, we have proposed a new contrastive loss paradigm that maximizes the similarity agreement between similar images and at the same time reduces the redundancy in embedding representations. Our contrastive regression framework shows good performance in comparison to several state of the art contrastive learning techniques used for gaze estimation.
Abstract (translated)
随着对智能系统的人机交互需求不断增加, gaze控制系统已经成为必要的开发方向。 gaze 是人类交互的非侵入形式,因此是一种非常适合的方法。基于外观的深度学习模型是 gaze 估计最常用的方法之一。但这些模型的性能完全取决于标记的 gaze 数据集的大小,并且实际上会影响其性能的泛化能力。本文旨在开发一个半监督的对比学习框架,用于估计 gaze 方向。只要有少量的标记 gaze 数据集,框架就能够为 unseen 的面容图像找到通用的解决方案。本文提出了一种新的对比损失范式,该范式最大化相似图像之间的相似度一致性,同时减少嵌入表示中的冗余。我们的对比回归框架相对于用于 gaze 估计的一些最先进的对比学习技术表现出良好的性能。
URL
https://arxiv.org/abs/2308.02784