Paper Reading AI Learner

A Review of Driver Gaze Estimation and Application in Gaze Behavior Understanding

2023-07-04 04:29:03
Pavan Kumar Sharma, Pranamesh Chakraborty

Abstract

Driver gaze plays an important role in different gaze-based applications such as driver attentiveness detection, visual distraction detection, gaze behavior understanding, and building driver assistance system. The main objective of this study is to perform a comprehensive summary of driver gaze fundamentals, methods to estimate driver gaze, and it's applications in real world driving scenarios. We first discuss the fundamentals related to driver gaze, involving head-mounted and remote setup based gaze estimation and the terminologies used for each of these data collection methods. Next, we list out the existing benchmark driver gaze datasets, highlighting the collection methodology and the equipment used for such data collection. This is followed by a discussion of the algorithms used for driver gaze estimation, which primarily involves traditional machine learning and deep learning based techniques. The estimated driver gaze is then used for understanding gaze behavior while maneuvering through intersections, on-ramps, off-ramps, lane changing, and determining the effect of roadside advertising structures. Finally, we have discussed the limitations in the existing literature, challenges, and the future scope in driver gaze estimation and gaze-based applications.

Abstract (translated)

司机的目光在各种不同的 gaze based 应用中发挥着重要作用,例如司机注意力检测、视觉干扰检测、目光行为理解以及构建司机辅助系统。本研究的主要目标是对司机的目光 fundamentals、估计司机目光的方法以及它在真实世界驾驶场景中的应用进行 comprehensive summary。我们首先讨论了与司机的目光相关的 fundamentals,包括基于头戴式和远程setup 的目光估算方法,以及这些数据收集方法所使用的术语。接下来,我们列出了现有的司机目光基准数据集,重点介绍了这些数据的收集方法和所使用的设备。这之后,我们讨论了用于司机目光估算的算法,这主要涉及传统的机器学习和深度学习 based 技术。估计的司机目光被用来理解目光行为,在穿过路口、入口、出口、换车道以及确定路边广告结构的影响时进行操纵。最后,我们讨论了现有文献中的限制、挑战以及司机目光估算和 gaze-based 应用程序的未来 scope。

URL

https://arxiv.org/abs/2307.01470

PDF

https://arxiv.org/pdf/2307.01470.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot