Paper Reading AI Learner

APPD: Adaptive and Precise Pupil Boundary Detection using Entropy of Contour Gradients

2018-08-09 15:26:35
Cihan Topal, Halil Ibrahim Cakir, Cuneyt Akinlar

Abstract

Eye tracking spreads through a vast area of applications from ophthalmology, assistive technologies to gaming and virtual reality. Precisely detecting the pupil's contour and center is the very first step in many of these tasks, hence needs to be performed accurately. Although detection of pupil is a simple problem when it is entirely visible; occlusions and oblique view angles complicate the solution. In this study, we propose APPD, an adaptive and precise pupil boundary detection method that is able to infer whether entire pupil is in clearly visible by a heuristic that estimates the shape of a contour in a computationally efficient way. Thus, a faster detection is performed with the assumption of no occlusions. If the heuristic fails, a more comprehensive search among extracted image features is executed to maintain accuracy. Furthermore, the algorithm can find out if there is no pupil as an helpful information for many applications. We provide a dataset containing 3904 high resolution eye images collected from 12 subjects and perform an extensive set of experiments to obtain quantitative results in terms of accuracy, localization and timing. The proposed method outperforms three other state of the art algorithms and has an average execution time $\sim$5 ms in single-thread on a standard laptop computer for 720p images.

Abstract (translated)

眼动追踪通过眼科,辅助技术,游戏和虚拟现实等广泛应用。精确检测瞳孔的轮廓和中心是许多这些任务的第一步,因此需要准确地执行。虽然在完全可见的情况下检测瞳孔是一个简单的问题;闭塞和倾斜视角使解决方案复杂化。在这项研究中,我们提出APPD,一种自适应和精确的瞳孔边界检测方法,能够通过以计算有效的方式估计轮廓形状的启发式来推断整个瞳孔是否清晰可见。因此,在没有遮挡的情况下执行更快的检测。如果启发式失败,则执行提取的图像特征中的更全面的搜索以保持准确性。此外,该算法可以找出是否没有瞳孔作为许多应用的有用信息。我们提供了一个数据集,其中包含从12个受试者收集的3904个高分辨率眼睛图像,并执行一系列广泛的实验,以获得准确性,定位和时间方面的定量结果。所提出的方法优于其他三种最先进的算法,并且在用于720p图像的标准膝上型计算机上的单线程中具有平均执行时间$ \ sim $ 5ms。

URL

https://arxiv.org/abs/1709.06366

PDF

https://arxiv.org/pdf/1709.06366.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot