Using Computer Vision to Automate Hand Detection and Tracking of Surgeon Movements in Videos of Open Surgery

2020-12-13 03:10:09

Michael Zhang, Xiaotian Cheng, Daniel Copeland, Arjun Desai, Melody Y. Guan, Gabriel A. Brat, Serena Yeung

arXiv_CV

arXiv_CV CNN Detection Object_Detection Tracking Prediction

Abstract
Abstract (translated)
URL
PDF

Abstract

Open, or non-laparoscopic surgery, represents the vast majority of all operating room procedures, but few tools exist to objectively evaluate these techniques at scale. Current efforts involve human expert-based visual assessment. We leverage advances in computer vision to introduce an automated approach to video analysis of surgical execution. A state-of-the-art convolutional neural network architecture for object detection was used to detect operating hands in open surgery videos. Automated assessment was expanded by combining model predictions with a fast object tracker to enable surgeon-specific hand tracking. To train our model, we used publicly available videos of open surgery from YouTube and annotated these with spatial bounding boxes of operating hands. Our model's spatial detections of operating hands significantly outperforms the detections achieved using pre-existing hand-detection datasets, and allow for insights into intra-operative movement patterns and economy of motion.

Abstract (translated)

URL

https://arxiv.org/abs/2012.06948

PDF

https://arxiv.org/pdf/2012.06948.pdf