To show or not to show: Redacting sensitive text from videos of electronic displays

2022-08-19 07:53:04

Abhishek Mukhopadhyay, Shubham Agarwal, Patrick Dylan Zwick, Pradipta Biswas

arXiv_AI

Abstract
Abstract (translated)
URL
PDF

Abstract

With the increasing prevalence of video recordings there is a growing need for tools that can maintain the privacy of those recorded. In this paper, we define an approach for redacting personally identifiable text from videos using a combination of optical character recognition (OCR) and natural language processing (NLP) techniques. We examine the relative performance of this approach when used with different OCR models, specifically Tesseract and the OCR system from Google Cloud Vision (GCV). For the proposed approach the performance of GCV, in both accuracy and speed, is significantly higher than Tesseract. Finally, we explore the advantages and disadvantages of both models in real-world applications.

Abstract (translated)

URL

https://arxiv.org/abs/2208.10270

PDF

https://arxiv.org/pdf/2208.10270.pdf