Towards Optimizing OCR for Accessibility

2022-06-21 11:01:42

Peya Mowar, Tanuja Ganu, Saikat Guha

arXiv_CV

arXiv_CV OCR Speech

Abstract
Abstract (translated)
URL
PDF

Abstract

Visual cues such as structure, emphasis, and icons play an important role in efficient information foraging by sighted individuals and make for a pleasurable reading experience. Blind, low-vision and other print-disabled individuals miss out on these cues since current OCR and text-to-speech software ignore them, resulting in a tedious reading experience. We identify four semantic goals for an enjoyable listening experience, and identify syntactic visual cues that help make progress towards these goals. Empirically, we find that preserving even one or two visual cues in aural form significantly enhances the experience for listening to print content.

Abstract (translated)

URL

https://arxiv.org/abs/2206.10254

PDF

https://arxiv.org/pdf/2206.10254.pdf