MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding

2021-08-14 14:10:23

Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin

arXiv_CV

Abstract
Abstract (translated)
URL
PDF

Abstract

We present MMOCR-an open-source toolbox which provides a comprehensive pipeline for text detection and recognition, as well as their downstream tasks such as named entity recognition and key information extraction. MMOCR implements 14 state-of-the-art algorithms, which is significantly more than all the existing open-source OCR projects we are aware of to date. To facilitate future research and industrial applications of text recognition-related problems, we also provide a large number of trained models and detailed benchmarks to give insights into the performance of text detection, recognition and understanding. MMOCR is publicly released at this https URL.

Abstract (translated)

URL

https://arxiv.org/abs/2108.06543

PDF

https://arxiv.org/pdf/2108.06543.pdf