Optical_Character
Optical_Character
-
Noisy Parallel Data Alignment
Ruoyu Xie, Antonios Anastasopoulos
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
PDF
-
On the feasibility of attacking Thai LPR systems with adversarial examples
Chissanupong Jiamsuchon, Jakapan Suaboot, Norrathep Rattanavipanon
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Adversarial
Pose
Prediction
PDF
-
Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle
Alex Kogan
arXiv_AI
arXiv_AI
Recognition
OCR
Bert
Optical_Character
Pose
Inference
PDF
-
Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules
Ritesh Chandra, Sadhana Tiwari, Sonali Agarwal, Navjot Singh
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Ontology
Action
Medical
PDF
-
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling
Yilin Wen, Biao Luo, Yuqian Zhao
arXiv_AI
arXiv_AI
Recognition
Reinforcement_Learning
OCR
Optimization
Sparse
Optical_Character
Knowledge
Knowledge_Graph
Pose
Inference
Prediction
PDF
-
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition
Gürkan Soykan, Deniz Yuret, Tevfik Metin Sezgin
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Detection
Relation
Inference
PDF
-
Bengali Handwritten Digit Recognition using CNN with Explainable AI
Md Tanvir Rouf Shawon, Raihan Tanvir, Md. Golam Rabiul Alam
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
-
Geometric Rectification of Creased Document Images based on Isometric Mapping
Dong Luo, Pengbo Bo
arXiv_CV
arXiv_CV
Recognition
OCR
3D
Optical_Character
Knowledge
Pose
PDF
-
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering
Siwen Luo, Feiqi Cao, Felipe Nunez, Zean Wen, Josiah Poon, Caren Han
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Scene_Text
Action
Relation
VQA
Attention
QA
PDF
-
Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Hongkuan Zhang, Edward Whittaker, Ikuo Kitagishi
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
PDF
-
PACMAN: a framework for pulse oximeter digit detection and reading in a low-resource setting
Chiraphat Boonnag, Wanumaidah Saengmolee, Narongrid Seesawad, Amrest Chinkamol, Saendee Rattanasomrerk, Kanyakorn Veerakanjana, Kamonwan Thanontip, Warissara Limpornchitwilai, Piyalitt Ittichaiwong, Theerawit Wilaiprasitporn
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
Detection
Object_Detection
PDF
-
SoftCTC $unicode{x2013}$ Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels
Martin Kišš, Michal Hradiš, Karel Beneš, Petr Buchal, Michal Kula
arXiv_CV
arXiv_CV
Handwriting
Recognition
Optical_Character
Speech
Pose
Speech_Recognition
PDF
-
Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images
Shufan Li, Congxi Lu, Linkai Li, Haoshuai Zhou
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
Detection
Object_Detection
PDF
-
Text-Aware Dual Routing Network for Visual Question Answering
Luoqian Jiang, Yifan He, Jian Chen
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
-
Efficient few-shot learning for pixel-precise handwritten document layout analysis
Axel De Nardin, Silvia Zottin, Matteo Paier, Gian Luca Foresti, Emanuela Colombi, Claudio Piciarelli
arXiv_AI
arXiv_AI
Recognition
Optical_Character
Few-Shot
PDF
-
A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail
Zhibo Zhang, Ernesto Damiani, Hussam Al Hamadi, Chan Yeob Yeun, Fatma Taher
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
Pose
Detection
CNN
PDF
-
MCSCSet: A Specialist-annotated Dataset for Medical-domain Chinese Spelling Correction
Wangjie Jiang, Zhihao Ye, Zijing Ou, Ruihui Zhao, Jianguang Zheng, Yi Liu, Siheng Li, Bang Liu, Yujiu Yang, Yefeng Zheng
arXiv_CL
arXiv_CL
Recognition
Optical_Character
Knowledge
Pose
Attention
Medical
PDF
-
MenuAI: Restaurant Food Recommendation System via a Transformer-based Deep Learning Model
Xinwei Ju, Frank Po Wen Lo, Jianing Qiu, Peilun Shi, Jiachuan Peng, Benny Lo
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Optical_Character
Knowledge
Pose
Deep_Learning
Recommendation
PDF
-
Text Detection Forgot About Document OCR
Krzysztof Olejniczak, Milan Šulc
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
PDF
-
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections
Roberto Arroyo, Javier Yebes, Elena Martínez, Héctor Corrales, Javier Lorenzo
arXiv_CV
arXiv_CV
Enhancement
Recognition
OCR
Optical_Character
Action
Deep_Learning
Detection
Prediction
PDF
-
EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
Yashowardhan Shinde, Kishore Kulkarni
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Denoising
CNN
PDF
-
Chandojnanam: A Sanskrit Meter Identification and Utilization System
Hrishikesh Terdalkar, Arnab Bhattacharya
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Face
Matching
PDF
-
3D Rendering Framework for Data Augmentation in Optical Character Recognition
Andreas Spruck, Maximiliane Hawesch, Anatol Maier, Christian Riess, Jürgen Seiler, André Kaup
arXiv_CV
arXiv_CV
Recognition
OCR
3D
Optical_Character
Pose
PDF
-
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Prediction
PDF
-
Computer vision based vehicle tracking as a complementary and scalable approach to RFID tagging
Pranav Kant Gaur, Abhilash Bhardwaj, Pritam Shete, Mohini Laghate, Dinesh M Sarode
arXiv_CV
arXiv_CV
Tracking
Enhancement
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
GAN
Prediction
PDF
-
A Masked Bounding-Box Selection Based ResNet Predictor for Text Rotation Prediction
Michael Yang, Yuan Lin, ChiuMan Ho
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Deep_Learning
CNN
Prediction
PDF
-
A Black-Box Attack on Optical Character Recognition Systems
Samet Bayram, Kenneth Barner
arXiv_CV
arXiv_CV
Recognition
Optical_Character
Adversarial
Pose
Classification
Deep_Learning
PDF
-
An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Aly Mostafa, Omar Mohamed, Ali Ashraf, Ahmed Elbehery, Salma Jamal, Anas Salah, Amr S. Ghoneim
arXiv_CL
arXiv_CL
Transformer
Handwriting
Enhancement
Recognition
OCR
Optimization
Optical_Character
Pose
Image_Enhancement
Action
PDF
-
To show or not to show: Redacting sensitive text from videos of electronic displays
Abhishek Mukhopadhyay, Shubham Agarwal, Patrick Dylan Zwick, Pradipta Biswas
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
PDF
-
Character decomposition to resolve class imbalance problem in Hangul OCR
Geonuk Kim, Jaemin Son, Kanghyu Lee, Jaesik Min
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
-
You Actually Look Twice At it : using an object detection approach instead of region segmentation within the Kraken engine
Thibault Clérice (ENC, CJM, HiSoMA, UJML)
arXiv_CV
arXiv_CV
Segmentation
Recognition
Optical_Character
Pose
Classification
Detection
Object_Detection
PDF
-
Towards Multimodal Vision-Language Models Generating Non-Generic Text
Wes Robbins, Zanyar Zohourianshahzadi, Jugal Kalita
arXiv_AI
arXiv_AI
Image_Caption
Transformer
Recognition
Optical_Character
Caption
Language_Model
PDF
-
Detection of Furigana Text in Images
Nikolaj Kjøller Bjerregaard, Veronika Cheplygina, Stefan Heinrich
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
Detection
Object_Detection
GAN
PDF
-
BusiNet -- a Light and Fast Text Detection Network for Business Documents
Oshri Naparstek, Ophir Azulai, Daniel Rotman, Yevgeny Burshtein, Peter Staar, Udi Barzelay
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Adversarial
Detection
PDF
-
Sequence-aware multimodal page classification of Brazilian legal documents
Pedro H. Luz de Araujo, Ana Paula G. S. de Almeida, Fabricio A. Braz, Nilton C. da Silva, Flavio de Barros Vidal, Teofilo E. de Campos
arXiv_CL
arXiv_CL
Embedding
Recognition
RNN
Optical_Character
Pose
Classification
CNN
PDF
-
Multistep Automated Data Labelling Procedure for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image Annotation
Jikai Zhang, Maciej M. Mazurowski, Brian C. Allen, Benjamin Wildman-Torbiner
arXiv_CV
arXiv_CV
Segmentation
Recognition
Optical_Character
Pose
PDF
-
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Xu Yang, Daoyuan Wu, Xiao Yi, Jimmy H. M. Lee, Tan Lee
arXiv_CV
arXiv_CV
Face_Detection
Recognition
OCR
Optimization
Optical_Character
Pose
Face
Detection
Face_Recognition
PDF
-
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu, Chao Wang, Wenqiang Lei, Ziyang Liu, Tat Seng Chua
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
Face
Action
Detection
Object_Detection
Attention
Prediction
PDF
-
Transformer based Urdu Handwritten Text Optical Character Reader
Mohammad Daniyal Shaiq, Musa Dildar Ahmed Cheema, Ali Kamal
arXiv_AI
arXiv_AI
Transformer
Handwriting
OCR
Optical_Character
Pose
Action
PDF
-
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Self-Supervised
Pose
Detection
Object_Detection
Attention
Inference
PDF
-
Two Decades of Bengali Handwritten Digit Recognition: A Survey
A.B.M. Ashikur Rahman, Md. Bakhtiar Hasan, Sabbir Ahmed, Tasnim Ahmed, Md. Hamjajul Ashmafee, Mohammad Ridwan Kabir, Md. Hasanul Kabir
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Review
Survey
Deep_Learning
PDF
-
Introducing One Sided Margin Loss for Solving Classification Problems in Deep Networks
Ali Karimi, Zahra Mousavi Kouzehkanan, Reshad Hosseini, Hadi Asheri
arXiv_CV
arXiv_CV
Recognition
Optical_Character
Classification
PDF
-
Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Christoph Auer (1), Michele Dolfi (1), André Carvalho (2), Cesar Berrospi Ramis (1), Peter W. J. Staar (1) ((1) IBM Research, (2) SoftINSA Lda.)
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Knowledge
Face
PDF
-
Optical character recognition quality affects perceived usefulness of historical newspaper clippings
Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen
arXiv_CL
arXiv_CL
Recognition
Optical_Character
Knowledge
PDF
-
hmBERT: Historical Multilingual Language Models for Named Entity Recognition
Stefan Schweter, Luisa März, Katharina Schmid, Erion Çano
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
GAN
Language_Model
PDF
-
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
Video_Caption
OCR
Optical_Character
Scene_Text
Classification
Detection
Object_Detection
Caption
Image_Classification
Language_Model
PDF
-
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition
Md. Ismail Hossain, Mohammed Rakib, Sabbir Mollah, Fuad Rahman, Nabeel Mohammed
arXiv_AI
arXiv_AI
Handwriting
Recognition
OCR
RNN
Optical_Character
Knowledge
CNN
PDF
-
Detection Masking for Improved OCR on Noisy Documents
Daniel Rotman, Ophir Azulai, Inbar Shapira, Yevgeny Burshtein, Udi Barzelay
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Detection
PDF
-
A Hybrid Defense Method against Adversarial Attacks on Traffic Sign Classifiers in Autonomous Vehicles
Zadid Khan, Mashrur Chowdhury, Sakib Mahmud Khan
arXiv_CV
arXiv_CV
Transfer_Learning
Gradient_Descent
Recognition
Optical_Character
Adversarial
Classification
Autonomous
PDF
-
BankNote-Net: Open dataset for assistive universal currency recognition
Felipe Oviedo, Srinivas Vinnakota, Eugene Seleznev, Hemant Malhotra, Saqib Shaikh, Juan Lavista Ferres
arXiv_CV
arXiv_CV
Embedding
Recognition
Optical_Character
Contrastive_Learning
Few-Shot
PDF
-
Digitizing Historical Balance Sheet Data: A Practitioner's Guide
Sergio Correia, Stephan Luck
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
PDF
-
Benchmarking Algorithms for Automatic License Plate Recognition
Marcel Del Castillo Velarde, Gissel Velarde
arXiv_CV
arXiv_CV
Transfer_Learning
Recognition
Optical_Character
Pose
CNN
PDF
-
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Chengyang Fang, Gangyan Zeng, Yu Zhou, Daiqing Wu, Can Ma, Dayong Hu, Weiping Wang
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
-
Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting
Chuhui Xue, Yu Hao, Shijian Lu, Philip Torr, Song Bai
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Weakly_Supervised
Optical_Character
Pose
Scene_Text
Action
Detection
PDF
-
OCR quality affects perceived usefulness of historical newspaper clippings -- a user study
Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Knowledge
Face
PDF
-
Improving Amharic Handwritten Word Recognition Using Auxiliary Task
Mesay Samuel Gondere, Lars Schmidt-Thieme, Durga Prasad Sharma, Abiot Sinamo Boltena
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Classification
Deep_Learning
CNN
PDF
-
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching
Geewook Kim, Wonseok Hwang, Minjoon Seo, Seunghyun Park
arXiv_CL
arXiv_CL
Embedding
Recognition
Optical_Character
Pose
Matching
PDF
-
DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts
Wenzhen Zhu, Negin Sokhandan, Guang Yang, Sujitha Martin, Suchitra Sathyanarayana
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
PDF
-
Self-paced learning to improve text row detection in historical documents with missing lables
Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu
arXiv_CV
arXiv_CV
Recognition
Optical_Character
Pose
Detection
Object_Detection
GAN
PDF
-
An Assessment of the Impact of OCR Noise on Language Models
Konstantin Todorov, Giovanni Colavizza
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Optical_Character
Pose
Language_Model
PDF
-
Classroom Slide Narration System
Jobin K.V., Ajoy Mondal, C. V. Jawahar
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Recognition
OCR
Optical_Character
Pose
Face
Classification
PDF
-
On the Cross-dataset Generalization for License Plate Recognition
Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
-
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss
Bao Hieu Tran, Thanh Le-Cong, Huu Manh Nguyen, Duc Anh Le, Thanh Hung Nguyen, Phi Le Nguyen
arXiv_CV
arXiv_CV
Transformer
Recognition
RNN
Optical_Character
Pose
Scene_Text
Attention
PDF
-
A Survey on Deep learning based Document Image Enhancement
Zahra Anvari, Vassilis Athitsos
arXiv_CV
arXiv_CV
Enhancement
Recognition
OCR
Restoration
Optical_Character
Review
Pose
Image_Enhancement
Survey
Action
Deep_Learning
Denoising
Attention
PDF
-
An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images
Zekun Li, Yao-Yi Chiang, Sasan Tavakkol, Basel Shbita, Johannes H. Uhl, Stefan Leyk, Craig A. Knoblock
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
PDF
-
Donut: Document Understanding Transformer without OCR
Geewook Kim, Teakgyu Hong, Moonbin Yim, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
-
Image preprocessing and modified adaptive thresholding for improving OCR
Rohan Lal Kshetry
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
-
Ice hockey player identification via transformers
Kanav Vats, William McNally, Pascale Walters, David A. Clausi, John S. Zelek
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Action
PDF
-
Handwritten Digit Recognition Using Improved Bounding Box Recognition Technique
Arkaprabha Basu, M. Sathya
arXiv_CV
arXiv_CV
Recognition
OCR
Optimization
Optical_Character
Prediction
PDF
-
Lexically Aware Semi-Supervised Learning for OCR Post-Correction
Shruti Rijhwani, Daisy Rosenblum, Antonios Anastasopoulos, Graham Neubig
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
Language_Model
PDF
-
Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts
Allen Kim, Charuta Pethe, Naoya Inoue, Steve Skiena
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Detection
Relation
Language_Model
PDF
-
Optical Character Recognition of 19th Century Classical Commentaries: the Current State of Affairs
Matteo Romanello, Sven Najem-Meyer, Bruce Robertson
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Face
PDF
-
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng
arXiv_CL
arXiv_CL
Segmentation
Recognition
Video_Caption
OCR
Optical_Character
Knowledge
Speech
Pose
Detection
Speech_Recognition
Caption
PDF
-
A Proposal of Automatic Error Correction in Text
Wulfrano A. Luna-Ramírez, Carlos R. Jaimez-González
arXiv_AI
arXiv_AI
Recognition
Optical_Character
Speech
Pose
Detection
Language_Model
PDF
-
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li, Tengchao Lv, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
RNN
Optical_Character
Pose
Text_Generation
Language_Model
PDF
-
Deep learning-based NLP Data Pipeline for EHR Scanned Document Information Extraction
Enshuo Hsu (1, 3, and 4), Ioannis Malagaris (1), Yong-Fang Kuo (1), Rizwana Sultana (2), Kirk Roberts (3) ((1) Office of Biostatistics, (2) Division of Pulmonary, Critical Care and Sleep Medicine, Department of Internal Medicine, University of Texas Medical Branch, Galveston, Texas, USA. (3) School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA. (4) Center for Outcomes Research, Houston Methodist, Houston, TX, USA.)
arXiv_CV
arXiv_CV
Recognition
OCR
Bert
RNN
Optical_Character
Pose
Action
Deep_Learning
Medical
PDF
-
Post-OCR Document Correction with large Ensembles of Character Sequence Models
Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Evangelos Milios, Axel J. Soto
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
PDF
-
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Inference
PDF
-
A Multimodal Framework for Video Ads Understanding
Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Speech
Attention
Speech_Recognition
Prediction
PDF
-
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Xiaopeng Lu, Zhen Fan, Yansen Wang, Jean Oh, Carolyn P. Rose
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Relation
VQA
QA
PDF
-
VisBuddy -- A Smart Wearable Assistant for the Visually Challenged
Ishwarya Sivakumar, Nishaali Meenakshisundaram, Ishwarya Ramesh, Shiloah Elizabeth D, Sunil Retmin Raj C
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Optical_Character
Knowledge
Pose
Face
Action
Deep_Learning
Detection
Object_Detection
Caption
PDF
-
Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents
Amit Gupte, Alexey Romanov, Sahitya Mantravadi, Dalitso Banda, Jianjie Liu, Raza Khan, Lakshmanan Ramu Meenal, Benjamin Han, Soundar Srinivasan
arXiv_CL
arXiv_CL
Recognition
OCR
Restoration
Optical_Character
Action
PDF
-
Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining
Guowei Xu, Wenbiao Ding, Weiping Fu, Zhongqin Wu, Zitao Liu
arXiv_AI
arXiv_AI
Recognition
OCR
Text_Classification
Optical_Character
Pose
Classification
PDF
-
An End-to-End Khmer Optical Character Recognition using Sequence-to-Sequence with Attention
Rina Buoy, Sokchea Kor, Nguonly Taing
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Attention
CNN
PDF
-
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Jiapeng Wang, Tianwei Wang, Guozhi Tang, Lianwen Jin, Weihong Ma, Kai Ding, Yichao Huang
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Action
Attention
GAN
Inference
PDF
-
Classification of Documents Extracted from Images with Optical Character Recognition Methods
Omer Aydin
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Optical_Character
Classification
PDF
-
Mixed Model OCR Training on Historical Latin Script for Out-of-the-Box Recognition and Finetuning
Christian Reul, Christoph Wick, Maximilian Nöth, Andreas Büttner, Maximilian Wehner, Uwe Springmann
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
-
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter
Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Attention
GAN
Inference
Prediction
PDF
-
PAM: Understanding Product Images in Cross Product Category Attribute Extraction
Rongmei Lin, Xiang He, Jie Feng, Nasser Zalmout, Yan Liang, Li Xiong, Xin Luna Dong
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Knowledge
Knowledge_Graph
Pose
Action
VQA
PDF
-
Classification of Contract-Amendment Relationships
Fuqi Song
arXiv_CL
arXiv_CL
Tracking
Recognition
OCR
Optical_Character
Pose
Classification
Relation
PDF
-
Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods
Ovishake Sen, Mohtasim Fuad, MD. Nazrul Islam, Jakaria Rabbi, MD. Kamrul Hasan, Awal Ahmed Fime, Md. Tahmid Hasan Fuad, Delowar Sikder, MD. Akil Raihan Iftee
arXiv_AI
arXiv_AI
Recognition
Optical_Character
Knowledge
Review
Speech
Face
Action
Deep_Learning
Detection
Sentiment
Summarization
Speech_Recognition
PDF
-
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling
Marcin Namysl, Sven Behnke, Joachim Köhler
arXiv_CL
arXiv_CL
Embedding
Recognition
OCR
Optical_Character
Language_Model
PDF
-
Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations
Pascal Fischer, Alen Smajic, Alexander Mehler, Giuseppe Abrami
arXiv_AI
arXiv_AI
Transfer_Learning
Recognition
OCR
Optical_Character
Deep_Learning
Detection
Attention
PDF
-
Simple Transparent Adversarial Examples
Jaydeep Borkar, Pin-Yu Chen
arXiv_AI
arXiv_AI
Embedding
Recognition
Optical_Character
Adversarial
Pose
Detection
Object_Detection
PDF
-
End-to-End Unsupervised Document Image Blind Denoising
Mehrdad J Gangeh, Marcin Plata, Hamid Motahari, Nigel P Duffy
arXiv_CV
arXiv_CV
Unsupervised
Recognition
OCR
Optical_Character
Pose
Deep_Learning
Denoising
PDF
-
Unknown-box Approximation to Improve Optical Character Recognition Performance
Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
-
STRIDE : Scene Text Recognition In-Device
Rachit S Munjal, Arun D Prabhu, Nikhil Arora, Sukumar Moharana, Gopi Ramena
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Scene_Text
Attention
Inference
PDF
-
Mining Legacy Issues in Open Pit Mining sites: Innovation & Support of Renaturalization and Land Utilization
Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann, Gerhard Heyer
arXiv_CL
arXiv_CL
Recognition
OCR
Text_Classification
Optical_Character
Action
Classification
PDF
-
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Detection
VQA
QA
PDF
-
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding
Zilong Wang, Mingjie Zhan, Houxing Ren, Zhaohui Hou, Yuwei Wu, Xingyan Zhang, Ding Liang
arXiv_AI
arXiv_AI
OCR
Optical_Character
Pose
Action
Relation
Relation_Extraction
GAN
PDF
-
An end-to-end Optical Character Recognition approach for ultra-low-resolution printed text images
Julian D. Gilbey, Carola-Bibiane Schönlieb
arXiv_CV
arXiv_CV
Recognition
Super_Resolution
OCR
Optical_Character
PDF
-
End-to-End Optical Character Recognition for Bengali Handwritten Words
Farisa Benta Safir, Abu Quwsar Ohi, M.F. Mridha, Muhammad Mostafa Monowar, Md. Abdul Hamid
arXiv_CV
arXiv_CV
NAS
Recognition
OCR
RNN
Optical_Character
Review
Pose
CNN
PDF
-
Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Svein Arne Brygfjeld
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Bert
Optical_Character
Classification
Language_Model
PDF
-
Document Layout Analysis via Dynamic Residual Feature Fusion
Xingjiao Wu, Ziling Hu, Xiangcheng Du, Jing Yang, Liang He
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
-
Interpretable Distance Metric Learning for Handwritten Chinese Character Recognition
Boxiang Dong, Aparna S. Varde, Danilo Stevanovic, Jiayin Wang, Liang Zhao
arXiv_AI
arXiv_AI
Handwriting
Recognition
OCR
Optical_Character
Pose
Face
Action
PDF
-
Combining Morphological and Histogram based Text Line Segmentation in the OCR Context
Pit Schneider
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
PDF
-
Deep Structured Feature Networks for Table Detection and Tabular Data Extraction from Scanned Financial Document Images
Siwen Luo, Mengting Wu, Yiwen Gong, Wanying Zhou, Josiah Poon
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
Optical_Character
Pose
Action
Detection
CNN
PDF
-
SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition
Denis Coquenet, Clément Chatelain, Thierry Paquet
arXiv_CV
arXiv_CV
Segmentation
Handwriting
Recognition
OCR
Optical_Character
Pose
CNN
PDF
-
Neural OCR Post-Hoc Correction of Historical Corpora
Lijun Lyu, Maria Koutraki, Martin Krickl, Besnik Fetahu
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
RNN
Optical_Character
Pose
Face
Attention
CNN
PDF
-
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports
Nathan Cooper, Carlos Bernal-Cárdenas, Oscar Chaparro, Kevin Moran, Denys Poshyvanyk
arXiv_AI
arXiv_AI
Recognition
Optical_Character
Pose
Face
Detection
PDF
-
On-Device Document Classification using multimodal features
Sugam Garg, Harichandana, Sumit Kumar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Classification
Inference
PDF
-
Iranis: A Large-scale Dataset of Farsi License Plate Characters
Ali Tourani, Sajjad Soroori, Asadollah Shahbahrami, Alireza Akoushideh
arXiv_AI
arXiv_AI
Surveillance
Recognition
Optical_Character
Pose
Classification
Deep_Learning
Detection
Object_Detection
CNN
Image_Classification
PDF
-
ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition
Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Face
Action
Attention
CNN
PDF
-
Indonesian ID Card Extractor Using Optical Character Recognition and Natural Language Post-Processing
Firhan Maulana Rusli, Kevin Akbar Adhiguna, Hendy Irawan
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
PDF
-
FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition Systems
Lu Chen, Jiao Sun, Wei Xu
arXiv_CV
arXiv_CV
Recognition
OCR
Optimization
Optical_Character
Adversarial
Pose
PDF
-
Vartani Spellcheck -- Automatic Context-Sensitive Spelling Correction of OCR-generated Hindi Text Using BERT and Levenshtein Distance
Aditya Pal, Abhijit Mustafi
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Bert
Optical_Character
Pose
Detection
Language_Model
PDF
-
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
Optical_Character
VQA
Attention
Caption
QA
PDF
-
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Zhaokai Wang, Renda Bao, Qi Wu, Si Liu
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Embedding
Recognition
OCR
Optical_Character
Pose
NMT
Caption
PDF
-
Polarization-driven Semantic Segmentation via Efficient Attention-bridged Fusion
Kaite Xiang, Kailun Yang, Kaiwei Wang
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Optical_Character
Detection
Attention
Autonomous
PDF
-
A Panoramic Survey of Natural Language Processing in the Arab World
Kareem Darwish, Nizar Habash, Mourad Abbas, Hend Al-Khalifa, Huseein T. Al-Natsheh, Samhaa R. El-Beltagy, Houda Bouamor, Karim Bouzoubaa, Violetta Cavalli-Sforza, Wassim El-Hajj, Mustafa Jarrar, Hamdy Mubarak
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Speech
Survey
Sentiment
Speech_Recognition
PDF
-
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning
Baohua Sun, Michael Lin, Hao Sha, Lin Yang
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Optical_Character
Pose
Detection
Caption
PDF
-
Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Fuqi Song, Éric de la Clergerie
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
Pose
PDF
-
BanglaWriting: A multi-purpose offline Bangla handwriting dataset
M. F. Mridha, Abu Quwsar Ohi, M. Ameer Ali, Mazedul Islam Emon, Muhammad Mohsin Kabir
arXiv_CV
arXiv_CV
Segmentation
Handwriting
Recognition
Optical_Character
Pose
PDF
-
On-Device Language Identification of Text in Images using Diacritic Characters
Shubham Vatsal, Nikhil Arora, Gopi Ramena, Sukumar Moharana, Dhruval Jain, Naresh Purre, Rachit S Munjal
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Inference
PDF
-
An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish
Quan Duong, Mika Hämäläinen, Simon Hengchen
arXiv_CL
arXiv_CL
Unsupervised
Recognition
OCR
Optical_Character
Action
NMT
PDF
-
Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image Wordings
Yue Wang, Jing Li, Michael R. Lyu, Irwin King
arXiv_CV
arXiv_CV
Optical_Character
Pose
Action
Classification
Attention
Prediction
Matching
PDF
-
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
Zan-Xia Jin, Heran Wu, Chun Yang, Fang Zhou, Jingyan Qin, Lei Xiao, Xu-Cheng Yin
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Relation
VQA
QA
Matching
PDF
-
Persian Handwritten Digit, Character, and Words Recognition by Using Deep Learning Methods
Mehdi Bonyani, Simindokht Jahangard
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
-
TLGAN: document Text Localization using Generative Adversarial Nets
Dongyoung Kim, Myungsung Kwak, Eunji Won, Sejung Shin, Jeongyeon Nam
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Adversarial
Action
GAN
PDF
-
Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution
Xiaoyu Xiang, Qian Lin, Jan P. Allebach
arXiv_AI
arXiv_AI
Reconstruction
Face_Detection
Recognition
Super_Resolution
Optical_Character
Pose
Scene_Text
Face
Detection
PDF
-
Table Structure Recognition using Top-Down and Bottom-Up Cues
Sachin Raja, Ajoy Mondal, C. V. Jawahar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
Detection
GAN
PDF
-
Parameterized Reinforcement Learning for Optical System Optimization
Heribert Wankerl, Maike L. Stern, Ali Mahdavi, Christoph Eichler, Elmar W. Lang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Optical_Character
Pose
Action
PDF
-
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Wei Han, Hantao Huang, Tao Han
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
-
Multi-label Classification of Common Bengali Handwritten Graphemes: Dataset and Challenge
Samiul Alam, Tahsin Reasat, Asif Shahriyar Sushmit, Sadi Mohammad Siddiquee, Fuad Rahman, Mahady Hasan, Ahmed Imtiaz Humayun
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Classification
Deep_Learning
PDF
-
PP-OCR: A Practical Ultra Lightweight OCR System
Yuning Du, Chenxia Li, Ruoyu Guo, Xiaoting Yin, Weiwei Liu, Jun Zhou, Yifan Bai, Zilin Yu, Yehua Yang, Qingqing Dang, Haoshuang Wang
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
PDF
-
An Efficient Language-Independent Multi-Font OCR for Arabic Script
Hussein Osman, Karim Zaghw, Mostafa Hazem, Seifeldin Elsehely
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Action
PDF
-
Word Segmentation from Unconstrained Handwritten Bangla Document Images using Distance Transform
Pawan Kumar Singh, Shubham Sinha, Sagnik Pal Chowdhury, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
PDF
-
Handwritten Script Identification from Text Lines
Pawan Kumar Singh, Iman Chatterjee, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
-
Deep Transparent Prediction through Latent Representation Analysis
D. Kollias, N. Bouas, Y. Vlaxos, V. Brillakis, M. Seferis, I. Kollia, L. Sukissian, J. Wingate, S. Kollias
arXiv_CV
arXiv_CV
Unsupervised
Optical_Character
Pose
Deep_Learning
Prediction
PDF
-
Abstractive Information Extraction from Scanned Invoices using End-to-end Sequential Approach
Shreeshiv Patel, Dvijesh Bhatt
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Action
Deep_Learning
PDF
-
MRZ code extraction from visa and passport documents using convolutional neural networks
Yichuan Liu, Hailey James, Otkrist Gupta, Dan Raviv
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
Detection
CNN
PDF
-
OCR Graph Features for Manipulation Detection in Documents
Hailey James, Otkrist Gupta, Dan Raviv
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
PDF
-
EASTER: Efficient and Scalable Text Recognizer
Kartik Chaudhary, Raghav Bali
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Deep_Learning
CNN
PDF
-
On the Accuracy of CRNNs for Line-Based OCR: A Multi-Parameter Evaluation
Bernhard Liebl, Manuel Burghardt
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Face
PDF
-
Can You Read Me Now? Content Aware Rectification using Angle Supervision
Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor, Roee Litman
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
-
Weakly Supervised Construction of ASR Systems with Massive Video Data
Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang
arXiv_CL
arXiv_CL
Unsupervised
Recognition
OCR
Weakly_Supervised
Optical_Character
Knowledge
Speech
Pose
Speech_Recognition
PDF
-
Advancing Visual Specification of Code Requirements for Graphs
Dewi Yokelson
arXiv_CV
arXiv_CV
Recognition
Optical_Character
PDF
-
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection
K.V. Kadambari, Vishnu Vardhan Nimmalapudi
arXiv_CV
arXiv_CV
Transformer
Surveillance
Recognition
OCR
Optical_Character
Adversarial
Pose
Deep_Learning
Detection
Object_Detection
GAN
PDF
-
DivNoising: Diversity Denoising with Fully Convolutional Variational Autoencoders
Mangal Prakash, Alexander Krull, Florian Jug
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
Recognition
OCR
Restoration
Optical_Character
Pose
Deep_Learning
Denoising
CNN
Prediction
PDF
-
Structured Multimodal Attentions for TextVQA
Chenyu Gao, Qi Zhu, Peng Wang, Hui Li, Yuliang Liu, Anton van den Hengel, Qi Wu
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Relation
VQA
Attention
QA
PDF
-
Plague Dot Text: Text mining and annotation of outbreak reports of the Third Plague Pandemic
Arlene Casey, Mike Bennett, Richard Tobin, Claire Grover, Iona Walker, Lukas Engelmann, Beatrice Alex
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Relation
PDF
-
Quantitative Analysis of Image Classification Techniques for Memory-Constrained Devices
Sebastian Müksch, Theo Olausson, John Wilhelm, Pavlos Andreadis
arXiv_CV
arXiv_CV
Recognition
RNN
Optical_Character
Speech
Pose
Quantitative
Classification
Speech_Recognition
CNN
Image_Classification
PDF
-
Deep Learning Based Vehicle Tracking System Using License Plate Detection And Recognition
Lalit Lakshmanan, Yash Vora, Raj Ghate
arXiv_CV
arXiv_CV
Tracking
Recognition
OCR
Optical_Character
Pose
Scene_Text
Deep_Learning
Detection
Object_Detection
PDF
-
A Hybrid Swarm and Gravitation based feature selection algorithm for Handwritten Indic Script Classification problem
Ritam Guha, Manosij Ghosh, Pawan Kumar Singh, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Classification
PDF
-
A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition
Steven I Reeves, Dongwook Lee, Anurag Singh, Kunal Verma
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
PDF
-
A Skip-connected Multi-column Network for Isolated Handwritten Bangla Character and Digit recognition
Animesh Singh, Ritesh Sarkhel, Nibaran Das, Mahantapas Kundu, Mita Nasipuri
arXiv_CV
arXiv_CV
Recognition
Optical_Character
Pose
Action
CNN
PDF
-
A Tool for Facilitating OCR Postediting in Historical Documents
Alberto Poncelas, Mohammad Aboomar, Jan Buts, James Hadley, Andy Way
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
Face
Language_Model
PDF
-
Object Detection and Recognition of Swap-Bodies using Camera mounted on a Vehicle
Ebin Zacharias, Didier Stricker, Martin Teuchler, Kripasindhu Sarkar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Face
Deep_Learning
Detection
Object_Detection
Attention
Autonomous
PDF
-
Image Processing Based Scene-Text Detection and Recognition with Tesseract
Ebin Zacharias, Martin Teuchler, Bénédicte Bernier
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Detection
PDF
-
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks
Wenwen Yu, Ning Lu, Xianbiao Qi, Ping Gong, Rong Xiao
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
Deep_Learning
Detection
CNN
PDF
-
An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers
Bernhard Liebl, Manuel Burghardt
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Relation
Attention
PDF
-
Real-time information retrieval from Identity cards
Niloofar Tavakolian, Azadeh Nazemi, Donal Fitzpatrick
arXiv_CV
arXiv_CV
Face_Detection
Recognition
OCR
Optical_Character
Pose
Scene_Text
Face
Deep_Learning
Detection
Object_Detection
GAN
PDF
-
TextCaps: a Dataset for Image Captioning with Reading Comprehension
Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh
arXiv_CV
arXiv_CV
Image_Caption
Recognition
Optical_Character
Caption
PDF
-
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation
Sharon Fogel (1), Hadar Averbuch-Elor (2), Sarel Cohen, Shai Mazor (1), Roee Litman (1) ((1) Amazon Rekognition Israel, (2) Cornell University)
arXiv_CV
arXiv_CV
Semi_Supervised
Recognition
OCR
Optical_Character
Deep_Learning
GAN
Text_Generation
PDF
-
Constraints in Developing a Complete Bengali Optical Character Recognition System
Abu Saleh Md. Abir, Sanjana Rahman, Samia Ellin, Maisha Farzana, Md Hridoy Manik, Chowdhury Rafeed Rahman
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Review
PDF
-
Attacking Optical Character Recognition Systems with Adversarial Watermarks
Lu Chen, Wei Xu
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Adversarial
Pose
Detection
PDF
-
Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction
Kushagra Mahajan, Monika Sharma, Lovekesh Vig
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
PDF
-
Vehicle Re-identification: exploring feature fusion using multi-stream convolutional networks
Icaro O. de Oliveira, Rayson Laroca, David Menotti, Keiko V. O. Fonseca, Rodrigo Minetto
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Re-identification
CNN
PDF
-
Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory
Duc Nguyen, Nhan Tran, Hung Le
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Scene_Text
CNN
PDF
-
VASTA: A Vision and Language-assisted Smartphone Task Automation System
Alborz Rezazadeh Sereshkeh, Gary Leung, Krish Perumal, Caleb Phillips, Minfan Zhang, Afsaneh Fazly, Iqbal Mohomed
arXiv_AI
arXiv_AI
Recognition
Optical_Character
Face
Action
Detection
Object_Detection
PDF
-
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images
W. Ronny Huang, Yike Qi, Qianqian Li, Jonathan Degange
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Weakly_Supervised
Optical_Character
Pose
PDF
-
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction
Mika Hämäläinen, Simon Hengchen
arXiv_CL
arXiv_CL
Embedding
Unsupervised
Recognition
OCR
Optical_Character
NMT
PDF
-
Rosetta: Large scale system for text detection and recognition in images
Fedor Borisyuk, Albert Gordo, Viswanath Sivakumar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Face
Detection
Recommendation
PDF
-
NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term Memory
Adewale Akinfaderin, Olamilekan Wahab
arXiv_AI
arXiv_AI
Embedding
Recognition
OCR
RNN
Optical_Character
PDF
-
Chargrid-OCR: End-to-end trainable Optical Character Recognition through Semantic Segmentation and Object Detection
Christian Reisswig, Anoop R Katti, Marco Spinaci, Johannes Höhne
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Recognition
OCR
Optical_Character
Detection
Object_Detection
PDF
-
OCR4all -- An Open-Source Tool Providing a Automatic OCR Workflow for Historical Printings
Christian Reul, Dennis Christ, Alexander Hartelt, Nico Balbach, Maximilian Wehner, Uwe Springmann, Christoph Wick, Christine Grundig, Andreas Büttner, Frank Puppe
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Face
PDF
-
Self-supervised Data Bootstrapping for Deep Optical Character Recognition of Identity Documents
Oliver Mothes, Joachim Denzler
arXiv_CV
arXiv_CV
Recognition
Optical_Character
Self-Supervised
Pose
Classification
CNN
PDF
-
Mitigating Noisy Inputs for Question Answering
Denis Peskov, Joe Barrow, Pedro Rodriguez, Graham Neubig, Jordan Boyd-Graber
arXiv_CL
arXiv_CL
Recognition
Optical_Character
Speech
Speech_Recognition
QA
PDF
-
Answering Questions about Data Visualizations using Efficient Bimodal Fusion
Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan
arXiv_AI
arXiv_AI
Embedding
Recognition
Optical_Character
Pose
VQA
QA
PDF
-
A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples
Lalitha Giridhar, Aishwarya Dharani and, Velmathi Guruviah
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Speech
Pose
Classification
CNN
PDF
-
Brno Mobile OCR Dataset
Martin Kišš, Michal Hradiš, Oldřich Kodym
arXiv_CV
arXiv_CV
Recognition
OCR
Restoration
RNN
Optical_Character
Classification
Denoising
CNN
PDF
-
Efficient, Lexicon-Free OCR using Deep Learning
Marcin Namysl, Iuliu Konya
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Action
Deep_Learning
CNN
Language_Model
PDF
-
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume, Hazim Kemal Ekenel, Jean-Philippe Thiran
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Knowledge
Pose
Detection
PDF
-
Integration of Text-maps in Convolutional Neural Networks for Region Detection among Different Textual Categories
Roberto Arroyo, Javier Tovar, Francisco J. Delgado, Emilio J. Almazán, Diego G. Serrador, Antonio Hurtado
arXiv_CV
arXiv_CV
Enhancement
Recognition
OCR
Optical_Character
Pose
Detection
CNN
PDF
-
Producing Corpora of Medieval and Premodern Occitan
Jean-Baptiste Camps (CJM), Gilles Guilhem Couffignal (PLH)
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Deep_Learning
PDF
-
An Ensemble of Neural Networks for Non-Linear Segmentation of Overlapped Cursive Script
Amjad Rehman
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
PDF
-
Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features
Gregor Wiedemann, Gerhard Heyer
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
Optical_Character
CNN
PDF
-
SAML-QC: a Stochastic Assessment and Machine Learning based QC technique for Industrial Printing
Azhar Hussain
arXiv_CV
arXiv_CV
Recognition
Optical_Character
Pose
Detection
PDF
-
Optical Character Recognition for Telugu: Database, Algorithm and Application
Chandra Prakash Konkimalla, Manikanta Srikar Yellapragada, Trishal Gayam, Souraj Mandal, Sumohana S. Channappayya
arXiv_CV
arXiv_CV
Transfer_Learning
Recognition
OCR
Optical_Character
Deep_Learning
PDF
-
Pay Voice: Point of Sale Recognition for Visually Impaired People
Guilherme Folego, Filipe Costa, Bruno Costa, Alan Godoy, Luiz Pita
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
-
Deep Reader: Information extraction from Document images via relation extraction and Natural Language
Vishwanath D, Rohit Rahul, Gunjan Sehgal, Swati, Arindam Chowdhury, Monika Sharma, Lovekesh Vig, Gautam Shroff, Ashwin Srinivasan
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Face
Action
Relation
Relation_Extraction
PDF
-
Auto-Encoder-BoF/HMM System for Arabic Text Recognition
Najoua Rahal, Maroua Tounsi, Adel M. Alimi
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Sparse
Optical_Character
Pose
PDF
-
Binary Document Image Super Resolution for Improved Readability and OCR Performance
Ram Krishna Pandey, K Vignesh, A G Ramakrishnan, Chandrahasa B
arXiv_CV
arXiv_CV
Recognition
Super_Resolution
OCR
Optical_Character
Pose
Action
CNN
PDF
-
Deep Bayesian Uncertainty Estimation for Adaptation and Self-Annotation of Food Packaging Images
Fabio De Sousa Ribeiro, Francesco Caliva, Mark Swainson, Kjartan Gudmundsson, Georgios Leontidis, Stefanos Kollias
arXiv_CV
arXiv_CV
Optical_Character
Pose
Classification
Deep_Learning
CNN
Inference
Prediction
PDF
-
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character Recognition
Mojtaba Heidarysafa, James Reed, Kamran Kowsari, April Celeste R.Leviton, Janet I. Warren, Donald E. Brown
arXiv_CV
arXiv_CV
Tracking
Recognition
Optical_Character
PDF
-
Image-based Natural Language Understanding Using 2D Convolutional Neural Networks
Erinc Merdivan, Anastasios Vafeiadis, Dimitrios Kalatzis, Sten Henke, Johannes Kropf, Konstantinos Votis, Dimitrios Giakoumis, Dimitrios Tzovaras, Liming Chen, Raouf Hamzaoui, Matthieu Geist
arXiv_CL
arXiv_CL
Recognition
Text_Classification
Optical_Character
Memory_Networks
Pose
Classification
CNN
PDF
-
Resolving Referring Expressions in Images With Labeled Elements
Nevan Wichers, Dilek Hakkani-Tur, Jindong Chen
arXiv_CV
arXiv_CV
Embedding
Recognition
Optical_Character
PDF
-
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin, Alessandro Pappalardo, Muhammad Mohsin Ghaffar, Giulio Gambardella, Norbert Wehn, Michaela Blott
arXiv_CV
arXiv_CV
Quantization
Recognition
RNN
Optical_Character
Classification
PDF
-
Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition
Christoph Wick, Christian Reul, Frank Puppe
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Classification
Deep_Learning
CNN
Prediction
PDF
-
French Word Recognition through a Quick Survey on Recurrent Neural Networks Using Long-Short Term Memory RNN-LSTM
Saman Sarraf
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Survey
Deep_Learning
PDF
-
Layered transition metal dichalcogenides: promising near-lattice-matched substrates for GaN growth
Priti Gupta, A. A. Rahman, Shruti Subramanian, Shalini Gupta, Arumugam Thamizhavel, Tatyana Orlova, Sergei Rouvimov, Suresh Vishwanath, Vladimir Protasenko, Masihhur R. Laskar, Huili Grace Xing, Debdeep Jena, Arnab Bhattacharya
arXiv_CV
arXiv_CV
Optical_Character
GAN
PDF
-
GaN directional couplers for integrated quantum photonics
Yanfeng Zhang, Loyd McKnight, Erman Engin, Ian M. Watson, Martin J. Cryan, Erdan Gu, Mark G. Thompson, Stephane Calvez, Jeremy L. O'Brien, Martin D. Dawson
arXiv_CV
arXiv_CV
Optical_Character
Pose
Face
GAN
PDF
-
GaN/AlGaN microcavities for enhancement of non linear optical effects
V. Tasco, I. Tarantini, A. Campa, A. Massaro, T. Stomeo, G. Epifani, A. Passaseo, M. Braccini, M.C. Larciprete, C. Sibilia, F.A. Bovino
arXiv_CV
arXiv_CV
Enhancement
Optical_Character
Pose
Face
GAN
PDF
-
Optical characterization of GaN by N+ implantation into GaAs at elevated temperature
S. Dhara, P. Magudapathy, R. Kesavamoorthy, S. Kalavathi, K. G. M. Nair, G. M. Hsu, L. C. Chen, K. H. Chen, K. Santhakumar, T. Soga
arXiv_CV
arXiv_CV
Optical_Character
GAN
PDF