OCR
OCR
2022-06-27
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Xu Yang, Daoyuan Wu, Xiao Yi, Jimmy H. M. Lee, Tan Lee
arXiv_CV
arXiv_CV
Face_Detection
Recognition
OCR
Optimization
Optical_Character
Pose
Face
Detection
Face_Recognition
PDF
2022-06-27
Differentially Private Condorcet Voting
Zhechen Li, Ao Liu, Lirong Xia, Yongzhi Cao, Hanpin Wang
arXiv_AI
arXiv_AI
OCR
Pose
Relation
PDF
2022-06-26
FAIR-BFL: Flexible and Incentive Redesign for Blockchain-based Federated Learning
Rongxin Xu, Shiva Raj Pokhrel, Qiujun Lan, Gang Li
arXiv_AI
arXiv_AI
OCR
PDF
2022-06-24
Dissecting U-net for Seismic Application: An In-Depth Study on Deep Learning Multiple Removal
Ricard Durall, Ammar Ghanim, Norman Ettrich, Janis Keuper
arXiv_CV
arXiv_CV
OCR
Knowledge
Deep_Learning
PDF
2022-06-21
An Automatic and Efficient BERT Pruning for Edge AI Systems
Shaoyi Huang, Ning Liu, Yueying Liang, Hongwu Peng, Hongjia Li, Dongkuan Xu, Mimi Xie, Caiwen Ding
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Pose
Deep_Learning
Relation
Inference
PDF
2022-06-21
Towards Optimizing OCR for Accessibility
Peya Mowar, Tanuja Ganu, Saikat Guha
arXiv_CV
arXiv_CV
OCR
Speech
PDF
2022-06-21
Broken News: Making Newspapers Accessible to Print-Impaired
Vishal Agarwal, Tanuja Ganu, Saikat Guha
arXiv_CV
arXiv_CV
Segmentation
OCR
Pose
Detection
PDF
2022-06-18
Camera Adaptation for Fundus-Image-Based CVD Risk Estimation
Zhihong Lin, Danli Shi, Donghao Zhang, Xianwen Shang, Mingguang He, Zongyuan Ge
arXiv_CV
arXiv_CV
OCR
Knowledge
Pose
Deep_Learning
Attention
PDF
2022-06-14
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu, Chao Wang, Wenqiang Lei, Ziyang Liu, Tat Seng Chua
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
Face
Action
Detection
Object_Detection
Attention
Prediction
PDF
2022-06-12
An Unsupervised Deep-Learning Method for Bone Age Assessment
Hao Zhu, Wan-Jing Nie, Yue-Jie Hou, Qi-Meng Du, Si-Jing Li, Chi-Chun Zhou
arXiv_CV
arXiv_CV
Unsupervised
OCR
Knowledge
Pose
Classification
CNN
PDF
2022-06-11
An Evaluation of OCR on Egocentric Data
Valentin Popescu, Dima Damen, Toby Perrett
arXiv_CV
arXiv_CV
OCR
PDF
2022-06-10
Human-AI Interaction Design in Machine Teaching
Karan Taneja, Harshvardhan Sikka, Ashok Goel
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Face
Action
PDF
2022-06-09
Transformer based Urdu Handwritten Text Optical Character Reader
Mohammad Daniyal Shaiq, Musa Dildar Ahmed Cheema, Ali Kamal
arXiv_AI
arXiv_AI
Transformer
Handwriting
OCR
Optical_Character
Pose
Action
PDF
2022-06-07
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Self-Supervised
Pose
Detection
Object_Detection
Attention
Inference
PDF
2022-06-06
Contrastive Graph Multimodal Model for Text Classification in Videos
Ye Liu, Changchong Lu, Chen Lin, Di Yin, Bo Ren
arXiv_CV
arXiv_CV
Video_Indexing
Recognition
OCR
Text_Classification
Knowledge
Contrastive_Learning
Action
Classification
Relation
PDF
2022-06-05
Two Decades of Bengali Handwritten Digit Recognition: A Survey
A.B.M. Ashikur Rahman, Md. Bakhtiar Hasan, Sabbir Ahmed, Tasnim Ahmed, Md. Hamjajul Ashmafee, Mohammad Ridwan Kabir, Md. Hasanul Kabir
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Review
Survey
Deep_Learning
PDF
2022-06-04
A Superimposed Divide-and-Conquer Image Recognition Method for SEM Images of Nanoparticles on The Surface of Monocrystalline silicon with High Aggregation Degree
Ruiling Xiao, Jiayang Niu
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Face
Contour
PDF
2022-06-03
Beyond Tabula Rasa: Reincarnating Reinforcement Learning
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville, Marc G. Bellemare
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Knowledge
Pose
PDF
2022-06-01
Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Christoph Auer (1), Michele Dolfi (1), André Carvalho (2), Cesar Berrospi Ramis (1), Peter W. J. Staar (1) ((1) IBM Research, (2) SoftINSA Lda.)
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Knowledge
Face
PDF
2022-06-01
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Self-Supervised
Pose
Language_Model
PDF
2022-05-31
hmBERT: Historical Multilingual Language Models for Named Entity Recognition
Stefan Schweter, Luisa März, Katharina Schmid, Erion Çano
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
GAN
Language_Model
PDF
2022-05-30
Easter2.0: Improving convolutional models for handwritten text recognition
Kartik Chaudhary, Raghav Bali
arXiv_AI
arXiv_AI
Transformer
Handwriting
Recognition
OCR
RNN
Pose
Classification
Few-Shot
CNN
PDF
2022-05-27
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
Video_Caption
OCR
Optical_Character
Scene_Text
Classification
Detection
Object_Detection
Caption
Image_Classification
Language_Model
PDF
2022-05-25
Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction
Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-05-25
DisinfoMeme: A Multimodal Dataset for Detecting Meme Intentionally Spreading Out Disinformation
Jingnong Qu, Liunian Harold Li, Jieyu Zhao, Sunipa Dev, Kai-Wei Chang
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Action
GAN
PDF
2022-05-25
Skin Cancer Diagnostics with an All-Inclusive Smartphone Application
Upender Kalwa, Christopher Legner, Taejoon Kong, Santosh Pandey
arXiv_CV
arXiv_CV
Segmentation
OCR
Classification
Detection
Medical
PDF
2022-05-23
Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Tuan Dinh, Jy-yong Sohn, Shashank Rajput, Timothy Ossowski, Yifei Ming, Junjie Hu, Dimitris Papailiopoulos, Kangwook Lee
arXiv_CL
arXiv_CL
Embedding
Unsupervised
OCR
Pose
PDF
2022-05-23
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition
Md. Ismail Hossain, Mohammed Rakib, Sabbir Mollah, Fuad Rahman, Nabeel Mohammed
arXiv_AI
arXiv_AI
Handwriting
Recognition
OCR
RNN
Optical_Character
Knowledge
CNN
PDF
2022-05-21
Improving Long Tailed Document-Level Relation Extraction via Easy Relation Augmentation and Contrastive Learning
Yangkai Du, Tengfei Ma, Lingfei Wu, Yiming Wu, Xuhong Zhang, Bo Long, Shouling Ji
arXiv_AI
arXiv_AI
OCR
Pose
Contrastive_Learning
Action
Relation
Relation_Extraction
PDF
2022-05-17
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong, Mingyuan Zhang, Liang Pan, Zhongang Cai, Lei Yang, Ziwei Liu
arXiv_CV
arXiv_CV
OCR
3D
Zero-Shot
Knowledge
Pose
Quantitative
Language_Model
PDF
2022-05-17
Detection Masking for Improved OCR on Noisy Documents
Daniel Rotman, Ophir Azulai, Inbar Shapira, Yevgeny Burshtein, Udi Barzelay
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Detection
PDF
2022-05-13
An empirical study of CTC based models for OCR of Indian languages
Minesh Mathew, CV Jawahar
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Classification
Prediction
PDF
2022-05-13
The Case for a Legal Compliance API for the Enforcement of the EU's Digital Services Act on Social Media Platforms
Catalina Goanta, Thales Bertaglia, Adriana Iamnitchi
arXiv_AI
arXiv_AI
OCR
Pose
Face
PDF
2022-05-12
AiSocrates: Towards Answering Ethical Quandary Questions
Yejin Bang, Nayeon Lee, Tiezheng Yu, Leila Khalatbari, Yan Xu, Dan Su, Elham J. Barezi, Andrea Madotto, Hayden Kee, Pascale Fung
arXiv_AI
arXiv_AI
OCR
Pose
Few-Shot
Language_Model
PDF
2022-05-11
Pre-trained Language Models as Re-Annotators
Chang Shu
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Contrastive_Learning
Action
Detection
Relation
Relation_Extraction
Language_Model
PDF
2022-05-09
A Novel Augmented Reality Ultrasound Framework Using an RGB-D Camera and a 3D-printed Marker
Yitian Zhou, Gaétan Lelu, Boris Labbé, Guillaume Pasquier, Pierre Le Gargasson, Albert Murienne, Laurent Launay
arXiv_CV
arXiv_CV
Tracking
Point_Cloud
OCR
3D
Pose
Medical
PDF
2022-05-06
Rethinking Fairness: An Interdisciplinary Survey of Critiques of Hegemonic ML Fairness Approaches
Lindsay Weinberg
arXiv_AI
arXiv_AI
OCR
Survey
Action
Classification
GAN
PDF
2022-05-06
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Martin Malmsten, Chris Haffenden, Love Börjeson
arXiv_CL
arXiv_CL
Recognition
OCR
Speech
Speech_Recognition
Language_Model
PDF
2022-05-05
RoboCraft: Learning to See, Simulate, and Shape Elasto-Plastic Objects with Graph Networks
Haochen Shi, Huazhe Xu, Zhiao Huang, Yunzhu Li, Jiajun Wu
arXiv_AI
arXiv_AI
OCR
Pose
Action
PDF
2022-05-05
Text Detection on Technical Drawings for the Digitization of Brown-field Processes
Tobias Schlagenhauf, Markus Netzer, Jan Hillinger
arXiv_CV
arXiv_CV
Recognition
OCR
Knowledge
Detection
Object_Detection
Autonomous
PDF
2022-05-05
OCR Synthetic Benchmark Dataset for Indic Languages
Naresh Saini, Promodh Pinto, Aravinth Bheemaraj, Deepak Kumar, Dhiraj Daga, Saurabh Yadav, Srihari Nagaraj
arXiv_CV
arXiv_CV
OCR
PDF
2022-05-05
Relational Representation Learning in Visually-Rich Documents
Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren
arXiv_CL
arXiv_CL
Recognition
OCR
Represenation_Learning
Knowledge
Pose
Contrastive_Learning
Action
Detection
Relation
PDF
2022-05-04
Reproducibility Beyond the Research Community: Experience from NLP Beginners
Shane Storks, Keunwoo Peter Yu, Joyce Chai
arXiv_CL
arXiv_CL
OCR
Attention
PDF
2022-05-04
Few-Shot Document-Level Relation Extraction
Nicholas Popovic, Michael Färber
arXiv_AI
arXiv_AI
OCR
Pose
Action
Relation
Few-Shot
Relation_Extraction
PDF
2022-05-04
Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction
Liyan Xu, Jinho D. Choi
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-05-02
Music Interpretation Analysis. A Multimodal Approach To Score-Informed Resynthesis of Piano Recordings
Federico Simonetta
arXiv_SD
arXiv_SD
OCR
Pose
Action
PDF
2022-04-27
The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild
Spyridon Baxevanakis, Giorgos Kordopatis-Zilos, Panagiotis Galopoulos, Lazaros Apostolidis, Killian Levacher, Ipek B. Schlicht, Denis Teyssou, Ioannis Kompatsiaris, Symeon Papadopoulos
arXiv_CV
arXiv_CV
OCR
Adversarial
Pose
Deep_Learning
Detection
PDF
2022-04-27
Document-Level Relation Extraction with Sentences Importance Estimation and Focusing
Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-04-26
Approach to Predicting News -- A Precise Multi-LSTM Network With BERT
Chia-Lin Chen (1), Pei-Yu Huang (2), Yi-Ting Huang (3), Chun Lin (3) ((1) Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, (2) Management and Digital Innovation, University of London, Singapore, (3) Institute of Information Science, Academia Sinica, Taipei, Taiwan)
arXiv_CL
arXiv_CL
Transformer
Embedding
OCR
Bert
RNN
PDF
2022-04-21
German Parliamentary Corpus
Giuseppe Abrami, Mevlüt Bagci, Leon Hammerla, Alexander Mehler
arXiv_CL
arXiv_CL
OCR
PDF
2022-04-21
A Masked Image Reconstruction Network for Document-level Relation Extraction
Liang Zhang, Yidong Cheng
arXiv_CL
arXiv_CL
Reconstruction
OCR
Pose
Action
Relation
Relation_Extraction
Inference
PDF
2022-04-20
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations
Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj Doğan, Jingcheng Du, Li Fang, Wang Kai, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Shubo Tian, Jinfeng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu, Richard Dufour, Yanis Labrak, Niladri Chatterjee, Kushagri Tandon, Fréjus Laleye, Loïc Rakotoson, Emmanuele Chersoni, Jinghang Gu, Annemarie Friedrich, Subhash Chandra Pujari, Mariia Chizhikova, Naveen Sivadasan, Naveen Sivadasan, Zhiyong Lu
arXiv_CL
arXiv_CL
Transformer
OCR
Review
Classification
GAN
Medical
PDF
2022-04-17
Does Recommend-Revise Produce Reliable Annotations? An Analysis on Missing Instances in DocRED
Quzhe Huang, Shibo Hao, Yuan Ye, Shengqi Zhu, Yansong Feng, Dongyan Zhao
arXiv_CL
arXiv_CL
OCR
Action
Relation
Relation_Extraction
Recommendation
PDF
2022-04-14
Multi-label topic classification for COVID-19 literature with Bioformer
Li Fang, Kai Wang
arXiv_CL
arXiv_CL
OCR
Bert
Classification
PDF
2022-04-06
Data-Centric Green AI: An Exploratory Empirical Study
Roberto Verdecchia, Luís Cruz, June Sallou, Michelle Lin, James Wickenden, Estelle Hotellier
arXiv_AI
arXiv_AI
OCR
PDF
2022-04-05
Region Rebalance for Long-Tailed Semantic Segmentation
Jiequan Cui, Yuhui Yuan, Zhisheng Zhong, Zhuotao Tian, Han Hu, Stephen Lin, Jiaya Jia
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
PDF
2022-04-03
A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus
Seth Kulick, Neville Ryant, Beatrice Santorini, Joel Wallenberg
arXiv_CL
arXiv_CL
Embedding
OCR
Knowledge
Speech
Pose
Relation
PDF
2022-04-03
A sequence-to-sequence approach for document-level relation extraction
John Giorgi, Gary D. Bader, Bo Wang
arXiv_CL
arXiv_CL
OCR
Action
Relation
Relation_Extraction
Medical
PDF
2022-04-01
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng, Adrian Wong, Stefan Welker, Krzysztof Choromanski, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
arXiv_AI
arXiv_AI
Image_Caption
OCR
Zero-Shot
Knowledge
Pose
Caption
Language_Model
PDF
2022-04-01
Unitail: Detecting, Reading, and Matching in Retail Scene
Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides
arXiv_CV
arXiv_CV
OCR
Detection
Object_Detection
Matching
PDF
2022-03-31
Digitizing Historical Balance Sheet Data: A Practitioner's Guide
Sergio Correia, Stephan Luck
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
PDF
2022-03-30
Automatic Facial Skin Feature Detection for Everyone
Qian Zheng, Ankur Purwar, Heng Zhao, Guang Liang Lim, Ling Li, Debasish Behera, Qian Wang, Min Tan, Rizhao Cai, Jennifer Werner, Dennis Sng, Maurice van Steensel, Weisi Lin, Alex C Kot
arXiv_CV
arXiv_CV
OCR
Detection
Recommendation
PDF
2022-03-26
A Densely Connected Criss-Cross Attention Network for Document-level Relation Extraction
Liang Zhang, Yidong Cheng
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Attention
PDF
2022-03-25
Quantifying Demonstration Quality for Robot Learning and Generalization
Maram Sakr, Zexi Jesse Li, H. F. Machiel Van der Loos, Dana Kulic, Elizabeth A. Croft
arXiv_RO
arXiv_RO
OCR
Pose
Relation
PDF
2022-03-25
Plagiarism Detection in the Bengali Language: A Text Similarity-Based Approach
Satyajit Ghosh, Aniruddha Ghosh, Bittaswer Ghosh, Abhishek Roy
arXiv_CL
arXiv_CL
OCR
Pose
Action
Detection
PDF
2022-03-25
Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
OCR
Contrastive_Learning
PDF
2022-03-24
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Chengyang Fang, Gangyan Zeng, Yu Zhou, Daiqing Wu, Can Ma, Dayong Hu, Weiping Wang
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
2022-03-21
Transformer-based HTR for Historical Documents
Phillip Benjamin Ströbel, Simon Clematide, Martin Volk, Tobias Hodel
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
OCR
PDF
2022-03-21
Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation
Qingyu Tan, Ruidan He, Lidong Bing, Hwee Tou Ng
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Action
Relation
Relation_Extraction
Attention
PDF
2022-03-20
Who will share Fake-News on Twitter? Psycholinguistic cues in online post histories discriminate Between actors in the misinformation ecosystem
Verena Schoenmueller, Simon J. Blanchard, Gita V. Johar
arXiv_CL
arXiv_CL
OCR
Emotion
Classification
Prediction
PDF
2022-03-20
Document Dewarping with Control Points
Guo-Wang Xie, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu
arXiv_CV
arXiv_CV
OCR
Sparse
Pose
Action
PDF
2022-03-15
Revitalize Region Feature for Democratizing Video-Language Pre-training
Guanyu Cai, Yixiao Ge, Alex Jinpeng Wang, Rui Yan, Xudong Lin, Ying Shan, Lianghua He, Xiaohu Qie, Jianping Wu, Mike Zheng Shou
arXiv_CV
arXiv_CV
OCR
Sparse
Regularization
Relation
Video_Retrieval
PDF
2022-03-14
CAR: Class-aware Regularizations for Semantic Segmentation
Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Xiangjian He, Linchao Bao
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Represenation_Learning
Regularization
Pose
Inference
Prediction
PDF
2022-03-14
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu, Changhua Meng, Ke Wang, Jun Lan, Weiqiang Wang, Ming Gu, Liqing Zhang
arXiv_CL
arXiv_CL
Transformer
Embedding
OCR
Pose
PDF
2022-03-11
Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision
Yufeng Cui, Lichen Zhao, Feng Liang, Yangguang Li, Jing Shao
arXiv_CV
arXiv_CV
Transformer
OCR
Pose
CNN
PDF
2022-03-11
Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection
Siyue Yu, Jimin Xiao, Bingfeng Zhang, Eng Gee Lim
arXiv_CV
arXiv_CV
Enhancement
OCR
Salient
Pose
Contrastive_Learning
Classification
Detection
Object_Detection
Attention
Prediction
PDF
2022-03-08
Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting
Chuhui Xue, Yu Hao, Shijian Lu, Philip Torr, Song Bai
arXiv_CV
arXiv_CV
Recognition
OCR
Weakly_Supervised
Optical_Character
Pose
Scene_Text
Action
Detection
PDF
2022-03-04
OCR quality affects perceived usefulness of historical newspaper clippings -- a user study
Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Knowledge
Face
PDF
2022-03-03
A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions
Francois St-Hilaire, Dung Do Vu, Antoine Frau, Nathan Burns, Farid Faraji, Joseph Potochny, Stephane Robert, Arnaud Roussel, Selene Zheng, Taylor Glazier, Junfel Vincent Romano, Robert Belfer, Muhammad Shayan, Ariella Smofsky, Tommy Delarosbil, Seulmin Ahn, Simon Eden-Walker, Kritika Sony, Ansona Onyi Ching, Sabina Elkins, Anush Stepanyan, Adela Matajova, Victor Chen, Hossein Sahraei, Robert Larson, Nadia Markova, Andrew Barkett, Laurent Charlin, Yoshua Bengio, Iulian Vlad Serban, Ekaterina Kochmar
arXiv_AI
arXiv_AI
OCR
Action
PDF
2022-03-02
Foundations for Grassroots Democratic Metaverse
Nimrod Talmon, Ehud Shapiro
arXiv_AI
arXiv_AI
OCR
Face
Autonomous
PDF
2022-03-02
TableFormer: Table Structure Understanding with Transformers
Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar
arXiv_CV
arXiv_CV
Transformer
OCR
RNN
Knowledge
Knowledge_Graph
Action
Deep_Learning
Detection
Object_Detection
GAN
PDF
2022-03-02
Centralized Fairness for Redistricting
Seyed A. Esmaeili, Hayley Grape, Brian Brubach
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-03-01
Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection
Yufei Liang, Jiangning Zhang, Shiwei Zhao, Runze Wu, Yong Liu, Shuwen Pan
arXiv_CV
arXiv_CV
Reconstruction
Unsupervised
OCR
Restoration
Pose
Action
Classification
Detection
Relation
GAN
PDF
2022-02-27
OCR Improves Machine Translation for Low-Resource Languages
Oana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán
arXiv_CL
arXiv_CL
OCR
PDF
2022-02-25
OCR-IDL: OCR Annotations for Industry Document Library Dataset
Ali Furkan Biten, Rubèn Tito, Lluis Gomez, Ernest Valveny, Dimosthenis Karatzas
arXiv_CV
arXiv_CV
OCR
Pose
PDF
2022-02-25
Improving Amharic Handwritten Word Recognition Using Auxiliary Task
Mesay Samuel Gondere, Lars Schmidt-Thieme, Durga Prasad Sharma, Abiot Sinamo Boltena
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Classification
Deep_Learning
CNN
PDF
2022-02-24
Design and Characterization of 3D Printed, Open-Source Actuators for Legged Locomotion
Karthik Urs, Challen Enninful Adu, Elliott J. Rouse, Talia Y. Moore
arXiv_RO
arXiv_RO
OCR
3D
Optimization
PDF
2022-02-22
CorefDRE: Document-level Relation Extraction with coreference resolution
Zhongxuan Xue, Rongzhen Li, Qizhu Dai, Zhong Jiang
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Attention
Inference
PDF
2022-02-18
BLPnet: A new DNN model and Bengali OCR engine for Automatic License Plate Recognition
Md. Saif Hassan Onim, Hussain Nyeem, Koushik Roy, Mahmudul Hasan, Abtahi Ishmam, Md. Akiful Hoque Akif, Tareque Bashar Ovi
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Detection
Attention
PDF
2022-02-18
SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions
Ripon K. Saha, Akira Ura, Sonal Mahajan, Chenguang Zhu, Linyi Li, Yang Hu, Hiroaki Yoshida, Sarfraz Khurshid, Mukul R. Prasad
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-02-18
How to Manage Tiny Machine Learning at Scale: An Industrial Perspective
Haoyu Ren, Darko Anicic, Thomas Runkler
arXiv_AI
arXiv_AI
OCR
Knowledge
Knowledge_Graph
Pose
Ontology
PDF
2022-02-17
CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving
Yinuo Zhao, Kun Wu, Zhiyuan Xu, Zhengping Che, Qi Lu, Jian Tang, Chi Harold Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Optimization
Relation
Attention
Autonomous
PDF
2022-02-16
ADIMA: Abuse Detection In Multilingual Audio
Vikram Gupta, Rini Sharon, Ramit Sawhney, Debdoot Mukherjee
arXiv_CL
arXiv_CL
Recognition
OCR
Zero-Shot
Speech
Pose
Quantitative
Detection
Speech_Recognition
PDF
2022-02-15
Shifting Trends of COVID-19 Tweet Sentiment with Respect to Voting Preferences in the 2020 Election Year of the United States
Megan Doman, Jacob Motley, Hong Qin, Mengjun Xie, Li Yang
arXiv_CL
arXiv_CL
OCR
Relation
Sentiment
PDF
2022-02-13
Omnifont Persian OCR System Using Primitives
Azarakhsh Keipour, Mohammad Eshghi, Sina Mohammadzadeh Ghadikolaei, Negin Mohammadi, Shahab Ensafi
arXiv_AI
arXiv_AI
Recognition
OCR
PDF
2022-02-12
State of AI Ethics Report
Abhishek Gupta (1, 2, 3), Connor Wright (1, 4), Marianna Bergamaschi Ganapini (1, 5), Masa Sweidan (1), Renjie Butalid (1) ((1) Montreal AI Ethics Institute, (2) Microsoft, (3) Green Software Foundation, (4) University of Exeter, (5) Union College)
arXiv_AI
arXiv_AI
OCR
Salient
PDF
2022-02-08
Tube-Balloon Logic for the Exploration of Fluidic Control Elements
Jovanna A. Tracz, Lukas Wille, Dylan Pathiraja, Savita V. Kendre, Ron Pfisterer, Ethan Turett, Gus T. Teran, Christoffer K. Abrahamsson, Samuel E. Root, Won-Kyu Lee, Daniel J. Preston, Haihui Joy Jiang, George M. Whitesides, Markus P. Nemitz
arXiv_RO
arXiv_RO
OCR
Autonomous
PDF
2022-02-06
Human rights, democracy, and the rule of law assurance framework for AI systems: A proposal
David Leslie, Christopher Burr, Mhairi Aitken, Michael Katell, Morgan Briggs, Cami Rincon
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-02-03
DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts
Wenzhen Zhu, Negin Sokhandan, Guang Yang, Sujitha Martin, Suchitra Sathyanarayana
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
PDF
2022-02-02
DCSAU-Net: A Deeper and More Compact Split-Attention U-Net for Medical Image Segmentation
Qing Xu, Wenting Duan, Na He
arXiv_CV
arXiv_CV
Segmentation
OCR
Pose
Attention
Medical
PDF
2022-01-28
Detection of fake faces in videos
M. Shamanth, Russel Mathias, Dr Vijayalakshmi MN
arXiv_CV
arXiv_CV
OCR
Adversarial
Face
Deep_Learning
Detection
GAN
PDF
2022-01-27
Human-centered mechanism design with Democratic AI
Raphael Koster, Jan Balaguer, Andrea Tacchetti, Ari Weinstein, Tina Zhu, Oliver Hauser, Duncan Williams, Lucy Campbell-Gillingham, Phoebe Thacker, Matthew Botvinick, Christopher Summerfield
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
PDF
2022-01-26
Continuous Examination by Automatic Quiz Assessment Using Spiral Codes and Image Processing
Fernando Alonso-Fernandez, Josef Bigun
arXiv_CV
arXiv_CV
OCR
PDF
2022-01-26
An Assessment of the Impact of OCR Noise on Language Models
Konstantin Todorov, Giovanni Colavizza
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Optical_Character
Pose
Language_Model
PDF
2022-01-26
The Norwegian Parliamentary Speech Corpus
Per Erik Solberg, Pablo Ortiz
arXiv_SD
arXiv_SD
Recognition
OCR
Speech
Speech_Recognition
PDF
2022-01-25
A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit Recognition
Md. Ferdous Wahid, Md. Fahim Shahriar, Md. Shohanur Islam Sobuj
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Action
Classification
PDF
2022-01-21
Classroom Slide Narration System
Jobin K.V., Ajoy Mondal, C. V. Jawahar
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Recognition
OCR
Optical_Character
Pose
Face
Classification
PDF
2022-01-18
Improve Sentence Alignment by Divide-and-conquer
Wu Zhang
arXiv_CL
arXiv_CL
Embedding
OCR
PDF
2022-01-13
Document-level Relation Extraction with Context Guided Mention Integration and Inter-pair Reasoning
Chao Zhao, Daojian Zeng, Lu Xu, Jianhua Dai
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-01-06
DReyeVR: Democratizing driving simulation in virtual reality for behavioural & interaction research
Gustavo Silvera, Abhijat Biswas, Henny Admoni
arXiv_AI
arXiv_AI
Tracking
OCR
Face
Action
Autonomous
PDF
2022-01-02
On the Cross-dataset Generalization for License Plate Recognition
Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
2021-12-23
ELSA: Enhanced Local Self-Attention for Vision Transformer
Jingkai Zhou, Pichao Wang, Fan Wang, Qiong Liu, Hao Li, Rong Jin
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
Pose
Attention
PDF
2021-12-23
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha
arXiv_CV
arXiv_CV
Transformer
OCR
Pose
Scene_Text
Detection
VQA
Object_Detection
QA
PDF
2021-12-15
Lesan -- Machine Translation for Low Resource Languages
Asmelash Teka Hadgu, Abel Aregawi, Adam Beaudoin
arXiv_CL
arXiv_CL
Transformer
OCR
PDF
2021-12-15
Tracing Text Provenance via Context-Aware Lexical Substitution
Xi Yang, Jie Zhang, Kejiang Chen, Weiming Zhang, Zehua Ma, Feng Wang, Nenghai Yu
arXiv_CL
arXiv_CL
Transformer
OCR
Bert
Pose
Language_Model
PDF
2021-12-09
BLPnet: A New DNN model for Automatic License Plate Detection with Bengali OCR
Md Saif Hassan Onim, Hussain Nyeem, Koushik Roy, Mahmudul Hasan, Abtahi Ishmam, Md. Akiful Hoque Akif, Tareque Bashar Ovi
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Detection
PDF
2021-12-06
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
arXiv_AI
arXiv_AI
Transformer
OCR
Knowledge
Pose
Attention
Prediction
QA
PDF
2021-12-06
Requirements for Open Political Information: Transparency Beyond Open Data
Andong Luis Li Zhao, Andrew Paley, Rachel Adler, Harper Pack, Sergio Servantez, Alexander Einarsson, Cameron Barrie, Marko Sterbentz, Kristian Hammond
arXiv_AI
arXiv_AI
OCR
Sketch
Knowledge
PDF
2021-12-06
A Survey on Deep learning based Document Image Enhancement
Zahra Anvari, Vassilis Athitsos
arXiv_CV
arXiv_CV
Enhancement
Recognition
OCR
Restoration
Optical_Character
Review
Pose
Image_Enhancement
Survey
Action
Deep_Learning
Denoising
Attention
PDF
2021-12-03
Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution
Sahan Bulathwela, María Pérez-Ortiz, Catherine Holloway, John Shawe-Taylor
arXiv_AI
arXiv_AI
OCR
PDF
2021-12-03
ROCA: Robust CAD Model Retrieval and Alignment from a Single Image
Can Gümeli, Angela Dai, Matthias Nießner
arXiv_CV
arXiv_CV
OCR
3D
Optimization
PDF
2021-12-03
An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images
Zekun Li, Yao-Yi Chiang, Sasan Tavakkol, Basel Shbita, Johannes H. Uhl, Stefan Leyk, Craig A. Knoblock
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
PDF
2021-12-01
On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification
Rutika Moharir, Arun D Prabhu, Sukumar Moharana, Gopi Ramena, Rachit S Munjal
arXiv_CV
arXiv_CV
Image_Caption
OCR
RNN
Scene_Text
Classification
Attention
CNN
Inference
PDF
2021-11-30
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
Sahar Abdelnabi, Rakibul Hasan, Mario Fritz
arXiv_CV
arXiv_CV
Image_Caption
OCR
Pose
Caption
PDF
2021-11-30
Donut: Document Understanding Transformer without OCR
Geewook Kim, Teakgyu Hong, Moonbin Yim, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
2021-11-30
Automatic Extraction of Medication Names in Tweets as Named Entity Recognition
Carol Anderson, Bo Liu, Anas Abidin, Hoo-Chang Shin, Virginia Adams
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Action
Classification
Medical
Language_Model
Prediction
PDF
2021-11-30
Chemical Identification and Indexing in PubMed Articles via BERT and Text-to-Text Approaches
Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin
arXiv_CL
arXiv_CL
Embedding
Recognition
OCR
Bert
Language_Model
PDF
2021-11-30
Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models
Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin
arXiv_CL
arXiv_CL
OCR
Bert
Action
Classification
Relation
Relation_Extraction
PDF
2021-11-28
Image preprocessing and modified adaptive thresholding for improving OCR
Rohan Lal Kshetry
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2021-11-26
BCH-NLP at BioCreative VII Track 3: medications detection in tweets using transformer networks and multi-task learning
Dongfang Xu, Shan Chen, Timothy Miller
arXiv_CL
arXiv_CL
Transformer
OCR
Text_Classification
Action
Classification
Detection
PDF
2021-11-26
When Creators Meet the Metaverse: A Survey on Computational Arts
Lik-Hang Lee, Zijun Lin, Rui Hu, Zhengya Gong, Abhishek Kumar, Tangyao Li, Sijia Li, Pan Hui
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Survey
PDF
2021-11-25
Unravelling multi-agent ranked delegations
Rachael Colley, Umberto Grandi, Arianna Novaro
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-11-25
Does constituency analysis enhance domain-specific pre-trained BERT models for relation extraction?
Anfu Tang (LISN), Louise Deléger, Robert Bossy, Pierre Zweigenbaum (LISN), Claire Nédellec
arXiv_AI
arXiv_AI
OCR
Bert
Pose
Action
Relation
Relation_Extraction
Prediction
PDF
2021-11-22
Ice hockey player identification via transformers
Kanav Vats, William McNally, Pascale Walters, David A. Clausi, John S. Zelek
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Action
PDF
2021-11-20
Improving Tagging Consistency and Entity Coverage for Chemical Identification in Full-text Articles
Hyunjae Kim, Mujeen Sung, Wonjin Yoon, Sungjoon Park, Jaewoo Kang
arXiv_CL
arXiv_CL
Recognition
OCR
PDF
2021-11-17
Discriminative Dictionary Learning based on Statistical Methods
G.Madhuri, Atul Negi
arXiv_CV
arXiv_CV
Reconstruction
Inpainting
OCR
Sparse
Review
Classification
Denoising
PDF
2021-11-17
Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees
Yaman Kumar Singla, Sriram Krishna, Rajiv Ratn Shah, Changyou Chen
arXiv_CL
arXiv_CL
OCR
Speech
Pose
PDF
2021-11-16
An AI-based Learning Companion Promoting Lifelong Learning Opportunities for All
Maria Perez-Ortiz, Erik Novak, Sahan Bulathwela, John Shawe-Taylor
arXiv_AI
arXiv_AI
OCR
PDF
2021-11-15
DFC: Deep Feature Consistency for Robust Point Cloud Registration
Zhu Xu, Zhengyao Bai, Huijie Liu, Qianjie Lu, Shenglan Fan
arXiv_CV
arXiv_CV
Segmentation
Point_Cloud
OCR
3D
Pose
Classification
Deep_Learning
Matching
PDF
2021-11-12
DriverGym: Democratising Reinforcement Learning for Autonomous Driving
Parth Kothari, Christian Perone, Luca Bergamini, Alexandre Alahi, Peter Ondruska
arXiv_CV
arXiv_CV
Reinforcement_Learning
OCR
Pose
Autonomous
PDF
2021-11-12
Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models
Igor Kulev, Berkay Köprü, Raul Rodriguez-Esteban, Diego Saldana, Yi Huang, Alessandro La Torraca, Elif Ozkirimli
arXiv_CL
arXiv_CL
OCR
Pose
Action
Language_Model
PDF
2021-11-12
A comprehensive study of clustering a class of 2D shapes
Agnieszka Kaliszewska, Monika Syga
arXiv_CV
arXiv_CV
OCR
3D
Pose
Contour
PDF
2021-11-11
CU-UD: text-mining drug and chemical-protein interactions with ensembles of BERT-based models
Mehmet Efruz Karabulut, K. Vijay-Shanker, Yifan Peng
arXiv_AI
arXiv_AI
OCR
Bert
Action
Relation
Language_Model
PDF
2021-11-11
Indian Licence Plate Dataset in the wild
Sanchit Tanwar, Ayush Tiwari, Ritesh Chowdhry
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
Detection
Object_Detection
PDF
2021-11-11
Yaw-Guided Imitation Learning for Autonomous Driving in Urban Environments
Yandong Liu, Chengzhong Xu, Hui Kong
arXiv_RO
arXiv_RO
OCR
Pose
Relation
Attention
Autonomous
PDF
2021-11-10
BagBERT: BERT-based bagging-stacking for multi-topic classification
Loïc Rakotoson, Charles Letaillieur, Sylvain Massip, Fréjus Laleye
arXiv_CL
arXiv_CL
Embedding
OCR
Bert
Knowledge
Pose
Classification
PDF
2021-11-10
Handwritten Digit Recognition Using Improved Bounding Box Recognition Technique
Arkaprabha Basu, M. Sathya
arXiv_CV
arXiv_CV
Recognition
OCR
Optimization
Optical_Character
Prediction
PDF
2021-11-08
Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization
Lei Sun
arXiv_CV
arXiv_CV
Point_Cloud
OCR
3D
Pose
Matching
PDF
2021-11-07
A Word on Machine Ethics: A Response to Jiang et al.
Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-11-07
NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
Xingcheng Yao, Yanan Zheng, Xiaocong Yang, Zhilin Yang
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Classification
Language_Model
PDF
2021-11-04
Whistleblower protection in the digital age -- why 'anonymous' is not enough. Towards an interdisciplinary view of ethical dilemmas
Bettina Berendt, Stefan Schiffner
arXiv_AI
arXiv_AI
OCR
Face
Relation
GAN
Activity
PDF
2021-11-04
Lexically Aware Semi-Supervised Learning for OCR Post-Correction
Shruti Rijhwani, Daisy Rosenblum, Antonios Anastasopoulos, Graham Neubig
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
Language_Model
PDF
2021-11-03
A PubMedBERT-based Classifier with Data Augmentation Strategy for Detecting Medication Mentions in Tweets
Qing Han, Shubo Tian, Jinfeng Zhang
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
GAN
PDF
2021-11-03
Curriculum Offline Imitation Learning
Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Action
PDF
2021-11-02
Graph Tree Deductive Networks
Seokjun Kim, Jaeeun Jang, Hyeoncheol Kim
arXiv_AI
arXiv_AI
OCR
Relation
PDF
2021-10-31
R-BERT-CNN: Drug-target interactions extraction from biomedical literature
Jehad Aldahdooh, Ziaurrehman Tanoli, Jing Tang
arXiv_AI
arXiv_AI
OCR
Bert
Knowledge
Action
Deep_Learning
Relation
Medical
CNN
Language_Model
PDF
2021-10-28
DocScanner: Robust Document Image Rectification with Progressive Learning
Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, Houqiang Li
arXiv_CV
arXiv_CV
OCR
3D
Regularization
Pose
Quantitative
Inference
PDF
2021-10-25
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
Pose
Attention
PDF
2021-10-25
Ultra Light OCR Competition Technical Report
Shuhan Zhang, Yuxin Zou, Tianhe Wang, Yichao Xiong
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Scene_Text
GAN
PDF
2021-10-22
Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts
Allen Kim, Charuta Pethe, Naoya Inoue, Steve Skiena
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Detection
Relation
Language_Model
PDF
2021-10-21
HENet: Forcing a Network to Think More for Font Recognition
Jingchao Chen, Shiyi Mu, Shugong Xu, Youdong Ding
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Action
Inference
PDF
2021-10-18
Newsalyze: Effective Communication of Person-Targeting Biases in News Articles
Felix Hamborg, Kim Heinser, Anastasia Zhukova, Karsten Donnay, Bela Gipp
arXiv_AI
arXiv_AI
OCR
Review
PDF
2021-10-16
Learning UI Navigation through Demonstrations composed of Macro Actions
Wei Li
arXiv_AI
arXiv_AI
OCR
Pose
Action
Detection
PDF
2021-10-16
PAGnol: An Extra-Large French Generative Model
Julien Launay, E.L. Tommasone, Baptiste Pannier, François Boniface, Amélie Chatelain, Alessandro Cappelli, Iacopo Poli, Djamé Seddah
arXiv_CL
arXiv_CL
OCR
Bert
Summarization
PDF
2021-10-16
BAPGAN: GAN-based Bone Age Progression of Femur and Phalange X-ray Images
Shinji Nakazawa, Changhee Han, Joe Hasei, Ryuichi Nakahara, Toshifumi Ozaki
arXiv_CV
arXiv_CV
Embedding
OCR
Knowledge
Adversarial
Pose
VQA
GAN
CNN
PDF
2021-10-14
Making Document-Level Information Extraction Right for the Right Reasons
Liyan Tang, Dhruv Rajan, Suyash Mohan, Abhijeet Pradhan, R. Nick Bryan, Greg Durrett
arXiv_AI
arXiv_AI
OCR
Action
Relation
Inference
PDF
2021-10-13
An algorithm for a fairer and better voting system
Gabriel-Claudiu Grama
arXiv_AI
arXiv_AI
OCR
PDF
2021-10-13
Optical Character Recognition of 19th Century Classical Commentaries: the Current State of Affairs
Matteo Romanello, Sven Najem-Meyer, Bruce Robertson
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Face
PDF
2021-10-12
On the Security Risks of AutoML
Ren Pang, Zhaohan Xi, Shouling Ji, Xiapu Luo, Ting Wang
arXiv_CV
arXiv_CV
NAS
OCR
Adversarial
Relation
PDF
2021-10-08
Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks
Le Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong, Ran Xu
arXiv_AI
arXiv_AI
Transformer
OCR
Pose
Action
Recommendation
PDF
2021-10-08
Generational Frameshifts in Technology: Computer Science and Neurosurgery, The VR Use Case
Samuel R. Browd, Maya Sharma, Chetan Sharma
arXiv_AI
arXiv_AI
OCR
Action
PDF
2021-10-08
Towards Sample-efficient Apprenticeship Learning from Suboptimal Demonstration
Letian Chen, Rohan Paleja, Matthew Gombolay
arXiv_RO
arXiv_RO
OCR
Self-Supervised
Pose
Relation
PDF
2021-10-08
Machine Learning Featurizations for AI Hacking of Political Systems
Nathan E Sanders, Bruce Schneier
arXiv_AI
arXiv_AI
OCR
Pose
Action
Deep_Learning
PDF
2021-10-08
On the invertibility of a voice privacy system using embedding alignement
Pierre Champion (MULTISPEECH, LIUM), Thomas Thebaud (LIUM), Gaël Le Lan, Anthony Larcher (LIUM), Denis Jouvet (MULTISPEECH)
arXiv_SD
arXiv_SD
Embedding
Unsupervised
OCR
Pose
PDF
2021-10-07
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng
arXiv_CL
arXiv_CL
Segmentation
Recognition
Video_Caption
OCR
Optical_Character
Knowledge
Speech
Pose
Detection
Speech_Recognition
Caption
PDF
2021-10-04
Rerunning OCR -- A Machine Learning Approach to Quality Assessment and Enhancement Prediction
Pit Schneider
arXiv_AI
arXiv_AI
Enhancement
OCR
Prediction
PDF
2021-10-04
An Experimental Evaluation on Deepfake Detection using Deep Face Recognition
Sreeraj Ramachandran, Aakash Varma Nadimpalli, Ajita Rattani
arXiv_AI
arXiv_AI
Recognition
OCR
Face
Classification
Deep_Learning
Detection
Face_Recognition
CNN
PDF
2021-10-02
Asking questions on handwritten document collections
Minesh Mathew, Lluis Gomez, Dimosthenis Karatzas, CV Jawahar
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Pose
VQA
QA
PDF
2021-09-24
SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Yuxin Xiao, Zecheng Zhang, Yuning Mao, Carl Yang, Jiawei Han
arXiv_AI
arXiv_AI
OCR
Pose
Action
Relation
Relation_Extraction
Inference
Prediction
PDF
2021-09-21
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li, Tengchao Lv, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
RNN
Optical_Character
Pose
Text_Generation
Language_Model
PDF
2021-09-18
Atrial Fibrillation: A Medical and Technological Review
Samayan Bhattacharya, Sk Shahnawaz
arXiv_CV
arXiv_CV
OCR
Review
Detection
Relation
Attention
Medical
PDF
2021-09-15
An influencer-based approach to understanding radical right viral tweets
Laila Sprejer, Helen Margetts, Kleber Oliveira, David O'Sullivan, Bertie Vidgen
arXiv_CL
arXiv_CL
OCR
Pose
Attention
PDF
2021-09-14
Learning Bill Similarity with Annotated and Augmented Corpora of Bills
Jiseon Kim, Elden Griggs, In Song Kim, Alice Oh
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Classification
Relation
PDF
2021-09-14
Optimal To-Do List Gamification for Long Term Planning
Saksham Consul, Jugoslav Stojcheski, Valkyrie Felso, Falk Lieder
arXiv_AI
arXiv_AI
OCR
PDF
2021-09-14
Deep learning-based NLP Data Pipeline for EHR Scanned Document Information Extraction
Enshuo Hsu (1, 3, and 4), Ioannis Malagaris (1), Yong-Fang Kuo (1), Rizwana Sultana (2), Kirk Roberts (3) ((1) Office of Biostatistics, (2) Division of Pulmonary, Critical Care and Sleep Medicine, Department of Internal Medicine, University of Texas Medical Branch, Galveston, Texas, USA. (3) School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA. (4) Center for Outcomes Research, Houston Methodist, Houston, TX, USA.)
arXiv_CV
arXiv_CV
Recognition
OCR
Bert
RNN
Optical_Character
Pose
Action
Deep_Learning
Medical
PDF
2021-09-13
Post-OCR Document Correction with large Ensembles of Character Sequence Models
Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Evangelos Milios, Axel J. Soto
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
PDF
2021-09-13
Surveying the Research on Fake News in Social Media: a Tale of Networks and Language
Giancarlo Ruffo (1), Alfonso Semeraro (1), Anastasia Giachanou (2), Paolo Rosso (3) ((1) Università degli Studi di Torino, (2) Utrecht University, (3) Universitat Politècnica de València)
arXiv_CL
arXiv_CL
OCR
Face
Survey
GAN
PDF
2021-09-13
Tamizhi-Net OCR: Creating A Quality Large Scale Tamil-Sinhala-English Parallel Corpus Using Deep Learning Based Printed Character Recognition
Charangan Vasantharajan, Uthayasanker Thayasivam
arXiv_CL
arXiv_CL
Recognition
OCR
RNN
Pose
Action
Deep_Learning
PDF
2021-09-10
FR-Detect: A Multi-Modal Framework for Early Fake News Detection on Social Media Using Publishers Features
Ali Jarrahi, Leila Safari
arXiv_CL
arXiv_CL
OCR
Pose
Detection
Activity
CNN
PDF
2021-09-07
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Inference
PDF
2021-08-31
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools
Nils Feldhus, Robert Schwarzenberg, Sebastian Möller
arXiv_CL
arXiv_CL
OCR
Knowledge
PDF
2021-08-30
The Application of Convolutional Neural Networks for Tomographic Reconstruction of Hyperspectral Images
Wei-Chih Huang, Mads Svanborg Peters, Mads Juul Ahlebaek, Mads Toudal Frandsen, René Lynge Eriksen, Bjarke Jørgensen
arXiv_CV
arXiv_CV
Reconstruction
OCR
Pose
CNN
PDF
2021-08-30
3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations
Kangxue Yin, Jun Gao, Maria Shugrina, Sameh Khamis, Sanja Fidler
arXiv_AI
arXiv_AI
Reconstruction
Style_Transfer
OCR
3D
Pose
Quantitative
PDF
2021-08-29
A Multimodal Framework for Video Ads Understanding
Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Speech
Attention
Speech_Recognition
Prediction
PDF
2021-08-27
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
Arjit Sharma, Sahil Sharma
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Action
Autonomous
PDF
2021-08-26
Mining Contextual Information Beyond Image for Semantic Segmentation
Zhenchao Jin, Tao Gong, Dongdong Yu, Qi Chu, Jian Wang, Changhu Wang, Jie Shao
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
Classification
PDF
2021-08-26
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang, Furu Wei
arXiv_CL
arXiv_CL
OCR
Pose
Deep_Learning
Detection
Prediction
PDF
2021-08-22
External Knowledge Augmented Text Visual Question Answering
Arka Ujjal Dey, Ernest Valveny, Gaurav Harit
arXiv_CV
arXiv_CV
Transformer
OCR
Knowledge
Pose
VQA
QA
PDF
2021-08-22
Self-Regulation for Semantic Segmentation
Zhang Dong, Zhang Hanwang, Tang Jinhui, Hua Xiansheng, Sun Qianru
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Classification
PDF
2021-08-20
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Xiaopeng Lu, Zhen Fan, Yansen Wang, Jean Oh, Carolyn P. Rose
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Relation
VQA
QA
PDF
2021-08-18
End-to-End License Plate Recognition Pipeline for Real-time Low Resource Video Based Applications
Alif Ashrafee, Akib Mohammed Khan, Mohammad Sabik Irbaz, MD Abdullah Al Nasim
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Detection
Inference
PDF
2021-08-18
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc Van Gool
arXiv_CV
arXiv_CV
Reinforcement_Learning
OCR
Action
Autonomous
PDF
2021-08-18
Statistical analysis of locally parameterized shapes
Mohsen Taheri, Jörn Schulz
arXiv_CV
arXiv_CV
OCR
Pose
Classification
PDF
2021-08-18
AdapterHub Playground: Simple and Flexible Few-Shot Learning with Adapters
Tilman Beck, Bela Bohlender, Christina Viehmann, Vincent Hane, Yanik Adamson, Jaber Khuri, Jonas Brossmann, Jonas Pfeiffer, Iryna Gurevych
arXiv_CL
arXiv_CL
Transfer_Learning
OCR
Knowledge
Face
Few-Shot
Language_Model
Prediction
PDF
2021-08-17
VisBuddy -- A Smart Wearable Assistant for the Visually Challenged
Ishwarya Sivakumar, Nishaali Meenakshisundaram, Ishwarya Ramesh, Shiloah Elizabeth D, Sunil Retmin Raj C
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Optical_Character
Knowledge
Pose
Face
Action
Deep_Learning
Detection
Object_Detection
Caption
PDF
2021-08-16
An NLP approach to quantify dynamic salience of predefined topics in a text corpus
A. Bock, A. Palladino, S. Smith-Heisters, I. Boardman, E. Pellegrini, E.J. Bienenstock, A. Valenti
arXiv_CL
arXiv_CL
OCR
PDF
2021-08-14
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin
arXiv_CV
arXiv_CV
Recognition
OCR
Action
Detection
PDF
2021-08-10
BROS: A Layout-Aware Pre-trained Language Model for Understanding Documents
Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, Sungrae Park
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Pose
Language_Model
PDF
2021-08-06
Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents
Amit Gupte, Alexey Romanov, Sahitya Mantravadi, Dalitso Banda, Jianjie Liu, Raza Khan, Lakshmanan Ramu Meenal, Benjamin Han, Soundar Srinivasan
arXiv_CL
arXiv_CL
Recognition
OCR
Restoration
Optical_Character
Action
PDF
2021-08-03
Solo-learn: A Library of Self-supervised Methods for Visual Representation Learning
Victor G. Turrisi da Costa, Enrico Fini, Moin Nabi, Nicu Sebe, Elisa Ricci
arXiv_CV
arXiv_CV
OCR
Represenation_Learning
Self-Supervised
PDF
2021-07-30
Foundations of data imbalance and solutions for a data democracy
Ajay Kulkarni, Deri Chong, Feras A. Batarseh
arXiv_AI
arXiv_AI
OCR
Classification
PDF
2021-07-27
PDF-Malware: An Overview on Threats, Detection and Evasion Attacks
Nicolas Fleury, Theo Dubrunquez, Ihsen Alouani
arXiv_AI
arXiv_AI
OCR
Pose
Detection
PDF
2021-07-19
Machine Learning and Deep Learning Methods for Building Intelligent Systems in Medicine and Drug Discovery: A Comprehensive Survey
G Jignesh Chowdary, Suganya G, Premalatha M, Asnath Victy Phamila Y, Karunamurthy K
arXiv_AI
arXiv_AI
OCR
Optimization
Survey
Classification
Deep_Learning
Relation
Medical
Prediction
PDF
2021-07-15
Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining
Guowei Xu, Wenbiao Ding, Weiping Fu, Zhongqin Wu, Zitao Liu
arXiv_AI
arXiv_AI
Recognition
OCR
Text_Classification
Optical_Character
Pose
Classification
PDF
2021-07-13
Scene Text recognition with Full Normalization
Nathan Zachary, Gerald Carl, Russell Elijah, Hessi Roma, Robert Leer, James Amelia
arXiv_CV
arXiv_CV
Recognition
OCR
Scene_Text
PDF
2021-07-12
Hate versus Politics: Detection of Hate against Policy makers in Italian tweets
Armend Duzha, Cristiano Casadei, Michael Tosi, Fabio Celli
arXiv_CL
arXiv_CL
OCR
Speech
Classification
Detection
PDF
2021-07-12
MOOCRep: A Unified Pre-trained Embedding of MOOC Entities
Shalini Pandey, Jaideep Srivastava
arXiv_AI
arXiv_AI
Transformer
Embedding
OCR
Represenation_Learning
Knowledge
Pose
Relation
Language_Model
Prediction
Recommendation
PDF
2021-07-09
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset
Hannah Rose Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, Yuki M. Asano
arXiv_CV
arXiv_CV
OCR
Pose
Face
Detection
Caption
PDF
2021-07-05
Vision Xformers: Efficient Attention for Image Classification
Pranav Jeevan, Amit Sethi (Indian Institute of Technology Bombay)
arXiv_AI
arXiv_AI
Transformer
Embedding
OCR
Classification
Attention
CNN
Image_Classification
PDF
2021-07-02
The Optimal Size of an Epistemic Congress
Manon Revel, Tao Lin, Daniel Halpern
arXiv_AI
arXiv_AI
OCR
PDF
2021-07-02
Data Centric Domain Adaptation for Historical Text with OCR Errors
Luisa März, Stefan Schweter, Nina Poerner, Benjamin Roth, Hinrich Schütze
arXiv_CL
arXiv_CL
Embedding
Unsupervised
Recognition
OCR
Pose
PDF
2021-06-29
New Arabic Medical Dataset for Diseases Classification
Jaafar Hammoud, Aleksandra Vatian, Natalia Dobrenko, Nikolai Vedernikov, Anatoly Shalyto, Natalia Gusarova
arXiv_CL
arXiv_CL
OCR
Bert
Classification
Deep_Learning
Medical
PDF
2021-06-27
DONet: Learning Category-Level 6D Object Pose and Size Estimation from Depth Observation
Haitao Lin, Zichang Liu, Chilam Cheang, Lingwei Zhang, Yanwei Fu, Xiangyang Xue
arXiv_CV
arXiv_CV
OCR
3D
Pose
Inference
PDF
2021-06-26
The Feasibility and Inevitability of Stealth Attacks
Ivan Y. Tyukin, Desmond J. Higham, Eliyas Woldegeorgis, Alexander N. Gorban
arXiv_AI
arXiv_AI
OCR
Adversarial
Pose
Deep_Learning
PDF
2021-06-22
A Simple and Practical Approach to Improve Misspellings in OCR Text
Junxia Lin (1), Johannes Ledolter (2) ((1) Georgetown University Medical Center, Georgetown University, (2) Tippie College of Business, University of Iowa)
arXiv_CL
arXiv_CL
Unsupervised
OCR
PDF
2021-06-21
An End-to-End Khmer Optical Character Recognition using Sequence-to-Sequence with Attention
Rina Buoy, Sokchea Kor, Nguonly Taing
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Attention
CNN
PDF
2021-06-20
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Jiapeng Wang, Tianwei Wang, Guozhi Tang, Lianwen Jin, Weihong Ma, Kai Ding, Yichao Huang
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Action
Attention
GAN
Inference
PDF
2021-06-16
Eider: Evidence-enhanced Document-level Relation Extraction
Yiqing Xie, Jiaming Shen, Sha Li, Yuning Mao, Jiawei Han
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Prediction
PDF
2021-06-15
Classification of Documents Extracted from Images with Optical Character Recognition Methods
Omer Aydin
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Optical_Character
Classification
PDF
2021-06-15
Mixed Model OCR Training on Historical Latin Script for Out-of-the-Box Recognition and Finetuning
Christian Reul, Christoph Wick, Maximilian Nöth, Andreas Büttner, Maximilian Wehner, Uwe Springmann
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
2021-06-14
Pitfalls of Explainable ML: An Industry Perspective
Sahil Verma, Aditya Lahiri, John P. Dickerson, Su-In Lee
arXiv_AI
arXiv_AI
OCR
Prediction
PDF
2021-06-14
EuroCrops: A Pan-European Dataset for Time Series Crop Type Classification
Maja Schneider, Amelie Broszeit, Marco Körner
arXiv_CV
arXiv_CV
OCR
Classification
PDF
2021-06-10
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter
Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Attention
GAN
Inference
Prediction
PDF
2021-06-10
Hard Choices in Artificial Intelligence
Roel Dobbe, Thomas Krendl Gilbert, Yonatan Mintz
arXiv_AI
arXiv_AI
OCR
PDF
2021-06-10
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
Ryota Yoshihashi, Tomohiro Tanaka, Kenji Doi, Takumi Fujino, Naoaki Yamashita
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Detection
PDF
2021-06-08
PAM: Understanding Product Images in Cross Product Category Attribute Extraction
Rongmei Lin, Xiang He, Jie Feng, Nasser Zalmout, Yan Liang, Li Xiong, Xin Luna Dong
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Knowledge
Knowledge_Graph
Pose
Action
VQA
PDF
2021-06-08
Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface
Peng Xu, Wenjie Zi, Hamidreza Shahidi, Ákos Kádár, Keyi Tang, Wei Yang, Jawad Ateeq, Harsh Barot, Meidan Alon, Yanshuai Cao
arXiv_CL
arXiv_CL
OCR
Face
Prediction
PDF
2021-06-08
Classification of Contract-Amendment Relationships
Fuqi Song
arXiv_CL
arXiv_CL
Tracking
Recognition
OCR
Optical_Character
Pose
Classification
Relation
PDF
2021-06-07
Document-level Relation Extraction as Semantic Segmentation
Ningyu Zhang, Xiang Chen, Xin Xie, Shumin Deng, Chuanqi Tan, Mosha Chen, Fei Huang, Luo Si, Huajun Chen
arXiv_CL
arXiv_CL
Transformer
Segmentation
Semantic_Segmentation
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2021-06-05
Denoising Word Embeddings by Averaging in a Shared Space
Avi Caciularu, Ido Dagan, Jacob Goldberger
arXiv_CL
arXiv_CL
Embedding
OCR
Denoising
PDF
2021-06-04
Language Model Metrics and Procrustes Analysis for Improved Vector Transformation of NLP Embeddings
Thomas Conley, Jugal Kalita
arXiv_CL
arXiv_CL
Embedding
OCR
Language_Model
PDF
2021-06-03
Defending Democracy: Using Deep Learning to Identify and Prevent Misinformation
Anusua Trivedi, Alyssa Suhm, Prathamesh Mahankal, Subhiksha Mukuntharaj, Meghana D. Parab, Malvika Mohan, Meredith Berger, Arathi Sethumadhavan, Ashish Jaiman, Rahul Dodhia
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Deep_Learning
Detection
PDF
2021-06-03
Discriminative Reasoning for Document-level Relation Extraction
Wang Xu, Kehai Chen, Tiejun Zhao
arXiv_CL
arXiv_CL
Recognition
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2021-06-02
Detecting Bot-Generated Text by Characterizing Linguistic Accommodation in Human-Bot Interactions
Paras Bhatt, Anthony Rios
arXiv_CL
arXiv_CL
OCR
Speech
Action
Detection
PDF
2021-06-02
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net
Tuan-Anh Nguyen Dang, Dat-Thanh Nguyen
arXiv_CV
arXiv_CV
Embedding
OCR
Pose
Action
Deep_Learning
Relation
Attention
PDF
2021-06-01
PanoDR: Spherical Panorama Diminished Reality for Indoor Scenes
V. Gkitsas, V. Sterzentsenko, N. Zioulis, G. Albanis, D. Zarpalas
arXiv_CV
arXiv_CV
Reconstruction
Inpainting
OCR
3D
Pose
Quantitative
PDF
2021-06-01
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Action
PDF
2021-05-29
Correcting public opinion trends through Bayesian data assimilation
Robin Hendrickx, Rossella Arcucci, Julio Amador Dıaz Lopez, Yi-Ke Guo, Mark Kennedy
arXiv_AI
arXiv_AI
OCR
Pose
Survey
PDF
2021-05-26
What data do we need for training an AV motion planner?
Long Chen, Lukas Platinsky, Stefanie Speichert, Blazej Osinski, Oliver Scheel, Yawei Ye, Hugo Grimmett, Luca del Pero, Peter Ondruska
arXiv_CV
arXiv_CV
OCR
Autonomous
PDF
2021-05-25
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling
Marcin Namysl, Sven Behnke, Joachim Köhler
arXiv_CL
arXiv_CL
Embedding
Recognition
OCR
Optical_Character
Language_Model
PDF
2021-05-25
Affine Transport for Sim-to-Real Domain Adaptation
Anton Mallasto, Karol Arndt, Markus Heinonen, Samuel Kaski, Ville Kyrki
arXiv_RO
arXiv_RO
OCR
PDF
2021-05-23
Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations
Pascal Fischer, Alen Smajic, Alexander Mehler, Giuseppe Abrami
arXiv_AI
arXiv_AI
Transfer_Learning
Recognition
OCR
Optical_Character
Deep_Learning
Detection
Attention
PDF
2021-05-19
End-to-End Unsupervised Document Image Blind Denoising
Mehrdad J Gangeh, Marcin Plata, Hamid Motahari, Nigel P Duffy
arXiv_CV
arXiv_CV
Unsupervised
Recognition
OCR
Optical_Character
Pose
Deep_Learning
Denoising
PDF
2021-05-19
Surprisingly Popular Voting Recovers Rankings, Surprisingly!
Hadi Hosseini, Debmalya Mandal, Nisarg Shah, Kevin Shi
arXiv_AI
arXiv_AI
OCR
Prediction
PDF
2021-05-17
Unknown-box Approximation to Improve Optical Character Recognition Performance
Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2021-05-17
STRIDE : Scene Text Recognition In-Device
Rachit S Munjal, Arun D Prabhu, Nikhil Arora, Sukumar Moharana, Gopi Ramena
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Scene_Text
Attention
Inference
PDF
2021-05-17
EasyFL: A Low-code Federated Learning Platform For Dummies
Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang
arXiv_AI
arXiv_AI
Tracking
OCR
Optimization
Pose
Action
PDF
2021-05-12
Mining Legacy Issues in Open Pit Mining sites: Innovation & Support of Renaturalization and Land Utilization
Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann, Gerhard Heyer
arXiv_CL
arXiv_CL
Recognition
OCR
Text_Classification
Optical_Character
Action
Classification
PDF
2021-05-12
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Detection
VQA
QA
PDF
2021-05-10
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding
Zilong Wang, Mingjie Zhan, Houxing Ren, Zhaohui Hou, Yuwei Wu, Xingyan Zhang, Ding Liang
arXiv_AI
arXiv_AI
OCR
Optical_Character
Pose
Action
Relation
Relation_Extraction
GAN
PDF
2021-05-10
An end-to-end Optical Character Recognition approach for ultra-low-resolution printed text images
Julian D. Gilbey, Carola-Bibiane Schönlieb
arXiv_CV
arXiv_CV
Recognition
Super_Resolution
OCR
Optical_Character
PDF
2021-05-10
DocReader: Bounding-Box Free Training of a Document Information Extraction Model
Shachar Klaiman, Marius Lehne
arXiv_CV
arXiv_CV
OCR
Action
PDF
2021-05-09
End-to-End Optical Character Recognition for Bengali Handwritten Words
Farisa Benta Safir, Abu Quwsar Ohi, M.F. Mridha, Muhammad Mostafa Monowar, Md. Abdul Hamid
arXiv_CV
arXiv_CV
NAS
Recognition
OCR
RNN
Optical_Character
Review
Pose
CNN
PDF
2021-05-09
High-performance symbolic-numerics via multiple dispatch
Shashi Gowda, Yingbo Ma, Alessandro Cheli, Maja Gwozdz, Viral B. Shah, Christopher Rackauckas
arXiv_CL
arXiv_CL
OCR
Optimization
Knowledge
Face
Action
PDF
2021-05-04
A Survey on End-User Robot Programming
Gopika Ajaykumar, Maureen Steele, Chien-Ming Huang (Johns Hopkins University)
arXiv_RO
arXiv_RO
OCR
Survey
PDF
2021-05-04
Towards Accountability in the Use of Artificial Intelligence for Public Administrations
Michele Loi, Matthias Spielkamp
arXiv_AI
arXiv_AI
OCR
Ontology
GAN
PDF
2021-05-02
BI-REC: Guided Data Analysis for Conversational Business Intelligence
Venkata Vamsikrishna Meduri, Abdul Quamar, Chuan Lei, Vasilis Efthymiou, Fatma Ozcan
arXiv_AI
arXiv_AI
Embedding
OCR
Pose
Face
Action
Prediction
Recommendation
PDF
2021-04-30
Participatory Budgeting with Donations and Diversity Constraints
Jiehua Chen, Martin Lackner, Jan Maly
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-04-30
Word-Level Alignment of Paper Documents with their Electronic Full-Text Counterparts
Mark-Christoph Müller, Sucheta Ghosh, Ulrike Wittig, Maja Rey
arXiv_CL
arXiv_CL
Unsupervised
OCR
Medical
PDF
2021-04-28
Motion-guided Non-local Spatial-Temporal Network for Video Crowd Counting
Haoyue Bai, S.-H. Gary Chan
arXiv_CV
arXiv_CV
OCR
Pose
Relation
PDF
2021-04-27
AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions
Martin Kišš, Karel Beneš, Michal Hradiš
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
PDF
2021-04-23
CapillaryNet: An Automated System to Analyze Microcirculation Videos from Handheld Vital Microscopy
Maged Helmy, Anastasiya Dykyy, Tuyen Trung Truong, Paulo Ferreira, Eric Jul
arXiv_CV
arXiv_CV
OCR
Medical
PDF
2021-04-23
OCRTOC: A Cloud-Based Competition and Benchmark for Robotic Grasping and Manipulation
Ziyuan Liu, Wei Liu, Yuzhe Qin, Fanbo Xiang, Songyan Xin, Maximo A. Roa, Berk Calli, Hao Su, Yu Sun, Ping Tan
arXiv_RO
arXiv_RO
OCR
Pose
GAN
PDF
2021-04-19
Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Svein Arne Brygfjeld
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Bert
Optical_Character
Classification
Language_Model
PDF
2021-04-18
Documenting the English Colossal Clean Crawled Corpus
Jesse Dodge, Maarten Sap, Ana Marasovic, William Agnew, Gabriel Ilharco, Dirk Groeneveld, Matt Gardner
arXiv_AI
arXiv_AI
OCR
Salient
Face
Language_Model
PDF
2021-04-16
Open data for Moroccan license plates for OCR applications : data collection, labeling, and model construction
Abdelkrim Alahyane, Mohamed El Fakir, Saad Benjelloun, Ikram Chairi
arXiv_AI
arXiv_AI
Segmentation
Recognition
OCR
PDF
2021-04-16
TeLCoS: OnDevice Text Localization with Clustering of Script
Rachit S Munjal, Manoj Goyal, Rutika Moharir, Sukumar Moharana
arXiv_AI
arXiv_AI
Recognition
OCR
Knowledge
Pose
Scene_Text
Action
PDF
2021-04-15
Tabletop Object Rearrangement: Team ACRV's Entry to OCRTOC
Zheyu Zhang, Rhys Newbury, Kerry He, Steven Martin, Gavin Suddrey, Jun Kwan, Peter Corke, Akansel Cosgun
arXiv_RO
arXiv_RO
OCR
GAN
PDF
2021-04-13
'Subverting the Jewtocracy': Online Antisemitism Detection Using Multimodal Deep Learning
Mohit Chandra, Dheeraj Pailla, Himanshu Bhatia, Aadilmehdi Sanchawala, Manish Gupta, Manish Shrivastava, Ponnurangam Kumaraguru
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Deep_Learning
Detection
PDF
2021-04-12
Escaping the Big Data Paradigm with Compact Transformers
Ali Hassani, Steven Walton, Nikhil Shah, Abulikemu Abuduweili, Jiachen Li, Humphrey Shi
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
PDF
2021-04-12
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra
arXiv_CL
arXiv_CL
OCR
NMT
PDF
2021-04-12
Diamond in the rough: Improving image realism by traversing the GAN latent space
Jeffrey Wen, Fabian Benitez-Quiroz, Qianli Feng, Aleix Martinez
arXiv_CV
arXiv_CV
Unsupervised
OCR
Optimization
Adversarial
Quantitative
GAN
PDF
2021-04-10
Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis
Xutan Peng, Guanyi Chen, Chenghua Lin, Mark Stevenson
arXiv_AI
arXiv_AI
Embedding
OCR
Knowledge
Knowledge_Graph
Pose
Relation
PDF
2021-04-09
Video-aided Unsupervised Grammar Induction
Songyang Zhang, Linfeng Song, Lifeng Jin, Kun Xu, Dong Yu, Jiebo Luo
arXiv_CV
arXiv_CV
Unsupervised
OCR
Speech
Pose
Face
Action
PDF
2021-04-08
Computation and Bribery of Voting Power in Delegative Simple Games
Gianlorenzo D'Angelo, Esmaeil Delfaraz, Hugo Gilbert
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-04-07
Streaming Self-Training via Domain-Agnostic Unlabeled Images
Zhiqiu Lin, Deva Ramanan, Aayush Bansal
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Recognition
OCR
Knowledge
Face
Classification
Medical
Image_Classification
PDF
2021-04-07
Document Layout Analysis via Dynamic Residual Feature Fusion
Xingjiao Wu, Ziling Hu, Xiangcheng Du, Jing Yang, Liang He
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2021-04-05
When Can Liquid Democracy Unveil the Truth?
Ruben Becker, Gianlorenzo D'Angelo, Esmaeil Delfaraz, Hugo Gilbert
arXiv_AI
arXiv_AI
OCR
GAN
PDF
2021-04-05
Procrustean Training for Imbalanced Deep Learning
Han-Jia Ye, De-Chuan Zhan, Wei-Lun Chao
arXiv_CV
arXiv_CV
OCR
Knowledge
Pose
Deep_Learning
Prediction
PDF
2021-04-02
Artificial intelligence, human rights, democracy, and the rule of law: a primer
David Leslie, Christopher Burr, Mhairi Aitken, Josh Cowls, Michael Katell, Morgan Briggs
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-03-31
PAUL: Procrustean Autoencoder for Unsupervised Lifting
Chaoyang Wang, Simon Lucey
arXiv_CV
arXiv_CV
Unsupervised
OCR
3D
Pose
Deep_Learning
PDF
2021-03-30
Deep regression on manifolds: a 3D rotation case study
Romain Brégier
arXiv_CV
arXiv_CV
OCR
3D
Pose
Deep_Learning
PDF
2021-03-29
A Multiplexed Network for End-to-End, Multilingual OCR
Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Detection
PDF
2021-03-29
Personalized Affect-Aware Socially Assistive Robot Tutors Aimed at Fostering Social Grit in Children with Autism
Zhonghao Shi, Manwei Cao, Sophia Pei, Xiaoyang Qiao, Thomas R Groechel, Maja J Matarić
arXiv_RO
arXiv_RO
OCR
Pose
Emotion
PDF
2021-03-23
A News Recommender System Considering Temporal Dynamics and Diversity
Shaina Raza
arXiv_AI
arXiv_AI
OCR
Prediction
Recommendation
PDF
2021-03-22
Fairness Perceptions of Algorithmic Decision-Making: A Systematic Review of the Empirical Literature
Christopher Starke, Janine Baleis, Birte Keller, Frank Marcinkowski
arXiv_AI
arXiv_AI
OCR
Review
Autonomous
PDF
2021-03-19
Congolese Swahili Machine Translation for Humanitarian Response
Alp Öktem, Eric DeLuca, Rodrigue Bashizi, Eric Paquin, Grace Tang
arXiv_CL
arXiv_CL
OCR
QA
PDF
2021-03-18
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shjian Lu, C.V. Jawahar
arXiv_AI
arXiv_AI
Recognition
OCR
Action
GAN
PDF
2021-03-18
KoDF: A Large-scale Korean DeepFake Detection Dataset
Patrick Kwon, Jaeseong You, Gyuhyeon Nam, Sungwoo Park, Gyeongsu Chae
arXiv_CV
arXiv_CV
OCR
Face
Detection
PDF
2021-03-17
On the Whitney extension problem for near isometries and beyond
Steven B. Damelin
arXiv_CV
arXiv_CV
OCR
Optimization
PDF
2021-03-17
Interpretable Distance Metric Learning for Handwritten Chinese Character Recognition
Boxiang Dong, Aparna S. Varde, Danilo Stevanovic, Jiayin Wang, Liang Zhao
arXiv_AI
arXiv_AI
Handwriting
Recognition
OCR
Optical_Character
Pose
Face
Action
PDF
2021-03-17
What s in My LiDAR Odometry Toolbox?
Pierre Dellenbach, Jean-Emmanuel Deschaud, Bastien Jacquet, François Goulette
arXiv_CV
arXiv_CV
OCR
3D
Review
SLAM
Deep_Learning
GAN
PDF
2021-03-17
Endangered Languages are not Low-Resourced!
Mika Hämäläinen
arXiv_CL
arXiv_CL
OCR
Relation
PDF
2021-03-16
Combining Morphological and Histogram based Text Line Segmentation in the OCR Context
Pit Schneider
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
PDF
2021-03-15
Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs
Lars Vögtlin, Manuel Drazyk, Vinaychandran Pondenkandath, Michele Alberti, Rolf Ingold
arXiv_AI
arXiv_AI
OCR
Deep_Learning
GAN
PDF
2021-03-13
uTHCD: A New Benchmarking for Tamil Handwritten OCR
Noushath Shaffi, Faizal Hajamohideen
arXiv_CV
arXiv_CV
Recognition
OCR
CNN
PDF
2021-03-11
Characterizing Partisan Political Narratives about COVID-19 on Twitter
Elise Jing, Yong-Yeol Ahn
arXiv_CL
arXiv_CL
OCR
Pose
PDF
2021-03-10
Adversarial Regression Learning for Bone Age Estimation
Youshan Zhang, Brian D. Davison
arXiv_CV
arXiv_CV
Reconstruction
OCR
Adversarial
Pose
PDF
2021-03-09
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering
Aman Jain, Mayank Kothyari, Vishwajeet Kumar, Preethi Jyothi, Ganesh Ramakrishnan, Soumen Chakrabarti
arXiv_CV
arXiv_CV
OCR
Knowledge
Knowledge_Graph
Action
Quantitative
VQA
QA
PDF
2021-03-09
TS-Net: OCR Trained to Switch Between Text Transcription Styles
Jan Kohút, Michal Hradiš
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Knowledge
Pose
PDF
2021-03-03
Self-play Learning Strategies for Resource Assignment in Open-RAN Networks
Xiaoyang Wang, Jonathan D Thomas, Robert J Piechocki, Shipra Kapoor, Raul Santos-Rodriguez, Arjun Parekh
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
PDF
2021-03-02
Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020 US Elections on the Basis of Offensive Speech and Stance Detection
Lara Grimminger, Roman Klinger
arXiv_CL
arXiv_CL
OCR
Bert
Speech
Detection
PDF
2021-02-28
Citizen Participation and Machine Learning for a Better Democracy
M. Arana-Catania, F.A. Van Lier, Rob Procter, Nataliya Tkachenko, Yulan He, Arkaitz Zubiaga, Maria Liakata
arXiv_CL
arXiv_CL
OCR
PDF
2021-02-27
A Simple But Effective Approach to n-shot Task-Oriented Dialogue Augmentation
Taha Aksu, Nancy F. Chen, Min-Yen Kan, Zhengyuan Liu
arXiv_CL
arXiv_CL
Tracking
OCR
Pose
PDF
2021-02-26
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Zero-Shot
Action_Recognition
Action
Classification
Caption
PDF
2021-02-23
Fair Set Selection: Meritocracy and Social Welfare
Thomas Kleine Buening, Meirav Segal, Debabrota Basu, Christos Dimitrakakis
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-02-20
Deep Structured Feature Networks for Table Detection and Tabular Data Extraction from Scanned Financial Document Images
Siwen Luo, Mengting Wu, Yiwen Gong, Wanying Zhou, Josiah Poon
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
Optical_Character
Pose
Action
Detection
CNN
PDF
2021-02-18
FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks
Lingjiao Chen, Matei Zaharia, James Zou
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Scene_Text
Classification
Image_Classification
Prediction
Matching
PDF
2021-02-17
SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition
Denis Coquenet, Clément Chatelain, Thierry Paquet
arXiv_CV
arXiv_CV
Segmentation
Handwriting
Recognition
OCR
Optical_Character
Pose
CNN
PDF
2021-02-17
Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning
Byungchan Ko, Jungseul Ok
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Regularization
Pose
PDF
2021-02-11
An End-to-end Model for Entity-level Relation Extraction using Multi-instance Learning
Markus Eberts, Adrian Ulges
arXiv_CL
arXiv_CL
OCR
Action
Relation
Relation_Extraction
PDF
2021-02-11
Representation Matters: Offline Pretraining for Sequential Decision Making
Mengjiao Yang, Ofir Nachum
arXiv_AI
arXiv_AI
Unsupervised
Reinforcement_Learning
OCR
Optimization
Prediction
PDF
2021-02-09
Bootstrapping Relation Extractors using Syntactic Search by Examples
Matan Eyal, Asaf Amrami, Hillel Taub-Tabib, Yoav Goldberg
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2021-02-06
The Arc of the Data Scientific Universe
David Leslie
arXiv_AI
arXiv_AI
OCR
Bert
GAN
PDF
2021-02-01
Neural OCR Post-Hoc Correction of Historical Corpora
Lijun Lyu, Maria Koutraki, Martin Krickl, Besnik Fetahu
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
RNN
Optical_Character
Pose
Face
Attention
CNN
PDF
2021-01-30
Epistocracy Algorithm: A Novel Hyper-heuristic Optimization Strategy for Solving Complex Optimization Problems
Seyed Ziae Mousavi Mojab, Seyedmohammad Shams, Hamid Soltanian-Zadeh, Farshad Fotouhi
arXiv_AI
arXiv_AI
OCR
Optimization
Knowledge
Pose
PDF
2021-01-29
General-Purpose OCR Paragraph Identification by Graph Convolution Networks
Renshen Wang, Yasuhisa Fujii, Ashok C. Popat
arXiv_CV
arXiv_CV
OCR
Pose
PDF
2021-01-28
Exploring Cross-Image Pixel Contrast for Semantic Segmentation
Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc Van Gool
arXiv_CV
arXiv_CV
Segmentation
Embedding
Unsupervised
Semantic_Segmentation
OCR
Optimization
Represenation_Learning
Pose
Relation
Attention
PDF
2021-01-26
El Volumen Louder Por Favor: Code-switching in Task-oriented SemanticParsing
Arash Einolghozati, Abhinav Arora, Lorena Sainz-Maza Lecanda, Anuj Kumar, Sonal Gupta
arXiv_AI
arXiv_AI
OCR
Zero-Shot
Pose
Few-Shot
Language_Model
PDF
2021-01-22
Censorship of Online Encyclopedias: Implications for NLP Models
Eddie Yang, Margaret E. Roberts
arXiv_AI
arXiv_AI
Embedding
OCR
Action
Attention
PDF
2021-01-15
Affordance-based Reinforcement Learning for Urban Driving
Tanmay Agarwal, Hitesh Arora, Jeff Schneider
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Autonomous
Prediction
PDF
2021-01-14
Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection
Ben Chen, Bin Chen, Dehong Gao, Qijin Chen, Chengfu Huo, Xiaonan Meng, Weijun Ren, Yang Zhou
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Knowledge
Adversarial
Pose
Quantitative
Detection
Language_Model
PDF
2021-01-09
An Unsupervised Normalization Algorithm for Noisy Text: A Case Study for Information Retrieval and Stance Detection
Anurag Roy, Shalmoli Ghosh, Kripabandhu Ghosh, Saptarshi Ghosh
arXiv_AI
arXiv_AI
Unsupervised
OCR
Pose
Action
Classification
Detection
PDF
2021-01-07
Robust Text CAPTCHAs Using Adversarial Examples
Rulin Shao, Zhouxing Shi, Jinfeng Yi, Pin-Yu Chen, Cho-Jui Hsieh
arXiv_CV
arXiv_CV
OCR
Adversarial
Pose
PDF
2021-01-06
On-Device Document Classification using multimodal features
Sugam Garg, Harichandana, Sumit Kumar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Classification
Inference
PDF
2021-01-04
Where Do Deep Fakes Look? Synthetic Face Detection via Gaze Tracking
Ilke Demir, Umur A. Ciftci
arXiv_AI
arXiv_AI
Face_Detection
Tracking
OCR
Pose
Face
Detection
Attention
PDF
2020-12-31
Improving Learning Experience in MOOCs with Educational Content Linking
Shang-Wen Li
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Face
Survey
GAN
PDF
2020-12-29
Present-Biased Optimization
Fedor V. Fomin, Pierre Fraigniaud, Petr A. Golovach
arXiv_AI
arXiv_AI
OCR
Optimization
Pose
Action
GAN
PDF
2020-12-28
Advanced Machine Learning Techniques for Fake News Detection: A Systematic Mapping Study
Michal Choras, Konstantinos Demestichas, Agata Gielczyk, Alvaro Herrero, Pawel Ksieniewicz, Konstantina Remoundou, Daniel Urda, Michal Wozniak
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Detection
GAN
PDF
2020-12-28
From Point to Space: 3D Moving Human Pose Estimation Using Commodity WiFi
Yiming Wang, Lingchao Guo, Zhaoming Lu, Xiangming Wen, Shuang Zhou, Wanyu Meng
arXiv_CV
arXiv_CV
OCR
3D
Pose_Estimation
Pose
PDF
2020-12-23
ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition
Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Face
Action
Attention
CNN
PDF
2020-12-22
Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019
Nacira Abbas, Kholoud Alghamdi, Mortaza Alinam, Francesca Alloatti, Glenda Amaral, Claudia d'Amato, Luigi Asprino, Martin Beno, Felix Bensmann, Russa Biswas, Ling Cai, Riley Capshaw, Valentina Anita Carriero, Irene Celino, Amine Dadoun, Stefano De Giorgis, Harm Delva, John Domingue, Michel Dumontier, Vincent Emonet, Marieke van Erp, Paola Espinoza Arias, Omaima Fallatah, Sebastián Ferrada, Marc Gallofré Ocaña, Michalis Georgiou, Genet Asefa Gesese, Frances Gillis-Webber, Francesca Giovannetti, Marìa Granados Buey, Ismail Harrando, Ivan Heibi, Vitor Horta, Laurine Huber, Federico Igne, Mohamad Yaser Jaradeh, Neha Keshan, Aneta Koleva, Bilal Koteich, Kabul Kurniawan, Mengya Liu, Chuangtao Ma, Lientje Maas, Martin Mansfield, Fabio Mariani, Eleonora Marzi, Sepideh Mesbah, et al. (27 additional authors not shown)
arXiv_AI
arXiv_AI
OCR
Knowledge
Knowledge_Graph
PDF
2020-12-21
Document-Level Relation Extraction with Reconstruction
Wang Xu, Kehai Chen, Tiejun Zhao
arXiv_CL
arXiv_CL
Reconstruction
OCR
Pose
Action
Classification
Relation
Relation_Extraction
Attention
Inference
PDF
2020-12-19
Self-Supervision based Task-Specific Image Collection Summarization
Anurag Singh, Deepak Kumar Sharma, Sudhir Kumar Sharma, Joel J. P. C. Rodrigues
arXiv_CV
arXiv_CV
Embedding
OCR
Adversarial
Pose
Quantitative
Classification
Deep_Learning
GAN
Summarization
Inference
PDF
2020-12-18
Understood in Translation, Transformers for Domain Understanding
Dimitrios Christofidellis, Matteo Manica, Leonidas Georgopoulos, Hans Vandierendonck
arXiv_CL
arXiv_CL
Transformer
Unsupervised
OCR
RNN
Knowledge
Knowledge_Graph
Pose
Action
PDF
2020-12-17
Named Entity Recognition in the Legal Domain using a Pointer Generator Network
Stavroula Skylaki, Ali Oskooei, Omar Bari, Nadja Herger, Zac Kriegman (Thomson Reuters Labs)
arXiv_CL
arXiv_CL
Recognition
OCR
Action
PDF
2020-12-15
Amazon SageMaker Automatic Model Tuning: Scalable Black-box Optimization
Valerio Perrone, Huibin Shen, Aida Zolic, Iaroslav Shcherbatyi, Amr Ahmed, Tanya Bansal, Michele Donini, Fela Winkelmolen, Rodolphe Jenatton, Jean Baptiste Faddoul, Barbara Pogorzelska, Miroslav Miladinovic, Krishnaram Kenthapadi, Matthias Seeger, Cédric Archambeau
arXiv_AI
arXiv_AI
OCR
Optimization
Regularization
Pose
PDF
2020-12-15
Indonesian ID Card Extractor Using Optical Character Recognition and Natural Language Post-Processing
Firhan Maulana Rusli, Kevin Akbar Adhiguna, Hendy Irawan
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
PDF
2020-12-15
FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition Systems
Lu Chen, Jiao Sun, Wei Xu
arXiv_CV
arXiv_CV
Recognition
OCR
Optimization
Optical_Character
Adversarial
Pose
PDF
2020-12-14
Discovering Airline-Specific Business Intelligence from Online Passenger Reviews: An Unsupervised Text Analytics Approach
Sharan Srinivas, Surya Ramachandiran
arXiv_AI
arXiv_AI
Unsupervised
OCR
Review
Pose
Sentiment
Prediction
PDF
2020-12-14
Vartani Spellcheck -- Automatic Context-Sensitive Spelling Correction of OCR-generated Hindi Text Using BERT and Levenshtein Distance
Aditya Pal, Abhijit Mustafi
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Bert
Optical_Character
Pose
Detection
Language_Model
PDF
2020-12-11
Interdisciplinary Approaches to Understanding Artificial Intelligence's Impact on Society
Suresh Venkatasubramanian, Nadya Bliss, Helen Nissenbaum, Melanie Moses
arXiv_AI
arXiv_AI
Surveillance
OCR
Attention
PDF
2020-12-09
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
Optical_Character
VQA
Attention
Caption
QA
PDF
2020-12-08
EvoCraft: A New Challenge for Open-Endedness
Djordje Grbic, Rasmus Berg Palm, Elias Najarro, Claire Glanois, Sebastian Risi
arXiv_AI
arXiv_AI
OCR
Pose
Face
PDF
2020-12-08
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florencio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo
arXiv_CV
arXiv_CV
Image_Caption
OCR
Represenation_Learning
Pose
Scene_Text
Relation
VQA
Caption
Language_Model
Prediction
QA
Matching
PDF
2020-12-07
How To Solve Moral Conundrums with Computability Theory
Min Baek
arXiv_AI
arXiv_AI
OCR
Survey
PDF
2020-12-07
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Zhaokai Wang, Renda Bao, Qi Wu, Si Liu
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Embedding
Recognition
OCR
Optical_Character
Pose
NMT
Caption
PDF
2020-12-02
Analyzing Stylistic Variation across Different Political Regimes
Liviu P. Dinu, Ana-Sabina Uban
arXiv_CL
arXiv_CL
OCR
Pose
Classification
PDF
2020-11-30
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation
Jiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu
arXiv_CV
arXiv_CV
Reconstruction
OCR
3D
Pose
PDF
2020-11-29
Intrinsic Decomposition of Document Images In-the-Wild
Sagnik Das, Hassan Ahmed Sial, Ke Ma, Ramon Baldrich, Maria Vanrell, Dimitris Samaras
arXiv_CV
arXiv_CV
OCR
Self-Supervised
Pose
Deep_Learning
PDF
2020-11-28
OpenKBP: The open-access knowledge-based planning grand challenge
Aaron Babier, Binghao Zhang, Rafid Mahmood, Kevin L. Moore, Thomas G. Purdie, Andrea L. McNiven. Timothy C. Y. Chan
arXiv_CV
arXiv_CV
OCR
3D
Knowledge
Pose
Contour
Prediction
PDF
2020-11-27
A Survey of Deep Learning Approaches for OCR and Document Understanding
Nishant Subramani, Alexandre Matton, Malcolm Greaves, Adrian Lam
arXiv_CV
arXiv_CV
OCR
Review
Survey
Deep_Learning
PDF
2020-11-25
A Panoramic Survey of Natural Language Processing in the Arab World
Kareem Darwish, Nizar Habash, Mourad Abbas, Hend Al-Khalifa, Huseein T. Al-Natsheh, Samhaa R. El-Beltagy, Houda Bouamor, Karim Bouzoubaa, Violetta Cavalli-Sforza, Wassim El-Hajj, Mustafa Jarrar, Hamdy Mubarak
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Speech
Survey
Sentiment
Speech_Recognition
PDF
2020-11-22
Locally Linear Embedding and its Variants: Tutorial and Survey
Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley
arXiv_CV
arXiv_CV
Reconstruction
Embedding
OCR
Bert
Survey
PDF
2020-11-21
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning
Baohua Sun, Michael Lin, Hao Sha, Lin Yang
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Optical_Character
Pose
Detection
Caption
PDF
2020-11-20
On-Device Text Image Super Resolution
Dhruval Jain, Arun D Prabhu, Gopi Ramena, Manoj Goyal, Debi Prasanna Mohanty, Sukumar Moharana, Naresh Purre
arXiv_CV
arXiv_CV
Super_Resolution
OCR
Pose
Action
CNN
Inference
PDF
2020-11-18
Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Fuqi Song, Éric de la Clergerie
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
Pose
PDF
2020-11-17
PassGoodPool: Joint Passengers and Goods Fleet Management with Reinforcement Learning aided Pricing, Matching, and Route Planning
Kaushik Manchella, Marina Haliem, Vaneet Aggarwal, Bharat Bhargava
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Inference
Matching
PDF
2020-11-11
Classification Of Sleep-Wake State In A Ballistocardiogram System Based On Deep Learning
Nemath Ahmed, Aashit Singh, Srivyshnav KS, Gulshan Kumar, Gaurav Parchani, Vibhor Saran
arXiv_AI
arXiv_AI
OCR
Pose
Action
Classification
Deep_Learning
Prediction
PDF
2020-11-10
OCR Post Correction for Endangered Language Texts
Shruti Rijhwani, Antonios Anastasopoulos, Graham Neubig
arXiv_CL
arXiv_CL
Recognition
OCR
Pose
PDF
2020-11-10
On-Device Language Identification of Text in Images using Diacritic Characters
Shubham Vatsal, Nikhil Arora, Gopi Ramena, Sukumar Moharana, Dhruval Jain, Naresh Purre, Rachit S Munjal
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Inference
PDF
2020-11-08
Denoising Relation Extraction from Document-level Distant Supervision
Chaojun Xiao, Yuan Yao, Ruobing Xie, Xu Han, Zhiyuan Liu, Maosong Sun, Fen Lin, Leyu Lin
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Denoising
PDF
2020-11-06
An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish
Quan Duong, Mika Hämäläinen, Simon Hengchen
arXiv_CL
arXiv_CL
Unsupervised
Recognition
OCR
Optical_Character
Action
NMT
PDF
2020-11-06
OP-IMS @ DIACR-Ita: Back to the Roots: SGNS+OP+CD still rocks Semantic Change Detection
Jens Kaiser, Dominik Schlechtweg, Sabine Schulte im Walde
arXiv_CL
arXiv_CL
OCR
Detection
PDF
2020-11-04
Handwriting Classification for the Analysis of Art-Historical Documents
Christian Bartz, Hendrik Rätz, Christoph Meinel
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Text_Classification
Knowledge
Pose
Classification
Deep_Learning
PDF
2020-11-04
Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training
Dongha Choi, Hyunju Lee
arXiv_CL
arXiv_CL
OCR
Pose
Action
Deep_Learning
Medical
Prediction
PDF
2020-11-03
BioNerFlair: biomedical named entity recognition using flair embedding and sequence tagger
Harsh Patel
arXiv_CL
arXiv_CL
Transformer
Embedding
Recognition
OCR
Bert
RNN
Action
Medical
PDF
2020-11-02
Automated Transcription of Non-Latin Script Periodicals: A Case Study in the Ottoman Turkish Print Archive
Suphan Kirmizialtin, David Wrisley
arXiv_CL
arXiv_CL
OCR
Deep_Learning
PDF
2020-11-01
Pseudo-Bidirectional Decoding for Local Sequence Transduction
Wangchunshu Zhou, Tao Ge, Ke Xu
arXiv_CL
arXiv_CL
OCR
Regularization
Pose
PDF
2020-10-28
DeSMOG: Detecting Stance in Media On Global Warming
Yiwei Luo, Dallas Card, Dan Jurafsky
arXiv_AI
arXiv_AI
OCR
Bert
Detection
Attention
PDF
2020-10-24
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
Zan-Xia Jin, Heran Wu, Chun Yang, Fang Zhou, Jingyan Qin, Lei Xiao, Xu-Cheng Yin
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Relation
VQA
QA
Matching
PDF
2020-10-24
Persian Handwritten Digit, Character, and Words Recognition by Using Deep Learning Methods
Mehdi Bonyani, Simindokht Jahangard
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
2020-10-23
A Software Architecture for Autonomous Vehicles: Team LRM-B Entry in the First CARLA Autonomous Driving Challenge
Luis Alberto Rosero, Iago Pacheco Gomes, Júnior Anderson Rodrigues da Silva, Tiago Cesar dos Santos, Angelica Tiemi Mizuno Nakamura, Jean Amaro, Denis Fernando Wolf, Fernando Santos Osório
arXiv_AI
arXiv_AI
Point_Cloud
OCR
3D
Face
Action
Classification
Detection
GAN
CNN
Autonomous
Prediction
PDF
2020-10-22
Quantitative analysis of robot gesticulation behavior
Unai Zabala, Igor Rodriguez, José María Martínez-Otzeta, Itziar Irigoien, Elena Lazkano
arXiv_RO
arXiv_RO
OCR
Gesture
Adversarial
Pose
Quantitative
GAN
PDF
2020-10-22
TLGAN: document Text Localization using Generative Adversarial Nets
Dongyoung Kim, Myungsung Kwak, Eunji Won, Sejung Shin, Jeongyeon Nam
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Adversarial
Action
GAN
PDF
2020-10-21
Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling
Wenxuan Zhou, Kevin Huang, Tengyu Ma, Jing Huang
arXiv_CL
arXiv_CL
OCR
Pose
Action
Classification
Relation
Relation_Extraction
Attention
Medical
Language_Model
PDF
2020-10-18
Image-based Automated Species Identification: Can Virtual Data Augmentation Overcome Problems of Insufficient Sampling?
Morris Klasen, Dirk Ahrens, Jonas Eberle, Volker Steinhage
arXiv_CV
arXiv_CV
OCR
Deep_Learning
GAN
CNN
PDF
2020-10-17
DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement
Mohamed Ali Souibgui, Yousri Kessentini
arXiv_CV
arXiv_CV
Enhancement
OCR
Knowledge
Adversarial
Pose
GAN
PDF
2020-10-17
Learning from Suboptimal Demonstration via Self-Supervised Reward Regression
Letian Chen, Rohan Paleja, Matthew Gombolay
arXiv_RO
arXiv_RO
Reinforcement_Learning
OCR
Self-Supervised
Relation
PDF
2020-10-16
A Conglomerate of Multiple OCR Table Detection and Extraction
Smita Pallavi, Raj Ratn Pranesh, Sumit Kumar
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Face
Action
Detection
PDF
2020-10-15
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding
Zilong Wang, Mingjie Zhan, Xuebo Liu, Ding Liang
arXiv_AI
arXiv_AI
OCR
Action
Detection
Relation
GAN
PDF
2020-10-14
Power in Liquid Democracy
Yuzhe Zhang, Davide Grossi
arXiv_AI
arXiv_AI
OCR
PDF
2020-10-10
HAMLET: A Hierarchical Agent-based Machine Learning Platform
Ahmad Esmaeili, John C. Gallagher, John A. Springer, Eric T. Matson
arXiv_AI
arXiv_AI
OCR
Pose
Action
Autonomous
PDF
2020-10-09
CryptoCredit: Securely Training Fair Models
Leo de Castro, Jiahao Chen, Antigoni Polychroniadou
arXiv_AI
arXiv_AI
OCR
Knowledge
Relation
PDF
2020-10-09
Table Structure Recognition using Top-Down and Bottom-Up Cues
Sachin Raja, Ajoy Mondal, C. V. Jawahar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
Detection
GAN
PDF
2020-10-06
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Wei Han, Hantao Huang, Tao Han
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
2020-10-05
Understanding bias in facial recognition technologies
David Leslie
arXiv_CV
arXiv_CV
Surveillance
Recognition
OCR
Bert
Face
Detection
Recommendation
PDF
2020-10-01
Multi-label Classification of Common Bengali Handwritten Graphemes: Dataset and Challenge
Samiul Alam, Tahsin Reasat, Asif Shahriyar Sushmit, Sadi Mohammad Siddiquee, Fuad Rahman, Mahady Hasan, Ahmed Imtiaz Humayun
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Classification
Deep_Learning
PDF
2020-09-29
Double Graph Based Reasoning for Document-level Relation Extraction
Shuang Zeng, Runxin Xu, Baobao Chang, Lei Li
arXiv_AI
arXiv_AI
OCR
Pose
Action
Relation
Relation_Extraction
Inference
PDF
2020-09-25
Democratizing Artificial Intelligence in Healthcare: A Study of Model Development Across Two Institutions Incorporating Transfer Learning
Vikash Gupta1, Holger Roth, Varun Buch3, Marcio A.B.C. Rockenbach, Richard D White, Dong Yang, Olga Laur, Brian Ghoshhajra, Ittai Dayan, Daguang Xu, Mona G. Flores, Barbaros Selnur Erdal
arXiv_CV
arXiv_CV
Segmentation
Transfer_Learning
OCR
Sparse
Deep_Learning
Medical
PDF
2020-09-23
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition
Bingcong Li, Xin Tang, Xianbiao Qi, Yihao Chen, Rong Xiao
arXiv_AI
arXiv_AI
Transformer
Embedding
Recognition
OCR
Pose
Scene_Text
Classification
Attention
PDF
2020-09-21
PP-OCR: A Practical Ultra Lightweight OCR System
Yuning Du, Chenxia Li, Ruoyu Guo, Xiaoting Yin, Weiwei Liu, Jun Zhou, Yifan Bai, Zilin Yu, Yehua Yang, Qingqing Dang, Haoshuang Wang
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
PDF
2020-09-18
An Efficient Language-Independent Multi-Font OCR for Arabic Script
Hussein Osman, Karim Zaghw, Mostafa Hazem, Seifeldin Elsehely
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Action
PDF
2020-09-17
A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching
Mariona Coll Ardanuy, Kasra Hosseini, Katherine McDonough, Amrey Krause, Daniel van Strien, Federico Nanni
arXiv_CL
arXiv_CL
OCR
Deep_Learning
Attention
Matching
PDF
2020-09-17
Word Segmentation from Unconstrained Handwritten Bangla Document Images using Distance Transform
Pawan Kumar Singh, Shubham Sinha, Sagnik Pal Chowdhury, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
PDF
2020-09-16
A New Approach for Texture based Script Identification At Block Level using Quad Tree Decomposition
Pawan Kumar Singh, Supratim Das, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
OCR
Classification
PDF
2020-09-16
Handwritten Script Identification from Text Lines
Pawan Kumar Singh, Iman Chatterjee, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2020-09-14
Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices
Anton Trusov, Elena Limonova, Dmitry Slugin, Dmitry Nikolaev, Vladimir V. Arlazarov
arXiv_CV
arXiv_CV
Quantization
Recognition
OCR
Pose
CNN
Inference
PDF
2020-09-12
Abstractive Information Extraction from Scanned Invoices using End-to-end Sequential Approach
Shreeshiv Patel, Dvijesh Bhatt
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Action
Deep_Learning
PDF
2020-09-11
MRZ code extraction from visa and passport documents using convolutional neural networks
Yichuan Liu, Hailey James, Otkrist Gupta, Dan Raviv
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
Detection
CNN
PDF
2020-09-10
OCR Graph Features for Manipulation Detection in Documents
Hailey James, Otkrist Gupta, Dan Raviv
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
PDF
2020-09-07
Democratizing AI: Non-expert design of prediction tasks
James P. Bagrow
arXiv_AI
arXiv_AI
OCR
Pose
Prediction
PDF
2020-09-02
Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning
Nathan Lambert, Craig Schindler, Daniel Drew, Kristofer Pister
arXiv_RO
arXiv_RO
Reinforcement_Learning
OCR
Action
PDF
2020-09-01
On Open and Strong-Scaling Tools for Atom Probe Crystallography: High-Throughput Methods for Indexing Crystal Structure and Orientation
Markus Kühbach, Matthew Kasemer, Baptiste Gault, Andrew Breen
arXiv_AI
arXiv_AI
OCR
Review
Face
Action
Quantitative
Relation
PDF
2020-09-01
Practical Cross-modal Manifold Alignment for Grounded Language
Andre T. Nguyen, Luke E. Richards, Gaoussou Youssouf Kebe, Edward Raff, Kasra Darvish, Frank Ferraro, Cynthia Matuszek
arXiv_CV
arXiv_CV
Embedding
OCR
Pose
PDF
2020-08-27
Entity and Evidence Guided Relation Extraction for DocRED
Kevin Huang, Guangtao Wang, Tengyu Ma, Jing Huang
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Action
Relation
Relation_Extraction
Attention
Language_Model
Prediction
PDF
2020-08-22
DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda Classification
Anastasios Bairaktaris, Symeon Symeonidis, Avi Arampatzis
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Classification
Detection
GAN
PDF
2020-08-18
EASTER: Efficient and Scalable Text Recognizer
Kartik Chaudhary, Raghav Bali
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Deep_Learning
CNN
PDF
2020-08-13
Can weight sharing outperform random architecture search? An investigation with TuNAS
Gabriel Bender, Hanxiao Liu, Bo Chen, Grace Chu, Shuyang Cheng, Pieter-Jan Kindermans, Quoc Le
arXiv_CV
arXiv_CV
NAS
OCR
Pose
Classification
Detection
Image_Classification
PDF
2020-08-12
Optimal to-do list gamification
Jugoslav Stojcheski, Valkyrie Felso, Falk Lieder
arXiv_AI
arXiv_AI
OCR
Face
PDF
2020-08-11
PlugSonic: a web- and mobile-based platform for binaural audio and sonic narratives
Marco Comunità, Andrea Gerino, Veranika Lim, Lorenzo Picinali
arXiv_SD
arXiv_SD
OCR
3D
Knowledge
PDF
2020-08-06
On the Accuracy of CRNNs for Line-Based OCR: A Multi-Parameter Evaluation
Bernhard Liebl, Manuel Burghardt
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Face
PDF
2020-08-05
Can You Read Me Now? Content Aware Rectification using Angle Supervision
Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor, Roee Litman
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
2020-08-04
Weakly Supervised Construction of ASR Systems with Massive Video Data
Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang
arXiv_CL
arXiv_CL
Unsupervised
Recognition
OCR
Weakly_Supervised
Optical_Character
Knowledge
Speech
Pose
Speech_Recognition
PDF
2020-08-03
State-of-the-art Techniques in Deep Edge Intelligence
Ahnaf Hannan Lodhi, Barış Akgün, Öznur Özkasap
arXiv_AI
arXiv_AI
OCR
Deep_Learning
GAN
PDF
2020-07-30
Brand Intelligence Analytics
A. Fronzetti Colladon, F. Grippa
arXiv_CL
arXiv_CL
OCR
PDF
2020-07-30
Photon: A Robust Cross-Domain Text-to-SQL System
Jichuan Zeng, Xi Victoria Lin, Caiming Xiong, Richard Socher, Michael R. Lyu, Irwin King, Steven C.H. Hoi
arXiv_AI
arXiv_AI
OCR
Pose
Face
Relation
PDF
2020-07-24
Out-of-Plane Magnetic Anisotropy in Ordered Ensembles of Fe$_y$N Nanocrystals Embedded in GaN
A. Navarro-Quezada, K. Gas, T. Truglas, V. Bauernfeind, M. Matzer, D. Kreil, A. Ney, H. Groiss, M. Sawicki, A. Bonanni
arXiv_CV
arXiv_CV
OCR
GAN
PDF
2020-07-23
Spatially Aware Multimodal Transformers for TextVQA
Yash Kant, Dhruv Batra, Peter Anderson, Alex Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal
arXiv_CV
arXiv_CV
Transformer
OCR
Pose
Relation
VQA
Attention
QA
PDF
2020-07-22
FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition
Wenqing Zhang, Yang Qiu, Song Bai, Rui Zhang, Xiaolin Wei, Xiang Bai
arXiv_CV
arXiv_CV
Recognition
OCR
Knowledge
Pose
Scene_Text
PDF
2020-07-21
Explainable Rumor Detection using Inter and Intra-feature Attention Networks
Mingxuan Chen, Ning Wang, K.P. Subbalakshmi
arXiv_AI
arXiv_AI
OCR
Detection
Attention
PDF
2020-07-21
Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations
Sungheon Park, Minsik Lee, Nojun Kwak
arXiv_CV
arXiv_CV
Reconstruction
OCR
3D
Pose
Face
Deep_Learning
PDF
2020-07-19
Political Framing: US COVID19 Blame Game
Chereen Shurafa, Kareem Darwish, Wajdi Zaghouani
arXiv_CL
arXiv_CL
OCR
PDF
2020-07-18
On a Novel Application of Wasserstein-Procrustes for Unsupervised Cross-Lingual Learning
Guillem Ramírez, Rumen Dangovski, Preslav Nakov, Marin Soljačić
arXiv_AI
arXiv_AI
Embedding
Unsupervised
OCR
PDF
2020-07-17
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection
K.V. Kadambari, Vishnu Vardhan Nimmalapudi
arXiv_CV
arXiv_CV
Transformer
Surveillance
Recognition
OCR
Optical_Character
Adversarial
Pose
Deep_Learning
Detection
Object_Detection
GAN
PDF
2020-07-08
The Automation of Acceleration: AI and the Future of Society
Nicholas Kluge Corrêa
arXiv_AI
arXiv_AI
OCR
Review
PDF
2020-07-07
Self-organizing Democratized Learning: Towards Large-scale Distributed Learning Systems
Minh N. H. Nguyen, Shashi Raj Pandey, Tri Nguyen Dang, Eui-Nam Huh, Choong Seon Hong, Nguyen H. Tran, Walid Saad
arXiv_AI
arXiv_AI
OCR
Pose
GAN
PDF
2020-07-01
Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval
Siddhant Bansal, Praveen Krishnan, C.V. Jawahar
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
PDF
2020-06-25
An Analysis of SVD for Deep Rotation Estimation
Jake Levinson, Carlos Esteves, Kefan Chen, Noah Snavely, Angjoo Kanazawa, Afshin Rostamizadeh, Ameesh Makadia
arXiv_CV
arXiv_CV
Unsupervised
OCR
3D
Quantitative
Deep_Learning
PDF
2020-06-18
Robust Unsupervised Learning of Temporal Dynamic Interactions
Aritra Guha, Rayleigh Lei, Jiacheng Zhu, XuanLong Nguyen, Ding Zhao
arXiv_RO
arXiv_RO
Unsupervised
OCR
Represenation_Learning
Action
PDF
2020-06-16
Improving accuracy and speeding up Document Image Classification through parallel systems
Javier Ferrando, Juan Luis Dominguez, Jordi Torres, Raul Garcia, David Garcia, Daniel Garrido, Jordi Cortada, Mateo Valero
arXiv_CV
arXiv_CV
Transfer_Learning
OCR
Bert
Pose
Classification
Deep_Learning
CNN
Image_Classification
Prediction
PDF
2020-06-13
Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural Networks
Lili Wang, Ruibo Liu, Soroush Vosoughi
arXiv_CV
arXiv_CV
Transfer_Learning
Recognition
OCR
Salient
Prediction
PDF
2020-06-11
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
Youngmin Baek, Daehyun Nam, Sungrae Park, Junyeop Lee, Seung Shin, Jeonghun Baek, Chae Young Lee, Hwalsuk Lee
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Quantitative
Detection
Matching
PDF
2020-06-10
DivNoising: Diversity Denoising with Fully Convolutional Variational Autoencoders
Mangal Prakash, Alexander Krull, Florian Jug
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
Recognition
OCR
Restoration
Optical_Character
Pose
Deep_Learning
Denoising
CNN
Prediction
PDF
2020-06-09
Predicting and Analyzing Law-Making in Kenya
Oyinlola Babafemi, Adewale Akinfaderin
arXiv_CL
arXiv_CL
OCR
Attention
PDF
2020-06-09
Tamil Vowel Recognition With Augmented MNIST-like Data Set
Muthiah Annamalai
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Classification
Deep_Learning
PDF
2020-06-09
Contestable Black-Boxes
Andrea Aler Tubella, Andreas Theodorou, Virginia Dignum, Loizos Michael
arXiv_AI
arXiv_AI
OCR
Pose
Attention
PDF
2020-06-01
Structured Multimodal Attentions for TextVQA
Chenyu Gao, Qi Zhu, Peng Wang, Hui Li, Yuliang Liu, Anton van den Hengel, Qi Wu
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Relation
VQA
Attention
QA
PDF
2020-05-30
Attention-Guided Discriminative Region Localization for Bone Age Assessment
Chao Chen, Zhihong Chen1, Xinyu Jin, Lanjuan Li, William Speier, Corey W. Arnold2
arXiv_CV
arXiv_CV
OCR
Pose
Classification
Deep_Learning
Attention
PDF
2020-05-27
Towards the Infeasibility of Membership Inference on Deep Models
Shahbaz Rezaei, Xin Liu
arXiv_CV
arXiv_CV
OCR
Pose
Inference
PDF
2020-05-19
Multi-modal Sensor Fusion-Based Deep Neural Network for End-to-end Autonomous Driving with Scene Understanding
Zhiyu Huang, Chen Lv, Yang Xing, Jingda Wu
arXiv_RO
arXiv_RO
Segmentation
Semantic_Segmentation
OCR
Pose
Deep_Learning
Autonomous
PDF
2020-05-14
Plague Dot Text: Text mining and annotation of outbreak reports of the Third Plague Pandemic
Arlene Casey, Mike Bennett, Richard Tobin, Claire Grover, Iona Walker, Lukas Engelmann, Beatrice Alex
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Relation
PDF
2020-05-14
NAT: Noise-Aware Training for Robust Neural Sequence Labeling
Marcin Namysl, Sven Behnke, Joachim Köhler
arXiv_CL
arXiv_CL
Recognition
OCR
Pose
PDF
2020-05-14
Large Scale Font Independent Urdu Text Recognition System
Atique Ur Rehman, Sibt Ul Hussain
arXiv_CV
arXiv_CV
Recognition
OCR
Classification
Attention
CNN
QA
PDF
2020-05-13
Reasoning with Latent Structure Refinement for Document-Level Relation Extraction
Guoshun Nan, Zhijiang Guo, Ivan Sekulić, Wei Lu
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Inference
PDF
2020-05-10
Deep Learning Based Vehicle Tracking System Using License Plate Detection And Recognition
Lalit Lakshmanan, Yash Vora, Raj Ghate
arXiv_CV
arXiv_CV
Tracking
Recognition
OCR
Optical_Character
Pose
Scene_Text
Deep_Learning
Detection
Object_Detection
PDF
2020-05-10
A Hybrid Swarm and Gravitation based feature selection algorithm for Handwritten Indic Script Classification problem
Ritam Guha, Manosij Ghosh, Pawan Kumar Singh, Ram Sarkar, Mita Nasipuri
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Classification
PDF
2020-05-08
Text-Based Ideal Points
Keyon Vafa, Suresh Naidu, David M. Blei
arXiv_CL
arXiv_CL
Unsupervised
OCR
Speech
Pose
PDF
2020-05-08
Development of a New Image-to-text Conversion System for Pashto, Farsi and Traditional Chinese
Marek Rychlik, Dwight Nwaigwe, Yan Han, Dylan Murphy
arXiv_AI
arXiv_AI
Image_Caption
OCR
Deep_Learning
PDF
2020-05-07
A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition
Steven I Reeves, Dongwook Lee, Anurag Singh, Kunal Verma
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
PDF
2020-05-04
Understanding Scanned Receipts
Eric Melz
arXiv_CL
arXiv_CL
OCR
Knowledge
Detection
GAN
PDF
2020-05-04
The Newspaper Navigator Dataset: Extracting And Analyzing Visual Content from 16 Million Historic Newspaper Pages in Chronicling America
Benjamin Charles Germain Lee, Jaime Mears, Eileen Jakeway, Meghan Ferriter, Chris Adams, Nathan Yarasavage, Deborah Thomas, Kate Zwaard, Daniel S. Weld
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Deep_Learning
Caption
PDF
2020-04-29
Detecting Deep-Fake Videos from Appearance and Behavior
Shruti Agarwal (1), Tarek El-Gaaly (2), Hany Farid (1), Ser-Nam Lim (2) ((1) Univeristy of California, Berkeley, Berkeley, CA, USA, (2) Facebook Research, New York, NY, USA)
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Face
PDF
2020-04-29
Deepfake Video Forensics based on Transfer Learning
Rahul U, Ragul M, Raja Vignesh K, Tejeswinee K
arXiv_CV
arXiv_CV
Transfer_Learning
OCR
Pose
Face
Classification
Detection
Image_Classification
PDF
2020-04-29
MatrriVasha: Bangla Handwritten Compound Character Dataset and Recognition
Jannatul Ferdous, Suvrajit Karmaker, Akm Shahariar Azad Rabby, Syed Akhter Hossain
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Pose
Face
Deep_Learning
PDF
2020-04-26
PTPARL-D: Annotated Corpus of 44 years of Portuguese Parliament debates
Paulo Almeida, Manuel Marques-Pita, Joana Gonçalves-Sá
arXiv_CL
arXiv_CL
OCR
PDF
2020-04-24
Deep Global Registration
Christopher Choy, Wei Dong, Vladlen Koltun
arXiv_CV
arXiv_CV
OCR
3D
Pose_Estimation
Pose
CNN
Prediction
PDF
2020-04-23
Characterising User Content on a Multi-lingual Social Network
Pushkal Agarwal, Kiran Garimella, Sagar Joglekar, Nishanth Sastry, Gareth Tyson
arXiv_CL
arXiv_CL
OCR
Activity
PDF
2020-04-23
A Tool for Facilitating OCR Postediting in Historical Documents
Alberto Poncelas, Mohammad Aboomar, Jan Buts, James Hadley, Andy Way
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
Face
Language_Model
PDF
2020-04-23
Human-Machine Collaboration for Democratizing Data Science
Clément Gautrais, Yann Dauxais, Stefano Teso, Samuel Kolb, Gust Verbruggen, Luc De Raedt
arXiv_AI
arXiv_AI
OCR
Sketch
PDF
2020-04-17
Object Detection and Recognition of Swap-Bodies using Camera mounted on a Vehicle
Ebin Zacharias, Didier Stricker, Martin Teuchler, Kripasindhu Sarkar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Face
Deep_Learning
Detection
Object_Detection
Attention
Autonomous
PDF
2020-04-17
Image Processing Based Scene-Text Detection and Recognition with Tesseract
Ebin Zacharias, Martin Teuchler, Bénédicte Bernier
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Detection
PDF
2020-04-16
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks
Wenwen Yu, Ning Lu, Xianbiao Qi, Ping Gong, Rong Xiao
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
Deep_Learning
Detection
CNN
PDF
2020-04-15
An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers
Bernhard Liebl, Manuel Burghardt
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
Relation
Attention
PDF
2020-04-14
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
Matt Deitke, Winson Han, Alvaro Herrasti, Aniruddha Kembhavi, Eric Kolve, Roozbeh Mottaghi, Jordi Salvador, Dustin Schwenk, Eli VanderBilt, Matthew Wallingford, Luca Weihs, Mark Yatskar, Ali Farhadi
arXiv_CV
arXiv_CV
Recognition
OCR
PDF
2020-04-10
Identifying Cultural Differences through Multi-Lingual Wikipedia
Yufei Tian, Tuhin Chakrabarty, Fred Morstatter, Nanyun Peng
arXiv_AI
arXiv_AI
OCR
Relation
PDF
2020-04-07
Operationalizing the legal concept of 'Incitement to Hatred' as an NLP task
Frederike Zufall, Huangpan Zhang, Katharina Kloppenborg, Torsten Zesch
arXiv_CL
arXiv_CL
OCR
Speech
Detection
PDF
2020-04-07
Inspector Gadget: A Data Programming-based Labeling System for Industrial Images
Geon Heo, Yuji Roh, Seonghyeon Hwang, Dayun Lee, Steven Euijong Whang
arXiv_CV
arXiv_CV
OCR
Knowledge
Pose
Face
Classification
CNN
Image_Classification
PDF
2020-04-04
Segmentation for Classification of Screening Pancreatic Neuroendocrine Tumors
Zhuotun Zhu, Yongyi Lu, Wei Shen, Elliot K. Fishman, Alan L. Yuille
arXiv_CV
arXiv_CV
Segmentation
OCR
Knowledge
Quantitative
Classification
Detection
PDF
2020-04-03
Sparse Concept Coded Tetrolet Transform for Unconstrained Odia Character Recognition
Kalyan S Dash, N B Puhan, G Panda
arXiv_CV
arXiv_CV
Image_Caption
Handwriting
Recognition
OCR
Sparse
Pose
PDF
2020-04-01
Towards democratizing music production with AI-Design of Variational Autoencoder-based Rhythm Generator as a DAW plugin
Nao Tokui
arXiv_SD
arXiv_SD
OCR
Pose
Deep_Learning
PDF
2020-03-28
HIN: Hierarchical Inference Network for Document-Level Relation Extraction
Hengzhu Tang, Yanan Cao, Zhenyu Zhang, Jiangxia Cao, Fang Fang, Shi Wang, Pengfei Yin
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Action
Relation
Relation_Extraction
Inference
PDF
2020-03-26
Real-time information retrieval from Identity cards
Niloofar Tavakolian, Azadeh Nazemi, Donal Fitzpatrick
arXiv_CV
arXiv_CV
Face_Detection
Recognition
OCR
Optical_Character
Pose
Scene_Text
Face
Deep_Learning
Detection
Object_Detection
GAN
PDF
2020-03-23
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation
Sharon Fogel (1), Hadar Averbuch-Elor (2), Sarel Cohen, Shai Mazor (1), Roee Litman (1) ((1) Amazon Rekognition Israel, (2) Cornell University)
arXiv_CV
arXiv_CV
Semi_Supervised
Recognition
OCR
Optical_Character
Deep_Learning
GAN
Text_Generation
PDF
2020-03-23
Deep Soft Procrustes for Markerless Volumetric Sensor Alignment
Vladimiros Sterzentsenko, Alexandros Doumanoglou, Spyridon Thermos, Nikolaos Zioulis, Dimitrios Zarpalas, Petros Daras
arXiv_CV
arXiv_CV
Segmentation
OCR
Pose_Estimation
Knowledge
Pose
Classification
PDF
2020-03-18
Constraints in Developing a Complete Bengali Optical Character Recognition System
Abu Saleh Md. Abir, Sanjana Rahman, Samia Ellin, Maisha Farzana, Md Hridoy Manik, Chowdhury Rafeed Rahman
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Review
PDF
2020-03-18
Distributed and Democratized Learning: Philosophy and Research Challenges
Minh N. H. Nguyen, Shashi Raj Pandey, Kyi Thar, Nguyen H. Tran, Mingzhe Chen, Walid Saad, Choong Seon Hong
arXiv_AI
arXiv_AI
OCR
Pose
GAN
PDF
2020-03-15
Multistage Curvilinear Coordinate Transform Based Document Image Dewarping using a Novel Quality Estimator
Tanmoy Dasgupta, Nibaran Das, Mita Nasipuri
arXiv_CV
arXiv_CV
OCR
PDF
2020-03-10
Efficient Intent Detection with Dual Sentence Encoders
Iñigo Casanueva, Tadas Temčinas, Daniela Gerz, Matthew Henderson, Ivan Vulić
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Detection
Few-Shot
Object_Detection
PDF
2020-03-08
Investigating the Decoders of Maximum Likelihood Sequence Models: A Look-ahead Approach
Yu-Siang Wang, Yen-Ling Kuo, Boris Katz
arXiv_CL
arXiv_CL
OCR
Pose
PDF
2020-03-04
Monte Carlo Tree Search for Generating Interactive Data Analysis Interfaces
Yiru Chen, Eugene Wu
arXiv_AI
arXiv_AI
OCR
Pose
Face
PDF
2020-02-21
Curating Social Media Data
Kushal Vaghani
arXiv_CV
arXiv_CV
OCR
Knowledge
Pose
Face
Action
GAN
PDF
2020-02-08
Attacking Optical Character Recognition Systems with Adversarial Watermarks
Lu Chen, Wei Xu
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Adversarial
Pose
Detection
PDF
2020-01-07
Leveraging Prior Knowledge for Protein-Protein Interaction Extraction with Memory Network
Huiwei Zhou, Zhuang Liu, Shixian Ning, Yunlong Yang, Chengkun Lang, Yingyu Lin, Kun Ma
arXiv_CV
arXiv_CV
Embedding
OCR
Memory_Networks
Knowledge
Pose
Action
Relation
Medical
PDF
2019-12-15
Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts
Abhishek Prusty, Sowmya Aitha, Abhishek Trivedi, Ravi Kiran Sarvadevabhatla
arXiv_CV
arXiv_CV
Segmentation
OCR
Pose
CNN
PDF
2019-12-11
Improving Neural Protein-Protein Interaction Extraction with Knowledge Selection
Huiwei Zhou, Xuefei Li, Weihong Yao, Zhuang Liu, Shixian Ning, Chengkun Lang, Lei Du
arXiv_CL
arXiv_CL
Transformer
Embedding
OCR
Knowledge
Pose
Action
Relation
Attention
PDF
2019-12-10
A Feasible Framework for Arbitrary-Shaped Scene Text Recognition
Jinjin Zhang
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Pose
Scene_Text
Deep_Learning
Detection
Attention
Language_Model
PDF
2019-11-26
Procrustes registration of two-dimensional statistical shape models without correspondences
Alma Eguizabal, Peter Schreier, Juergen Schmidt
arXiv_CV
arXiv_CV
OCR
Pose
Contour
PDF
2019-11-26
AuthorGAN: Improving GAN Reproducibility using a Modular GAN Framework
Raunak Sinha, Anush Sankaran, Mayank Vatsa, Richa Singh
arXiv_CV
arXiv_CV
OCR
Adversarial
Pose
Survey
GAN
PDF
2019-11-25
Explaining Neural Networks via Perturbing Important Learned Features
Ashkan Khakzar, Soroosh Baselizadeh, Saurabh Khanduja, Seong Tae Kim, Nassir Navab
arXiv_CV
arXiv_CV
OCR
Pose
Quantitative
PDF
2019-11-25
Women, politics and Twitter: Using machine learning to change the discourse
Lana Cuthbertson, Alex Kearney, Riley Dawson, Ashia Zawaduk, Eve Cuthbertson, Ann Gordon-Tighe, Kory W Mathewson
arXiv_CL
arXiv_CL
OCR
Bert
Quantitative
Classification
PDF
2019-11-25
Cascaded Detail-Preserving Networks for Super-Resolution of Document Images
Zhichao Fu, Yu Kong, Yingbin Zheng, Hao Ye, Wenxin Hu, Jing Yang, Liang He
arXiv_CV
arXiv_CV
Recognition
Super_Resolution
OCR
Pose
PDF
2019-11-20
Hard Choices in Artificial Intelligence: Addressing Normative Uncertainty through Sociotechnical Commitments
Roel Dobbe, Thomas Krendl Gilbert, Yonatan Mintz
arXiv_AI
arXiv_AI
Surveillance
OCR
Pose
PDF
2019-11-15
Handwritten and Machine printed OCR for Geez Numbers Using Artificial Neural Network
Eyob Gebretinsae Beyene
arXiv_CV
arXiv_CV
Recognition
OCR
Classification
PDF
2019-11-15
Experiments in Detecting Persuasion Techniques in the News
Seunghak Yu, Giovanni Da San Martino, Preslav Nakov
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Attention
GAN
PDF
2019-11-14
Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction
Kushagra Mahajan, Monika Sharma, Lovekesh Vig
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
PDF
2019-11-13
Vehicle Re-identification: exploring feature fusion using multi-stream convolutional networks
Icaro O. de Oliveira, Rayson Laroca, David Menotti, Keiko V. O. Fonseca, Rodrigo Minetto
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Re-identification
CNN
PDF
2019-11-11
Recognition of Images of Korean Characters Using Embedded Networks
Sergey A. Ilyuhin, Alexander V. Sheshkus, Vladimir L. Arlazarov
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Attention
PDF
2019-11-08
A Human-in-the-loop Framework to Construct Context-dependent Mathematical Formulations of Fairness
Mohammad Yaghini, Hoda Heidari, Andreas Krause
arXiv_AI
arXiv_AI
OCR
Knowledge
PDF
2019-11-05
Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory
Duc Nguyen, Nhan Tran, Hung Le
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Scene_Text
CNN
PDF
2019-10-23
Accurate 6D Object Pose Estimation by Pose Conditioned Mesh Reconstruction
Pedro Castro, Anil Armagan, Tae-Kyun Kim
arXiv_CV
arXiv_CV
Reconstruction
OCR
3D
Pose_Estimation
Pose
CNN
PDF
2019-10-22
How can AI Automate End-to-End Data Science?
Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray
arXiv_AI
arXiv_AI
OCR
Review
Survey
PDF
2019-10-17
How to do lexical quality estimation of a large OCRed historical Finnish newspaper collection with scarce resources
Kimmo Kettunen
arXiv_CL
arXiv_CL
OCR
PDF
2019-10-15
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images
W. Ronny Huang, Yike Qi, Qianqian Li, Jonathan Degange
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Weakly_Supervised
Optical_Character
Pose
PDF
2019-10-12
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction
Mika Hämäläinen, Simon Hengchen
arXiv_CL
arXiv_CL
Embedding
Unsupervised
Recognition
OCR
Optical_Character
NMT
PDF
2019-10-11
Rosetta: Large scale system for text detection and recognition in images
Fedor Borisyuk, Albert Gordo, Viswanath Sivakumar
arXiv_CV
arXiv_CV