OCR
OCR
2023-01-31
The Touch'e23-ValueEval Dataset for Identifying Human Values behind Arguments
Nailia Mirzakhmedova, Johannes Kiesel, Milad Alshomary, Maximilian Heinrich, Nicolas Handke, Xiaoni Cai, Barriere Valentin, Doratossadat Dastgheib, Omid Ghahroodi, Mohammad Ali Sadraei, Ehsaneddin Asgari, Lea Kawaletz, Henning Wachsmuth, Benno Stein
arXiv_CL
arXiv_CL
OCR
Bert
Classification
Detection
PDF
2023-01-31
Automated Sentiment and Hate Speech Analysis of Facebook Data by Employing Multilingual Transformer Models
Ritumbra Manuvie, Saikat Chatterjee
arXiv_CL
arXiv_CL
Transformer
OCR
Speech
Face
Sentiment
GAN
Language_Model
PDF
2023-01-30
Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for Education
George Boateng, Victor Kumbol, Elsie Effah Kaufmann
arXiv_AI
arXiv_AI
OCR
Pose
QA
PDF
2023-01-29
Vicarious Offense and Noise Audit of Offensive Speech Classifiers
Tharindu Cyril Weerasooriya, Sujan Dutta, Tharindu Ranasinghe, Marcos Zamperi, Christopher M. Homan, Ashiqur R. KhudaBukhsh
arXiv_CL
arXiv_CL
OCR
Speech
PDF
2023-01-23
PRIMEQA: The Prime Repository for State-of-the-Art MultilingualQuestion Answering Research and Development
Avirup Sil, Jaydeep Sen, Bhavani Iyer, Martin Franz, Kshitij Fadnis, Mihaela Bornea, Sara Rosenthal, Scott McCarley, Rong Zhang, Vishwajeet Kumar, Yulong Li, Md Arafat Sultan, Riyaz Bhat, Radu Florian, Salim Roukos
arXiv_CL
arXiv_CL
OCR
Language_Model
QA
PDF
2023-01-23
Noisy Parallel Data Alignment
Ruoyu Xie, Antonios Anastasopoulos
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
PDF
2023-01-15
Collective Privacy Recovery: Data-sharing Coordination via Decentralized Artificial Intelligence
Evangelos Pournaras, Mark Christopher Ballandies, Stefano Bennati, Chien-fei Chen
arXiv_AI
arXiv_AI
OCR
Inference
PDF
2023-01-13
Reworking geometric morphometrics into a methodology of transformation grids
Fred L. Bookstein
arXiv_CV
arXiv_CV
OCR
Pose
Quantitative
GAN
Prediction
PDF
2023-01-13
On the feasibility of attacking Thai LPR systems with adversarial examples
Chissanupong Jiamsuchon, Jakapan Suaboot, Norrathep Rattanavipanon
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Adversarial
Pose
Prediction
PDF
2023-01-12
Rock Guitar Tablature Generation via Natural Language Processing
Josue Casco-Rodriguez
arXiv_SD
arXiv_SD
OCR
Knowledge
Pose
Deep_Learning
PDF
2023-01-12
Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle
Alex Kogan
arXiv_AI
arXiv_AI
Recognition
OCR
Bert
Optical_Character
Pose
Inference
PDF
2023-01-10
Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries
Matyáš Boháček, Marek Hrúz
arXiv_CV
arXiv_CV
Transfer_Learning
Recognition
OCR
Pose
Few-Shot
PDF
2023-01-08
Fully Dynamic Online Selection through Online Contention Resolution Schemes
Vashist Avadhanula, Andrea Celli, Riccardo Colini-Baldeschi, Stefano Leonardi, Matteo Russo
arXiv_AI
arXiv_AI
OCR
Optimization
Adversarial
Action
Matching
PDF
2023-01-08
The State of Human-centered NLP Technology for Fact-checking
Anubrata Das, Houjiang Liu, Venelin Kovatchev, Matthew Lease
arXiv_AI
arXiv_AI
OCR
Review
Pose
PDF
2023-01-08
Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules
Ritesh Chandra, Sadhana Tiwari, Sonali Agarwal, Navjot Singh
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Ontology
Action
Medical
PDF
2023-01-06
Measuring a Priori Voting Power -- Taking Delegations Seriously
Rachael Colley, Théo Delemazure, Hugo Gilbert
arXiv_AI
arXiv_AI
OCR
PDF
2023-01-06
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling
Yilin Wen, Biao Luo, Yuqian Zhao
arXiv_AI
arXiv_AI
Recognition
Reinforcement_Learning
OCR
Optimization
Sparse
Optical_Character
Knowledge
Knowledge_Graph
Pose
Inference
Prediction
PDF
2023-01-05
The political ideology of conversational AI: Converging evidence on ChatGPT's pro-environmental, left-libertarian orientation
Jochen Hartmann, Jasper Schwenzow, Maximilian Witte
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Attention
PDF
2022-12-27
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition
Gürkan Soykan, Deniz Yuret, Tevfik Metin Sezgin
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Detection
Relation
Inference
PDF
2022-12-23
Bengali Handwritten Digit Recognition using CNN with Explainable AI
Md Tanvir Rouf Shawon, Raihan Tanvir, Md. Golam Rabiul Alam
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
2022-12-21
Continual Contrastive Finetuning Improves Low-Resource Relation Extraction
Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen, Hoifung Poon
arXiv_CL
arXiv_CL
Embedding
OCR
Represenation_Learning
Knowledge
Self-Supervised
Pose
Contrastive_Learning
Action
Classification
Relation
Relation_Extraction
PDF
2022-12-20
Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
Artidoro Pagnoni, Alexander R. Fabbri, Wojciech Kryściński, Chien-Sheng Wu
arXiv_CL
arXiv_CL
Unsupervised
OCR
Summarization
QA
PDF
2022-12-20
Document-level Relation Extraction with Relation Correlations
Ridong Han, Tao Peng, Benyou Wang, Lu Liu, Xiang Wan
arXiv_CL
arXiv_CL
Embedding
OCR
Knowledge
Pose
Face
Action
Relation
Relation_Extraction
Prediction
PDF
2022-12-19
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
Haoli Bai, Zhiguang Liu, Xiaojun Meng, Wentao Li, Shuang Liu, Nian Xie, Rongfu Zheng, Liangwei Wang, Lu Hou, Jiansheng Wei, Xin Jiang, Qun Liu
arXiv_CV
arXiv_CV
Transformer
Unsupervised
OCR
Knowledge
Pose
Contrastive_Learning
Action
Matching
PDF
2022-12-19
Transferring General Multimodal Pretrained Models to Text Recognition
Junyang Lin, Xuancheng Ren, Yichang Zhang, Gao Liu, Peng Wang, An Yang, Chang Zhou
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
Pose
Caption
PDF
2022-12-17
Towards Robust Handwritten Text Recognition with On-the-fly User Participation
Ajoy Mondal, Rohit saluja, C. V. Jawahar
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Prediction
PDF
2022-12-16
LEDCNet: A Lightweight and Efficient Semantic Segmentation Algorithm Using Dual Context Module for Extracting Ground Objects from UAV Aerial Remote Sensing Images
Xiaoxiang Han, Yiman Liu, Gang Liu, Qiaohong Liu
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
Survey
Action
Drone
PDF
2022-12-16
Geometric Rectification of Creased Document Images based on Isometric Mapping
Dong Luo, Pengbo Bo
arXiv_CV
arXiv_CV
Recognition
OCR
3D
Optical_Character
Knowledge
Pose
PDF
2022-12-16
On Safe and Usable Chatbots for Promoting Voter Participation
Bharath Muppasani, Vishal Pallagani, Kausik Lakkaraju, Shuge Lei, Biplav Srivastava, Brett Robertson, Andrea Hickerson, Vignesh Narayanan
arXiv_CL
arXiv_CL
OCR
Action
QA
PDF
2022-12-16
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering
Siwen Luo, Feiqi Cao, Felipe Nunez, Zean Wen, Josiah Poon, Caren Han
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Scene_Text
Action
Relation
VQA
Attention
QA
PDF
2022-12-11
A systematic literature review on Robotic Process Automation security
Nishith Gajjar, Keyur Rathod, Khushali Jani
arXiv_RO
arXiv_RO
OCR
Review
Face
PDF
2022-12-11
Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images
Hongkuan Zhang, Edward Whittaker, Ikuo Kitagishi
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
PDF
2022-12-09
LADIS: Language Disentanglement for 3D Shape Editing
Ian Huang, Panos Achlioptas, Tianyi Zhang, Sergey Tulyakov, Minhyuk Sung, Leonidas Guibas
arXiv_CV
arXiv_CV
OCR
3D
Pose
Face
Action
PDF
2022-12-09
PACMAN: a framework for pulse oximeter digit detection and reading in a low-resource setting
Chiraphat Boonnag, Wanumaidah Saengmolee, Narongrid Seesawad, Amrest Chinkamol, Saendee Rattanasomrerk, Kanyakorn Veerakanjana, Kamonwan Thanontip, Warissara Limpornchitwilai, Piyalitt Ittichaiwong, Theerawit Wilaiprasitporn
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
Detection
Object_Detection
PDF
2022-12-08
OCR-RTPS: An OCR-based real-time positioning system for the valet parking
Zizhang Wu, Xinyuan Chen, Jizheng Wang, Xiaoquan Wang, Yuanzhu Gan, Muqing Fang, Tianhao Xu
arXiv_CV
arXiv_CV
OCR
Pose
Detection
Autonomous
PDF
2022-12-06
Beyond Object Recognition: A New Benchmark towards Object Concept Learning
Yong-Lu Li, Yue Xu, Xinyu Xu, Xiaohan Mao, Yuan Yao, Siqi Liu, Cewu Lu
arXiv_AI
arXiv_AI
Recognition
OCR
Knowledge
Pose
Deep_Learning
Relation
PDF
2022-12-05
Quantized Wasserstein Procrustes Alignment of Word Embedding Spaces
Prince O Aboagye, Yan Zheng, Michael Yeh, Junpeng Wang, Zhongfang Zhuang, Huiyuan Chen, Liang Wang, Wei Zhang, Jeff Phillips
arXiv_CL
arXiv_CL
Embedding
Unsupervised
Quantization
OCR
Pose
PDF
2022-12-04
Democratizing Machine Translation with OPUS-MT
Jörg Tiedemann, Mikko Aulamo, Daria Bakshandaeva, Michele Boggia, Stig-Arne Grönroos, Tommi Nieminen, Alessandro Raganato, Yves Scherrer, Raul Vazquez, Sami Virpioja
arXiv_CL
arXiv_CL
OCR
PDF
2022-12-02
PGFed: Personalize Each Client's Global Objective for Federated Learning
Jun Luo, Matias Mendieta, Chen Chen, Shandong Wu
arXiv_CV
arXiv_CV
OCR
Regularization
Knowledge
Pose
PDF
2022-11-29
Democratizing Machine Learning for Interdisciplinary Scholars: Report on Organizing the NLP+CSS Online Tutorial Series
Ian Stewart, Katherine Keith
arXiv_CL
arXiv_CL
OCR
Knowledge
Survey
GAN
PDF
2022-11-27
Deep Active Learning for Computer Vision: Past and Future
Rinyoichi Takezoe, Xu Liu, Shunan Mao, Marco Tianyu Chen, Zhanpeng Feng, Shiliang Zhang, Xiaoyu Wang
arXiv_CV
arXiv_CV
OCR
Review
Pose
Attention
PDF
2022-11-25
Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images
Shufan Li, Congxi Lu, Linkai Li, Haoshuai Zhou
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
Detection
Object_Detection
PDF
2022-11-23
Look, Read and Ask: Learning to Ask Questions by Reading Text in Images
Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra
arXiv_CV
arXiv_CV
OCR
Pose
Scene_Text
VQA
Text_Generation
PDF
2022-11-23
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
Kumar Shridhar, Jakub Macina, Mennatallah El-Assady, Tanmay Sinha, Manu Kapur, Mrinmaya Sachan
arXiv_CL
arXiv_CL
Reinforcement_Learning
OCR
Pose
Language_Model
PDF
2022-11-22
Expansive Participatory AI: Supporting Dreaming within Inequitable Institutions
Michael Alan Chang, Shiran Dudy
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-11-22
Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation
Zesen Cheng, Pengchong Qiao, Kehan Li, Siheng Li, Pengxu Wei, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Weakly_Supervised
Relation
Prediction
PDF
2022-11-21
Visual Dexterity: In-hand Dexterous Manipulation from Depth
Tao Chen, Megha Tippur, Siyang Wu, Vikash Kumar, Edward Adelson, Pulkit Agrawal
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
PDF
2022-11-21
Data-Driven Offline Decision-Making via Invariant Representation Learning
Han Qi, Yi Su, Aviral Kumar, Sergey Levine
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Optimization
Represenation_Learning
Action
Prediction
PDF
2022-11-18
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Guangxuan Xiao, Ji Lin, Mickael Seznec, Julien Demouth, Song Han
arXiv_AI
arXiv_AI
Transformer
Quantization
OCR
Pose
Inference
Language_Model
PDF
2022-11-18
Pandering in a Flexible Representative Democracy
Xiaolin Sun, Jacob Masur, Ben Abramowitz, Nicholas Mattei, Zizhan Zheng
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Action
PDF
2022-11-17
Text-Aware Dual Routing Network for Visual Question Answering
Luoqian Jiang, Yifan He, Jian Chen
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
2022-11-16
ChartParser: Automatic Chart Parsing for Print-Impaired
Anukriti Kumar, Tanuja Ganu, Saikat Guha
arXiv_CV
arXiv_CV
OCR
Pose
Quantitative
Deep_Learning
PDF
2022-11-15
NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Jorg Bornschein, Alexandre Galashov, Ross Hemsley, Amal Rannen-Triki, Yutian Chen, Arslan Chaudhry, Xu Owen He, Arthur Douillard, Massimo Caccia, Qixuang Feng, Jiajun Shen, Sylvestre-Alvise Rebuffi, Kitty Stacpoole, Diego de las Casas, Will Hawkins, Angeliki Lazaridou, Yee Whye Teh, Andrei A. Rusu, Razvan Pascanu, Marc'Aurelio Ranzato
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Classification
PDF
2022-11-15
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari, Nikhil Singh, Amrith Krishna, Ganesh Ramakrishnan
arXiv_CL
arXiv_CL
OCR
Language_Model
Prediction
PDF
2022-11-15
DeepParliament: A Legal domain Benchmark & Dataset for Parliament Bills Prediction
Ankit Pal
arXiv_CL
arXiv_CL
OCR
RNN
Review
Pose
Classification
Prediction
PDF
2022-11-14
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
Wenhao Li, Xiaoyuan Yi, Jinyi Hu, Maosong Sun, Xing Xie
arXiv_CL
arXiv_CL
Transformer
OCR
Sparse
Regularization
Attention
Text_Generation
PDF
2022-11-11
Synthetic Expertise
Ron Fulbright, Grover Walters
arXiv_AI
arXiv_AI
OCR
Knowledge
Review
PDF
2022-11-10
Not Just Plain Text! Fuel Document-Level Relation Extraction with Explicit Syntax Refinement and Subsentence Modeling
Zhichao Duan, Xiuxing Li, Zhenyu Li, Zhuo Wang, Jianyong Wang
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-11-09
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel van Strien, David Ifeoluwa Adelani, et al. (342 additional authors not shown)
arXiv_CL
arXiv_CL
Transformer
OCR
GAN
Language_Model
PDF
2022-10-29
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh S, Anchit Gupta, C. V. Jawahar, Makarand Tapaswi
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
OCR
Self-Supervised
Matching
PDF
2022-10-29
Leveraging Orbital Information and Atomic Feature in Deep Learning Model
Xiangrui Yang
arXiv_AI
arXiv_AI
Embedding
OCR
Represenation_Learning
Pose
Deep_Learning
Prediction
PDF
2022-10-28
DORE: Document Ordered Relation Extraction based on Generative Framework
Qipeng Guo, Yuqing Yang, Hang Yan, Xipeng Qiu, Zheng Zhang
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Language_Model
PDF
2022-10-26
A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail
Zhibo Zhang, Ernesto Damiani, Hussam Al Hamadi, Chan Yeob Yeun, Fatma Taher
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
Pose
Detection
CNN
PDF
2022-10-21
AutoPrognosis 2.0: Democratizing Diagnostic and Prognostic Modeling in Healthcare with Automated Machine Learning
Fergus Imrie, Bogdan Cebere, Eoin F. McKinney, Mihaela van der Schaar
arXiv_AI
arXiv_AI
OCR
Action
Medical
PDF
2022-10-19
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez, David Vazquez, Issam Laradji, Marco Pedersoli, Pau Rodriguez
arXiv_CV
arXiv_CV
Reconstruction
OCR
Quantitative
GAN
PDF
2022-10-18
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
Zan Wang, Yixin Chen, Tengyu Liu, Yixin Zhu, Wei Liang, Siyuan Huang
arXiv_AI
arXiv_AI
OCR
3D
Pose
Action
PDF
2022-10-17
Review of the state of the art in autonomous artificial intelligence
Petar Radanliev, David De Roure
arXiv_AI
arXiv_AI
OCR
Review
Autonomous
PDF
2022-10-16
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, Patrick Schramowski, Srivatsa Kundurthy, Katherine Crowson, Ludwig Schmidt, Robert Kaczmarczyk, Jenia Jitsev
arXiv_AI
arXiv_AI
OCR
Zero-Shot
Face
Classification
Detection
PDF
2022-10-15
MenuAI: Restaurant Food Recommendation System via a Transformer-based Deep Learning Model
Xinwei Ju, Frank Po Wen Lo, Jianing Qiu, Peilun Shi, Jiachuan Peng, Benny Lo
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Optical_Character
Knowledge
Pose
Deep_Learning
Recommendation
PDF
2022-10-14
Text Detection Forgot About Document OCR
Krzysztof Olejniczak, Milan Šulc
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
PDF
2022-10-14
Frame Mining: a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds
Minghua Liu, Xuanlin Li, Zhan Ling, Yangyan Li, Hao Su
arXiv_CV
arXiv_CV
Point_Cloud
OCR
3D
Pose
Action
PDF
2022-10-13
Task Grouping for Multilingual Text Recognition
Jing Huang, Kevin J Liang, Rama Kovvuri, Tal Hassner
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
PDF
2022-10-13
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
Evan Crothers, Nathalie Japkowicz, Herna Viktor
arXiv_CL
arXiv_CL
OCR
Review
Pose
Survey
Detection
PDF
2022-10-13
Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors' Reasoning with Deep Reinforcement Learning
Arsene Fansi Tchango, Rishab Goel, Julien Martel, Zhi Wen, Gaetan Marceau Caron, Joumana Ghosn
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Action
Attention
Medical
Prediction
Recommendation
PDF
2022-10-13
On the Explainability of Natural Language Processing Deep Models
Julia El Zini, Mariette Awad
arXiv_CL
arXiv_CL
Embedding
OCR
Pose
Survey
PDF
2022-10-12
On-Premise Artificial Intelligence as a Service for Small and Medium Size Setups
Carolina Fortuna, Din Mušić, Gregor Cerar, Andrej Čampa, Panagiotis Kapsalis, Mihael Mohorčič
arXiv_AI
arXiv_AI
OCR
PDF
2022-10-11
Detecting Propagators of Disinformation on Twitter Using Quantitative Discursive Analysis
Mark M. Bailey
arXiv_CL
arXiv_CL
OCR
Quantitative
Classification
Detection
Attention
Prediction
PDF
2022-10-11
PP-StructureV2: A Stronger Document Analysis System
Chenxia Li, Ruoyu Guo, Jun Zhou, Mengtao An, Yuning Du, Lingfeng Zhu, Yi Liu, Xiaoguang Hu, Dianhai Yu
arXiv_CV
arXiv_CV
Recognition
OCR
Restoration
Knowledge
Pose
Action
Detection
Relation
Object_Detection
Relation_Extraction
Inference
PDF
2022-10-08
ConstGCN: Constrained Transmission-based Graph Convolutional Networks for Document-level Relation Extraction
Ji Qi, Bin Xu, Kaisheng Zeng, Jinxin Liu, Jifan Yu, Qi Gao, Juanzi Li, Lei Hou
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Face
Action
Relation
Relation_Extraction
CNN
Inference
PDF
2022-10-07
Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections
Roberto Arroyo, Javier Yebes, Elena Martínez, Héctor Corrales, Javier Lorenzo
arXiv_CV
arXiv_CV
Enhancement
Recognition
OCR
Optical_Character
Action
Deep_Learning
Detection
Prediction
PDF
2022-10-07
Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq, Naveed Akhtar, Ganna Pogrebna
arXiv_AI
arXiv_AI
OCR
Review
Adversarial
Pose
Survey
Inference
PDF
2022-10-07
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee, Mandar Joshi, Iulia Turc, Hexiang Hu, Fangyu Liu, Julian Eisenschlos, Urvashi Khandelwal, Peter Shaw, Ming-Wei Chang, Kristina Toutanova
arXiv_CV
arXiv_CV
Image_Caption
OCR
Face
Caption
Language_Model
PDF
2022-10-03
EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
Yashowardhan Shinde, Kishore Kulkarni
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Denoising
CNN
PDF
2022-10-02
PCONet: A Convolutional Neural Network Architecture to Detect Polycystic Ovary Syndrome from Ovarian Ultrasound Images
A.K.M. Salman Hosain, Md Humaion Kabir Mehedi, Irteza Enan Kabir
arXiv_CV
arXiv_CV
Transfer_Learning
OCR
Quantitative
CNN
PDF
2022-09-29
AICCA: AI-driven Cloud Classification Atlas
Takuya Kurihana, Elisabeth Moyer, Ian Foster
arXiv_CV
arXiv_CV
Unsupervised
OCR
Classification
GAN
CNN
PDF
2022-09-29
Chandojnanam: A Sanskrit Meter Identification and Utilization System
Hrishikesh Terdalkar, Arnab Bhattacharya
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Face
Matching
PDF
2022-09-28
Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition
Andreas Spruck, Maximilane Gruber, Anatol Maier, Denise Moussa, Jürgen Seiler, Christian Riess, André Kaup
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
PDF
2022-09-27
3D Rendering Framework for Data Augmentation in Optical Character Recognition
Andreas Spruck, Maximiliane Hawesch, Anatol Maier, Christian Riess, Jürgen Seiler, André Kaup
arXiv_CV
arXiv_CV
Recognition
OCR
3D
Optical_Character
Pose
PDF
2022-09-27
A critical appraisal of equity in conversational AI: Evidence from auditing GPT-3's dialogues with different publics on climate change and Black Lives Matter
Kaiping Chen, Anqi Shao, Jirayu Burapacheep, Yixuan Li
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Deep_Learning
Autonomous
Language_Model
PDF
2022-09-27
Efficient Non-Parametric Optimizer Search for Diverse Tasks
Ruochen Wang, Yuanhao Xiong, Minhao Cheng, Cho-Jui Hsieh
arXiv_AI
arXiv_AI
OCR
Pose
Detection
PDF
2022-09-27
Sentiment is all you need to win US Presidential elections
Sovesh Mohapatra, Somesh Mohapatra
arXiv_CL
arXiv_CL
OCR
Speech
Sentiment
PDF
2022-09-24
Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations
Letian Chen, Sravan Jayanthi, Rohan Paleja, Daniel Martin, Viacheslav Zakharov, Matthew Gombolay
arXiv_RO
arXiv_RO
Reinforcement_Learning
OCR
Knowledge
Pose
Inference
PDF
2022-09-21
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen
arXiv_CV
arXiv_CV
Image_Caption
Transformer
OCR
3D
Knowledge
Pose
Scene_Text
Face
Relation
VQA
Attention
Caption
Prediction
QA
PDF
2022-09-20
Setting the rhythm scene: deep learning-based drum loop generation from arbitrary language cues
Ignacio J. Tripodi
arXiv_CL
arXiv_CL
OCR
Action
Deep_Learning
PDF
2022-09-19
SOCRATES: A Stereo Camera Trap for Monitoring of Biodiversity
Timm Haucke, Hjalmar Kühl, Volker Steinhage
arXiv_CV
arXiv_CV
OCR
3D
Detection
PDF
2022-09-18
HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions
Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Christopher Ré, Matei Zaharia, James Zou
arXiv_AI
arXiv_AI
Recognition
OCR
Speech
Detection
Sentiment
Object_Detection
Speech_Recognition
Prediction
PDF
2022-09-15
Application of Liquid Rank Reputation System for Content Recommendation
Abhishek Saxena (Novosibirsk State University), Anton Kolonin (Novosibirsk State University)
arXiv_AI
arXiv_AI
OCR
Pose
Recommendation
PDF
2022-09-14
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Prediction
PDF
2022-09-13
Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks
Bulla Rajesh, Manav Kamlesh Agrawal, Milan Bhuva, Kisalaya Kishore, Mohammed Javed
arXiv_AI
arXiv_AI
Enhancement
OCR
Adversarial
Pose
GAN
CNN
PDF
2022-09-13
Computer vision based vehicle tracking as a complementary and scalable approach to RFID tagging
Pranav Kant Gaur, Abhilash Bhardwaj, Pritam Shete, Mohini Laghate, Dinesh M Sarode
arXiv_CV
arXiv_CV
Tracking
Enhancement
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
GAN
Prediction
PDF
2022-09-13
OCR for TIFF Compressed Document Images Directly in Compressed Domain Using Text segmentation and Hidden Markov Model
Dikshit Sharma, Mohammed Javed
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
Pose
PDF
2022-09-12
PreSTU: Pre-Training for Scene-Text Understanding
Jihyung Kil, Soravit Changpinyo, Xi Chen, Hexiang Hu, Sebastian Goodman, Wei-Lun Chao, Radu Soricut
arXiv_CV
arXiv_CV
Transformer
OCR
Pose
VQA
QA
PDF
2022-09-12
Lexical Simplification Benchmarks for English, Portuguese, and Spanish
Sanja Stajner, Daniel Ferres, Matthew Shardlow, Kai North, Marcos Zampieri, Horacio Saggion
arXiv_CL
arXiv_CL
OCR
Pose
Attention
PDF
2022-09-08
Levenshtein OCR
Cheng Da, Peng Wang, Cong Yao
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Pose
Scene_Text
Quantitative
Prediction
PDF
2022-09-06
A Masked Bounding-Box Selection Based ResNet Predictor for Text Rotation Prediction
Michael Yang, Yuan Lin, ChiuMan Ho
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Deep_Learning
CNN
Prediction
PDF
2022-08-26
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Kaushal Santosh Bhogale, Abhigyan Raman, Tahir Javed, Sumanth Doddapaneni, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra
arXiv_CL
arXiv_CL
Transfer_Learning
Recognition
OCR
Speech
Self-Supervised
Speech_Recognition
PDF
2022-08-26
AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications
Yusen Zhang, Zhongli Li, Qingyu Zhou, Ziyi Liu, Chao Li, Mina Ma, Yunbo Cao, Hongzhi Liu
arXiv_CV
arXiv_CV
Handwriting
OCR
Review
Pose
Inference
PDF
2022-08-24
Visual Subtitle Feature Enhanced Video Outline Generation
Qi Lv, Ziqiang Cao, Wenrui Xie, Derui Wang, Jingwen Wang, Zhiyong Hu, Tangkun Zhang, Yuan Ba, Yuanhang Li, Min Cao, Wenjie Li, Sujian Li, Guohong Fu
arXiv_CL
arXiv_CL
Segmentation
Video_Caption
OCR
Pose
Attention
Summarization
PDF
2022-08-23
Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents
Andrea Gemelli, Emanuele Vivoli, Simone Marinai
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Pose
Action
PDF
2022-08-21
Performance, Opaqueness, Consequences, and Assumptions: Simple questions for responsible planning of machine learning solutions
Przemyslaw Biecek
arXiv_AI
arXiv_AI
OCR
Pose
Attention
PDF
2022-08-20
An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Aly Mostafa, Omar Mohamed, Ali Ashraf, Ahmed Elbehery, Salma Jamal, Anas Salah, Amr S. Ghoneim
arXiv_CL
arXiv_CL
Transformer
Handwriting
Enhancement
Recognition
OCR
Optimization
Optical_Character
Pose
Image_Enhancement
Action
PDF
2022-08-20
Few-Shot Learning of Accurate Folding Landscape for Protein Structure Prediction
Jun Zhang, Sirui Liu, Mengyun Chen, Haotian Chu, Min Wang, Zidong Wang, Jialiang Yu, Ningxi Ni, Fan Yu, Diqing Chen, Yi Isaac Yang, Boxin Xue, Lijiang Yang, Yuan Liu, Yi Qin Gao
arXiv_AI
arXiv_AI
OCR
Few-Shot
Denoising
Prediction
PDF
2022-08-19
To show or not to show: Redacting sensitive text from videos of electronic displays
Abhishek Mukhopadhyay, Shubham Agarwal, Patrick Dylan Zwick, Pradipta Biswas
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
PDF
2022-08-18
Transcending XAI Algorithm Boundaries through End-User-Inspired Design
Weina Jin, Jianyu Fan, Diane Gromala, Philippe Pasquier, Xiaoxiao Li, Ghassan Hamarneh
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Autonomous
PDF
2022-08-12
Character decomposition to resolve class imbalance problem in Hangul OCR
Geonuk Kim, Jaemin Son, Kanghyu Lee, Jaesik Min
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
2022-08-08
Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features
Hien Thi Ha, Aleš Horák
arXiv_CL
arXiv_CL
OCR
Knowledge
Action
PDF
2022-08-07
Vernacular Search Query Translation with Unsupervised Domain Adaptation
Mandar Kulkarni, Nikesh Garera
arXiv_CL
arXiv_CL
Unsupervised
OCR
Pose
PDF
2022-08-05
GLASS: Global to Local Attention for Scene-Text Spotting
Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha
arXiv_CV
arXiv_CV
Recognition
OCR
Face
Detection
Attention
PDF
2022-08-02
Joint Learning-based Causal Relation Extraction from Biomedical Literature
Dongling Li, Pengchao Wu, Yuehu Dong, Jinghang Gu, Longhua Qian, Guodong Zhou
arXiv_CL
arXiv_CL
OCR
Pose
Action
Detection
Relation
Relation_Extraction
Medical
PDF
2022-07-25
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning
Jingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai
arXiv_CV
arXiv_CV
Recognition
Reinforcement_Learning
OCR
Pose
Scene_Text
Detection
PDF
2022-07-23
Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild
Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding
arXiv_CV
arXiv_CV
Segmentation
OCR
Knowledge
Pose
PDF
2022-07-20
Generalizable and Robust Deep Learning Algorithm for Atrial Fibrillation Diagnosis Across Ethnicities, Ages and Sexes
Shany Biton, Mohsin Aldhafeeri, Erez Marcusohn, Kenta Tsutsui, Tom Szwagier, Adi Elias, Julien Oster, Jean Marc Sellal, Mahmoud Suleiman, Joachim A. Behar
arXiv_AI
arXiv_AI
OCR
Knowledge
Deep_Learning
Detection
PDF
2022-07-19
OpenFilter: A Framework to Democratize Research Access to Social Media AR Filters
Piera Riccio, Bill Psomas, Francesco Galati, Francisco Escolano, Thomas Hofmann, Nuria Oliver
arXiv_AI
arXiv_AI
OCR
Face
Quantitative
Relation
PDF
2022-07-19
MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views
Haitian Zeng, Xin Yu, Jiaxu Miao, Yi Yang
arXiv_CV
arXiv_CV
Reconstruction
Unsupervised
OCR
Pose
PDF
2022-07-18
Symmetrized Robust Procrustes: Constant-Factor Approximation and Exact Recovery
Tal Amir, Shahar Kovalsky, Nadav Dym
arXiv_CV
arXiv_CV
OCR
Pose
PDF
2022-07-15
Printable Flexible Robots for Remote Learning
Savita V. Kendre, Gus. T. Teran, Lauryn Whiteside, Tyler Looney, Ryley Wheelock, Surya Ghai, Markus P. Nemitz
arXiv_RO
arXiv_RO
OCR
3D
Pose
PDF
2022-07-14
DEXTER: An end-to-end system to extract table contents from electronic medical health documents
Nandhinee PR, Harinath Krishnamoorthy, Anil Goyal, Sudarsun Santhiappan
arXiv_CV
arXiv_CV
Transfer_Learning
OCR
Pose
Action
Classification
Detection
Medical
PDF
2022-07-14
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding
Liang Qiao, Hui Jiang, Ying Chen, Can Li, Pengfei Li, Zaisheng Li, Baorui Zou, Dashan Guo, Yingda Xu, Yunlu Xu, Zhanzhan Cheng, Yi Niu
arXiv_CV
arXiv_CV
OCR
Attention
PDF
2022-07-11
GMN: Generative Multi-modal Network for Practical Document Information Extraction
Haoyu Cao, Jiefeng Ma, Antai Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Action
Attention
PDF
2022-07-10
Facilitated machine learning for image-based fruit quality assessment in developing countries
Manuel Knott, Fernando Perez-Cruz, Thijs Defraeye
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
Pose
Classification
CNN
Image_Classification
PDF
2022-07-08
Detection of Furigana Text in Images
Nikolaj Kjøller Bjerregaard, Veronika Cheplygina, Stefan Heinrich
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Action
Detection
Object_Detection
GAN
PDF
2022-07-04
Positive-Negative Equal Contrastive Loss for Semantic Segmentation
Jing Wang, Linfei Xuan, Wenxuan Wang, Tianxiang Zhang, Jiangyun Li
arXiv_CV
arXiv_CV
Transformer
Segmentation
Embedding
Unsupervised
Semantic_Segmentation
OCR
Pose
Contrastive_Learning
PDF
2022-07-04
Distilling Ensemble of Explanations for Weakly-Supervised Pre-Training of Image Segmentation Models
Xuhong Li, Haoyi Xiong, Yi Liu, Dingfu Zhou, Zeyu Chen, Yaqing Wang, Dejing Dou
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
OCR
Pose
Classification
Image_Classification
PDF
2022-07-04
BusiNet -- a Light and Fast Text Detection Network for Business Documents
Oshri Naparstek, Ophir Azulai, Daniel Rotman, Yevgeny Burshtein, Peter Staar, Udi Barzelay
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Adversarial
Detection
PDF
2022-07-02
DeltaZ: An Accessible Compliant Delta Robot Manipulator for Research and Education
Sarvesh Patil, Samuel C. Alvares, Pragna Mannam, Oliver Kroemer, F. Zeynep Temel
arXiv_RO
arXiv_RO
Reinforcement_Learning
OCR
3D
PDF
2022-06-30
Democratizing Ethical Assessment of Natural Language Generation Models
Amin Rasekh, Ian Eisenberg
arXiv_CL
arXiv_CL
OCR
Speech
PDF
2022-06-29
Procrustes Analysis with Deformations: A Closed-Form Solution by Eigenvalue Decomposition
Fang Bai, Adrien Bartoli
arXiv_RO
arXiv_RO
OCR
3D
Regularization
Pose
PDF
2022-06-29
Using Twitter Data to Understand Public Perceptions of Approved versus Off-label Use for COVID-19-related Medications
Yining Hua, Hang Jiang, Shixu Lin, Jie Yang, Joseph M. Plasek, David W. Bates, Li Zhou
arXiv_CL
arXiv_CL
OCR
Pose
Attention
PDF
2022-06-27
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Xu Yang, Daoyuan Wu, Xiao Yi, Jimmy H. M. Lee, Tan Lee
arXiv_CV
arXiv_CV
Face_Detection
Recognition
OCR
Optimization
Optical_Character
Pose
Face
Detection
Face_Recognition
PDF
2022-06-27
Differentially Private Condorcet Voting
Zhechen Li, Ao Liu, Lirong Xia, Yongzhi Cao, Hanpin Wang
arXiv_AI
arXiv_AI
OCR
Pose
Relation
PDF
2022-06-26
FAIR-BFL: Flexible and Incentive Redesign for Blockchain-based Federated Learning
Rongxin Xu, Shiva Raj Pokhrel, Qiujun Lan, Gang Li
arXiv_AI
arXiv_AI
OCR
PDF
2022-06-24
Dissecting U-net for Seismic Application: An In-Depth Study on Deep Learning Multiple Removal
Ricard Durall, Ammar Ghanim, Norman Ettrich, Janis Keuper
arXiv_CV
arXiv_CV
OCR
Knowledge
Deep_Learning
PDF
2022-06-21
An Automatic and Efficient BERT Pruning for Edge AI Systems
Shaoyi Huang, Ning Liu, Yueying Liang, Hongwu Peng, Hongjia Li, Dongkuan Xu, Mimi Xie, Caiwen Ding
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Pose
Deep_Learning
Relation
Inference
PDF
2022-06-21
Towards Optimizing OCR for Accessibility
Peya Mowar, Tanuja Ganu, Saikat Guha
arXiv_CV
arXiv_CV
OCR
Speech
PDF
2022-06-21
Broken News: Making Newspapers Accessible to Print-Impaired
Vishal Agarwal, Tanuja Ganu, Saikat Guha
arXiv_CV
arXiv_CV
Segmentation
OCR
Pose
Detection
PDF
2022-06-18
Camera Adaptation for Fundus-Image-Based CVD Risk Estimation
Zhihong Lin, Danli Shi, Donghao Zhang, Xianwen Shang, Mingguang He, Zongyuan Ge
arXiv_CV
arXiv_CV
Transformer
OCR
Knowledge
Pose
Deep_Learning
Attention
PDF
2022-06-14
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu, Chao Wang, Wenqiang Lei, Ziyang Liu, Tat Seng Chua
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
Face
Action
Detection
Object_Detection
Attention
Prediction
PDF
2022-06-12
An Unsupervised Deep-Learning Method for Bone Age Assessment
Hao Zhu, Wan-Jing Nie, Yue-Jie Hou, Qi-Meng Du, Si-Jing Li, Chi-Chun Zhou
arXiv_CV
arXiv_CV
Unsupervised
OCR
Knowledge
Pose
Classification
CNN
PDF
2022-06-11
An Evaluation of OCR on Egocentric Data
Valentin Popescu, Dima Damen, Toby Perrett
arXiv_CV
arXiv_CV
OCR
PDF
2022-06-10
Human-AI Interaction Design in Machine Teaching
Karan Taneja, Harshvardhan Sikka, Ashok Goel
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Face
Action
PDF
2022-06-09
Transformer based Urdu Handwritten Text Optical Character Reader
Mohammad Daniyal Shaiq, Musa Dildar Ahmed Cheema, Ali Kamal
arXiv_AI
arXiv_AI
Transformer
Handwriting
OCR
Optical_Character
Pose
Action
PDF
2022-06-07
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Self-Supervised
Pose
Detection
Object_Detection
Attention
Inference
PDF
2022-06-06
Contrastive Graph Multimodal Model for Text Classification in Videos
Ye Liu, Changchong Lu, Chen Lin, Di Yin, Bo Ren
arXiv_CV
arXiv_CV
Video_Indexing
Recognition
OCR
Text_Classification
Knowledge
Contrastive_Learning
Action
Classification
Relation
PDF
2022-06-05
Two Decades of Bengali Handwritten Digit Recognition: A Survey
A.B.M. Ashikur Rahman, Md. Bakhtiar Hasan, Sabbir Ahmed, Tasnim Ahmed, Md. Hamjajul Ashmafee, Mohammad Ridwan Kabir, Md. Hasanul Kabir
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Review
Survey
Deep_Learning
PDF
2022-06-04
A Superimposed Divide-and-Conquer Image Recognition Method for SEM Images of Nanoparticles on The Surface of Monocrystalline silicon with High Aggregation Degree
Ruiling Xiao, Jiayang Niu
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Face
Contour
PDF
2022-06-03
Beyond Tabula Rasa: Reincarnating Reinforcement Learning
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville, Marc G. Bellemare
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Knowledge
Pose
PDF
2022-06-01
Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Christoph Auer (1), Michele Dolfi (1), André Carvalho (2), Cesar Berrospi Ramis (1), Peter W. J. Staar (1) ((1) IBM Research, (2) SoftINSA Lda.)
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Knowledge
Face
PDF
2022-06-01
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Self-Supervised
Pose
Language_Model
PDF
2022-05-31
hmBERT: Historical Multilingual Language Models for Named Entity Recognition
Stefan Schweter, Luisa März, Katharina Schmid, Erion Çano
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
GAN
Language_Model
PDF
2022-05-30
Easter2.0: Improving convolutional models for handwritten text recognition
Kartik Chaudhary, Raghav Bali
arXiv_AI
arXiv_AI
Transformer
Handwriting
Recognition
OCR
RNN
Pose
Classification
Few-Shot
CNN
PDF
2022-05-27
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
Video_Caption
OCR
Optical_Character
Scene_Text
Classification
Detection
Object_Detection
Caption
Image_Classification
Language_Model
PDF
2022-05-25
Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction
Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-05-25
DisinfoMeme: A Multimodal Dataset for Detecting Meme Intentionally Spreading Out Disinformation
Jingnong Qu, Liunian Harold Li, Jieyu Zhao, Sunipa Dev, Kai-Wei Chang
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Action
GAN
PDF
2022-05-25
Skin Cancer Diagnostics with an All-Inclusive Smartphone Application
Upender Kalwa, Christopher Legner, Taejoon Kong, Santosh Pandey
arXiv_CV
arXiv_CV
Segmentation
OCR
Classification
Detection
Medical
PDF
2022-05-23
Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Tuan Dinh, Jy-yong Sohn, Shashank Rajput, Timothy Ossowski, Yifei Ming, Junjie Hu, Dimitris Papailiopoulos, Kangwook Lee
arXiv_CL
arXiv_CL
Embedding
Unsupervised
OCR
Pose
PDF
2022-05-23
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition
Md. Ismail Hossain, Mohammed Rakib, Sabbir Mollah, Fuad Rahman, Nabeel Mohammed
arXiv_AI
arXiv_AI
Handwriting
Recognition
OCR
RNN
Optical_Character
Knowledge
CNN
PDF
2022-05-21
Improving Long Tailed Document-Level Relation Extraction via Easy Relation Augmentation and Contrastive Learning
Yangkai Du, Tengfei Ma, Lingfei Wu, Yiming Wu, Xuhong Zhang, Bo Long, Shouling Ji
arXiv_AI
arXiv_AI
OCR
Pose
Contrastive_Learning
Action
Relation
Relation_Extraction
PDF
2022-05-17
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong, Mingyuan Zhang, Liang Pan, Zhongang Cai, Lei Yang, Ziwei Liu
arXiv_CV
arXiv_CV
Transformer
OCR
3D
Zero-Shot
Knowledge
Pose
Quantitative
Language_Model
PDF
2022-05-17
Detection Masking for Improved OCR on Noisy Documents
Daniel Rotman, Ophir Azulai, Inbar Shapira, Yevgeny Burshtein, Udi Barzelay
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Detection
PDF
2022-05-13
An empirical study of CTC based models for OCR of Indian languages
Minesh Mathew, CV Jawahar
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Classification
Prediction
PDF
2022-05-13
The Case for a Legal Compliance API for the Enforcement of the EU's Digital Services Act on Social Media Platforms
Catalina Goanta, Thales Bertaglia, Adriana Iamnitchi
arXiv_AI
arXiv_AI
OCR
Pose
Face
PDF
2022-05-12
AiSocrates: Towards Answering Ethical Quandary Questions
Yejin Bang, Nayeon Lee, Tiezheng Yu, Leila Khalatbari, Yan Xu, Dan Su, Elham J. Barezi, Andrea Madotto, Hayden Kee, Pascale Fung
arXiv_AI
arXiv_AI
OCR
Pose
Few-Shot
Language_Model
PDF
2022-05-11
Pre-trained Language Models as Re-Annotators
Chang Shu
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Contrastive_Learning
Action
Detection
Relation
Relation_Extraction
Language_Model
PDF
2022-05-09
A Novel Augmented Reality Ultrasound Framework Using an RGB-D Camera and a 3D-printed Marker
Yitian Zhou, Gaétan Lelu, Boris Labbé, Guillaume Pasquier, Pierre Le Gargasson, Albert Murienne, Laurent Launay
arXiv_CV
arXiv_CV
Tracking
Point_Cloud
OCR
3D
Pose
Medical
PDF
2022-05-06
Rethinking Fairness: An Interdisciplinary Survey of Critiques of Hegemonic ML Fairness Approaches
Lindsay Weinberg
arXiv_AI
arXiv_AI
OCR
Survey
Action
Classification
GAN
PDF
2022-05-06
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language
Martin Malmsten, Chris Haffenden, Love Börjeson
arXiv_CL
arXiv_CL
Recognition
OCR
Speech
Speech_Recognition
Language_Model
PDF
2022-05-05
RoboCraft: Learning to See, Simulate, and Shape Elasto-Plastic Objects with Graph Networks
Haochen Shi, Huazhe Xu, Zhiao Huang, Yunzhu Li, Jiajun Wu
arXiv_AI
arXiv_AI
OCR
Pose
Action
PDF
2022-05-05
Text Detection on Technical Drawings for the Digitization of Brown-field Processes
Tobias Schlagenhauf, Markus Netzer, Jan Hillinger
arXiv_CV
arXiv_CV
Recognition
OCR
Knowledge
Detection
Object_Detection
Autonomous
PDF
2022-05-05
OCR Synthetic Benchmark Dataset for Indic Languages
Naresh Saini, Promodh Pinto, Aravinth Bheemaraj, Deepak Kumar, Dhiraj Daga, Saurabh Yadav, Srihari Nagaraj
arXiv_CV
arXiv_CV
OCR
PDF
2022-05-05
Relational Representation Learning in Visually-Rich Documents
Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Represenation_Learning
Knowledge
Pose
Contrastive_Learning
Action
Detection
Relation
PDF
2022-05-04
Reproducibility Beyond the Research Community: Experience from NLP Beginners
Shane Storks, Keunwoo Peter Yu, Joyce Chai
arXiv_CL
arXiv_CL
OCR
Attention
PDF
2022-05-04
Few-Shot Document-Level Relation Extraction
Nicholas Popovic, Michael Färber
arXiv_AI
arXiv_AI
OCR
Pose
Action
Relation
Few-Shot
Relation_Extraction
PDF
2022-05-04
Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction
Liyan Xu, Jinho D. Choi
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-05-02
Music Interpretation Analysis. A Multimodal Approach To Score-Informed Resynthesis of Piano Recordings
Federico Simonetta
arXiv_SD
arXiv_SD
OCR
Pose
Action
PDF
2022-04-27
The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild
Spyridon Baxevanakis, Giorgos Kordopatis-Zilos, Panagiotis Galopoulos, Lazaros Apostolidis, Killian Levacher, Ipek B. Schlicht, Denis Teyssou, Ioannis Kompatsiaris, Symeon Papadopoulos
arXiv_CV
arXiv_CV
OCR
Adversarial
Pose
Deep_Learning
Detection
PDF
2022-04-27
Document-Level Relation Extraction with Sentences Importance Estimation and Focusing
Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-04-26
Approach to Predicting News -- A Precise Multi-LSTM Network With BERT
Chia-Lin Chen (1), Pei-Yu Huang (2), Yi-Ting Huang (3), Chun Lin (3) ((1) Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, (2) Management and Digital Innovation, University of London, Singapore, (3) Institute of Information Science, Academia Sinica, Taipei, Taiwan)
arXiv_CL
arXiv_CL
Transformer
Embedding
OCR
Bert
RNN
PDF
2022-04-21
German Parliamentary Corpus
Giuseppe Abrami, Mevlüt Bagci, Leon Hammerla, Alexander Mehler
arXiv_CL
arXiv_CL
OCR
PDF
2022-04-21
A Masked Image Reconstruction Network for Document-level Relation Extraction
Liang Zhang, Yidong Cheng
arXiv_CL
arXiv_CL
Reconstruction
OCR
Pose
Action
Relation
Relation_Extraction
Inference
PDF
2022-04-20
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations
Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj Doğan, Jingcheng Du, Li Fang, Wang Kai, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Shubo Tian, Jinfeng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu, Richard Dufour, Yanis Labrak, Niladri Chatterjee, Kushagri Tandon, Fréjus Laleye, Loïc Rakotoson, Emmanuele Chersoni, Jinghang Gu, Annemarie Friedrich, Subhash Chandra Pujari, Mariia Chizhikova, Naveen Sivadasan, Naveen Sivadasan, Zhiyong Lu
arXiv_CL
arXiv_CL
Transformer
OCR
Review
Classification
GAN
Medical
PDF
2022-04-17
Does Recommend-Revise Produce Reliable Annotations? An Analysis on Missing Instances in DocRED
Quzhe Huang, Shibo Hao, Yuan Ye, Shengqi Zhu, Yansong Feng, Dongyan Zhao
arXiv_CL
arXiv_CL
OCR
Action
Relation
Relation_Extraction
Recommendation
PDF
2022-04-14
Multi-label topic classification for COVID-19 literature with Bioformer
Li Fang, Kai Wang
arXiv_CL
arXiv_CL
OCR
Bert
Classification
PDF
2022-04-06
Data-Centric Green AI: An Exploratory Empirical Study
Roberto Verdecchia, Luís Cruz, June Sallou, Michelle Lin, James Wickenden, Estelle Hotellier
arXiv_AI
arXiv_AI
OCR
PDF
2022-04-05
Region Rebalance for Long-Tailed Semantic Segmentation
Jiequan Cui, Yuhui Yuan, Zhisheng Zhong, Zhuotao Tian, Han Hu, Stephen Lin, Jiaya Jia
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
PDF
2022-04-03
A Part-of-Speech Tagger for Yiddish: First Steps in Tagging the Yiddish Book Center Corpus
Seth Kulick, Neville Ryant, Beatrice Santorini, Joel Wallenberg
arXiv_CL
arXiv_CL
Embedding
OCR
Knowledge
Speech
Pose
Relation
PDF
2022-04-03
A sequence-to-sequence approach for document-level relation extraction
John Giorgi, Gary D. Bader, Bo Wang
arXiv_CL
arXiv_CL
OCR
Action
Relation
Relation_Extraction
Medical
PDF
2022-04-01
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng, Adrian Wong, Stefan Welker, Krzysztof Choromanski, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
arXiv_AI
arXiv_AI
Image_Caption
OCR
Zero-Shot
Knowledge
Pose
Caption
Language_Model
PDF
2022-04-01
Unitail: Detecting, Reading, and Matching in Retail Scene
Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides
arXiv_CV
arXiv_CV
OCR
Detection
Object_Detection
Matching
PDF
2022-03-31
Digitizing Historical Balance Sheet Data: A Practitioner's Guide
Sergio Correia, Stephan Luck
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
PDF
2022-03-30
Automatic Facial Skin Feature Detection for Everyone
Qian Zheng, Ankur Purwar, Heng Zhao, Guang Liang Lim, Ling Li, Debasish Behera, Qian Wang, Min Tan, Rizhao Cai, Jennifer Werner, Dennis Sng, Maurice van Steensel, Weisi Lin, Alex C Kot
arXiv_CV
arXiv_CV
OCR
Detection
Recommendation
PDF
2022-03-26
A Densely Connected Criss-Cross Attention Network for Document-level Relation Extraction
Liang Zhang, Yidong Cheng
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Attention
PDF
2022-03-25
Quantifying Demonstration Quality for Robot Learning and Generalization
Maram Sakr, Zexi Jesse Li, H. F. Machiel Van der Loos, Dana Kulic, Elizabeth A. Croft
arXiv_RO
arXiv_RO
OCR
Pose
Relation
PDF
2022-03-25
Plagiarism Detection in the Bengali Language: A Text Similarity-Based Approach
Satyajit Ghosh, Aniruddha Ghosh, Bittaswer Ghosh, Abhishek Roy
arXiv_CL
arXiv_CL
OCR
Pose
Action
Detection
PDF
2022-03-25
Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
OCR
Contrastive_Learning
PDF
2022-03-24
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
Chengyang Fang, Gangyan Zeng, Yu Zhou, Daiqing Wu, Can Ma, Dayong Hu, Weiping Wang
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
VQA
Prediction
QA
PDF
2022-03-21
Transformer-based HTR for Historical Documents
Phillip Benjamin Ströbel, Simon Clematide, Martin Volk, Tobias Hodel
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
OCR
PDF
2022-03-21
Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation
Qingyu Tan, Ruidan He, Lidong Bing, Hwee Tou Ng
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Action
Relation
Relation_Extraction
Attention
PDF
2022-03-20
Who will share Fake-News on Twitter? Psycholinguistic cues in online post histories discriminate Between actors in the misinformation ecosystem
Verena Schoenmueller, Simon J. Blanchard, Gita V. Johar
arXiv_CL
arXiv_CL
OCR
Emotion
Classification
Prediction
PDF
2022-03-20
Document Dewarping with Control Points
Guo-Wang Xie, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu
arXiv_CV
arXiv_CV
OCR
Sparse
Pose
Action
PDF
2022-03-15
Revitalize Region Feature for Democratizing Video-Language Pre-training
Guanyu Cai, Yixiao Ge, Alex Jinpeng Wang, Rui Yan, Xudong Lin, Ying Shan, Lianghua He, Xiaohu Qie, Jianping Wu, Mike Zheng Shou
arXiv_CV
arXiv_CV
Transformer
OCR
Sparse
Regularization
Relation
Video_Retrieval
PDF
2022-03-14
CAR: Class-aware Regularizations for Semantic Segmentation
Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Xiangjian He, Linchao Bao
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Represenation_Learning
Regularization
Pose
Inference
Prediction
PDF
2022-03-14
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu, Changhua Meng, Ke Wang, Jun Lan, Weiqiang Wang, Ming Gu, Liqing Zhang
arXiv_CL
arXiv_CL
Transformer
Embedding
OCR
Pose
PDF
2022-03-11
Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision
Yufeng Cui, Lichen Zhao, Feng Liang, Yangguang Li, Jing Shao
arXiv_CV
arXiv_CV
Transformer
OCR
Pose
CNN
PDF
2022-03-11
Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection
Siyue Yu, Jimin Xiao, Bingfeng Zhang, Eng Gee Lim
arXiv_CV
arXiv_CV
Enhancement
OCR
Salient
Pose
Contrastive_Learning
Classification
Detection
Object_Detection
Attention
Prediction
PDF
2022-03-08
Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting
Chuhui Xue, Yu Hao, Shijian Lu, Philip Torr, Song Bai
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Weakly_Supervised
Optical_Character
Pose
Scene_Text
Action
Detection
PDF
2022-03-04
OCR quality affects perceived usefulness of historical newspaper clippings -- a user study
Kimmo Kettunen, Heikki Keskustalo, Sanna Kumpulainen, Tuula Pääkkönen, Juha Rautiainen
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Knowledge
Face
PDF
2022-03-03
A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions
Francois St-Hilaire, Dung Do Vu, Antoine Frau, Nathan Burns, Farid Faraji, Joseph Potochny, Stephane Robert, Arnaud Roussel, Selene Zheng, Taylor Glazier, Junfel Vincent Romano, Robert Belfer, Muhammad Shayan, Ariella Smofsky, Tommy Delarosbil, Seulmin Ahn, Simon Eden-Walker, Kritika Sony, Ansona Onyi Ching, Sabina Elkins, Anush Stepanyan, Adela Matajova, Victor Chen, Hossein Sahraei, Robert Larson, Nadia Markova, Andrew Barkett, Laurent Charlin, Yoshua Bengio, Iulian Vlad Serban, Ekaterina Kochmar
arXiv_AI
arXiv_AI
OCR
Action
PDF
2022-03-02
Foundations for Grassroots Democratic Metaverse
Nimrod Talmon, Ehud Shapiro
arXiv_AI
arXiv_AI
OCR
Face
Autonomous
PDF
2022-03-02
TableFormer: Table Structure Understanding with Transformers
Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar
arXiv_CV
arXiv_CV
Transformer
OCR
RNN
Knowledge
Knowledge_Graph
Action
Deep_Learning
Detection
Object_Detection
GAN
PDF
2022-03-02
Centralized Fairness for Redistricting
Seyed A. Esmaeili, Hayley Grape, Brian Brubach
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-03-01
Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection
Yufei Liang, Jiangning Zhang, Shiwei Zhao, Runze Wu, Yong Liu, Shuwen Pan
arXiv_CV
arXiv_CV
Reconstruction
Unsupervised
OCR
Restoration
Pose
Action
Classification
Detection
Relation
GAN
PDF
2022-02-27
OCR Improves Machine Translation for Low-Resource Languages
Oana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán
arXiv_CL
arXiv_CL
OCR
PDF
2022-02-25
OCR-IDL: OCR Annotations for Industry Document Library Dataset
Ali Furkan Biten, Rubèn Tito, Lluis Gomez, Ernest Valveny, Dimosthenis Karatzas
arXiv_CV
arXiv_CV
OCR
Pose
PDF
2022-02-25
Improving Amharic Handwritten Word Recognition Using Auxiliary Task
Mesay Samuel Gondere, Lars Schmidt-Thieme, Durga Prasad Sharma, Abiot Sinamo Boltena
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Classification
Deep_Learning
CNN
PDF
2022-02-24
Design and Characterization of 3D Printed, Open-Source Actuators for Legged Locomotion
Karthik Urs, Challen Enninful Adu, Elliott J. Rouse, Talia Y. Moore
arXiv_RO
arXiv_RO
OCR
3D
Optimization
PDF
2022-02-22
CorefDRE: Document-level Relation Extraction with coreference resolution
Zhongxuan Xue, Rongzhen Li, Qizhu Dai, Zhong Jiang
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Attention
Inference
PDF
2022-02-18
BLPnet: A new DNN model and Bengali OCR engine for Automatic License Plate Recognition
Md. Saif Hassan Onim, Hussain Nyeem, Koushik Roy, Mahmudul Hasan, Abtahi Ishmam, Md. Akiful Hoque Akif, Tareque Bashar Ovi
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Pose
Detection
Attention
PDF
2022-02-18
SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions
Ripon K. Saha, Akira Ura, Sonal Mahajan, Chenguang Zhu, Linyi Li, Yang Hu, Hiroaki Yoshida, Sarfraz Khurshid, Mukul R. Prasad
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-02-18
How to Manage Tiny Machine Learning at Scale: An Industrial Perspective
Haoyu Ren, Darko Anicic, Thomas Runkler
arXiv_AI
arXiv_AI
OCR
Knowledge
Knowledge_Graph
Pose
Ontology
PDF
2022-02-17
CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving
Yinuo Zhao, Kun Wu, Zhiyuan Xu, Zhengping Che, Qi Lu, Jian Tang, Chi Harold Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Optimization
Relation
Attention
Autonomous
PDF
2022-02-16
ADIMA: Abuse Detection In Multilingual Audio
Vikram Gupta, Rini Sharon, Ramit Sawhney, Debdoot Mukherjee
arXiv_CL
arXiv_CL
Recognition
OCR
Zero-Shot
Speech
Pose
Quantitative
Detection
Speech_Recognition
PDF
2022-02-15
Shifting Trends of COVID-19 Tweet Sentiment with Respect to Voting Preferences in the 2020 Election Year of the United States
Megan Doman, Jacob Motley, Hong Qin, Mengjun Xie, Li Yang
arXiv_CL
arXiv_CL
OCR
Relation
Sentiment
PDF
2022-02-13
Omnifont Persian OCR System Using Primitives
Azarakhsh Keipour, Mohammad Eshghi, Sina Mohammadzadeh Ghadikolaei, Negin Mohammadi, Shahab Ensafi
arXiv_AI
arXiv_AI
Recognition
OCR
PDF
2022-02-12
State of AI Ethics Report
Abhishek Gupta (1, 2, 3), Connor Wright (1, 4), Marianna Bergamaschi Ganapini (1, 5), Masa Sweidan (1), Renjie Butalid (1) ((1) Montreal AI Ethics Institute, (2) Microsoft, (3) Green Software Foundation, (4) University of Exeter, (5) Union College)
arXiv_AI
arXiv_AI
OCR
Salient
PDF
2022-02-08
Tube-Balloon Logic for the Exploration of Fluidic Control Elements
Jovanna A. Tracz, Lukas Wille, Dylan Pathiraja, Savita V. Kendre, Ron Pfisterer, Ethan Turett, Gus T. Teran, Christoffer K. Abrahamsson, Samuel E. Root, Won-Kyu Lee, Daniel J. Preston, Haihui Joy Jiang, George M. Whitesides, Markus P. Nemitz
arXiv_RO
arXiv_RO
OCR
Autonomous
PDF
2022-02-06
Human rights, democracy, and the rule of law assurance framework for AI systems: A proposal
David Leslie, Christopher Burr, Mhairi Aitken, Michael Katell, Morgan Briggs, Cami Rincon
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2022-02-03
DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts
Wenzhen Zhu, Negin Sokhandan, Guang Yang, Sujitha Martin, Suchitra Sathyanarayana
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
PDF
2022-02-02
DCSAU-Net: A Deeper and More Compact Split-Attention U-Net for Medical Image Segmentation
Qing Xu, Wenting Duan, Na He
arXiv_CV
arXiv_CV
Segmentation
OCR
Pose
Attention
Medical
PDF
2022-01-28
Detection of fake faces in videos
M. Shamanth, Russel Mathias, Dr Vijayalakshmi MN
arXiv_CV
arXiv_CV
OCR
Adversarial
Face
Deep_Learning
Detection
GAN
PDF
2022-01-27
Human-centered mechanism design with Democratic AI
Raphael Koster, Jan Balaguer, Andrea Tacchetti, Ari Weinstein, Tina Zhu, Oliver Hauser, Duncan Williams, Lucy Campbell-Gillingham, Phoebe Thacker, Matthew Botvinick, Christopher Summerfield
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
PDF
2022-01-26
Continuous Examination by Automatic Quiz Assessment Using Spiral Codes and Image Processing
Fernando Alonso-Fernandez, Josef Bigun
arXiv_CV
arXiv_CV
OCR
PDF
2022-01-26
An Assessment of the Impact of OCR Noise on Language Models
Konstantin Todorov, Giovanni Colavizza
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Optical_Character
Pose
Language_Model
PDF
2022-01-26
The Norwegian Parliamentary Speech Corpus
Per Erik Solberg, Pablo Ortiz
arXiv_SD
arXiv_SD
Recognition
OCR
Speech
Speech_Recognition
PDF
2022-01-25
A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit Recognition
Md. Ferdous Wahid, Md. Fahim Shahriar, Md. Shohanur Islam Sobuj
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Action
Classification
PDF
2022-01-21
Classroom Slide Narration System
Jobin K.V., Ajoy Mondal, C. V. Jawahar
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Recognition
OCR
Optical_Character
Pose
Face
Classification
PDF
2022-01-18
Improve Sentence Alignment by Divide-and-conquer
Wu Zhang
arXiv_CL
arXiv_CL
Embedding
OCR
PDF
2022-01-13
Document-level Relation Extraction with Context Guided Mention Integration and Inter-pair Reasoning
Chao Zhao, Daojian Zeng, Lu Xu, Jianhua Dai
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2022-01-06
DReyeVR: Democratizing driving simulation in virtual reality for behavioural & interaction research
Gustavo Silvera, Abhijat Biswas, Henny Admoni
arXiv_AI
arXiv_AI
Tracking
OCR
Face
Action
Autonomous
PDF
2022-01-02
On the Cross-dataset Generalization for License Plate Recognition
Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
2021-12-23
ELSA: Enhanced Local Self-Attention for Vision Transformer
Jingkai Zhou, Pichao Wang, Fan Wang, Qiong Liu, Hao Li, Rong Jin
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
Pose
Attention
PDF
2021-12-23
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha
arXiv_CV
arXiv_CV
Transformer
OCR
Pose
Scene_Text
Detection
VQA
Object_Detection
QA
PDF
2021-12-15
Lesan -- Machine Translation for Low Resource Languages
Asmelash Teka Hadgu, Abel Aregawi, Adam Beaudoin
arXiv_CL
arXiv_CL
Transformer
OCR
PDF
2021-12-15
Tracing Text Provenance via Context-Aware Lexical Substitution
Xi Yang, Jie Zhang, Kejiang Chen, Weiming Zhang, Zehua Ma, Feng Wang, Nenghai Yu
arXiv_CL
arXiv_CL
Transformer
OCR
Bert
Pose
Language_Model
PDF
2021-12-09
BLPnet: A New DNN model for Automatic License Plate Detection with Bengali OCR
Md Saif Hassan Onim, Hussain Nyeem, Koushik Roy, Mahmudul Hasan, Abtahi Ishmam, Md. Akiful Hoque Akif, Tareque Bashar Ovi
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Detection
PDF
2021-12-06
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang
arXiv_AI
arXiv_AI
Transformer
OCR
Knowledge
Pose
Attention
Prediction
QA
PDF
2021-12-06
Requirements for Open Political Information: Transparency Beyond Open Data
Andong Luis Li Zhao, Andrew Paley, Rachel Adler, Harper Pack, Sergio Servantez, Alexander Einarsson, Cameron Barrie, Marko Sterbentz, Kristian Hammond
arXiv_AI
arXiv_AI
OCR
Sketch
Knowledge
PDF
2021-12-06
A Survey on Deep learning based Document Image Enhancement
Zahra Anvari, Vassilis Athitsos
arXiv_CV
arXiv_CV
Enhancement
Recognition
OCR
Restoration
Optical_Character
Review
Pose
Image_Enhancement
Survey
Action
Deep_Learning
Denoising
Attention
PDF
2021-12-03
Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution
Sahan Bulathwela, María Pérez-Ortiz, Catherine Holloway, John Shawe-Taylor
arXiv_AI
arXiv_AI
OCR
PDF
2021-12-03
ROCA: Robust CAD Model Retrieval and Alignment from a Single Image
Can Gümeli, Angela Dai, Matthias Nießner
arXiv_CV
arXiv_CV
OCR
3D
Optimization
PDF
2021-12-03
An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images
Zekun Li, Yao-Yi Chiang, Sasan Tavakkol, Basel Shbita, Johannes H. Uhl, Stefan Leyk, Craig A. Knoblock
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
PDF
2021-12-01
On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification
Rutika Moharir, Arun D Prabhu, Sukumar Moharana, Gopi Ramena, Rachit S Munjal
arXiv_CV
arXiv_CV
Image_Caption
OCR
RNN
Scene_Text
Classification
Attention
CNN
Inference
PDF
2021-11-30
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
Sahar Abdelnabi, Rakibul Hasan, Mario Fritz
arXiv_CV
arXiv_CV
Image_Caption
OCR
Pose
Caption
PDF
2021-11-30
Donut: Document Understanding Transformer without OCR
Geewook Kim, Teakgyu Hong, Moonbin Yim, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Optical_Character
Pose
Deep_Learning
PDF
2021-11-30
Automatic Extraction of Medication Names in Tweets as Named Entity Recognition
Carol Anderson, Bo Liu, Anas Abidin, Hoo-Chang Shin, Virginia Adams
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Action
Classification
Medical
Language_Model
Prediction
PDF
2021-11-30
Chemical Identification and Indexing in PubMed Articles via BERT and Text-to-Text Approaches
Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin
arXiv_CL
arXiv_CL
Embedding
Recognition
OCR
Bert
Language_Model
PDF
2021-11-30
Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models
Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin
arXiv_CL
arXiv_CL
OCR
Bert
Action
Classification
Relation
Relation_Extraction
PDF
2021-11-28
Image preprocessing and modified adaptive thresholding for improving OCR
Rohan Lal Kshetry
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2021-11-26
BCH-NLP at BioCreative VII Track 3: medications detection in tweets using transformer networks and multi-task learning
Dongfang Xu, Shan Chen, Timothy Miller
arXiv_CL
arXiv_CL
Transformer
OCR
Text_Classification
Action
Classification
Detection
PDF
2021-11-26
When Creators Meet the Metaverse: A Survey on Computational Arts
Lik-Hang Lee, Zijun Lin, Rui Hu, Zhengya Gong, Abhishek Kumar, Tangyao Li, Sijia Li, Pan Hui
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Survey
PDF
2021-11-25
Unravelling multi-agent ranked delegations
Rachael Colley, Umberto Grandi, Arianna Novaro
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-11-25
Does constituency analysis enhance domain-specific pre-trained BERT models for relation extraction?
Anfu Tang (LISN), Louise Deléger, Robert Bossy, Pierre Zweigenbaum (LISN), Claire Nédellec
arXiv_AI
arXiv_AI
OCR
Bert
Pose
Action
Relation
Relation_Extraction
Prediction
PDF
2021-11-22
Ice hockey player identification via transformers
Kanav Vats, William McNally, Pascale Walters, David A. Clausi, John S. Zelek
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Pose
Action
PDF
2021-11-20
Improving Tagging Consistency and Entity Coverage for Chemical Identification in Full-text Articles
Hyunjae Kim, Mujeen Sung, Wonjin Yoon, Sungjoon Park, Jaewoo Kang
arXiv_CL
arXiv_CL
Recognition
OCR
PDF
2021-11-17
Discriminative Dictionary Learning based on Statistical Methods
G.Madhuri, Atul Negi
arXiv_CV
arXiv_CV
Reconstruction
Inpainting
OCR
Sparse
Review
Classification
Denoising
PDF
2021-11-17
Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees
Yaman Kumar Singla, Sriram Krishna, Rajiv Ratn Shah, Changyou Chen
arXiv_CL
arXiv_CL
OCR
Speech
Pose
PDF
2021-11-16
An AI-based Learning Companion Promoting Lifelong Learning Opportunities for All
Maria Perez-Ortiz, Erik Novak, Sahan Bulathwela, John Shawe-Taylor
arXiv_AI
arXiv_AI
OCR
PDF
2021-11-15
DFC: Deep Feature Consistency for Robust Point Cloud Registration
Zhu Xu, Zhengyao Bai, Huijie Liu, Qianjie Lu, Shenglan Fan
arXiv_CV
arXiv_CV
Segmentation
Point_Cloud
OCR
3D
Pose
Classification
Deep_Learning
Matching
PDF
2021-11-12
DriverGym: Democratising Reinforcement Learning for Autonomous Driving
Parth Kothari, Christian Perone, Luca Bergamini, Alexandre Alahi, Peter Ondruska
arXiv_CV
arXiv_CV
Reinforcement_Learning
OCR
Pose
Autonomous
PDF
2021-11-12
Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models
Igor Kulev, Berkay Köprü, Raul Rodriguez-Esteban, Diego Saldana, Yi Huang, Alessandro La Torraca, Elif Ozkirimli
arXiv_CL
arXiv_CL
OCR
Pose
Action
Language_Model
PDF
2021-11-12
A comprehensive study of clustering a class of 2D shapes
Agnieszka Kaliszewska, Monika Syga
arXiv_CV
arXiv_CV
OCR
3D
Pose
Contour
PDF
2021-11-11
CU-UD: text-mining drug and chemical-protein interactions with ensembles of BERT-based models
Mehmet Efruz Karabulut, K. Vijay-Shanker, Yifan Peng
arXiv_AI
arXiv_AI
OCR
Bert
Action
Relation
Language_Model
PDF
2021-11-11
Indian Licence Plate Dataset in the wild
Sanchit Tanwar, Ayush Tiwari, Ritesh Chowdhry
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
Detection
Object_Detection
PDF
2021-11-11
Yaw-Guided Imitation Learning for Autonomous Driving in Urban Environments
Yandong Liu, Chengzhong Xu, Hui Kong
arXiv_RO
arXiv_RO
OCR
Pose
Relation
Attention
Autonomous
PDF
2021-11-10
BagBERT: BERT-based bagging-stacking for multi-topic classification
Loïc Rakotoson, Charles Letaillieur, Sylvain Massip, Fréjus Laleye
arXiv_CL
arXiv_CL
Embedding
OCR
Bert
Knowledge
Pose
Classification
PDF
2021-11-10
Handwritten Digit Recognition Using Improved Bounding Box Recognition Technique
Arkaprabha Basu, M. Sathya
arXiv_CV
arXiv_CV
Recognition
OCR
Optimization
Optical_Character
Prediction
PDF
2021-11-08
Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization
Lei Sun
arXiv_CV
arXiv_CV
Point_Cloud
OCR
3D
Pose
Matching
PDF
2021-11-07
A Word on Machine Ethics: A Response to Jiang et al.
Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-11-07
NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
Xingcheng Yao, Yanan Zheng, Xiaocong Yang, Zhilin Yang
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Classification
Language_Model
PDF
2021-11-04
Whistleblower protection in the digital age -- why 'anonymous' is not enough. Towards an interdisciplinary view of ethical dilemmas
Bettina Berendt, Stefan Schiffner
arXiv_AI
arXiv_AI
OCR
Face
Relation
GAN
Activity
PDF
2021-11-04
Lexically Aware Semi-Supervised Learning for OCR Post-Correction
Shruti Rijhwani, Daisy Rosenblum, Antonios Anastasopoulos, Graham Neubig
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
Language_Model
PDF
2021-11-03
A PubMedBERT-based Classifier with Data Augmentation Strategy for Detecting Medication Mentions in Tweets
Qing Han, Shubo Tian, Jinfeng Zhang
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
GAN
PDF
2021-11-03
Curriculum Offline Imitation Learning
Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Action
PDF
2021-11-02
Graph Tree Deductive Networks
Seokjun Kim, Jaeeun Jang, Hyeoncheol Kim
arXiv_AI
arXiv_AI
OCR
Relation
PDF
2021-10-31
R-BERT-CNN: Drug-target interactions extraction from biomedical literature
Jehad Aldahdooh, Ziaurrehman Tanoli, Jing Tang
arXiv_AI
arXiv_AI
OCR
Bert
Knowledge
Action
Deep_Learning
Relation
Medical
CNN
Language_Model
PDF
2021-10-28
DocScanner: Robust Document Image Rectification with Progressive Learning
Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, Houqiang Li
arXiv_CV
arXiv_CV
OCR
3D
Regularization
Pose
Quantitative
Inference
PDF
2021-10-25
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
Pose
Attention
PDF
2021-10-25
Ultra Light OCR Competition Technical Report
Shuhan Zhang, Yuxin Zou, Tianhe Wang, Yichao Xiong
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Scene_Text
GAN
PDF
2021-10-22
Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts
Allen Kim, Charuta Pethe, Naoya Inoue, Steve Skiena
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Detection
Relation
Language_Model
PDF
2021-10-21
HENet: Forcing a Network to Think More for Font Recognition
Jingchao Chen, Shiyi Mu, Shugong Xu, Youdong Ding
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Action
Inference
PDF
2021-10-18
Newsalyze: Effective Communication of Person-Targeting Biases in News Articles
Felix Hamborg, Kim Heinser, Anastasia Zhukova, Karsten Donnay, Bela Gipp
arXiv_AI
arXiv_AI
OCR
Review
PDF
2021-10-16
Learning UI Navigation through Demonstrations composed of Macro Actions
Wei Li
arXiv_AI
arXiv_AI
OCR
Pose
Action
Detection
PDF
2021-10-16
PAGnol: An Extra-Large French Generative Model
Julien Launay, E.L. Tommasone, Baptiste Pannier, François Boniface, Amélie Chatelain, Alessandro Cappelli, Iacopo Poli, Djamé Seddah
arXiv_CL
arXiv_CL
OCR
Bert
Summarization
PDF
2021-10-16
BAPGAN: GAN-based Bone Age Progression of Femur and Phalange X-ray Images
Shinji Nakazawa, Changhee Han, Joe Hasei, Ryuichi Nakahara, Toshifumi Ozaki
arXiv_CV
arXiv_CV
Embedding
OCR
Knowledge
Adversarial
Pose
VQA
GAN
CNN
PDF
2021-10-14
Making Document-Level Information Extraction Right for the Right Reasons
Liyan Tang, Dhruv Rajan, Suyash Mohan, Abhijeet Pradhan, R. Nick Bryan, Greg Durrett
arXiv_AI
arXiv_AI
OCR
Action
Relation
Inference
PDF
2021-10-13
An algorithm for a fairer and better voting system
Gabriel-Claudiu Grama
arXiv_AI
arXiv_AI
OCR
PDF
2021-10-13
Optical Character Recognition of 19th Century Classical Commentaries: the Current State of Affairs
Matteo Romanello, Sven Najem-Meyer, Bruce Robertson
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Face
PDF
2021-10-12
On the Security Risks of AutoML
Ren Pang, Zhaohan Xi, Shouling Ji, Xiapu Luo, Ting Wang
arXiv_CV
arXiv_CV
NAS
OCR
Adversarial
Relation
PDF
2021-10-08
Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks
Le Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong, Ran Xu
arXiv_AI
arXiv_AI
Transformer
OCR
Pose
Action
Recommendation
PDF
2021-10-08
Generational Frameshifts in Technology: Computer Science and Neurosurgery, The VR Use Case
Samuel R. Browd, Maya Sharma, Chetan Sharma
arXiv_AI
arXiv_AI
OCR
Action
PDF
2021-10-08
Towards Sample-efficient Apprenticeship Learning from Suboptimal Demonstration
Letian Chen, Rohan Paleja, Matthew Gombolay
arXiv_RO
arXiv_RO
OCR
Self-Supervised
Pose
Relation
PDF
2021-10-08
Machine Learning Featurizations for AI Hacking of Political Systems
Nathan E Sanders, Bruce Schneier
arXiv_AI
arXiv_AI
OCR
Pose
Action
Deep_Learning
PDF
2021-10-08
On the invertibility of a voice privacy system using embedding alignement
Pierre Champion (MULTISPEECH, LIUM), Thomas Thebaud (LIUM), Gaël Le Lan, Anthony Larcher (LIUM), Denis Jouvet (MULTISPEECH)
arXiv_SD
arXiv_SD
Embedding
Unsupervised
OCR
Pose
PDF
2021-10-07
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng
arXiv_CL
arXiv_CL
Segmentation
Recognition
Video_Caption
OCR
Optical_Character
Knowledge
Speech
Pose
Detection
Speech_Recognition
Caption
PDF
2021-10-04
Rerunning OCR -- A Machine Learning Approach to Quality Assessment and Enhancement Prediction
Pit Schneider
arXiv_AI
arXiv_AI
Enhancement
OCR
Prediction
PDF
2021-10-04
An Experimental Evaluation on Deepfake Detection using Deep Face Recognition
Sreeraj Ramachandran, Aakash Varma Nadimpalli, Ajita Rattani
arXiv_AI
arXiv_AI
Recognition
OCR
Face
Classification
Deep_Learning
Detection
Face_Recognition
CNN
PDF
2021-10-02
Asking questions on handwritten document collections
Minesh Mathew, Lluis Gomez, Dimosthenis Karatzas, CV Jawahar
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Pose
VQA
QA
PDF
2021-09-24
SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Yuxin Xiao, Zecheng Zhang, Yuning Mao, Carl Yang, Jiawei Han
arXiv_AI
arXiv_AI
OCR
Pose
Action
Relation
Relation_Extraction
Inference
Prediction
PDF
2021-09-21
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li, Tengchao Lv, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
RNN
Optical_Character
Pose
Text_Generation
Language_Model
PDF
2021-09-18
Atrial Fibrillation: A Medical and Technological Review
Samayan Bhattacharya, Sk Shahnawaz
arXiv_CV
arXiv_CV
OCR
Review
Detection
Relation
Attention
Medical
PDF
2021-09-15
An influencer-based approach to understanding radical right viral tweets
Laila Sprejer, Helen Margetts, Kleber Oliveira, David O'Sullivan, Bertie Vidgen
arXiv_CL
arXiv_CL
OCR
Pose
Attention
PDF
2021-09-14
Learning Bill Similarity with Annotated and Augmented Corpora of Bills
Jiseon Kim, Elden Griggs, In Song Kim, Alice Oh
arXiv_CL
arXiv_CL
OCR
Bert
Pose
Classification
Relation
PDF
2021-09-14
Optimal To-Do List Gamification for Long Term Planning
Saksham Consul, Jugoslav Stojcheski, Valkyrie Felso, Falk Lieder
arXiv_AI
arXiv_AI
OCR
PDF
2021-09-14
Deep learning-based NLP Data Pipeline for EHR Scanned Document Information Extraction
Enshuo Hsu (1, 3, and 4), Ioannis Malagaris (1), Yong-Fang Kuo (1), Rizwana Sultana (2), Kirk Roberts (3) ((1) Office of Biostatistics, (2) Division of Pulmonary, Critical Care and Sleep Medicine, Department of Internal Medicine, University of Texas Medical Branch, Galveston, Texas, USA. (3) School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA. (4) Center for Outcomes Research, Houston Methodist, Houston, TX, USA.)
arXiv_CV
arXiv_CV
Recognition
OCR
Bert
RNN
Optical_Character
Pose
Action
Deep_Learning
Medical
PDF
2021-09-13
Post-OCR Document Correction with large Ensembles of Character Sequence Models
Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Evangelos Milios, Axel J. Soto
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Pose
PDF
2021-09-13
Surveying the Research on Fake News in Social Media: a Tale of Networks and Language
Giancarlo Ruffo (1), Alfonso Semeraro (1), Anastasia Giachanou (2), Paolo Rosso (3) ((1) Università degli Studi di Torino, (2) Utrecht University, (3) Universitat Politècnica de València)
arXiv_CL
arXiv_CL
OCR
Face
Survey
GAN
PDF
2021-09-13
Tamizhi-Net OCR: Creating A Quality Large Scale Tamil-Sinhala-English Parallel Corpus Using Deep Learning Based Printed Character Recognition
Charangan Vasantharajan, Uthayasanker Thayasivam
arXiv_CL
arXiv_CL
Recognition
OCR
RNN
Pose
Action
Deep_Learning
PDF
2021-09-10
FR-Detect: A Multi-Modal Framework for Early Fake News Detection on Social Media Using Publishers Features
Ali Jarrahi, Leila Safari
arXiv_CL
arXiv_CL
OCR
Pose
Detection
Activity
CNN
PDF
2021-09-07
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
Yuning Du, Chenxia Li, Ruoyu Guo, Cheng Cui, Weiwei Liu, Jun Zhou, Bin Lu, Yehua Yang, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Inference
PDF
2021-08-31
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools
Nils Feldhus, Robert Schwarzenberg, Sebastian Möller
arXiv_CL
arXiv_CL
OCR
Knowledge
PDF
2021-08-30
The Application of Convolutional Neural Networks for Tomographic Reconstruction of Hyperspectral Images
Wei-Chih Huang, Mads Svanborg Peters, Mads Juul Ahlebaek, Mads Toudal Frandsen, René Lynge Eriksen, Bjarke Jørgensen
arXiv_CV
arXiv_CV
Reconstruction
OCR
Pose
CNN
PDF
2021-08-30
3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations
Kangxue Yin, Jun Gao, Maria Shugrina, Sameh Khamis, Sanja Fidler
arXiv_AI
arXiv_AI
Reconstruction
Style_Transfer
OCR
3D
Pose
Quantitative
PDF
2021-08-29
A Multimodal Framework for Video Ads Understanding
Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Speech
Attention
Speech_Recognition
Prediction
PDF
2021-08-27
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving
Arjit Sharma, Sahil Sharma
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Action
Autonomous
PDF
2021-08-26
Mining Contextual Information Beyond Image for Semantic Segmentation
Zhenchao Jin, Tao Gong, Dongdong Yu, Qi Chu, Jian Wang, Changhu Wang, Jie Shao
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Pose
Classification
PDF
2021-08-26
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang, Furu Wei
arXiv_CL
arXiv_CL
OCR
Pose
Deep_Learning
Detection
Prediction
PDF
2021-08-22
External Knowledge Augmented Text Visual Question Answering
Arka Ujjal Dey, Ernest Valveny, Gaurav Harit
arXiv_CV
arXiv_CV
Transformer
OCR
Knowledge
Pose
VQA
QA
PDF
2021-08-22
Self-Regulation for Semantic Segmentation
Zhang Dong, Zhang Hanwang, Tang Jinhui, Hua Xiansheng, Sun Qianru
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
OCR
Classification
PDF
2021-08-20
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Xiaopeng Lu, Zhen Fan, Yansen Wang, Jean Oh, Carolyn P. Rose
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Relation
VQA
QA
PDF
2021-08-18
End-to-End License Plate Recognition Pipeline for Real-time Low Resource Video Based Applications
Alif Ashrafee, Akib Mohammed Khan, Mohammad Sabik Irbaz, MD Abdullah Al Nasim
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Detection
Inference
PDF
2021-08-18
End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc Van Gool
arXiv_CV
arXiv_CV
Reinforcement_Learning
OCR
Action
Autonomous
PDF
2021-08-18
Statistical analysis of locally parameterized shapes
Mohsen Taheri, Jörn Schulz
arXiv_CV
arXiv_CV
OCR
Pose
Classification
PDF
2021-08-18
AdapterHub Playground: Simple and Flexible Few-Shot Learning with Adapters
Tilman Beck, Bela Bohlender, Christina Viehmann, Vincent Hane, Yanik Adamson, Jaber Khuri, Jonas Brossmann, Jonas Pfeiffer, Iryna Gurevych
arXiv_CL
arXiv_CL
Transfer_Learning
OCR
Knowledge
Face
Few-Shot
Language_Model
Prediction
PDF
2021-08-17
VisBuddy -- A Smart Wearable Assistant for the Visually Challenged
Ishwarya Sivakumar, Nishaali Meenakshisundaram, Ishwarya Ramesh, Shiloah Elizabeth D, Sunil Retmin Raj C
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Optical_Character
Knowledge
Pose
Face
Action
Deep_Learning
Detection
Object_Detection
Caption
PDF
2021-08-16
An NLP approach to quantify dynamic salience of predefined topics in a text corpus
A. Bock, A. Palladino, S. Smith-Heisters, I. Boardman, E. Pellegrini, E.J. Bienenstock, A. Valenti
arXiv_CL
arXiv_CL
OCR
PDF
2021-08-14
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin
arXiv_CV
arXiv_CV
Recognition
OCR
Action
Detection
PDF
2021-08-10
BROS: A Layout-Aware Pre-trained Language Model for Understanding Documents
Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, Sungrae Park
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Pose
Language_Model
PDF
2021-08-06
Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents
Amit Gupte, Alexey Romanov, Sahitya Mantravadi, Dalitso Banda, Jianjie Liu, Raza Khan, Lakshmanan Ramu Meenal, Benjamin Han, Soundar Srinivasan
arXiv_CL
arXiv_CL
Recognition
OCR
Restoration
Optical_Character
Action
PDF
2021-08-03
Solo-learn: A Library of Self-supervised Methods for Visual Representation Learning
Victor G. Turrisi da Costa, Enrico Fini, Moin Nabi, Nicu Sebe, Elisa Ricci
arXiv_CV
arXiv_CV
OCR
Represenation_Learning
Self-Supervised
PDF
2021-07-30
Foundations of data imbalance and solutions for a data democracy
Ajay Kulkarni, Deri Chong, Feras A. Batarseh
arXiv_AI
arXiv_AI
OCR
Classification
PDF
2021-07-27
PDF-Malware: An Overview on Threats, Detection and Evasion Attacks
Nicolas Fleury, Theo Dubrunquez, Ihsen Alouani
arXiv_AI
arXiv_AI
OCR
Pose
Detection
PDF
2021-07-19
Machine Learning and Deep Learning Methods for Building Intelligent Systems in Medicine and Drug Discovery: A Comprehensive Survey
G Jignesh Chowdary, Suganya G, Premalatha M, Asnath Victy Phamila Y, Karunamurthy K
arXiv_AI
arXiv_AI
OCR
Optimization
Survey
Classification
Deep_Learning
Relation
Medical
Prediction
PDF
2021-07-15
Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining
Guowei Xu, Wenbiao Ding, Weiping Fu, Zhongqin Wu, Zitao Liu
arXiv_AI
arXiv_AI
Recognition
OCR
Text_Classification
Optical_Character
Pose
Classification
PDF
2021-07-13
Scene Text recognition with Full Normalization
Nathan Zachary, Gerald Carl, Russell Elijah, Hessi Roma, Robert Leer, James Amelia
arXiv_CV
arXiv_CV
Recognition
OCR
Scene_Text
PDF
2021-07-12
Hate versus Politics: Detection of Hate against Policy makers in Italian tweets
Armend Duzha, Cristiano Casadei, Michael Tosi, Fabio Celli
arXiv_CL
arXiv_CL
OCR
Speech
Classification
Detection
PDF
2021-07-12
MOOCRep: A Unified Pre-trained Embedding of MOOC Entities
Shalini Pandey, Jaideep Srivastava
arXiv_AI
arXiv_AI
Transformer
Embedding
OCR
Represenation_Learning
Knowledge
Pose
Relation
Language_Model
Prediction
Recommendation
PDF
2021-07-09
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset
Hannah Rose Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, Yuki M. Asano
arXiv_CV
arXiv_CV
OCR
Pose
Face
Detection
Caption
PDF
2021-07-05
Vision Xformers: Efficient Attention for Image Classification
Pranav Jeevan, Amit Sethi (Indian Institute of Technology Bombay)
arXiv_AI
arXiv_AI
Transformer
Embedding
OCR
Classification
Attention
CNN
Image_Classification
PDF
2021-07-02
The Optimal Size of an Epistemic Congress
Manon Revel, Tao Lin, Daniel Halpern
arXiv_AI
arXiv_AI
OCR
PDF
2021-07-02
Data Centric Domain Adaptation for Historical Text with OCR Errors
Luisa März, Stefan Schweter, Nina Poerner, Benjamin Roth, Hinrich Schütze
arXiv_CL
arXiv_CL
Embedding
Unsupervised
Recognition
OCR
Pose
PDF
2021-06-29
New Arabic Medical Dataset for Diseases Classification
Jaafar Hammoud, Aleksandra Vatian, Natalia Dobrenko, Nikolai Vedernikov, Anatoly Shalyto, Natalia Gusarova
arXiv_CL
arXiv_CL
OCR
Bert
Classification
Deep_Learning
Medical
PDF
2021-06-27
DONet: Learning Category-Level 6D Object Pose and Size Estimation from Depth Observation
Haitao Lin, Zichang Liu, Chilam Cheang, Lingwei Zhang, Yanwei Fu, Xiangyang Xue
arXiv_CV
arXiv_CV
OCR
3D
Pose
Inference
PDF
2021-06-26
The Feasibility and Inevitability of Stealth Attacks
Ivan Y. Tyukin, Desmond J. Higham, Eliyas Woldegeorgis, Alexander N. Gorban
arXiv_AI
arXiv_AI
OCR
Adversarial
Pose
Deep_Learning
PDF
2021-06-22
A Simple and Practical Approach to Improve Misspellings in OCR Text
Junxia Lin (1), Johannes Ledolter (2) ((1) Georgetown University Medical Center, Georgetown University, (2) Tippie College of Business, University of Iowa)
arXiv_CL
arXiv_CL
Unsupervised
OCR
PDF
2021-06-21
An End-to-End Khmer Optical Character Recognition using Sequence-to-Sequence with Attention
Rina Buoy, Sokchea Kor, Nguonly Taing
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Attention
CNN
PDF
2021-06-20
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Jiapeng Wang, Tianwei Wang, Guozhi Tang, Lianwen Jin, Weihong Ma, Kai Ding, Yichao Huang
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Pose
Action
Attention
GAN
Inference
PDF
2021-06-16
Eider: Evidence-enhanced Document-level Relation Extraction
Yiqing Xie, Jiaming Shen, Sha Li, Yuning Mao, Jiawei Han
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Prediction
PDF
2021-06-15
Classification of Documents Extracted from Images with Optical Character Recognition Methods
Omer Aydin
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Optical_Character
Classification
PDF
2021-06-15
Mixed Model OCR Training on Historical Latin Script for Out-of-the-Box Recognition and Finetuning
Christian Reul, Christoph Wick, Maximilian Nöth, Andreas Büttner, Maximilian Wehner, Uwe Springmann
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
PDF
2021-06-14
Pitfalls of Explainable ML: An Industry Perspective
Sahil Verma, Aditya Lahiri, John P. Dickerson, Su-In Lee
arXiv_AI
arXiv_AI
OCR
Prediction
PDF
2021-06-14
EuroCrops: A Pan-European Dataset for Time Series Crop Type Classification
Maja Schneider, Amelie Broszeit, Marco Körner
arXiv_CV
arXiv_CV
OCR
Classification
PDF
2021-06-10
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter
Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Attention
GAN
Inference
Prediction
PDF
2021-06-10
Hard Choices in Artificial Intelligence
Roel Dobbe, Thomas Krendl Gilbert, Yonatan Mintz
arXiv_AI
arXiv_AI
OCR
PDF
2021-06-10
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
Ryota Yoshihashi, Tomohiro Tanaka, Kenji Doi, Takumi Fujino, Naoaki Yamashita
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Detection
PDF
2021-06-08
PAM: Understanding Product Images in Cross Product Category Attribute Extraction
Rongmei Lin, Xiang He, Jie Feng, Nasser Zalmout, Yan Liang, Li Xiong, Xin Luna Dong
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Optical_Character
Knowledge
Knowledge_Graph
Pose
Action
VQA
PDF
2021-06-08
Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface
Peng Xu, Wenjie Zi, Hamidreza Shahidi, Ákos Kádár, Keyi Tang, Wei Yang, Jawad Ateeq, Harsh Barot, Meidan Alon, Yanshuai Cao
arXiv_CL
arXiv_CL
OCR
Face
Prediction
PDF
2021-06-08
Classification of Contract-Amendment Relationships
Fuqi Song
arXiv_CL
arXiv_CL
Tracking
Recognition
OCR
Optical_Character
Pose
Classification
Relation
PDF
2021-06-07
Document-level Relation Extraction as Semantic Segmentation
Ningyu Zhang, Xiang Chen, Xin Xie, Shumin Deng, Chuanqi Tan, Mosha Chen, Fei Huang, Luo Si, Huajun Chen
arXiv_CL
arXiv_CL
Transformer
Segmentation
Semantic_Segmentation
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2021-06-05
Denoising Word Embeddings by Averaging in a Shared Space
Avi Caciularu, Ido Dagan, Jacob Goldberger
arXiv_CL
arXiv_CL
Embedding
OCR
Denoising
PDF
2021-06-04
Language Model Metrics and Procrustes Analysis for Improved Vector Transformation of NLP Embeddings
Thomas Conley, Jugal Kalita
arXiv_CL
arXiv_CL
Embedding
OCR
Language_Model
PDF
2021-06-03
Defending Democracy: Using Deep Learning to Identify and Prevent Misinformation
Anusua Trivedi, Alyssa Suhm, Prathamesh Mahankal, Subhiksha Mukuntharaj, Meghana D. Parab, Malvika Mohan, Meredith Berger, Arathi Sethumadhavan, Ashish Jaiman, Rahul Dodhia
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Deep_Learning
Detection
PDF
2021-06-03
Discriminative Reasoning for Document-level Relation Extraction
Wang Xu, Kehai Chen, Tiejun Zhao
arXiv_CL
arXiv_CL
Recognition
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2021-06-02
Detecting Bot-Generated Text by Characterizing Linguistic Accommodation in Human-Bot Interactions
Paras Bhatt, Anthony Rios
arXiv_CL
arXiv_CL
OCR
Speech
Action
Detection
PDF
2021-06-02
End-to-End Information Extraction by Character-Level Embedding and Multi-Stage Attentional U-Net
Tuan-Anh Nguyen Dang, Dat-Thanh Nguyen
arXiv_CV
arXiv_CV
Embedding
OCR
Pose
Action
Deep_Learning
Relation
Attention
PDF
2021-06-01
PanoDR: Spherical Panorama Diminished Reality for Indoor Scenes
V. Gkitsas, V. Sterzentsenko, N. Zioulis, G. Albanis, D. Zarpalas
arXiv_CV
arXiv_CV
Reconstruction
Inpainting
OCR
3D
Pose
Quantitative
PDF
2021-06-01
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Action
PDF
2021-05-29
Correcting public opinion trends through Bayesian data assimilation
Robin Hendrickx, Rossella Arcucci, Julio Amador Dıaz Lopez, Yi-Ke Guo, Mark Kennedy
arXiv_AI
arXiv_AI
OCR
Pose
Survey
PDF
2021-05-26
What data do we need for training an AV motion planner?
Long Chen, Lukas Platinsky, Stefanie Speichert, Blazej Osinski, Oliver Scheel, Yawei Ye, Hugo Grimmett, Luca del Pero, Peter Ondruska
arXiv_CV
arXiv_CV
OCR
Autonomous
PDF
2021-05-25
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling
Marcin Namysl, Sven Behnke, Joachim Köhler
arXiv_CL
arXiv_CL
Embedding
Recognition
OCR
Optical_Character
Language_Model
PDF
2021-05-25
Affine Transport for Sim-to-Real Domain Adaptation
Anton Mallasto, Karol Arndt, Markus Heinonen, Samuel Kaski, Ville Kyrki
arXiv_RO
arXiv_RO
OCR
PDF
2021-05-23
Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations
Pascal Fischer, Alen Smajic, Alexander Mehler, Giuseppe Abrami
arXiv_AI
arXiv_AI
Transfer_Learning
Recognition
OCR
Optical_Character
Deep_Learning
Detection
Attention
PDF
2021-05-19
End-to-End Unsupervised Document Image Blind Denoising
Mehrdad J Gangeh, Marcin Plata, Hamid Motahari, Nigel P Duffy
arXiv_CV
arXiv_CV
Unsupervised
Recognition
OCR
Optical_Character
Pose
Deep_Learning
Denoising
PDF
2021-05-19
Surprisingly Popular Voting Recovers Rankings, Surprisingly!
Hadi Hosseini, Debmalya Mandal, Nisarg Shah, Kevin Shi
arXiv_AI
arXiv_AI
OCR
Prediction
PDF
2021-05-17
Unknown-box Approximation to Improve Optical Character Recognition Performance
Ayantha Randika, Nilanjan Ray, Xiao Xiao, Allegra Latimer
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2021-05-17
STRIDE : Scene Text Recognition In-Device
Rachit S Munjal, Arun D Prabhu, Nikhil Arora, Sukumar Moharana, Gopi Ramena
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Scene_Text
Attention
Inference
PDF
2021-05-17
EasyFL: A Low-code Federated Learning Platform For Dummies
Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang
arXiv_AI
arXiv_AI
Tracking
OCR
Optimization
Pose
Action
PDF
2021-05-12
Mining Legacy Issues in Open Pit Mining sites: Innovation & Support of Renaturalization and Land Utilization
Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann, Gerhard Heyer
arXiv_CL
arXiv_CL
Recognition
OCR
Text_Classification
Optical_Character
Action
Classification
PDF
2021-05-12
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Scene_Text
Detection
VQA
QA
PDF
2021-05-10
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding
Zilong Wang, Mingjie Zhan, Houxing Ren, Zhaohui Hou, Yuwei Wu, Xingyan Zhang, Ding Liang
arXiv_AI
arXiv_AI
OCR
Optical_Character
Pose
Action
Relation
Relation_Extraction
GAN
PDF
2021-05-10
An end-to-end Optical Character Recognition approach for ultra-low-resolution printed text images
Julian D. Gilbey, Carola-Bibiane Schönlieb
arXiv_CV
arXiv_CV
Recognition
Super_Resolution
OCR
Optical_Character
PDF
2021-05-10
DocReader: Bounding-Box Free Training of a Document Information Extraction Model
Shachar Klaiman, Marius Lehne
arXiv_CV
arXiv_CV
OCR
Action
PDF
2021-05-09
End-to-End Optical Character Recognition for Bengali Handwritten Words
Farisa Benta Safir, Abu Quwsar Ohi, M.F. Mridha, Muhammad Mostafa Monowar, Md. Abdul Hamid
arXiv_CV
arXiv_CV
NAS
Recognition
OCR
RNN
Optical_Character
Review
Pose
CNN
PDF
2021-05-09
High-performance symbolic-numerics via multiple dispatch
Shashi Gowda, Yingbo Ma, Alessandro Cheli, Maja Gwozdz, Viral B. Shah, Christopher Rackauckas
arXiv_CL
arXiv_CL
OCR
Optimization
Knowledge
Face
Action
PDF
2021-05-04
A Survey on End-User Robot Programming
Gopika Ajaykumar, Maureen Steele, Chien-Ming Huang (Johns Hopkins University)
arXiv_RO
arXiv_RO
OCR
Survey
PDF
2021-05-04
Towards Accountability in the Use of Artificial Intelligence for Public Administrations
Michele Loi, Matthias Spielkamp
arXiv_AI
arXiv_AI
OCR
Ontology
GAN
PDF
2021-05-02
BI-REC: Guided Data Analysis for Conversational Business Intelligence
Venkata Vamsikrishna Meduri, Abdul Quamar, Chuan Lei, Vasilis Efthymiou, Fatma Ozcan
arXiv_AI
arXiv_AI
Embedding
OCR
Pose
Face
Action
Prediction
Recommendation
PDF
2021-04-30
Participatory Budgeting with Donations and Diversity Constraints
Jiehua Chen, Martin Lackner, Jan Maly
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-04-30
Word-Level Alignment of Paper Documents with their Electronic Full-Text Counterparts
Mark-Christoph Müller, Sucheta Ghosh, Ulrike Wittig, Maja Rey
arXiv_CL
arXiv_CL
Unsupervised
OCR
Medical
PDF
2021-04-28
Motion-guided Non-local Spatial-Temporal Network for Video Crowd Counting
Haoyue Bai, S.-H. Gary Chan
arXiv_CV
arXiv_CV
OCR
Pose
Relation
PDF
2021-04-27
AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions
Martin Kišš, Karel Beneš, Michal Hradiš
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
PDF
2021-04-23
CapillaryNet: An Automated System to Analyze Microcirculation Videos from Handheld Vital Microscopy
Maged Helmy, Anastasiya Dykyy, Tuyen Trung Truong, Paulo Ferreira, Eric Jul
arXiv_CV
arXiv_CV
OCR
Medical
PDF
2021-04-23
OCRTOC: A Cloud-Based Competition and Benchmark for Robotic Grasping and Manipulation
Ziyuan Liu, Wei Liu, Yuzhe Qin, Fanbo Xiang, Songyan Xin, Maximo A. Roa, Berk Calli, Hao Su, Yu Sun, Ping Tan
arXiv_RO
arXiv_RO
OCR
Pose
GAN
PDF
2021-04-19
Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model
Per E Kummervold, Javier de la Rosa, Freddy Wetjen, Svein Arne Brygfjeld
arXiv_CL
arXiv_CL
Transformer
Recognition
OCR
Bert
Optical_Character
Classification
Language_Model
PDF
2021-04-18
Documenting the English Colossal Clean Crawled Corpus
Jesse Dodge, Maarten Sap, Ana Marasovic, William Agnew, Gabriel Ilharco, Dirk Groeneveld, Matt Gardner
arXiv_AI
arXiv_AI
OCR
Salient
Face
Language_Model
PDF
2021-04-16
Open data for Moroccan license plates for OCR applications : data collection, labeling, and model construction
Abdelkrim Alahyane, Mohamed El Fakir, Saad Benjelloun, Ikram Chairi
arXiv_AI
arXiv_AI
Segmentation
Recognition
OCR
PDF
2021-04-16
TeLCoS: OnDevice Text Localization with Clustering of Script
Rachit S Munjal, Manoj Goyal, Rutika Moharir, Sukumar Moharana
arXiv_AI
arXiv_AI
Recognition
OCR
Knowledge
Pose
Scene_Text
Action
PDF
2021-04-15
Tabletop Object Rearrangement: Team ACRV's Entry to OCRTOC
Zheyu Zhang, Rhys Newbury, Kerry He, Steven Martin, Gavin Suddrey, Jun Kwan, Peter Corke, Akansel Cosgun
arXiv_RO
arXiv_RO
OCR
GAN
PDF
2021-04-13
'Subverting the Jewtocracy': Online Antisemitism Detection Using Multimodal Deep Learning
Mohit Chandra, Dheeraj Pailla, Himanshu Bhatia, Aadilmehdi Sanchawala, Manish Gupta, Manish Shrivastava, Ponnurangam Kumaraguru
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Deep_Learning
Detection
PDF
2021-04-12
Escaping the Big Data Paradigm with Compact Transformers
Ali Hassani, Steven Walton, Nikhil Shah, Abulikemu Abuduweili, Jiachen Li, Humphrey Shi
arXiv_CV
arXiv_CV
Transformer
Embedding
OCR
PDF
2021-04-12
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra
arXiv_CL
arXiv_CL
OCR
NMT
PDF
2021-04-12
Diamond in the rough: Improving image realism by traversing the GAN latent space
Jeffrey Wen, Fabian Benitez-Quiroz, Qianli Feng, Aleix Martinez
arXiv_CV
arXiv_CV
Unsupervised
OCR
Optimization
Adversarial
Quantitative
GAN
PDF
2021-04-10
Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis
Xutan Peng, Guanyi Chen, Chenghua Lin, Mark Stevenson
arXiv_AI
arXiv_AI
Embedding
OCR
Knowledge
Knowledge_Graph
Pose
Relation
PDF
2021-04-09
Video-aided Unsupervised Grammar Induction
Songyang Zhang, Linfeng Song, Lifeng Jin, Kun Xu, Dong Yu, Jiebo Luo
arXiv_CV
arXiv_CV
Unsupervised
OCR
Speech
Pose
Face
Action
PDF
2021-04-08
Computation and Bribery of Voting Power in Delegative Simple Games
Gianlorenzo D'Angelo, Esmaeil Delfaraz, Hugo Gilbert
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-04-07
Streaming Self-Training via Domain-Agnostic Unlabeled Images
Zhiqiu Lin, Deva Ramanan, Aayush Bansal
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Recognition
OCR
Knowledge
Face
Classification
Medical
Image_Classification
PDF
2021-04-07
Document Layout Analysis via Dynamic Residual Feature Fusion
Xingjiao Wu, Ziling Hu, Xiangcheng Du, Jing Yang, Liang He
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
PDF
2021-04-05
When Can Liquid Democracy Unveil the Truth?
Ruben Becker, Gianlorenzo D'Angelo, Esmaeil Delfaraz, Hugo Gilbert
arXiv_AI
arXiv_AI
OCR
GAN
PDF
2021-04-05
Procrustean Training for Imbalanced Deep Learning
Han-Jia Ye, De-Chuan Zhan, Wei-Lun Chao
arXiv_CV
arXiv_CV
OCR
Knowledge
Pose
Deep_Learning
Prediction
PDF
2021-04-02
Artificial intelligence, human rights, democracy, and the rule of law: a primer
David Leslie, Christopher Burr, Mhairi Aitken, Josh Cowls, Michael Katell, Morgan Briggs
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-03-31
PAUL: Procrustean Autoencoder for Unsupervised Lifting
Chaoyang Wang, Simon Lucey
arXiv_CV
arXiv_CV
Unsupervised
OCR
3D
Pose
Deep_Learning
PDF
2021-03-30
Deep regression on manifolds: a 3D rotation case study
Romain Brégier
arXiv_CV
arXiv_CV
OCR
3D
Pose
Deep_Learning
PDF
2021-03-29
A Multiplexed Network for End-to-End, Multilingual OCR
Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner
arXiv_CV
arXiv_CV
Recognition
OCR
Pose
Detection
PDF
2021-03-29
Personalized Affect-Aware Socially Assistive Robot Tutors Aimed at Fostering Social Grit in Children with Autism
Zhonghao Shi, Manwei Cao, Sophia Pei, Xiaoyang Qiao, Thomas R Groechel, Maja J Matarić
arXiv_RO
arXiv_RO
OCR
Pose
Emotion
PDF
2021-03-23
A News Recommender System Considering Temporal Dynamics and Diversity
Shaina Raza
arXiv_AI
arXiv_AI
OCR
Prediction
Recommendation
PDF
2021-03-22
Fairness Perceptions of Algorithmic Decision-Making: A Systematic Review of the Empirical Literature
Christopher Starke, Janine Baleis, Birte Keller, Frank Marcinkowski
arXiv_AI
arXiv_AI
OCR
Review
Autonomous
PDF
2021-03-19
Congolese Swahili Machine Translation for Humanitarian Response
Alp Öktem, Eric DeLuca, Rodrigue Bashizi, Eric Paquin, Grace Tang
arXiv_CL
arXiv_CL
OCR
QA
PDF
2021-03-18
ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shjian Lu, C.V. Jawahar
arXiv_AI
arXiv_AI
Recognition
OCR
Action
GAN
PDF
2021-03-18
KoDF: A Large-scale Korean DeepFake Detection Dataset
Patrick Kwon, Jaeseong You, Gyuhyeon Nam, Sungwoo Park, Gyeongsu Chae
arXiv_CV
arXiv_CV
OCR
Face
Detection
PDF
2021-03-17
On the Whitney extension problem for near isometries and beyond
Steven B. Damelin
arXiv_CV
arXiv_CV
OCR
Optimization
PDF
2021-03-17
Interpretable Distance Metric Learning for Handwritten Chinese Character Recognition
Boxiang Dong, Aparna S. Varde, Danilo Stevanovic, Jiayin Wang, Liang Zhao
arXiv_AI
arXiv_AI
Handwriting
Recognition
OCR
Optical_Character
Pose
Face
Action
PDF
2021-03-17
What s in My LiDAR Odometry Toolbox?
Pierre Dellenbach, Jean-Emmanuel Deschaud, Bastien Jacquet, François Goulette
arXiv_CV
arXiv_CV
OCR
3D
Review
SLAM
Deep_Learning
GAN
PDF
2021-03-17
Endangered Languages are not Low-Resourced!
Mika Hämäläinen
arXiv_CL
arXiv_CL
OCR
Relation
PDF
2021-03-16
Combining Morphological and Histogram based Text Line Segmentation in the OCR Context
Pit Schneider
arXiv_CV
arXiv_CV
Segmentation
Recognition
OCR
Optical_Character
Pose
PDF
2021-03-15
Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs
Lars Vögtlin, Manuel Drazyk, Vinaychandran Pondenkandath, Michele Alberti, Rolf Ingold
arXiv_AI
arXiv_AI
OCR
Deep_Learning
GAN
PDF
2021-03-13
uTHCD: A New Benchmarking for Tamil Handwritten OCR
Noushath Shaffi, Faizal Hajamohideen
arXiv_CV
arXiv_CV
Recognition
OCR
CNN
PDF
2021-03-11
Characterizing Partisan Political Narratives about COVID-19 on Twitter
Elise Jing, Yong-Yeol Ahn
arXiv_CL
arXiv_CL
OCR
Pose
PDF
2021-03-10
Adversarial Regression Learning for Bone Age Estimation
Youshan Zhang, Brian D. Davison
arXiv_CV
arXiv_CV
Reconstruction
OCR
Adversarial
Pose
PDF
2021-03-09
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering
Aman Jain, Mayank Kothyari, Vishwajeet Kumar, Preethi Jyothi, Ganesh Ramakrishnan, Soumen Chakrabarti
arXiv_CV
arXiv_CV
OCR
Knowledge
Knowledge_Graph
Action
Quantitative
VQA
QA
PDF
2021-03-09
TS-Net: OCR Trained to Switch Between Text Transcription Styles
Jan Kohút, Michal Hradiš
arXiv_CV
arXiv_CV
Embedding
Recognition
OCR
Knowledge
Pose
PDF
2021-03-03
Self-play Learning Strategies for Resource Assignment in Open-RAN Networks
Xiaoyang Wang, Jonathan D Thomas, Robert J Piechocki, Shipra Kapoor, Raul Santos-Rodriguez, Arjun Parekh
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
PDF
2021-03-02
Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020 US Elections on the Basis of Offensive Speech and Stance Detection
Lara Grimminger, Roman Klinger
arXiv_CL
arXiv_CL
OCR
Bert
Speech
Detection
PDF
2021-02-28
Citizen Participation and Machine Learning for a Better Democracy
M. Arana-Catania, F.A. Van Lier, Rob Procter, Nataliya Tkachenko, Yulan He, Arkaitz Zubiaga, Maria Liakata
arXiv_CL
arXiv_CL
OCR
PDF
2021-02-27
A Simple But Effective Approach to n-shot Task-Oriented Dialogue Augmentation
Taha Aksu, Nancy F. Chen, Min-Yen Kan, Zhengyuan Liu
arXiv_CL
arXiv_CL
Tracking
OCR
Pose
PDF
2021-02-26
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Zero-Shot
Action_Recognition
Action
Classification
Caption
PDF
2021-02-23
Fair Set Selection: Meritocracy and Social Welfare
Thomas Kleine Buening, Meirav Segal, Debabrota Basu, Christos Dimitrakakis
arXiv_AI
arXiv_AI
OCR
Pose
PDF
2021-02-20
Deep Structured Feature Networks for Table Detection and Tabular Data Extraction from Scanned Financial Document Images
Siwen Luo, Mengting Wu, Yiwen Gong, Wanying Zhou, Josiah Poon
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
Optical_Character
Pose
Action
Detection
CNN
PDF
2021-02-18
FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks
Lingjiao Chen, Matei Zaharia, James Zou
arXiv_AI
arXiv_AI
Recognition
OCR
Pose
Scene_Text
Classification
Image_Classification
Prediction
Matching
PDF
2021-02-17
SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition
Denis Coquenet, Clément Chatelain, Thierry Paquet
arXiv_CV
arXiv_CV
Segmentation
Handwriting
Recognition
OCR
Optical_Character
Pose
CNN
PDF
2021-02-17
Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning
Byungchan Ko, Jungseul Ok
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Regularization
Pose
PDF
2021-02-11
An End-to-end Model for Entity-level Relation Extraction using Multi-instance Learning
Markus Eberts, Adrian Ulges
arXiv_CL
arXiv_CL
OCR
Action
Relation
Relation_Extraction
PDF
2021-02-11
Representation Matters: Offline Pretraining for Sequential Decision Making
Mengjiao Yang, Ofir Nachum
arXiv_AI
arXiv_AI
Unsupervised
Reinforcement_Learning
OCR
Optimization
Prediction
PDF
2021-02-09
Bootstrapping Relation Extractors using Syntactic Search by Examples
Matan Eyal, Asaf Amrami, Hillel Taub-Tabib, Yoav Goldberg
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
PDF
2021-02-06
The Arc of the Data Scientific Universe
David Leslie
arXiv_AI
arXiv_AI
OCR
Bert
GAN
PDF
2021-02-01
Neural OCR Post-Hoc Correction of Historical Corpora
Lijun Lyu, Maria Koutraki, Martin Krickl, Besnik Fetahu
arXiv_CL
arXiv_CL
Segmentation
Recognition
OCR
RNN
Optical_Character
Pose
Face
Attention
CNN
PDF
2021-01-30
Epistocracy Algorithm: A Novel Hyper-heuristic Optimization Strategy for Solving Complex Optimization Problems
Seyed Ziae Mousavi Mojab, Seyedmohammad Shams, Hamid Soltanian-Zadeh, Farshad Fotouhi
arXiv_AI
arXiv_AI
OCR
Optimization
Knowledge
Pose
PDF
2021-01-29
General-Purpose OCR Paragraph Identification by Graph Convolution Networks
Renshen Wang, Yasuhisa Fujii, Ashok C. Popat
arXiv_CV
arXiv_CV
OCR
Pose
PDF
2021-01-28
Exploring Cross-Image Pixel Contrast for Semantic Segmentation
Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc Van Gool
arXiv_CV
arXiv_CV
Segmentation
Embedding
Unsupervised
Semantic_Segmentation
OCR
Optimization
Represenation_Learning
Pose
Relation
Attention
PDF
2021-01-26
El Volumen Louder Por Favor: Code-switching in Task-oriented SemanticParsing
Arash Einolghozati, Abhinav Arora, Lorena Sainz-Maza Lecanda, Anuj Kumar, Sonal Gupta
arXiv_AI
arXiv_AI
OCR
Zero-Shot
Pose
Few-Shot
Language_Model
PDF
2021-01-22
Censorship of Online Encyclopedias: Implications for NLP Models
Eddie Yang, Margaret E. Roberts
arXiv_AI
arXiv_AI
Embedding
OCR
Action
Attention
PDF
2021-01-15
Affordance-based Reinforcement Learning for Urban Driving
Tanmay Agarwal, Hitesh Arora, Jeff Schneider
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Autonomous
Prediction
PDF
2021-01-14
Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection
Ben Chen, Bin Chen, Dehong Gao, Qijin Chen, Chengfu Huo, Xiaonan Meng, Weijun Ren, Yang Zhou
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Knowledge
Adversarial
Pose
Quantitative
Detection
Language_Model
PDF
2021-01-09
An Unsupervised Normalization Algorithm for Noisy Text: A Case Study for Information Retrieval and Stance Detection
Anurag Roy, Shalmoli Ghosh, Kripabandhu Ghosh, Saptarshi Ghosh
arXiv_AI
arXiv_AI
Unsupervised
OCR
Pose
Action
Classification
Detection
PDF
2021-01-07
Robust Text CAPTCHAs Using Adversarial Examples
Rulin Shao, Zhouxing Shi, Jinfeng Yi, Pin-Yu Chen, Cho-Jui Hsieh
arXiv_CV
arXiv_CV
OCR
Adversarial
Pose
PDF
2021-01-06
On-Device Document Classification using multimodal features
Sugam Garg, Harichandana, Sumit Kumar
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Classification
Inference
PDF
2021-01-04
Where Do Deep Fakes Look? Synthetic Face Detection via Gaze Tracking
Ilke Demir, Umur A. Ciftci
arXiv_AI
arXiv_AI
Face_Detection
Tracking
OCR
Pose
Face
Detection
Attention
PDF
2020-12-31
Improving Learning Experience in MOOCs with Educational Content Linking
Shang-Wen Li
arXiv_AI
arXiv_AI
OCR
Knowledge
Pose
Face
Survey
GAN
PDF
2020-12-29
Present-Biased Optimization
Fedor V. Fomin, Pierre Fraigniaud, Petr A. Golovach
arXiv_AI
arXiv_AI
OCR
Optimization
Pose
Action
GAN
PDF
2020-12-28
Advanced Machine Learning Techniques for Fake News Detection: A Systematic Mapping Study
Michal Choras, Konstantinos Demestichas, Agata Gielczyk, Alvaro Herrero, Pawel Ksieniewicz, Konstantina Remoundou, Daniel Urda, Michal Wozniak
arXiv_CL
arXiv_CL
OCR
Knowledge
Pose
Detection
GAN
PDF
2020-12-28
From Point to Space: 3D Moving Human Pose Estimation Using Commodity WiFi
Yiming Wang, Lingchao Guo, Zhaoming Lu, Xiangming Wen, Shuang Zhou, Wanyu Meng
arXiv_CV
arXiv_CV
OCR
3D
Pose_Estimation
Pose
PDF
2020-12-23
ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition
Zuoyu Yan, Xiaode Zhang, Liangcai Gao, Ke Yuan, Zhi Tang
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Pose
Face
Action
Attention
CNN
PDF
2020-12-22
Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019
Nacira Abbas, Kholoud Alghamdi, Mortaza Alinam, Francesca Alloatti, Glenda Amaral, Claudia d'Amato, Luigi Asprino, Martin Beno, Felix Bensmann, Russa Biswas, Ling Cai, Riley Capshaw, Valentina Anita Carriero, Irene Celino, Amine Dadoun, Stefano De Giorgis, Harm Delva, John Domingue, Michel Dumontier, Vincent Emonet, Marieke van Erp, Paola Espinoza Arias, Omaima Fallatah, Sebastián Ferrada, Marc Gallofré Ocaña, Michalis Georgiou, Genet Asefa Gesese, Frances Gillis-Webber, Francesca Giovannetti, Marìa Granados Buey, Ismail Harrando, Ivan Heibi, Vitor Horta, Laurine Huber, Federico Igne, Mohamad Yaser Jaradeh, Neha Keshan, Aneta Koleva, Bilal Koteich, Kabul Kurniawan, Mengya Liu, Chuangtao Ma, Lientje Maas, Martin Mansfield, Fabio Mariani, Eleonora Marzi, Sepideh Mesbah, et al. (27 additional authors not shown)
arXiv_AI
arXiv_AI
OCR
Knowledge
Knowledge_Graph
PDF
2020-12-21
Document-Level Relation Extraction with Reconstruction
Wang Xu, Kehai Chen, Tiejun Zhao
arXiv_CL
arXiv_CL
Reconstruction
OCR
Pose
Action
Classification
Relation
Relation_Extraction
Attention
Inference
PDF
2020-12-19
Self-Supervision based Task-Specific Image Collection Summarization
Anurag Singh, Deepak Kumar Sharma, Sudhir Kumar Sharma, Joel J. P. C. Rodrigues
arXiv_CV
arXiv_CV
Embedding
OCR
Adversarial
Pose
Quantitative
Classification
Deep_Learning
GAN
Summarization
Inference
PDF
2020-12-18
Understood in Translation, Transformers for Domain Understanding
Dimitrios Christofidellis, Matteo Manica, Leonidas Georgopoulos, Hans Vandierendonck
arXiv_CL
arXiv_CL
Transformer
Unsupervised
OCR
RNN
Knowledge
Knowledge_Graph
Pose
Action
PDF
2020-12-17
Named Entity Recognition in the Legal Domain using a Pointer Generator Network
Stavroula Skylaki, Ali Oskooei, Omar Bari, Nadja Herger, Zac Kriegman (Thomson Reuters Labs)
arXiv_CL
arXiv_CL
Recognition
OCR
Action
PDF
2020-12-15
Amazon SageMaker Automatic Model Tuning: Scalable Black-box Optimization
Valerio Perrone, Huibin Shen, Aida Zolic, Iaroslav Shcherbatyi, Amr Ahmed, Tanya Bansal, Michele Donini, Fela Winkelmolen, Rodolphe Jenatton, Jean Baptiste Faddoul, Barbara Pogorzelska, Miroslav Miladinovic, Krishnaram Kenthapadi, Matthias Seeger, Cédric Archambeau
arXiv_AI
arXiv_AI
OCR
Optimization
Regularization
Pose
PDF
2020-12-15
Indonesian ID Card Extractor Using Optical Character Recognition and Natural Language Post-Processing
Firhan Maulana Rusli, Kevin Akbar Adhiguna, Hendy Irawan
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Action
PDF
2020-12-15
FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition Systems
Lu Chen, Jiao Sun, Wei Xu
arXiv_CV
arXiv_CV
Recognition
OCR
Optimization
Optical_Character
Adversarial
Pose
PDF
2020-12-14
Discovering Airline-Specific Business Intelligence from Online Passenger Reviews: An Unsupervised Text Analytics Approach
Sharan Srinivas, Surya Ramachandiran
arXiv_AI
arXiv_AI
Unsupervised
OCR
Review
Pose
Sentiment
Prediction
PDF
2020-12-14
Vartani Spellcheck -- Automatic Context-Sensitive Spelling Correction of OCR-generated Hindi Text Using BERT and Levenshtein Distance
Aditya Pal, Abhijit Mustafi
arXiv_AI
arXiv_AI
Transformer
Recognition
OCR
Bert
Optical_Character
Pose
Detection
Language_Model
PDF
2020-12-11
Interdisciplinary Approaches to Understanding Artificial Intelligence's Impact on Society
Suresh Venkatasubramanian, Nadya Bliss, Helen Nissenbaum, Melanie Moses
arXiv_AI
arXiv_AI
Surveillance
OCR
Attention
PDF
2020-12-09
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
OCR
Optical_Character
VQA
Attention
Caption
QA
PDF
2020-12-08
EvoCraft: A New Challenge for Open-Endedness
Djordje Grbic, Rasmus Berg Palm, Elias Najarro, Claire Glanois, Sebastian Risi
arXiv_AI
arXiv_AI
OCR
Pose
Face
PDF
2020-12-08
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florencio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo
arXiv_CV
arXiv_CV
Image_Caption
OCR
Represenation_Learning
Pose
Scene_Text
Relation
VQA
Caption
Language_Model
Prediction
QA
Matching
PDF
2020-12-07
How To Solve Moral Conundrums with Computability Theory
Min Baek
arXiv_AI
arXiv_AI
OCR
Survey
PDF
2020-12-07
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Zhaokai Wang, Renda Bao, Qi Wu, Si Liu
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Embedding
Recognition
OCR
Optical_Character
Pose
NMT
Caption
PDF
2020-12-02
Analyzing Stylistic Variation across Different Political Regimes
Liviu P. Dinu, Ana-Sabina Uban
arXiv_CL
arXiv_CL
OCR
Pose
Classification
PDF
2020-11-30
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation
Jiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu
arXiv_CV
arXiv_CV
Reconstruction
OCR
3D
Pose
PDF
2020-11-29
Intrinsic Decomposition of Document Images In-the-Wild
Sagnik Das, Hassan Ahmed Sial, Ke Ma, Ramon Baldrich, Maria Vanrell, Dimitris Samaras
arXiv_CV
arXiv_CV
OCR
Self-Supervised
Pose
Deep_Learning
PDF
2020-11-28
OpenKBP: The open-access knowledge-based planning grand challenge
Aaron Babier, Binghao Zhang, Rafid Mahmood, Kevin L. Moore, Thomas G. Purdie, Andrea L. McNiven. Timothy C. Y. Chan
arXiv_CV
arXiv_CV
OCR
3D
Knowledge
Pose
Contour
Prediction
PDF
2020-11-27
A Survey of Deep Learning Approaches for OCR and Document Understanding
Nishant Subramani, Alexandre Matton, Malcolm Greaves, Adrian Lam
arXiv_CV
arXiv_CV
OCR
Review
Survey
Deep_Learning
PDF
2020-11-25
A Panoramic Survey of Natural Language Processing in the Arab World
Kareem Darwish, Nizar Habash, Mourad Abbas, Hend Al-Khalifa, Huseein T. Al-Natsheh, Samhaa R. El-Beltagy, Houda Bouamor, Karim Bouzoubaa, Violetta Cavalli-Sforza, Wassim El-Hajj, Mustafa Jarrar, Hamdy Mubarak
arXiv_CL
arXiv_CL
Recognition
OCR
Optical_Character
Speech
Survey
Sentiment
Speech_Recognition
PDF
2020-11-22
Locally Linear Embedding and its Variants: Tutorial and Survey
Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley
arXiv_CV
arXiv_CV
Reconstruction
Embedding
OCR
Bert
Survey
PDF
2020-11-21
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning
Baohua Sun, Michael Lin, Hao Sha, Lin Yang
arXiv_CV
arXiv_CV
Image_Caption
Recognition
OCR
Optical_Character
Pose
Detection
Caption
PDF
2020-11-20
On-Device Text Image Super Resolution
Dhruval Jain, Arun D Prabhu, Gopi Ramena, Manoj Goyal, Debi Prasanna Mohanty, Sukumar Moharana, Naresh Purre
arXiv_CV
arXiv_CV
Super_Resolution
OCR
Pose
Action
CNN
Inference
PDF
2020-11-18
Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Fuqi Song, Éric de la Clergerie
arXiv_AI
arXiv_AI
Recognition
OCR
Optical_Character
Knowledge
Pose
PDF
2020-11-17
PassGoodPool: Joint Passengers and Goods Fleet Management with Reinforcement Learning aided Pricing, Matching, and Route Planning
Kaushik Manchella, Marina Haliem, Vaneet Aggarwal, Bharat Bhargava
arXiv_AI
arXiv_AI
Reinforcement_Learning
OCR
Pose
Inference
Matching
PDF
2020-11-11
Classification Of Sleep-Wake State In A Ballistocardiogram System Based On Deep Learning
Nemath Ahmed, Aashit Singh, Srivyshnav KS, Gulshan Kumar, Gaurav Parchani, Vibhor Saran
arXiv_AI
arXiv_AI
OCR
Pose
Action
Classification
Deep_Learning
Prediction
PDF
2020-11-10
OCR Post Correction for Endangered Language Texts
Shruti Rijhwani, Antonios Anastasopoulos, Graham Neubig
arXiv_CL
arXiv_CL
Recognition
OCR
Pose
PDF
2020-11-10
On-Device Language Identification of Text in Images using Diacritic Characters
Shubham Vatsal, Nikhil Arora, Gopi Ramena, Sukumar Moharana, Dhruval Jain, Naresh Purre, Rachit S Munjal
arXiv_CV
arXiv_CV
Recognition
OCR
Optical_Character
Pose
Detection
Object_Detection
Inference
PDF
2020-11-08
Denoising Relation Extraction from Document-level Distant Supervision
Chaojun Xiao, Yuan Yao, Ruobing Xie, Xu Han, Zhiyuan Liu, Maosong Sun, Fen Lin, Leyu Lin
arXiv_CL
arXiv_CL
OCR
Pose
Action
Relation
Relation_Extraction
Denoising
PDF
2020-11-06
An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish
Quan Duong, Mika Hämäläinen, Simon Hengchen
arXiv_CL
arXiv_CL
Unsupervised
Recognition
OCR
Optical_Character
Action
NMT
PDF
2020-11-06
OP-IMS @ DIACR-Ita: Back to the Roots: SGNS+OP+CD still rocks Semantic Change Detection
Jens Kaiser, Dominik Schlechtweg, Sabine Schulte im Walde
arXiv_CL
arXiv_CL
OCR
Detection
PDF
2020-11-04
Handwriting Classification for the Analysis of Art-Historical Documents
Christian Bartz, Hendrik Rätz, Christoph Meinel
arXiv_CV
arXiv_CV
Handwriting
Recognition
OCR
Text_Classification
Knowledge
Pose
Classification
Deep_Learning