Transformer
2022-06-27
Prompting Decision Transformer for Few-Shot Policy Generalization
Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua B. Tenenbaum, Chuang Gan
arXiv_CV
arXiv_CV
Transformer
Reinforcement_Learning
Pose
Few-Shot
PDF
2022-06-27
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
Florent Bartoccioni, Éloi Zablocki, Andrei Bursuc, Patrick Pérez, Matthieu Cord, Karteek Alahari
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Action
Attention
Autonomous
Prediction
PDF
2022-06-27
Analyzing Encoded Concepts in Transformer Language Models
Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu
arXiv_CL
arXiv_CL
Transformer
Pose
Face
Relation
Language_Model
PDF
2022-06-27
Linguistic Correlation Analysis: Discovering Salient Neurons in deepNLP models
Nadir Durrani, Fahim Dalvi, Hassan Sajjad
arXiv_CL
arXiv_CL
Transformer
Transfer_Learning
Salient
Knowledge
Quantitative
Relation
Attention
PDF
2022-06-27
Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments
Millon Madhur Das, Punyajoy Saha, Mithun Das
arXiv_CL
arXiv_CL
Transformer
Speech
Classification
Detection
Prediction
PDF
2022-06-27
Context-Aware Transformers For Spinal Cancer Detection and Radiological Grading
Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman
arXiv_CV
arXiv_CV
Transformer
Pose
Detection
Medical
Prediction
PDF
2022-06-27
Kernel Attention Transformer for Histopathology Whole Slide Image Classification
Yushan Zheng, Jun Li, Jun Shi, Fengying Xie, Zhiguo Jiang
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Classification
Attention
Image_Classification
PDF
2022-06-27
PST: Plant Segmentation Transformer Enhanced Phenotyping of MLS Oilseed Rape Point Cloud
Ruiming Du, Zhihong Ma, Pengyao Xie, Haiyan Cen, Yong He
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Point_Cloud
Pose
Deep_Learning
Attention
GAN
PDF
2022-06-27
Video2StyleGAN: Encoding Video in Latent Space for Manipulation
Jiyang Yu, Jingen Liu, Jing Huang, Wei Zhang, Tao Mei
arXiv_CV
arXiv_CV
Transformer
Facial_Landmark
3D
Sparse
Pose
Face
Quantitative
GAN
PDF
2022-06-26
AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation
Nimet Kaygusuz, Oscar Mendez, Richard Bowden
arXiv_CV
arXiv_CV
Transformer
Pose_Estimation
Pose
Deep_Learning
Prediction
PDF
2022-06-26
Vision Transformer for Contrastive Clustering
Hua-Bao Ling, Bowen Zhu, Dong Huang, Ding-Hua Chen, Chang-Dong Wang, Jian-Huang Lai
arXiv_CV
arXiv_CV
Transformer
Represenation_Learning
Knowledge
Self-Supervised
Contrastive_Learning
CNN
PDF
2022-06-26
Data Augmentation for Dementia Detection in Spoken Language
Anna Hlédiková, Dominika Woszczyk, Alican Acman, Soteris Demetriou, Björn Schuller
arXiv_CL
arXiv_CL
Transformer
Sparse
Speech
Deep_Learning
Detection
PDF
2022-06-26
Semantic Role Aware Correlation Transformer for Text to Video Retrieval
Burak Satar, Hongyuan Zhu, Xavier Bresson, Joo Hwee Lim
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Relation
Attention
Video_Retrieval
Matching
PDF
2022-06-26
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Relation
Attention
Video_Retrieval
PDF
2022-06-26
On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Raviraj Joshi, Subodh Kumar
arXiv_CL
arXiv_CL
Transformer
Recognition
RNN
Speech
Attention
Speech_Recognition
Text_Generation
PDF
2022-06-26
Multiple Instance Learning with Mixed Supervision in Gleason Grading
Hao Bian, Zhuchen Shao, Yang Chen, Yifeng Wang, Haoqian Wang, Jian Zhang, Yongbing Zhang
arXiv_CV
arXiv_CV
Transformer
Pose
Deep_Learning
Prediction
PDF
2022-06-26
Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Qingcheng Zeng, Dading Chong, Peilin Zhou, Jie Yang
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Classification
Deep_Learning
Attention
Speech_Recognition
PDF
2022-06-25
Modeling Oceanic Variables with Dynamic Graph Neural Networks
Caio F. D. Netto, Marcel R. de Barros, Jefferson F. Coelho, Lucas P. de Freitas, Felipe M. Moreno, Marlon S. Mathias, Marcelo Dottori, Fábio G. Cozman, Anna H. R. Costa, Edson S. Gomi, Eduardo A. Tannuri
arXiv_AI
arXiv_AI
Transformer
Bert
RNN
Knowledge
Face
Relation
PDF
2022-06-25
Protoformer: Embedding Prototypes for Transformers
Ashkan Farhangi, Ning Sui, Nan Hua, Haiyan Bai, Arthur Huang, Zhishan Guo
arXiv_CL
arXiv_CL
Transformer
Embedding
Text_Classification
Pose
Classification
PDF
2022-06-25
Evaluation of Semantic Answer Similarity Metrics
Farida Mustafazade, Peter Ebbinghaus
arXiv_AI
arXiv_AI
Transformer
Bert
Knowledge
Pose
Relation
Prediction
QA
PDF
2022-06-25
Distilling a Pretrained Language Model to a Multilingual ASR Model
Kwanghee Choi, Hyung-Min Park
arXiv_AI
arXiv_AI
Transformer
Recognition
Knowledge
Speech
Pose
Speech_Recognition
Language_Model
PDF
2022-06-25
SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Dexiang Hong, Xiaoqi Ma, Xinyao Wang, Congcong Li, Yufei Wang, Longyin Wen
arXiv_CV
arXiv_CV
Transformer
Pose
Boundary_Detection
Classification
Detection
Optical_Flow
PDF
2022-06-25
Adversarial Self-Attention for Language Understanding
Hongqiu Wu, Hai Zhao
arXiv_CL
arXiv_CL
Transformer
Adversarial
Pose
Attention
Language_Model
PDF
2022-06-25
CV 3315 Is All You Need : Semantic Segmentation Competition
Akide Liu, Zihan Wang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Review
PDF
2022-06-25
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie
arXiv_AI
arXiv_AI
Transformer
Embedding
Represenation_Learning
Speech
Self-Supervised
Pose
Emotion
Contrastive_Learning
Prediction
PDF
2022-06-25
ConcreteGraph: A Data Augmentation Method Leveraging the Properties of Concept Relatedness Estimation
Yueen Ma, Zixing Song, Chirui Chang, Yue Yu, Irwin King
arXiv_CL
arXiv_CL
Transformer
Pose
Action
PDF
2022-06-24
DetIE: Multilingual Open Information Extraction Inspired by Object Detection
Michael Vasilkovsky, Anton Alekseev, Valentin Malykh, Ilya Shenbin, Elena Tutubalina, Dmitriy Salikhov, Mikhail Stepnov, Andrey Chertok, Sergey Nikolenko
arXiv_CL
arXiv_CL
Transformer
Zero-Shot
Pose
Action
Detection
Object_Detection
Inference
Prediction
Matching
PDF
2022-06-24
Bag of Tricks for Long-Tail Visual Recognition of Animal Species in Camera Trap Images
Fagner Cunha, Eulanda M. dos Santos, Juan G. Colonna
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Deep_Learning
PDF
2022-06-24
QAGAN: Adversarial Approach To Learning Domain Invariant Language Features
Shubham Shrivastava, Kaiyue Wang
arXiv_CL
arXiv_CL
Transformer
Embedding
Adversarial
GAN
Language_Model
Prediction
QA
PDF
2022-06-24
Defending Backdoor Attacks on Vision Transformer via Patch Processing
Khoa D. Doan, Yingjie Lao, Peng Yang, Ping Li
arXiv_CV
arXiv_CV
Transformer
Knowledge
Adversarial
Pose
CNN
PDF
2022-06-24
Megapixel Image Generation with Step-Unrolled Denoising Autoencoders
Alex F. McKinney, Chris G. Willcocks
arXiv_CV
arXiv_CV
Transformer
Quantization
Inpainting
Pose
Denoising
Attention
GAN
PDF
2022-06-24
Text and author-level political inference using heterogeneous knowledge representations
Samuel Caetano da Silva, Ivandre Paraboni
arXiv_CL
arXiv_CL
Transformer
Bert
Knowledge
Inference
Language_Model
PDF
2022-06-24
Capture Salient Historical Information: A Fast and Accurate Non-Autoregressive Model for Multi-turn Spoken Language Understanding
Lizhi Cheng, Weijia jia, Wenmian Yang
arXiv_CL
arXiv_CL
Transformer
Salient
Pose
Attention
Inference
Prediction
PDF
2022-06-24
Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-Resolution
Bo Yan, Leilei Cao, Fengliang Qi, Hongbin Wang
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Pose
Attention
Medical
PDF
2022-06-24
Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
Jiajun Deng, Xurong Xie, Tianzi Wang, Mingyu Cui, Boyang Xue, Zengrui Jin, Mengzhe Geng, Guinan Li, Xunying Liu, Helen Meng
arXiv_SD
arXiv_SD
Transformer
Unsupervised
Recognition
RNN
Speech
Pose
Speech_Recognition
Language_Model
PDF
2022-06-24
BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping
Gasser Elbanna, Neil Scheidwasser-Clow, Mikolaj Kegler, Pierre Beckmann, Karl El Hajal, Milos Cernak
arXiv_SD
arXiv_SD
Transformer
Embedding
Scene_Classification
Speech
Self-Supervised
Pose
Classification
Detection
CNN
PDF
2022-06-24
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation
Leilei Cao, Zhuang Li, Bo Yan, Feng Zhang, Fengliang Qi, Yuchen Hu, Hongbin Wang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Detection
Object_Detection
Inference
PDF
2022-06-24
A multi-model-based deep learning framework for short text multiclass classification with the imbalanced and extremely small data set
Jiajun Tong, Zhixiao Wang, Xiaobin Rui
arXiv_CL
arXiv_CL
Transformer
Bert
Text_Classification
RNN
Pose
Classification
Deep_Learning
PDF
2022-06-23
Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs
Yi-Lun Liao, Tess Smidt
arXiv_AI
arXiv_AI
Transformer
3D
Pose
Attention
Prediction
PDF
2022-06-23
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
Jinmiao Huang, Waseem Gharbieh, Qianhui Wan, Han Suk Shim, Chul Lee
arXiv_CL
arXiv_CL
Transformer
RNN
Pose
Action
Attention
PDF
2022-06-23
Agriculture-Vision Challenge 2022 -- The Runner-Up Solution for Agricultural Pattern Recognition via Transformer-based Models
Zhicheng Yang, Jui-Hsin Lai, Jun Zhou, Hang Zhou, Chen Du, Zhongcheng Lai
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
PDF
2022-06-23
Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space
Jinghuan Shang, Srijan Das, Michael S. Ryoo
arXiv_CV
arXiv_CV
Transformer
Recognition
3D
Pose
Action_Recognition
Action
Classification
CNN
Image_Classification
PDF
2022-06-23
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta, Stephen Tian, Yunzhi Zhang, Jiajun Wu, Roberto Martín-Martín, Li Fei-Fei
arXiv_CV
arXiv_CV
Transformer
Knowledge
Video_Prediction
Attention
Inference
Prediction
PDF
2022-06-23
Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency
Weijie Ma, Ye Zhu, Ruimao Zhang, Jie Yang, Yiwen Hu, Zhen Li, Li Xiang
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Classification
Attention
Image_Classification
Prediction
PDF
2022-06-23
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira, Tal Peer, Timo Gerkmann
arXiv_SD
arXiv_SD
Transformer
Enhancement
Speech
PDF
2022-06-23
Towards Green ASR: Lossless 4-bit Quantization of a Hybrid TDNN System on the 300-hr Switchboard Corpus
Junhao Xu, Shoukang Hu, Xunying Liu, Helen Meng
arXiv_SD
arXiv_SD
Transformer
Quantization
Recognition
Speech
Pose
Speech_Recognition
Language_Model
PDF
2022-06-23
ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?
Vivek Kumar Singh, Paul O Reilly, Jacqueline James, Manuel Salto Tellez, Perry Maxwell
arXiv_CV
arXiv_CV
Transformer
Segmentation
PDF
2022-06-22
Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Shangchen Zhou, Kelvin C.K. Chan, Chongyi Li, Chen Change Loy
arXiv_CV
arXiv_CV
Transformer
Restoration
Pose
Face
Prediction
PDF
2022-06-22
Behavior Transformers: Cloning $k$ modes with one stone
Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto
arXiv_AI
arXiv_AI
Transformer
Action
Detection
Object_Detection
Prediction
PDF
2022-06-22
Towards Unsupervised Content Disentanglement in Sentence Representations via Syntactic Roles
Ghazi Felhi, Joseph Le Roux, Djamé Seddah
arXiv_AI
arXiv_AI
Transformer
Unsupervised
Pose
Action
Attention
PDF
2022-06-22
Answer Fast: Accelerating BERT on the Tensor Streaming Processor
Ibrahim Ahmed, Sahil Parmar, Matthew Boyd, Michael Beidler, Kris Kang, Bill Liu, Kyle Roach, John Kim, Dennis Abts
arXiv_CL
arXiv_CL
Transformer
Recognition
Bert
Speech
Speech_Recognition
Inference
PDF
2022-06-22
Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer
Lalithkumar Seenivasan, Mobarakol Islam, Adithya Krishna, Hongliang Ren
arXiv_AI
arXiv_AI
Transformer
Recognition
Bert
Action
Classification
VQA
Medical
QA
PDF
2022-06-22
Polar Parametrization for Vision-based Surround-View 3D Detection
Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Chang Huang, Wenyu Liu
arXiv_CV
arXiv_CV
Transformer
Tracking
3D
Optimization
Pose
Detection
Prediction
PDF
2022-06-22
SpA-Former: Transformer image shadow detection and removal via spatial attention
Xiao Feng Zhang, Chao Chen Gu, Shan Ying Zhu
arXiv_CV
arXiv_CV
Transformer
Pose
Detection
Attention
PDF
2022-06-22
S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous Driving
Weihuang Chen, Fangfang Wang, Hongbin Sun
arXiv_CV
arXiv_CV
Transformer
RNN
Memory_Networks
Pose
Action
Autonomous
Prediction
PDF
2022-06-22
Feature Re-calibration based MIL for Whole Slide Image Classification
Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Meejeong Kim, Hyun Jung Sung, Sang Hyun Park
arXiv_CV
arXiv_CV
Transformer
Weakly_Supervised
Pose
Classification
Attention
Image_Classification
PDF
2022-06-22
NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022
Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon See
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Inference
Prediction
PDF
2022-06-22
Parallel Pre-trained Transformers for Synthetic Data-based Instance Segmentation
Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin Pan
arXiv_CV
arXiv_CV
Transformer
Segmentation
Optimization
Pose
PDF
2022-06-22
SVoRT: Iterative Transformer for Slice-to-Volume Registration in Fetal Brain MRI
Junshen Xu, Daniel Moyer, P. Ellen Grant, Polina Golland, Juan Eugenio Iglesias, Elfar Adalsteinsson
arXiv_CV
arXiv_CV
Transformer
Reconstruction
3D
Pose
Attention
PDF
2022-06-22
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu
arXiv_CV
arXiv_CV
Transformer
Zero-Shot
Knowledge
GAN
Language_Model
PDF
2022-06-22
Generative Pretraining for Black-Box Optimization
Siddarth Krishnamoorthy, Satvik Mehul Mashkaria, Aditya Grover
arXiv_AI
arXiv_AI
Transformer
Optimization
Pose
PDF
2022-06-21
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz Khan
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Classification
Detection
Attention
PDF
2022-06-21
Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic Consistency
Jie Yang, Ruimao Zhang, Chaoqun Wang, Zhen Li, Xiang Wan, Lingyan Zhang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Regularization
Pose
Relation
Attention
GAN
Medical
Prediction
PDF
2022-06-21
Scaling up Kernels in 3D CNNs
Yukang Chen, Jianhui Liu, Xiaojuan Qi, Xiangyu Zhang, Jian Sun, Jiaya Jia
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
3D
Optimization
Detection
Object_Detection
CNN
PDF
2022-06-21
Vicinity Vision Transformer
Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran Zhong
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Image_Classification
PDF
2022-06-21
Faster Diffusion Cardiac MRI with Deep Learning-based breath hold reduction
Michael Tanzer, Pedro Ferreira, Andrew Scott, Zohya Khalique, Maria Dwornik, Dudley Pennell, Guang Yang, Daniel Rueckert, Sonia Nielles-Vallespin
arXiv_CV
arXiv_CV
Transformer
Adversarial
Pose
Deep_Learning
GAN
PDF
2022-06-21
Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery
Yoshitomo Matsubara, Naoya Chiba, Ryo Igarashi, Tatsunori Taniai, Yoshitaka Ushiku
arXiv_AI
arXiv_AI
Transformer
Review
Pose
PDF
2022-06-21
Neural Transformers for Intraductal Papillary Mucosal Neoplasms Classification in MRI images
Federica Proietto Salanitri, Giovanni Bellitto, Simone Palazzo, Ismail Irmakci, Michael B. Wallace, Candice W. Bolan, Megan Engels, Sanne Hoogenboom, Marco Aldinucci, Ulas Bagci, Daniela Giordano, Concetto Spampinato
arXiv_AI
arXiv_AI
Transformer
Surveillance
Classification
Deep_Learning
Detection
Medical
CNN
PDF
2022-06-21
An Automatic and Efficient BERT Pruning for Edge AI Systems
Shaoyi Huang, Ning Liu, Yueying Liang, Hongwu Peng, Hongjia Li, Dongkuan Xu, Mimi Xie, Caiwen Ding
arXiv_AI
arXiv_AI
Transformer
OCR
Bert
Pose
Deep_Learning
Relation
Inference
PDF
2022-06-21
Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching
Nicola Messina, Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Pose
Caption
Inference
Matching
PDF
2022-06-21
CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework
Xiaofeng Li, Bin Ren, Xipeng Shen, Yanzhi Wang
arXiv_AI
arXiv_AI
Transformer
Bert
Optimization
Autonomous
PDF
2022-06-21
SVG Vector Font Generation for Chinese Characters with Transformer
Haruka Aoki, Kiyoharu Aizawa
arXiv_CV
arXiv_CV
Transformer
Pose
PDF
2022-06-21
KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Xuanhan Wang, Jingkuan Song, Xiaojia Chen, Lechao Cheng, Lianli Gao, Heng Tao Shen
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Detection
Relation
Object_Detection
CNN
Prediction
PDF
2022-06-21
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon, Daxin Jiang
arXiv_CL
arXiv_CL
Transformer
Pose
PDF
2022-06-21
Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms
Xuxin Chen, Ke Zhang, Neman Abdoli, Patrik W. Gilley, Ximin Wang, Hong Liu, Bin Zheng, Yuchen Qiu
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Relation
Medical
CNN
PDF
2022-06-21
One-stage Action Detection Transformer
Lijun Li, Li'an Zhuo, Bang Zhang
arXiv_AI
arXiv_AI
Transformer
Pose
Action
Detection
PDF
2022-06-21
Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation
Yuehai Chen, Jing Yang, Badong Chen, Shaoyi Du
arXiv_CV
arXiv_CV
Transformer
Sparse
Pose
Relation
Attention
Prediction
PDF
2022-06-20
Global Context Vision Transformers
Ali Hatamizadeh, Hongxu Yin, Jan Kautz, Pavlo Molchanov
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Pose
Action
Classification
Detection
Object_Detection
Attention
Image_Classification
PDF
2022-06-20
ORFD: A Dataset and Benchmark for Off-Road Freespace Detection
Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai
arXiv_CV
arXiv_CV
Transformer
Point_Cloud
Knowledge
Pose
Deep_Learning
Detection
Attention
Autonomous
PDF
2022-06-20
DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment
Haoning Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
VQA
Attention
QA
PDF
2022-06-20
M&M Mix: A Multimodal Multiview Transformer Ensemble
Xuehan Xiong, Anurag Arnab, Arsha Nagrani, Cordelia Schmid
arXiv_CV
arXiv_CV
Transformer
Recognition
Action_Recognition
Action
PDF
2022-06-20
Semantic Labeling of High Resolution Images Using EfficientUNets and Transformers
Hasan AlMarzouqi, Lyes Saad Saoud
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Pose
Action
Relation
CNN
PDF
2022-06-20
Cross-Modal Transformer GAN: A Brain Structure-Function Deep Fusing Framework for Alzheimer's Disease
Junren Pan, Shuqiang Wang
arXiv_CV
arXiv_CV
Transformer
Adversarial
Pose
Classification
Attention
GAN
PDF
2022-06-20
SPBERTQA: A Two-Stage Question Answering System Based on Sentence Transformers for Medical Texts
Nhung Thi-Hong Nguyen, Phuong Phan-Dieu Ha, Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Attention
Medical
QA
PDF
2022-06-20
nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models
Gunho Park, Baeseong Park, Se Jung Kwon, Byeongwook Kim, Youngjoo Lee, Dongsoo Lee
arXiv_CL
arXiv_CL
Transformer
Quantization
Self-Supervised
Pose
Inference
Language_Model
PDF
2022-06-20
Capturing and Inferring Dense Full-Body Human-Scene Contact
Chun-Hao P. Huang, Hongwei Yi, Markus Höschle, Matvey Safroshkin, Tsvetelina Alexiadis, Senya Polikovsky, Daniel Scharstein, Michael J. Black
arXiv_CV
arXiv_CV
Transformer
3D
Knowledge
Pose
Action
Detection
Relation
PDF
2022-06-19
Resource-Efficient Separation Transformer
Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin
arXiv_SD
arXiv_SD
Transformer
RNN
Speech
Attention
Inference
PDF
2022-06-19
Traffic-Twitter Transformer: A Nature Language Processing-joined Framework For Network-wide Traffic Forecasting
Meng-Ju Tsai, Zhiyong Cui, Hao (Frank) Yang, Yinhai Wang
arXiv_AI
arXiv_AI
Transformer
Pose
Relation
Prediction
PDF
2022-06-19
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Minguk Kang, Joonghyuk Shin, Jaesik Park
arXiv_CV
arXiv_CV
Transformer
Regularization
Adversarial
GAN
PDF
2022-06-19
Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Jenthe Thienpondt, Kris Demuynck
arXiv_SD
arXiv_SD
Transformer
Transfer_Learning
Enhancement
Recognition
Speech
Pose
Speech_Recognition
PDF
2022-06-19
Learning Multiscale Transformer Models for Sequence Generation
Bei Li, Tong Zheng, Yi Jing, Chengbo Jiao, Tong Xiao, Jingbo Zhu
arXiv_CL
arXiv_CL
Transformer
Knowledge
Pose
Relation
Attention
PDF
2022-06-19
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong Liu, Dacheng Tao
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Quantitative
Classification
Detection
Image_Classification
PDF
2022-06-18
SAViR-T: Spatially Attentive Visual Reasoning with Transformers
Pritish Sahu, Kalliopi Basioti, Vladimir Pavlovic
arXiv_CV
arXiv_CV
Transformer
Relation
PDF
2022-06-18
Can Language Models Capture Graph Semantics? From Graphs to Language Model and Vice-Versa
Tarun Garg, Kaushik Roy, Amit Sheth
arXiv_CL
arXiv_CL
Transformer
Knowledge
Knowledge_Graph
Deep_Learning
Relation
Attention
Language_Model
PDF
2022-06-18
Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods
Valeriya Goloviznina, Evgeny Kotelnikov
arXiv_CL
arXiv_CL
Transformer
Bert
Summarization
Text_Generation
Language_Model
PDF
2022-06-18
Argumentative Text Generation in Economic Domain
Irina Fishcheva, Dmitriy Osadchiy, Klavdiya Bochenina, Evgeny Kotelnikov
arXiv_CL
arXiv_CL
Transformer
Bert
Text_Generation
Language_Model
PDF
2022-06-18
Replacing Labeled Real-image Datasets with Auto-generated Contours
Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio Yokota
arXiv_AI
arXiv_AI
Transformer
Contour
PDF
2022-06-18
VReBERT: A Simple and Flexible Transformer for Visual Relationship Detection
Yu Cui, Moshiur Farazi
arXiv_AI
arXiv_AI
Transformer
Bert
Zero-Shot
Pose
Detection
Relation
Visual_Relation
Prediction
PDF
2022-06-18
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Mike Goodwin, Arvindh Krishnaswamy
arXiv_SD
arXiv_SD
Transformer
Pose
Action
Attention
PDF
2022-06-18
CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks
Tejas Srinivasan, Ting-Yun Chang, Leticia Leonor Pinto Alva, Georgios Chochlakis, Mohammad Rostami, Jesse Thomason
arXiv_AI
arXiv_AI
Transformer
Knowledge
Language_Model
PDF
2022-06-17
TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation
Nikhil Kumar Tomar, Annie Shergill, Brandon Rieders, Ulas Bagci, Debesh Jha
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Deep_Learning
Detection
Attention
PDF
2022-06-17
Transformer Neural Networks Attending to Both Sequence and Structure for Protein Prediction Tasks
Anowarul Kabir, Amarda Shehu
arXiv_AI
arXiv_AI
Transformer
Pose
Prediction
PDF
2022-06-17
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Detection
Attention
PDF
2022-06-17
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu, Christopher Clark, Rowan Zellers, Roozbeh Mottaghi, Aniruddha Kembhavi
arXiv_CV
arXiv_CV
Transformer
Pose_Estimation
Pose
Detection
VQA
Object_Detection
Caption
QA
PDF
2022-06-17
Adapting the Linearised Laplace Model Evidence for Modern Deep Learning
Javier Antorán, David Janz, James Urquhart Allingham, Erik Daxberger, Riccardo Barbano, Eric Nalisnick, José Miguel Hernández-Lobato
arXiv_AI
arXiv_AI
Transformer
Deep_Learning
Attention
Recommendation
PDF
2022-06-17
SimA: Simple Softmax-free Attention for Vision Transformers
Soroush Abbasi Koohpayegani, Hamed Pirsiavash
arXiv_CV
arXiv_CV
Transformer
Attention
PDF
2022-06-17
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
Reinforcement_Learning
Pose
Attention
PDF
2022-06-17
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Xiao Dong, Xunlin Zhan, Yunchao Wei, Xiaoyong Wei, Yaowei Wang, Minlong Lu, Xiaochun Cao, Xiaodan Liang
arXiv_CV
arXiv_CV
Transformer
Knowledge
Self-Supervised
Pose
Relation
Recommendation
PDF
2022-06-17
Holistic Transformer: A Joint Neural Network for Trajectory Prediction and Decision-Making of Autonomous Vehicles
Hongyu Hu, Qi Wang, Zhengguang Zhang, Zhengyi Li, Zhenhai Gao
arXiv_RO
arXiv_RO
Transformer
Sparse
Knowledge
Pose
Relation
Attention
Autonomous
Prediction
PDF
2022-06-17
BITS Pilani at HinglishEval: Quality Evaluation for Code-Mixed Hinglish Text Using Transformers
Shaz Furniturewala, Vijay Kumari, Amulya Ratna Dash, Hriday Kedia, Yashvardhan Sharma
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
PDF
2022-06-17
Local Slot Attention for Vision-and-Language Navigation
Yifeng Zhuang, Qiang Sun, Yanwei Fu, Lifeng Chen, Xiangyang Sue
arXiv_CV
arXiv_CV
Transformer
Segmentation
Bert
Pose
Attention
PDF
2022-06-17
Bootstrapped Transformer for Offline Reinforcement Learning
Kerong Wang, Hanye Zhao, Xufang Luo, Kan Ren, Weinan Zhang, Dongsheng Li
arXiv_RO
arXiv_RO
Transformer
Reinforcement_Learning
Pose
Attention
PDF
2022-06-17
Multi-Contextual Predictions with Vision Transformer for Video Anomaly Detection
Joo-Yeon Lee, Woo-Jeoung Nam, Seong-Whan Lee
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Pose
Detection
Prediction
PDF
2022-06-17
Rectify ViT Shortcut Learning by Visual Saliency
Chong Ma, Lin Zhao, Yuzhong Chen, David Weizhong Liu, Xi Jiang, Tuo Zhang, Xintao Hu, Dinggang Shen, Dajiang Zhu, Tianming Liu
arXiv_CV
arXiv_CV
Transformer
Salient
Knowledge
Pose
Deep_Learning
Attention
Medical
PDF
2022-06-16
Backdoor Attacks on Vision Transformers
Akshayvarun Subramanya, Aniruddha Saha, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed Pirsiavash
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Attention
PDF
2022-06-16
Simultaneous Bone and Shadow Segmentation Network using Task Correspondence Consistency
Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Face
PDF
2022-06-16
GAAMA 2.0: An Integrated System that Answers Boolean and Extractive Question
Scott McCarley, Mihaela Bornea, Sara Rosenthal, Anthony Ferritto, Md Arafat Sultan, Avirup Sil, Radu Florian
arXiv_CL
arXiv_CL
Transformer
QA
PDF
2022-06-16
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes
Rui Zhu, Zhengqin Li, Janarbek Matai, Fatih Porikli, Manmohan Chandraker
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Attention
PDF
2022-06-16
CS-UM6P at SemEval-2022 Task 6: Transformer-based Models for Intended Sarcasm Detection in English and Arabic
Abdelkader El Mahdaouy, Abdellah El Mekki, Kabil Essefar, Abderrahman Skiredj, Ismail Berrada
arXiv_CL
arXiv_CL
Transformer
Pose
Deep_Learning
Detection
Sentiment
Language_Model
PDF
2022-06-16
OmniMAE: Single Model Masked Pretraining on Images and Videos
Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra
arXiv_AI
arXiv_AI
Transformer
PDF
2022-06-16
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan
arXiv_SD
arXiv_SD
Transformer
Embedding
Recognition
Speech
Pose
Speech_Recognition
Inference
Language_Model
PDF
2022-06-16
GoodBye WaveNet -- A Language Model for Raw Audio with Context of 1/2 Million Samples
Prateek Verma
arXiv_SD
arXiv_SD
Transformer
RNN
Pose
Language_Model
PDF
2022-06-16
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
Viraj Prabhu, Sriram Yenamandra, Aaditya Singh, Judy Hoffman
arXiv_CV
arXiv_CV
Transformer
Recognition
Self-Supervised
Pose
Attention
CNN
PDF
2022-06-16
Online Segmentation of LiDAR Sequences: Dataset and Algorithm
Romain Loiseau, Mathieu Aubry, Loïc Landrieu
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
3D
Pose
Action
Autonomous
PDF
2022-06-16
TransDrift: Modeling Word-Embedding Drift using Transformer
Nishtha Madaan, Prateek Chaudhury, Nishant Kumar, Srikanta Bedathur
arXiv_CL
arXiv_CL
Transformer
Embedding
Pose
Classification
Prediction
PDF
2022-06-16
Patch-level Representation Learning for Self-supervised Vision Transformers
Sukmin Yun, Hankook Lee, Jaehyung Kim, Jinwoo Shin
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Represenation_Learning
Self-Supervised
Detection
Relation
Object_Detection
Attention
CNN
Prediction
PDF
2022-06-16
Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Lianyang Ma, Yu Yao, Tao Liang, Tongliang Liu
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Sentiment
PDF
2022-06-16
Multimodal Dialogue State Tracking
Hung Le, Nancy F. Chen, Steven C.H. Hoi
arXiv_AI
arXiv_AI
Transformer
Tracking
Video_Caption
Knowledge
Self-Supervised
Pose
Prediction
PDF
2022-06-16
Text normalization for endangered languages: the case of Ligurian
Stefano Lusito, Edoardo Ferrante, Jean Maillard
arXiv_CL
arXiv_CL
Transformer
PDF
2022-06-15
What makes domain generalization hard?
Spandan Madan, Li You, Mengmi Zhang, Hanspeter Pfister, Gabriel Kreiman
arXiv_AI
arXiv_AI
Transformer
Recognition
3D
Pose
Attention
PDF
2022-06-15
Masked Siamese ConvNets
Li Jing, Jiachen Zhu, Yann LeCun
arXiv_AI
arXiv_AI
Transformer
Embedding
Represenation_Learning
Knowledge
Self-Supervised
Pose
Classification
Detection
Object_Detection
Image_Classification
PDF
2022-06-15
A Simple Data Mixing Prior for Improving Self-Supervised Learning
Sucheng Ren, Huiyu Wang, Zhengqi Gao, Shengfeng He, Alan Yuille, Yuyin Zhou, Cihang Xie
arXiv_CV
arXiv_CV
Transformer
Recognition
Represenation_Learning
Self-Supervised
Pose
PDF
2022-06-15
Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022
Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson
arXiv_CV
arXiv_CV
Transformer
Pose
PDF
2022-06-15
AVATAR: Unconstrained Audiovisual Speech Recognition
Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid
arXiv_CV
arXiv_CV
Transformer
Recognition
Speech
Pose
Action
Attention
Speech_Recognition
PDF
2022-06-15
Transformer-based Automatic Speech Recognition of Formal and Colloquial Czech in MALACH Project
Jan Lehečka, Josef V. Psutka, Josef Psutka
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Self-Supervised
Speech_Recognition
Language_Model
PDF
2022-06-15
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng Hua
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Relation
Attention
CNN
Image_Classification
PDF
2022-06-15
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Detection
VQA
Object_Detection
Attention
Caption
QA
PDF
2022-06-15
Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech
Jan Lehečka, Jan Švec, Aleš Pražák, Josef V. Psutka
arXiv_CL
arXiv_CL
Transformer
Recognition
Zero-Shot
Speech
Speech_Recognition
PDF
2022-06-15
How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical Images
Yiqing Shen, Bingxin Zhou, Xinye Xiong, Ruitian Gao, Yu Guang Wang
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Deep_Learning
Relation
Medical
CNN
Prediction
PDF
2022-06-15
AMR Alignment: Paying Attention to Cross-Attention
Pere-Lluís Huguet Cabot, Abelardo Carlos Martínez Lorenzo, Roberto Navigli
arXiv_CL
arXiv_CL
Transformer
Attention
PDF
2022-06-15
Forecasting of depth and ego-motion with transformers and self-supervision
Houssem Boulahbal, Adrian Voicila, Andrew Comport
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Attention
Inference
PDF
2022-06-15
Estimating Confidence of Predictions of Individual Classifiers and Their Ensembles for the Genre Classification Task
Mikhail Lepekhin, Serge Sharoff
arXiv_CL
arXiv_CL
Transformer
Bert
Text_Classification
Classification
Prediction
PDF
2022-06-15
NatiQ: An End-to-end Text-to-Speech System for Arabic
Ahmed Abdelali, Nadir Durrani, Cenk Demiroglu, Fahim Dalvi, Hamdy Mubarak, Kareem Darwish
arXiv_CL
arXiv_CL
Transformer
RNN
Speech
Attention
GAN
PDF
2022-06-15
XMorpher: Full Transformer for Deformable Medical Image Registration via Cross Attention
Jiacheng Shi, Yuting He, Youyong Kong, Jean-Louis Coatrieux, Huazhong Shu, Guanyu Yang, Shuo Li
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Deep_Learning
Attention
Medical
PDF
2022-06-15
A Survey : Neural Networks for AMR-to-Text
Hongyu Hao, Guangtong Li, Zhiming Hu, Huafeng Wang
arXiv_CL
arXiv_CL
Transformer
Reconstruction
Optimization
Pose
Survey
Language_Model
PDF
2022-06-15
A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing
Benedikt Winter, Clemens Winter, Johannes Schilling, André Bardow
arXiv_CL
arXiv_CL
Transformer
Knowledge
Activity
Prediction
PDF
2022-06-15
VCT: A Video Compression Transformer
Fabian Mentzer, George Toderici, David Minnen, Sung-Jin Hwang, Sergi Caelles, Mario Lucic, Eirikur Agustsson
arXiv_CV
arXiv_CV
Transformer
Prediction
PDF
2022-06-15
Streaming non-autoregressive model for any-to-many voice conversion
Ziyi Chen, Haoran Miao, Pengyuan Zhang
arXiv_SD
arXiv_SD
Transformer
Recognition
Speech
Pose
Speech_Recognition
PDF
2022-06-15
Rethinking Generalization in Few-Shot Classification
Markus Hiller, Rongkai Ma, Mehrtash Harandi, Tom Drummond
arXiv_CV
arXiv_CV
Transformer
Embedding
Unsupervised
Optimization
Pose
Classification
Few-Shot
Inference
PDF
2022-06-15
Born for Auto-Tagging: Faster and better with new objective functions
Chiung-ju Liu, Huang-Ting Shieh
arXiv_CL
arXiv_CL
Transformer
Pose
Action
Recommendation
PDF
2022-06-15
A Projection-Based K-space Transformer Network for Undersampled Radial MRI Reconstruction with Limited Training Subjects
Chang Gao, Shu-Fu Shih, J. Paul Finn, Xiaodong Zhong
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Pose
Deep_Learning
Inference
PDF
2022-06-14
Surgical Phase Recognition in Laparoscopic Cholecystectomy
Yunfan Li, Vinayak Shenoy, Prateek Prasanna, I.V. Ramakrishnan, Haibin Ling, Himanshu Gupta
arXiv_CV
arXiv_CV
Transformer
Segmentation
Recognition
Pose
Action
Inference
PDF
2022-06-14
Codec at SemEval-2022 Task 5: Multi-Modal Multi-Transformer Misogynous Meme Classification Framework
Ahmed Mahran, Carlo Alessandro Borella, Konstantinos Perifanos
arXiv_AI
arXiv_AI
Transformer
Embedding
Knowledge
Classification
PDF
2022-06-14
It's Time for Artistic Correspondence in Music and Video
Didac Suris, Carl Vondrick, Bryan Russell, Justin Salamon
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Pose
PDF
2022-06-14
K-Space Transformer for Fast MRIReconstruction with Implicit Representation
Ziheng Zhao, Tianjiao Zhang, Weidi Xie, Yanfeng Wang, Ya Zhang
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Sparse
Pose
PDF
2022-06-14
Stand-Alone Inter-Frame Attention in Video Models
Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Jiebo Luo, Tao Mei
arXiv_CV
arXiv_CV
Transformer
Video_Caption
3D
Deep_Learning
Attention
Prediction
PDF
2022-06-14
Comprehending and Ordering Semantics for Image Captioning
Yehao Li, Yingwei Pan, Ting Yao, Tao Mei
arXiv_CL
arXiv_CL
Image_Caption
Transformer
Pose
Detection
Object_Detection
Caption
PDF
2022-06-14
Object Scene Representation Transformer
Mehdi S. M. Sajjadi, Daniel Duckworth, Aravindh Mahendran, Sjoerd van Steenkiste, Filip Pavetić, Mario Lučić, Leonidas J. Guibas, Klaus Greff, Thomas Kipf
arXiv_CV
arXiv_CV
Transformer
Unsupervised
3D
Represenation_Learning
PDF
2022-06-14
Efficient Decoder-free Object Detection with Transformers
Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen (Tencent Youtu Lab)
arXiv_CV
arXiv_CV
Transformer
Pose
Detection
Object_Detection
Inference
Prediction
PDF
2022-06-14
Peripheral Vision Transformer
Juhong Min, Yucheng Zhao, Chong Luo, Minsu Cho
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Classification
Contour
Attention
Image_Classification
PDF
2022-06-14
Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO
Javier Rando, Nasib Naimi, Thomas Baumann, Max Mathys
arXiv_CV
arXiv_CV
Transformer
Adversarial
Self-Supervised
Classification
PDF
2022-06-14
Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis
Rania Briq, Chuhang Zou, Leonid Pishchulin, Chris Broaddus, Juergen Gall
arXiv_CV
arXiv_CV
Transformer
Pose
Action
PDF
2022-06-14
TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Jiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, Wanli Ouyang
arXiv_CV
arXiv_CV
Transformer
Pose
PDF
2022-06-14
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Action
Attention
PDF
2022-06-14
Exploring evolution-based & -free protein language models as protein function predictors
Mingyang Hu, Fajie Yuan, Kevin K. Yang, Fusong Ju, Jin Su, Hui Wang, Fei Yang, Qiuyang Ding
arXiv_AI
arXiv_AI
Transformer
3D
Language_Model
Prediction
PDF
2022-06-13
Multimodal Learning with Transformers: A Survey
Peng Xu, Xiatian Zhu, David A. Clifton
arXiv_CV
arXiv_CV
Transformer
Review
Survey
PDF
2022-06-13
Compositional Mixture Representations for Vision and Text
Stephan Alaniz, Marco Federici, Zeynep Akata
arXiv_CV
arXiv_CV
Transformer
Weakly_Supervised
Represenation_Learning
Detection
Object_Detection
PDF
2022-06-13
Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation
Wouter Van Gansbeke, Simon Vandenhende, Luc Van Gool
arXiv_CV
arXiv_CV
Transformer
Segmentation
Unsupervised
Semantic_Segmentation
Pose
PDF
2022-06-13
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson
arXiv_CV
arXiv_CV
Transformer
Recognition
Video_Caption
Pose
Action_Recognition
Action
Relation
PDF
2022-06-13
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei
arXiv_AI
arXiv_AI
Transformer
NAS
Recognition
3D
Pose
Attention
CNN
PDF
2022-06-13
Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
arXiv_AI
arXiv_AI
Transformer
Pose
Action
Detection
Object_Detection
Prediction
PDF
2022-06-13
Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation
Yingwei Pan, Yehao Li, Yiheng Zhang, Qi Cai, Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei
arXiv_CV
arXiv_CV
Transformer
Reinforcement_Learning
3D
Action
PDF
2022-06-13
RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans
Pengxin Yu, Haoyue Zhang, Han Kang, Wen Tang, Corey W. Arnold, Rongguo Zhang
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Pose
Deep_Learning
Attention
Medical
CNN
PDF
2022-06-13
Transformer Lesion Tracker
Wen Tang, Han Kang, Haoyue Zhang, Pengxin Yu, Corey W. Arnold, Rongguo Zhang
arXiv_CV
arXiv_CV
Transformer
Tracking
Sparse
Knowledge
Pose
Action
Attention
Matching
PDF
2022-06-13
Rank Diminishing in Deep Neural Networks
Ruili Feng, Kecheng Zheng, Yukun Huang, Deli Zhao, Michael Jordan, Zheng-Jun Zha
arXiv_AI
arXiv_AI
Transformer
Classification
PDF
2022-06-13
On the Learning of Non-Autoregressive Transformers
Fei Huang, Tianhua Tao, Hao Zhou, Lei Li, Minlie Huang
arXiv_CL
arXiv_CL
Transformer
Pose
Relation
Text_Generation
PDF
2022-06-12
SeATrans: Learning Segmentation-Assisted diagnosis model via Transforme
Junde Wu, Huihui Fang, Fangxin Shang, Dalu Yang, Zhaowei Wang, Jing Gao, Yehui Yang, Yanwu Xu
arXiv_CV
arXiv_CV
Transformer
Segmentation
Embedding
Knowledge
Pose
Action
Deep_Learning
Relation
PDF
2022-06-12
Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction
Lihuan Li, Maurice Pagnucco, Yang Song
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Autonomous
Prediction
PDF
2022-06-11
DRAformer: Differentially Reconstructed Attention Transformer for Time-Series Forecasting
Benhan Li, Shengdong Du, Tianrui Li, Jie Hu, Zhen Jia
arXiv_AI
arXiv_AI
Transformer
Pose
Relation
Attention
PDF
2022-06-11
Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventional
Mingchuan Tian, Guangway Teng, Yipeng Bao
arXiv_AI
arXiv_AI
Transformer
Recognition
Pose
Relation
PDF
2022-06-11
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner, Ethan Manilow, Jesse Engel
arXiv_SD
arXiv_SD
Transformer
Reconstruction
Adversarial
Denoising
GAN
Activity
PDF
2022-06-11
Transformer-based Self-Supervised Fish Segmentation in Underwater Videos
Alzayat Saleh, Marcus Sheaves, Dean Jerry, Mostafa Rahimi Azghadi
arXiv_CV
arXiv_CV
Transformer
Segmentation
Represenation_Learning
Self-Supervised
Pose
Quantitative
PDF
2022-06-11
A Benchmark for Compositional Visual Reasoning
Aimen Zerroug, Mohit Vaishnav, Julien Colin, Sebastian Musslick, Thomas Serre
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
Relation
Visual_Relation
CNN
PDF
2022-06-10
Generalizable Neural Radiance Fields for Novel View Synthesis with Transformer
Dan Wang, Xinrui Cui, Septimiu Salcudean, Z. Jane Wang
arXiv_CV
arXiv_CV
Transformer
3D
Pose
Relation
Attention
PDF
2022-06-10
Exploring Feature Self-relation for Self-supervised Transformer
Zhong-Yu Li, Shanghua Gao, Ming-Ming Cheng
arXiv_CV
arXiv_CV
Transformer
Embedding
Self-Supervised
Relation
Attention
CNN
PDF
2022-06-10
Saccade Mechanisms for Image Classification, Object Detection and Tracking
Saurabh Farkya, Zachary Daniels, Aswin Nadamuni Raghavan, David Zhang, Michael Piacentino
arXiv_CV
arXiv_CV
Transformer
Tracking
Object_Tracking
Pose
Classification
Detection
Object_Detection
Attention
CNN
Image_Classification
PDF
2022-06-10
Position Labels for Self-Supervised Vision Transformer
Zhemin Zhang, Xun Gong, Jinyi Wu
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Pose
PDF
2022-06-10
NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng zhao
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Classification
PDF
2022-06-10
Borrowing or Codeswitching? Annotating for Finer-Grained Distinctions in Language Mixing
Elena Alvarez Mellado, Constantine Lignos
arXiv_CL
arXiv_CL
Transformer
Language_Model
PDF
2022-06-10
MAREO: Memory- and Attention- based visual REasOning
Mohit Vaishnav, Thomas Serre
arXiv_AI
arXiv_AI
Transformer
Relation
Attention
PDF
2022-06-10
NAGphormer: Neighborhood Aggregation Graph Transformer for Node Classification in Large Graphs
Jinsong Chen, Kaiyuan Gao, Gaichao Li, Kun He
arXiv_AI
arXiv_AI
Transformer
Pose
Classification
Attention
PDF
2022-06-10
Learning to Estimate Shapley Values with Vision Transformers
Ian Covert, Chanwoo Kim, Su-In Lee
arXiv_CV
arXiv_CV
Transformer
Attention
Prediction
PDF
2022-06-09
Building Spatio-temporal Transformers for Egocentric 3D Pose Estimation
Jinman Park, Kimathi Kaai, Saad Hossain, Norikatsu Sumi, Sirisha Rambhatla, Paul Fieguth
arXiv_AI
arXiv_AI
Transformer
3D
Pose_Estimation
Pose
Attention
CNN
PDF
2022-06-09
Neural Prompt Search
Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu
arXiv_AI
arXiv_AI
Transformer
NAS
Pose
Few-Shot
PDF
2022-06-09
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard Ghanem
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Point_Cloud
3D
Optimization
Pose
Classification
Inference
PDF
2022-06-09
GateHUB: Gated History Unit with Background Suppression for Online Action Detection
Junwen Chen, Gaurav Mittal, Ye Yu, Yu Kong, Mei Chen
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Detection
Attention
Optical_Flow
Prediction
PDF
2022-06-09
Spatial Entropy Regularization for Vision Transformers
Elia Peruzzo, Enver Sangineto, Yahui Liu, Marco De Nadai, Wei Bi, Bruno Lepri, Nicu Sebe
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Regularization
Self-Supervised
Pose
Attention
PDF
2022-06-09
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Santilli, Andreas Stuhlmüller, Andrew Dai, Andrew La, Andrew Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, et al. (392 additional authors not shown)
arXiv_AI
arXiv_AI
Transformer
Sparse
Knowledge
Quantitative
Language_Model
PDF
2022-06-09
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer
Shaoyu Chen, Tianheng Cheng, Xinggang Wang, Wenming Meng, Qian Zhang, Wenyu Liu
arXiv_CV
arXiv_CV
Transformer
Segmentation
Represenation_Learning
Pose
Autonomous
Inference
PDF
2022-06-09
Transformer based Urdu Handwritten Text Optical Character Reader
Mohammad Daniyal Shaiq, Musa Dildar Ahmed Cheema, Ali Kamal
arXiv_AI
arXiv_AI
Transformer
Handwriting
OCR
Optical_Character
Pose
Action
PDF
2022-06-09
Revisiting End-to-End Speech-to-Text Translation From Scratch
Biao Zhang, Barry Haddow, Rico Sennrich
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Pose
Attention
Speech_Recognition
PDF
2022-06-09
SparseFormer: Attention-based Depth Completion Network
Frederik Warburg, Michael Ramamonjisoa, Manuel López-Antequera
arXiv_CV
arXiv_CV
Transformer
3D
Sparse
SLAM
Attention
PDF
2022-06-09
Efficient Human Pose Estimation via 3D Event Point Cloud
Jiaan Chen, Hao Shi, Yaozu Ye, Kailun Yang, Lei Sun, Kaiwei Wang
arXiv_CV
arXiv_CV
Transformer
Point_Cloud
3D
Pose_Estimation
Pose
Deep_Learning
Detection
PDF
2022-06-09
cycle text2face: cycle text-to-face gan via transformers
Faezeh Gholamrezaie, Mohammad Manthouri
arXiv_AI
arXiv_AI
Transformer
Face
GAN
PDF
2022-06-09
Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer
Doyup Lee, Chiheon Kim, Saehoon Kim, Minsu Cho, Wook-Shin Han
arXiv_CV
arXiv_CV
Transformer
Pose
PDF
2022-06-09
VITA: Video Instance Segmentation via Object Token Association
Miran Heo, Sukjun Hwang, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Detection
Relation
Object_Detection
PDF
2022-06-09
Topic-Aware Evaluation and Transformer Methods for Topic-Controllable Summarization
Tatiana Passali, Grigorios Tsoumakas
arXiv_CL
arXiv_CL
Transformer
Embedding
Pose
Summarization
PDF
2022-06-09
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang, Arturs Backurs, Sébastien Bubeck, Ronen Eldan, Suriya Gunasekar, Tal Wagner
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
Attention
CNN
PDF
2022-06-09
SwinCheX: Multi-label classification on chest X-ray images with transformers
Sina Taslimi, Soroush Taslimi, Nima Fathi, Mohammadreza Salehi, Mohammad Hossein Rohban
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
PDF
2022-06-09
OOD Augmentation May Be at Odds with Open-Set Recognition
Mohammad Azizmalayeri, Mohammad Hossein Rohban
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Classification
Image_Classification
PDF
2022-06-08
Few-shot Question Generation for Personalized Feedback in Intelligent Tutoring Systems
Devang Kulshreshtha, Muhammad Shayan, Robert Belfer, Siva Reddy, Iulian Vlad Serban, Ekaterina Kochmar
arXiv_CL
arXiv_CL
Transformer
Few-Shot
PDF
2022-06-08
VN-Transformer: Rotation-Equivariant Attention for Vector Neurons
Serge Assaad, Carlton Downey, Rami Al-Rfou, Nigamaa Nayakanti, Ben Sapp
arXiv_CV
arXiv_CV
Transformer
3D
Classification
Attention
Inference
PDF
2022-06-08
CASS: Cross Architectural Self-Supervision for Medical Image Analysis
Pranav Singh, Elena Sizikova, Jacopo Cirrone
arXiv_AI
arXiv_AI
Transformer
Self-Supervised
Deep_Learning
Medical
CNN
PDF
2022-06-08
Few-Shot Audio-Visual Learning of Environment Acoustics
Sagnik Majumder, Changan Chen, Ziad Al-Halah, Kristen Grauman
arXiv_CV
arXiv_CV
Transformer
3D
Sparse
Few-Shot
Attention
Prediction
PDF
2022-06-08
Patch-based Object-centric Transformers for Efficient Video Generation
Wilson Yan, Ryo Okumura, Stephen James, Pieter Abbeel
arXiv_CV
arXiv_CV
Transformer
Video_Prediction
Prediction
PDF
2022-06-08
Syntactic Inductive Biases for Deep Learning Methods
Yikang Shen
arXiv_AI
arXiv_AI
Transformer
Pose
Deep_Learning
Relation
PDF
2022-06-08
Set Interdependence Transformer: Set-to-Sequence Neural Networks for Permutation Learning and Structure Prediction
Mateusz Jurewicz, Leon Derczynski
arXiv_CL
arXiv_CL
Transformer
Optimization
Pose
Action
Relation
Attention
Prediction
PDF
2022-06-08
1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task
Zhiyong Wang, Ge Zhang, Nineli Lashkarashvili
arXiv_CL
arXiv_CL
Transformer
Embedding
Pose
Matching
PDF
2022-06-08
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Puyang Zhang, Kaihao Zhang, Wenhan Luo, Changsheng Li, Guoren Wang
arXiv_CV
arXiv_CV
Transformer
Restoration
Pose
Face
Action
Quantitative
Attention
PDF
2022-06-08
UHD Image Deblurring via Multi-scale Cubic-Mixer
Zhuoran Zheng, Xiuyi Jia
arXiv_CV
arXiv_CV
Transformer
Pose
Attention
PDF
2022-06-07
How to Dissect a Muppet: The Structure of Transformer Embedding Spaces
Timothee Mickus, Denis Paperno, Mathieu Constant
arXiv_CL
arXiv_CL
Transformer
Embedding
Quantitative
Attention
PDF
2022-06-07
Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso
arXiv_CV
arXiv_CV
Transformer
Segmentation
Unsupervised
Adversarial
Pose
Detection
Denoising
GAN
Medical
Inference
PDF
2022-06-07
Can CNNs Be More Robust Than Transformers?
Zeyu Wang, Yutong Bai, Yuyin Zhou, Cihang Xie
arXiv_CV
arXiv_CV
Transformer
Recognition
Attention
CNN
PDF
2022-06-07
Tutel: Adaptive Mixture-of-Experts at Scale
Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong
arXiv_CV
arXiv_CV
Transformer
Sparse
Pose
Face
Deep_Learning
Detection
Object_Detection
Inference
PDF
2022-06-07
RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction
Yuan Liang, Zhuoxuan Jiang, Di Yin, Bo Ren
arXiv_CL
arXiv_CL
Transformer
Pose
Action
Relation
Attention
Prediction
PDF
2022-06-07
Parotid Gland MRI Segmentation Based on Swin-Unet and Multimodal Images
Yin Dai, Zi'an Xu, Fayu Liu, Siqi Li, Sheng Liu, Lifu Shi, Jun Fu
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Deep_Learning
CNN
PDF
2022-06-07
Fooling Explanations in Text Classifiers
Adam Ivankay, Ivan Girardi, Chiara Marchiori, Pascal Frossard
arXiv_CL
arXiv_CL
Transformer
Text_Classification
Knowledge
Classification
Relation
Prediction
PDF
2022-06-07
Wavelet Prior Attention Learning in Axial Inpainting Network
Chenjie Cao, Chengrong Wang, Yuntao Zhang, Yanwei Fu
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Inpainting
Knowledge
Pose
Quantitative
Attention
PDF
2022-06-07
Dual Swin-Transformer based Mutual Interactive Network for RGB-D Salient Object Detection
Chao Zeng, Sam Kwong
arXiv_CV
arXiv_CV
Transformer
Salient
Pose
Action
Detection
Object_Detection
Attention
Prediction
PDF
2022-06-07
OCHADAI at SemEval-2022 Task 2: Adversarial Training for Multilingual Idiomaticity Detection
Lis Kanashiro Pereira, Ichiro Kobayashi
arXiv_CL
arXiv_CL
Transformer
Bert
Zero-Shot
Knowledge
Adversarial
Pose
Detection
Language_Model
PDF
2022-06-07
Transformer-based Personalized Attention Mechanism for Medical Images with Clinical Records
Yusuke Takagi, Noriaki Hashimoto, Hiroki Masuda, Hiroaki Miyoshi, Koichi Ohshima, Hidekata Hontani, Ichiro Takeuchi
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Relation
Attention
Medical
PDF
2022-06-07
DiMS: Distilling Multiple Steps of Iterative Non-Autoregressive Transformers
Sajad Norouzi, Rasa Hosseinzadeh, Felipe Perez, Maksims Volkovs
arXiv_CL
arXiv_CL
Transformer
Enhancement
Knowledge
Inference
PDF
2022-06-07
Structured Context Transformer for Generic Event Boundary Detection
Congcong Li, Xinyao Wang, Dexiang Hong, Yufei Wang, Libo Zhang, Tiejian Luo, Longyin Wen
arXiv_CV
arXiv_CV
Transformer
Pose
Boundary_Detection
Detection
CNN
PDF
2022-06-07
DETR++: Taming Your Multi-Scale Detection Transformer
Chi Zhang, Lijuan Liu, Xiaoxue Zang, Frederick Liu, Hao Zhang, Xinying Song, Jindong Chen
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Classification
Detection
Object_Detection
Attention
CNN
PDF
2022-06-06
A Bird's-Eye Tutorial of Graph Attention Architectures
Kaustubh D. Dhole, Carl Yang
arXiv_AI
arXiv_AI
Transformer
Attention
Recommendation
PDF
2022-06-06
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li, Hao Zhang, Huaizhe xu, Shilong Liu, Lei Zhang, Lionel M. Ni, Heung-Yeung Shum
arXiv_CV
arXiv_CV
Transformer
Segmentation
Embedding
Semantic_Segmentation
Detection
Object_Detection
Denoising
Prediction
PDF
2022-06-06
Multi-Behavior Sequential Recommendation with Temporal Graph Transformer
Lianghao Xia, Chao Huang, Yong Xu, Jian Pei
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
Action
Relation
Attention
Recommendation
PDF
2022-06-06
Separable Self-attention for Mobile Vision Transformers
Sachin Mehta, Mohammad Rastegari
arXiv_AI
arXiv_AI
Transformer
Pose
Classification
Detection
Object_Detection
Attention
CNN
PDF
2022-06-06
Learning with Capsules: A Survey
Fabio De Sousa Ribeiro, Kevin Duarte, Miles Everett, Georgios Leontidis, Mubarak Shah
arXiv_CV
arXiv_CV
Transformer
Represenation_Learning
Pose
Survey
Deep_Learning
Relation
Attention
Medical
CNN
Inference
PDF
2022-06-06
Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal Mahmood
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Self-Supervised
Action
Prediction
PDF
2022-06-06
A computational psycholinguistic evaluation of the syntactic abilities of Galician BERT models at the interface of dependency resolution and training time
Iria de-Dios-Flores, Marcos Garcia
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Face
Prediction
PDF
2022-06-06
mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation
Yao Zhang, Nanjun He, Jiawei Yang, Yuexiang Li, Dong Wei, Yawen Huang, Yang Zhang, Zhiqiang He, Yefeng Zheng
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Relation
Medical
CNN
PDF
2022-06-06
Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos Of Team Sports
Bharath Comandur
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Re-identification
CNN
PDF
2022-06-06
MASNet:Improve Performance of Siamese Networks with Mutual-attention for Remote Sensing Change Detection Tasks
Hongbin Zhou, Yupeng Ren, Qiankun Li, Jun Yin, Yonggang Lin
arXiv_AI
arXiv_AI
Transformer
Action
Detection
Attention
CNN
PDF
2022-06-05
Performance Comparison of Simple Transformer and Res-CNN-BiLSTM for Cyberbullying Classification
Raunak Joshi, Abhishek Gupta
arXiv_CL
arXiv_CL
Transformer
Embedding
Text_Classification
RNN
Classification
Deep_Learning
PDF
2022-06-05
Recurrent Video Restoration Transformer with Guided Deformable Attention
Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van Gool
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Restoration
Pose
Denoising
Attention
PDF
2022-06-05
Federated Adversarial Training with Transformers
Ahmed Aldahdooh, Wassim Hamidouche, Olivier Déforges
arXiv_CV
arXiv_CV
Transformer
Knowledge
Adversarial
Pose
Classification
CNN
PDF
2022-06-05
Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu Chang
arXiv_CV
arXiv_CV
Transformer
Embedding
Knowledge
Speech
VQA
QA
PDF
2022-06-05
Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Xiang Kong, Adithya Renduchintala, James Cross, Yuqing Tang, Jiatao Gu, Xian Li
arXiv_CL
arXiv_CL
Transformer
Pose
Inference
PDF
2022-06-04
Learning Speaker-specific Lip-to-Speech Generation
Munender Varshney, Ravindra Yadav, Vinay P. Namboodiri, Rajesh M Hegde
arXiv_CV
arXiv_CV
Transformer
Embedding
Speech
Quantitative
Relation
PDF
2022-06-04
Actuarial Applications of Natural Language Processing Using Transformers: Case Studies for Using Text Features in an Actuarial Context
Andreas Troxler (AT Analytics), Jürg Schelldorfer (Swiss Re)
arXiv_CL
arXiv_CL
Transformer
Transfer_Learning
Classification
Prediction
PDF
2022-06-04
CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasks
Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jingrong Jiang, Qianjin Guo, Linghan Zheng
arXiv_AI
arXiv_AI
Transformer
Unsupervised
Detection
Relation
Attention
CNN
Inference
PDF
2022-06-04
Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
Mingjie Li, Wenjia Cai, Karin Verspoor, Shirui Pan, Xiaodan Liang, Xiaojun Chang
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Action
Relation
Medical
Inference
PDF
2022-06-04
Video-based Human-Object Interaction Detection from Tubelet Tokens
Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, Wei Shen
arXiv_CV
arXiv_CV
Transformer
Action
Detection
Attention
PDF
2022-06-04
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Zhewei Yao, Reza Yazdani Aminabadi, Minjia Zhang, Xiaoxia Wu, Conglong Li, Yuxiong He
arXiv_CL
arXiv_CL
Transformer
Quantization
Bert
Knowledge
Attention
Inference
Language_Model
PDF
2022-06-04
Extreme Compression for Pre-trained Transformers Made Simple and Efficient
Xiaoxia Wu, Zhewei Yao, Minjia Zhang, Conglong Li, Yuxiong He
arXiv_CL
arXiv_CL
Transformer
Quantization
Bert
Knowledge
Pose
PDF
2022-06-03
Uncertainty Estimation in Machine Learning
Valentin Arkov
arXiv_AI
arXiv_AI
Transformer
Survey
Prediction
PDF
2022-06-03
Anomaly detection in surveillance videos using transformer based attention model
Kapil Deshpande, Narinder Singh Punn, Sanjay Kumar Sonbhadra, Sonali Agarwal
arXiv_CV
arXiv_CV
Transformer
Surveillance
Weakly_Supervised
Pose
Detection
Attention
PDF
2022-06-03
YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detection
Xiao Ruiqiang
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Deep_Learning
Detection
Attention
Inference
PDF
2022-06-03
Transformer-Based Self-Supervised Learning for Emotion Recognition
Juan Vazquez-Rodriguez (M-PSI), Grégoire Lefebvre, Julien Cumin, James L. Crowley (M-PSI)
arXiv_AI
arXiv_AI
Transformer
Recognition
Self-Supervised
Pose
Emotion
Attention
PDF
2022-06-03
Exploring Transformers for Behavioural Biometrics: A Case Study in Gait Recognition
Paula Delgado-Santos, Ruben Tolosana, Richard Guest, Farzin Deravi, Ruben Vera-Rodriguez
arXiv_CV
arXiv_CV
Transformer
Recognition
Gait_Recognition
RNN
Knowledge
Pose
Deep_Learning
Attention
CNN
PDF
2022-06-03
Fair Classification via Transformer Neural Networks: Case Study of an Educational Domain
Modar Sulaiman, Kallol Roy
arXiv_AI
arXiv_AI
Transformer
Knowledge
Classification
PDF
2022-06-03
Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation
Yanglan Ou, Ye Yuan, Xiaolei Huang, Stephen T.C. Wong, John Volpi, James Z. Wang, Kelvin Wong
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Action
Relation
Medical
Inference
PDF
2022-06-02
MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems
Keyur Faldu, Amit Sheth, Prashant Kikani, Darshan Patel
arXiv_CL
arXiv_CL
Transformer
Bert
Adversarial
Relation
PDF
2022-06-02
Entangled Residual Mappings
Mathias Lechner, Ramin Hasani, Zahra Babaiee, Radu Grosu, Daniela Rus, Thomas A. Henzinger, Sepp Hochreiter
arXiv_AI
arXiv_AI
Transformer
Sparse
Represenation_Learning
RNN
Relation
Attention
CNN
PDF
2022-06-02
EfficientFormer: Vision Transformers at MobileNet Speed
Yanyu Li, Geng Yuan, Yang Wen, Eric Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian Ren
arXiv_CV
arXiv_CV
Transformer
NAS
Attention
CNN
Inference
PDF
2022-06-02
Optimizing Relevance Maps of Vision Transformers Improves Robustness
Hila Chefer, Idan Schwartz, Lior Wolf
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Pose
Classification
PDF
2022-06-02
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li, Junyu Chen, Yucheng Tang, Bennett A. Landman, S. Kevin Zhou
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Segmentation
Enhancement
Recognition
Review
Deep_Learning
Detection
GAN
Medical
CNN
PDF
2022-06-02
VL-BEiT: Generative Vision-Language Pretraining
Hangbo Bao, Wenhui Wang, Li Dong, Furu Wei
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Classification
VQA
Image_Classification
Language_Model
Prediction
PDF
2022-06-02
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger, Robert Platt, Christopher Amato
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
RNN
Pose
Attention
PDF
2022-06-02
CVM-Cervix: A Hybrid Cervical Pap-Smear Image Classification Framework Using CNN, Visual Transformer and Multilayer Perceptron
Wanli Liu, Chen Li, Ning Xu, Tao Jiang, Md Mamunur Rahaman, Hongzan Sun, Xiangchen Wu, Weiming Hu, Haoyuan Chen, Changhao Sun, Yudong Yao, Marcin Grzegorzek
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Classification
Deep_Learning
CNN
Image_Classification
PDF
2022-06-02
SparseDet: Towards End-to-End 3D Object Detection
Jianhong Han, Zhaoyi Wan, Zhe Liu, Jie Feng, Bingfeng Zhou
arXiv_AI
arXiv_AI
Transformer
Point_Cloud
3D
Sparse
Pose
Classification
Detection
Object_Detection
PDF
2022-06-02
The ParlaSent-BCS dataset of sentiment-annotated parliamentary debates from Bosnia-Herzegovina, Croatia, and Serbia
Michal Mochtak, Peter Rupnik, Nikola Ljubešič
arXiv_CL
arXiv_CL
Transformer
Review
Classification
Detection
Sentiment
PDF
2022-06-02
Modeling Image Composition for Complex Scene Generation
Zuopeng Yang, Daqing Liu, Chaoyue Wang, Jie Yang, Dacheng Tao
arXiv_CV
arXiv_CV
Transformer
Scene_Generation
Pose
Quantitative
Relation
Few-Shot
Attention
PDF
2022-06-02
KPGT: Knowledge-Guided Pre-training of Graph Transformer for Molecular Property Prediction
Han Li, Dan Zhao, Jianyang Zeng
arXiv_AI
arXiv_AI
Transformer
Represenation_Learning
Knowledge
Self-Supervised
Pose
Deep_Learning
Attention
Prediction
PDF
2022-06-02
MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet
Nan Wang, Shaohui Lin, Xiaoxiao Li, Ke Li, Yunhang Shen, Yue Gao, Lizhuang Ma
arXiv_CV
arXiv_CV
Transformer
Segmentation
3D
Pose
Action
Attention
Medical
Inference
PDF
2022-06-02
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Sehoon Kim, Amir Gholami, Albert Shaw, Nicholas Lee, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Kurt Keutzer
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Pose
Attention
Speech_Recognition
CNN
Language_Model
PDF
2022-06-02
BayesFormer: Transformer with Uncertainty Estimation
Karthik Abinav Sankararaman, Sinong Wang, Han Fang
arXiv_AI
arXiv_AI
Transformer
Pose
Classification
Inference
Language_Model
PDF
2022-06-02
XBound-Former: Toward Cross-scale Boundary Modeling in Transformers
Jiacheng Wang, Fei Chen, Yuxi Ma, Liansheng Wang, Zhaodong Fei, Jianwei Shuai, Xiangdong Tang, Qichao Zhou, Jing Qin
arXiv_AI
arXiv_AI
Transformer
Segmentation
Knowledge
Pose
Quantitative
Attention
PDF
2022-06-01
Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction
Jun Chen, Ming Hu, Boyang Li, Mohamed Elhoseiny
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Segmentation
Semantic_Segmentation
Self-Supervised
Pose
Classification
Detection
Object_Detection
Image_Classification
PDF
2022-06-01
A Multi-Policy Framework for Deep Learning-Based Fake News Detection
João Vitorino, Tiago Dias, Tiago Fonseca, Nuno Oliveira, Isabel Praça
arXiv_CL
arXiv_CL
Transformer
Bert
RNN
Pose
Deep_Learning
Detection
PDF
2022-06-01
Dynamic Linear Transformer for 3D Biomedical Image Segmentation
Zheyuan Zhang, Ulas Bagci
arXiv_CV
arXiv_CV
Transformer
Segmentation
3D
Pose
Attention
Medical
PDF
2022-06-01
Extreme Floorplan Reconstruction by Structure-Hallucinating Transformer Cascades
Sepidehsadat Hosseini, Yasutaka Furukawa
arXiv_AI
arXiv_AI
Transformer
Reconstruction
Pose
Quantitative
CNN
PDF
2022-06-01
Unifying Voxel-based Representation with Transformer for 3D Object Detection
Yanwei Li, Yilun Chen, Xiaojuan Qi, Zeming Li, Jian Sun, Jiaya Jia
arXiv_CV
arXiv_CV
Transformer
Point_Cloud
3D
Knowledge
Pose
Action
Detection
Object_Detection
PDF
2022-06-01
CLIP4IDC: CLIP for Image Difference Captioning
Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Caption
Image_Classification
PDF
2022-06-01
Dynaformer: A Deep Learning Model for Ageing-aware Battery Discharge Prediction
Luca Biggio, Tommaso Bendinelli, Chetan Kulkarni, Olga Fink
arXiv_AI
arXiv_AI
Transformer
Pose
Deep_Learning
Prediction
PDF
2022-06-01
The Fully Convolutional Transformer for Medical Image Segmentation
Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, Dirk Husmeier
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Segmentation
Pose
Medical
CNN
PDF
2022-06-01
Romantic-Computing
Elizabeth Horishny
arXiv_CL
arXiv_CL
Transformer
RNN
Face
Text_Generation
PDF
2022-06-01
Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
Guglielmo Camporese, Elena Izzo, Lamberto Ballan
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Pose
Relation
PDF
2022-06-01
A comparative study between vision transformers and CNNs in digital pathology
Luca Deininger, Bernhard Stimpel, Anil Yuce, Samaneh Abbasi-Sureshjani, Simon Schönenberger, Paolo Ocampo, Konstanty Korski, Fabien Gaire
arXiv_CV
arXiv_CV
Transformer
Sparse
Self-Supervised
Classification
Detection
CNN
Prediction
PDF
2022-06-01
CellCentroidFormer: Combining Self-attention and Convolution for Cell Detection
Royden Wagner, Karl Rohr
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
Pose
Deep_Learning
Detection
Attention
CNN
PDF
2022-06-01
On Layer Normalizations and Residual Connections in Transformers
Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki
arXiv_CL
arXiv_CL
Transformer
Pose
Text_Generation
PDF
2022-06-01
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang
arXiv_CV
arXiv_CV
Transformer
Recognition
OCR
Self-Supervised
Pose
Language_Model
PDF
2022-06-01
Vision GNN: An Image is Worth Graph of Nodes
Kai Han, Yunhe Wang, Jianyuan Guo, Yehui Tang, Enhua Wu
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Deep_Learning
Detection
Object_Detection
CNN
PDF
2022-06-01
Visual Transformer for Object Detection
Michael Yang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Pose
Action
Detection
Object_Detection
Attention
Caption
CNN
PDF
2022-06-01
Fair Comparison between Efficient Attentions
Jiuk Hong, Chaehyeon Lee, Soyoun Bang, Heechul Jung
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Prediction
PDF
2022-06-01
Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token Alignment
Jinhong Deng, Xiaoyue Zhang, Wen Li, Lixin Duan
arXiv_CV
arXiv_CV
Transformer
Embedding
Adversarial
Pose
Detection
Relation
Object_Detection
Attention
PDF
2022-06-01
THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption
Tianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li
arXiv_CL
arXiv_CL
Transformer
Pose
Medical
Inference
Language_Model
PDF
2022-06-01
Differentiable Soft-Masked Attention
Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Attention
PDF
2022-06-01
Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation
Leyla Khaleghi, Joshua Marshall, Ali Etemad
arXiv_CV
arXiv_CV
Transformer
Embedding
3D
Pose_Estimation
Pose
Action
Attention
CNN
PDF
2022-05-31
VALHALLA: Visual Hallucination for Machine Translation
Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu (Richard) Chen, Rogerio Feris, David Cox, Nuno Vasconcelos
arXiv_CV
arXiv_CV
Transformer
Attention
Inference
Prediction
PDF
2022-05-31
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving
Kashyap Chitta, Aditya Prakash, Bernhard Jaeger, Zehao Yu, Katrin Renz, Andreas Geiger
arXiv_AI
arXiv_AI
Transformer
Pose
Detection
Object_Detection
Attention
Autonomous
PDF
2022-05-31
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy, Ziwei Liu
arXiv_CV
arXiv_CV
Transformer
Human_Parsing
Pose
Quantitative
Prediction
PDF
2022-05-31
You Can't Count on Luck: Why Decision Transformers Fail in Stochastic Environments
Keiran Paster, Sheila McIlraith, Jimmy Ba
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Pose
Action
Prediction
PDF
2022-05-31
Inferring 3D change detection from bitemporal optical images
Valerio Marsocci, Virginia Coletta, Roberta Ravanelli, Simone Scardapane, Mattia Crespi
arXiv_CV
arXiv_CV
Transformer
3D
Pose
Deep_Learning
Detection
CNN
Inference
PDF
2022-05-31
Surface Analysis with Vision Transformers
Simon Dahan, Logan Z. J. Williams, Abdulah Fawaz, Daniel Rueckert, Emma C. Robinson
arXiv_CV
arXiv_CV
Transformer
Pose
Face
Attention
CNN
Prediction
PDF
2022-05-31
GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity
Iknoor Singh, Yue Li, Melissa Thong, Carolina Scarton
arXiv_AI
arXiv_AI
Transformer
Pose
PDF
2022-05-31
SymFormer: End-to-end symbolic regression using transformer-based architecture
Vastl, Martin, Kulhánek, Jonáš, Kubalík, Jiří, Derner, Erik, Babuška, Robert
arXiv_CV
arXiv_CV
Transformer
3D
Sparse
Pose_Estimation
Pose
Attention
PDF
2022-05-31
Transformers for Multi-Object Tracking on Point Clouds
Felicia Ruppel, Florian Faion, Claudius Gläser, Klaus Dietmayer
arXiv_CV
arXiv_CV
Transformer
Tracking
Point_Cloud
Object_Tracking
Pose
Detection
Object_Detection
Attention
Prediction
PDF
2022-05-31
Multilingual Transformers for Product Matching -- Experiments and a New Benchmark in Polish
Michał Mo{ż}d{ż}onek, Anna Wróblewska, Sergiy Tkachuk, Szymon Łukasik
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Matching
PDF
2022-05-31
ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation
Pramit Dutta, Ganesh Sistu, Senthil Yogamani, Edgar Galván, John McDonald
arXiv_CV
arXiv_CV
Transformer
Segmentation
CNN
Autonomous
PDF
2022-05-31
Weakly-supervised Action Transition Learning for Stochastic Human Motion Prediction
Wei Mao, Miaomiao Liu, Mathieu Salzmann
arXiv_AI
arXiv_AI
Transformer
RNN
Pose
Action
Prediction
PDF
2022-05-31
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts
Bingqian Lin, Yi Zhu, Zicong Chen, Xiwen Liang, Jianzhuang Liu, Xiaodan Liang
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
Action
Prediction
PDF
2022-05-31
Joint Spatial-Temporal and Appearance Modeling with Transformer for Multiple Object Tracking
Peng Dai, Yiqiang Feng, Renliang Weng, Changshui Zhang
arXiv_CV
arXiv_CV
Transformer
Tracking
Object_Tracking
Pose
Deep_Learning
Detection
Relation
Attention
PDF
2022-05-31
Learning to Represent Programs with Code Hierarchies
Minh Nguyen, Nghi D. Q. Bui
arXiv_AI
arXiv_AI
Transformer
Pose
Classification
Detection
CNN
Prediction
PDF
2022-05-30
Few-Shot Diffusion Models
Giorgio Giannone, Didrik Nielsen, Ole Winther
arXiv_CV
arXiv_CV
Transformer
Few-Shot
Denoising
Inference
PDF
2022-05-30
HeatER: An Efficient and Unified Network for Human Reconstruction via Heatmap-based TransformER
Ce Zheng, Matias Mendieta, Taojiannan Yang, Chen Chen
arXiv_AI
arXiv_AI
Transformer
Reconstruction
3D
Pose_Estimation
Pose
Attention
PDF
2022-05-30
Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small Datasets
Leandro M. de Lima, Renato A. Krohling
arXiv_CV
arXiv_CV
Transformer
Pose
CNN
PDF
2022-05-30
TubeFormer-DeepLab: Video Mask Transformer
Dahun Kim, Jun Xie, Huiyu Wang, Siyuan Qiao, Qihang Yu, Hong-Seok Kim, Hartwig Adam, In So Kweon, Liang-Chieh Chen
arXiv_CV
arXiv_CV
Transformer
Segmentation
PDF
2022-05-30
Can Transformer be Too Compositional? Analysing Idiom Processing in Neural Machine Translation
Verna Dankers, Christopher G. Lucas, Ivan Titov
arXiv_CL
arXiv_CL
Transformer
Action
NMT
Attention
PDF
2022-05-30
Zero-Shot and Few-Shot Learning for Lung Cancer Multi-Label Classification using Vision Transformer
Fu-Ming Guo, Yingfang Fan
arXiv_AI
arXiv_AI
Transformer
Zero-Shot
Classification
Few-Shot
PDF
2022-05-30
Multi-Game Decision Transformers
Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
PDF
2022-05-30
Self-Supervised Pre-training of Vision Transformers for Dense Prediction Tasks
Jaonary Rabarisoa, Velentin Belissen, Florian Chabot, Quoc-Cuong Pham
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Segmentation
Semantic_Segmentation
Self-Supervised
Pose
Prediction
PDF
2022-05-30
Robotic grasp detection based on Transformer
Mingshuai Dong, Xiuli Yu
arXiv_RO
arXiv_RO
Transformer
Pose
Action
Detection
CNN
PDF
2022-05-30
Transformer with Tree-order Encoding for Neural Program Generation
Klaudia-Doris Thellmann, Bernhard Stadler, Ricardo Usbeck, Jens Lehmann
arXiv_CL
arXiv_CL
Transformer
RNN
Attention
PDF
2022-05-30
Chefs' Random Tables: Non-Trigonometric Random Features
Valerii Likhosherstov, Krzysztof Choromanski, Avinava Dubey, Frederick Liu, Tamas Sarlos, Adrian Weller
arXiv_AI
arXiv_AI
Transformer
Knowledge
Speech
Classification
PDF
2022-05-30
CompleteDT: Point Cloud Completion with Dense Augment Inference Transformers
Jun Li, Shangwei Guo, Zhengchao Lai, Xiantong Meng, Shaokun Han
arXiv_CV
arXiv_CV
Transformer
Point_Cloud
Pose
Relation
Attention
Inference
PDF
2022-05-30
GMML is All you Need
Sara Atito, Muhammad Awais, Josef Kittler
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Pose
PDF
2022-05-30
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Xiaosong Zhang, Yunjie Tian, Wei Huang, Qixiang Ye, Qi Dai, Lingxi Xie, Qi Tian
arXiv_CV
arXiv_CV
Transformer
Segmentation
Transfer_Learning
Self-Supervised
Pose
Detection
Attention
PDF
2022-05-30
Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving
Peixuan Li, Jieyu Jin
arXiv_CV
arXiv_CV
Transformer
Tracking
3D
Object_Tracking
Pose
Detection
Relation
Object_Detection
Attention
Autonomous
PDF
2022-05-30
Easter2.0: Improving convolutional models for handwritten text recognition
Kartik Chaudhary, Raghav Bali
arXiv_AI
arXiv_AI
Transformer
Handwriting
Recognition
OCR
RNN
Pose
Classification
Few-Shot
CNN
PDF
2022-05-30
Illumination Adaptive Transformer
Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Pose
Detection
Object_Detection
PDF
2022-05-30
Rites de Passage: Elucidating Displacement to Emplacement of Refugees
Aparup Khatua, Wolfgang Nejdl
arXiv_CL
arXiv_CL
Transformer
Recognition
Bert
RNN
Pose
Language_Model
PDF
2022-05-30
Anti-virus Autobots: Predicting More Infectious Virus Variants for Pandemic Prevention through Deep Learning
Glenda Tan Hui En, Koay Tze Erhn, Shen Bingquan
arXiv_AI
arXiv_AI
Transformer
Pose
Deep_Learning
PDF
2022-05-30
Exposing Fine-grained Adversarial Vulnerability of Face Anti-spoofing Models
Songlin Yang, Wei Wang, Chenye Xu, Bo Peng, Jing Dong
arXiv_CV
arXiv_CV
Transformer
Adversarial
Pose
Face
Classification
PDF
2022-05-30
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Aniket Didolkar, Kshitij Gupta, Anirudh Goyal, Alex Lamb, Nan Rosemary Ke, Yoshua Bengio
arXiv_AI
arXiv_AI
Transformer
RNN
Pose
Attention
PDF
2022-05-29
EfficientViT: Enhanced Linear Attention for High-Resolution Low-Computation Visual Recognition
Han Cai, Chuang Gan, Song Han
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Recognition
Pose
Action
Detection
Object_Detection
Attention
CNN
PDF
2022-05-29
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong, Ming Ding, Wendi Zheng, Xinghan Liu, Jie Tang
arXiv_CV
arXiv_CV
Transformer
Pose
PDF
2022-05-29
L3Cube-MahaNLP: Marathi Natural Language Processing Datasets, Models, and Library
Raviraj Joshi
arXiv_CL
arXiv_CL
Transformer
Unsupervised
Recognition
Bert
Speech
Detection
Sentiment
Language_Model
PDF
2022-05-29
Modeling Beats and Downbeats with a Time-Frequency Transformer
Yun-Ning Hung, Ju-Chiang Wang, Xuchen Song, Wei-Tsung Lu, Minz Won
arXiv_SD
arXiv_SD
Transformer
Tracking
Pose
Attention
CNN
PDF
2022-05-29
To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions
Ju-Chiang Wang, Yun-Ning Hung, Jordan B. L. Smith
arXiv_SD
arXiv_SD
Transformer
Pose
Deep_Learning
Detection
PDF
2022-05-29
SFE-AI at SemEval-2022 Task 11: Low-Resource Named Entity Recognition using Large Pre-trained Language Models
Changyu Hou, Jun Wang, Yixuan Qiao, Peng Jiang, Peng Gao, Guotong Xie, Qizhi Lin, Xiaopeng Wang, Xiandi Jiang, Benqi Wang, Qifeng Xiao
arXiv_CL
arXiv_CL
Transformer
Recognition
Pose
Language_Model
PDF
2022-05-29
COFS: Controllable Furniture layout Synthesis
Wamiq Reyaz Para, Paul Guerrero, Niloy Mitra, Peter Wonka
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Quantitative
Language_Model
PDF
2022-05-29
Learning Locality and Isotropy in Dialogue Modeling
Han Wu, Haochen Tan, Mingjie Zhan, Gangming Zhao, Shaoqing Lu, Ding Liang, Linqi Song
arXiv_CL
arXiv_CL
Transformer
Pose
Language_Model
PDF
2022-05-29
3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D Reconstruction
Leslie Ching Ow Tiong, Dick Sigmund, Andrew Beng Jin Teoh
arXiv_AI
arXiv_AI
Transformer
Reconstruction
3D
Pose
Face
Relation
Attention
PDF
2022-05-29
ComplexGen: CAD Reconstruction by B-Rep Chain Complex Generation
Haoxiang Guo, Shilin Liu, Hao Pan, Yang Liu, Xin Tong, Baining Guo
arXiv_AI
arXiv_AI
Transformer
Reconstruction
Point_Cloud
Optimization
Sparse
Pose
Face
Detection
Relation
PDF
2022-05-28
Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization
Puyuan Liu, Chenyang Huang, Lili Mou
arXiv_CL
arXiv_CL
Transformer
Unsupervised
Pose
Summarization
Inference
PDF
2022-05-28
MDMLP: Image Classification from Scratch on Small Datasets with MLP
Tian Lv, Chongyang Bai, Chaojie Wang
arXiv_AI
arXiv_AI
Transformer
Classification
Attention
Image_Classification
PDF
2022-05-28
Variational Transformer: A Framework Beyond the Trade-off between Accuracy and Diversity for Image Captioning
Longzhen Yang, Shaohua Shang, Yihang Liu, Yitao Peng, Lianghua He
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Pose
Relation
Caption
PDF
2022-05-28
A Closer Look at Self-supervised Lightweight Vision Transformers
Shaoru Wang, Jin Gao, Zeming Li, Jian Sun, Weiming Hu
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Classification
PDF
2022-05-28
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li
arXiv_AI
arXiv_AI
Transformer
Reconstruction
Segmentation
Point_Cloud
3D
Represenation_Learning
Self-Supervised
Pose
Classification
Detection
Few-Shot
Object_Detection
Attention
PDF
2022-05-28
WaveMix-Lite: A Resource-efficient Neural Network for Image Analysis
Pranav Jeevan, Kavitha Viswanathan, Amit Sethi
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Classification
CNN
Image_Classification
PDF
2022-05-28
Multi-Task Learning with Multi-query Transformer for Dense Prediction
Yangyang Xu, Xiangtai Li, Haobo Yuan, Yibo Yang, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao
arXiv_CV
arXiv_CV
Transformer
Pose
Relation
Attention
Prediction
PDF
2022-05-28
Object-wise Masked Autoencoders for Fast Pre-training
Jiantao Wu, Shentong Mo
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Self-Supervised
Classification
Relation
Attention
Image_Classification
PDF
2022-05-28
RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo
Changjiang Cai, Pan Ji, Yi Xu
arXiv_CV
arXiv_CV
Transformer
Pose
CNN
PDF
2022-05-28
WT-MVSNet: Window-based Transformers for Multi-view Stereo
Jinli Liao, Yikang Ding, Yoli Shavit, Dihe Huang, Shihao Ren, Jia Guo, Wensen Feng, Kai Zhang
arXiv_CV
arXiv_CV
Transformer
3D
Regularization
Pose
Action
Matching
PDF
2022-05-27
TURJUMAN: A Public Toolkit for Neural Arabic Machine Translation
El Moatez Billah Nagoudi, AbdelRahim Elmadany, Muhammad Abdul-Mageed
arXiv_AI
arXiv_AI
Transformer
PDF
2022-05-27
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu, Ce Liu, Lijuan Wang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Recognition
Video_Caption
OCR
Optical_Character
Scene_Text
Classification
Detection
Object_Detection
Caption
Image_Classification
Language_Model
PDF
2022-05-27
Patching Leaks in the Charformer for Efficient Character-Level Generation
Lukas Edman, Antonio Toral, Gertjan van Noord
arXiv_CL
arXiv_CL
Transformer
NMT
PDF
2022-05-27
Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos
Gautam Singh, Yi-Fu Wu, Sungjin Ahn
arXiv_CV
arXiv_CV
Transformer
Unsupervised
Pose
PDF
2022-05-27
Future Transformer for Long-term Action Anticipation
Dayoung Gong, Joonseok Lee, Manjin Kim, Seong Jong Ha, Minsu Cho
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
Attention
Inference
PDF
2022-05-27
What Dense Graph Do You Need for Self-Attention?
Yuxing Wang, Chu-Tak Lee, Qipeng Guo, Zhangyue Yin, Yunhua Zhou, Xuanjing Huang, Xipeng Qiu
arXiv_AI
arXiv_AI
Transformer
Sparse
Pose
Action
Attention
PDF
2022-05-27
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Siyuan Li, Di Wu, Fang Wu, Zelin Zang, Kai Wang, Lei Shang, Baigui Sun, Hao Li, Stan.Z.Li
arXiv_AI
arXiv_AI
Transformer
Knowledge
Self-Supervised
Pose
Action
PDF
2022-05-27
3DILG: Irregular Latent Grids for 3D Generative Modeling
Biao Zhang, Matthias Nießner, Peter Wonka
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Point_Cloud
3D
Sparse
Pose
PDF
2022-05-27
X-ViT: High Performance Linear Vision Transformer without Softmax
Jeonggeun Song, Heung-Chang Lee
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Image_Classification
Prediction
PDF
2022-05-27
NLU for Game-based Learning in Real: Initial Evaluations
Eda Okur, Saurav Sahay, Lama Nachman
arXiv_CL
arXiv_CL
Transformer
Recognition
Pose
Action
PDF
2022-05-27
Understanding Long Programming Languages with Structure-Aware Sparse Attention
Tingting Liu, Chengyu Wang, Cen Chen, Ming Gao, Aoying Zhou
arXiv_AI
arXiv_AI
Transformer
Bert
Sparse
Relation
Attention
Language_Model
PDF
2022-05-27
FedFormer: Contextual Federation with Attention in Reinforcement Learning
Liam Hebert, Lukasz Golab, Pascal Poupart, Robin Cohen
arXiv_AI
arXiv_AI
Transformer
Embedding
Reinforcement_Learning
Pose
Relation
Attention
PDF
2022-05-26
Transformer for Partial Differential Equations' Operator Learning
Zijie Li, Kazem Meidani, Amir Barati Farimani
arXiv_AI
arXiv_AI
Transformer
Pose
Deep_Learning
Relation
Attention
CNN
PDF
2022-05-26
Revealing the Dark Secrets of Masked Image Modeling
Zhenda Xie, Zigang Geng, Jingcheng Hu, Zheng Zhang, Han Hu, Yue Cao
arXiv_AI
arXiv_AI
Transformer
Tracking
Object_Tracking
Pose_Estimation
Pose
Classification
Attention
PDF
2022-05-26
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen, Chongjian Ge, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Action_Recognition
Action
PDF
2022-05-26
Dynamically Relative Position Encoding-Based Transformer for Automatic Code Edit
Shiyi Qi, Yaoxian Li, Cuiyun Gao, Xiaohong Su, Shuzheng Gao, Zibin Zheng, Chuanyi Liu
arXiv_CL
arXiv_CL
Transformer
Pose
NMT
Deep_Learning
Detection
Attention
PDF
2022-05-26
Green Hierarchical Vision Transformer for Masked Image Modeling
Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki
arXiv_CV
arXiv_CV
Transformer
Classification
Detection
Object_Detection
Attention
PDF
2022-05-26
Are Transformers Effective for Time Series Forecasting?
Ailing Zeng, Muxi Chen, Lei Zhang, Qiang Xu
arXiv_AI
arXiv_AI
Transformer
Pose
Action
Detection
Relation
Relation_Extraction
Attention
Prediction
PDF
2022-05-26
SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation
Ziyi Wang, Yongming Rao, Xumin Yu, Jie Zhou, Jiwen Lu
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Point_Cloud
3D
Knowledge
Pose
Quantitative
Attention
PDF
2022-05-26
Efficient U-Transformer with Boundary-Aware Loss for Action Segmentation
Dazhao Du, Bing Su, Yu Li, Zhongang Qi, Lingyu Si, Ying Shan
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Action
Classification
Attention
PDF
2022-05-26
Your Transformer May Not be as Powerful as You Expect
Shengjie Luo, Shanda Li, Shuxin Zheng, Tie-Yan Liu, Liwei Wang, Di He
arXiv_CL
arXiv_CL
Transformer
Attention
PDF
2022-05-26
The Document Vectors Using Cosine Similarity Revisited
Zhang Bingyu, Nikolay Arefyev
arXiv_AI
arXiv_AI
Transformer
Bert
Review
Pose
Sentiment
PDF
2022-05-26
Towards Learning Universal Hyperparameter Optimizers with Transformers
Yutian Chen, Xingyou Song, Chansoo Lee, Zi Wang, Qiuyi Zhang, David Dohan, Kazuya Kawakami, Greg Kochanski, Arnaud Doucet, Marc'aurelio Ranzato, Sagi Perel, Nando de Freitas
arXiv_AI
arXiv_AI
Transformer
Optimization
Face
Prediction
PDF
2022-05-26
Cross-Architecture Self-supervised Video Representation Learning
Sheng Guo, Zihua Xiong, Yujie Zhong, Limin Wang, Xiaobo Guo, Bing Han, Weilin Huang
arXiv_CV
arXiv_CV
Transformer
Recognition
3D
Represenation_Learning
Self-Supervised
Contrastive_Learning
Action_Recognition
Action
Video_Retrieval
PDF
2022-05-26
VIDI: A Video Dataset of Incidents
Duygu Sesver, Alp Eren Gençoğlu, Çağrı Emre Yıldız, Zehra Günindi, Faeze Habibi, Ziya Ata Yazıcı, Hazım Kemal Ekenel
arXiv_CV
arXiv_CV
Transformer
Classification
Detection
PDF
2022-05-26
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Bruno Sauvalle, Arnaud de La Fortelle
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Segmentation
Unsupervised
Represenation_Learning
Detection
Object_Detection
Attention
PDF
2022-05-26
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification
Nan Zhang, Jianzong Wang, Zhenhou Hong, Chendong Zhao, Xiaoyang Qu, Jing Xiao
arXiv_SD
arXiv_SD
Transformer
Embedding
Speech
Pose
Attention
PDF
2022-05-26
Fast Vision Transformers with HiLo Attention
Zizheng Pan, Jianfei Cai, Bohan Zhuang
arXiv_AI
arXiv_AI
Transformer
Segmentation
Pose
Classification
Detection
Relation
Attention
Image_Classification
PDF
2022-05-26
AI for Porosity and Permeability Prediction from Geologic Core X-Ray Micro-Tomography
Zangir Iklassov, Dmitrii Medvedev, Otabek Nazarov
arXiv_AI
arXiv_AI
Transformer
Self-Supervised
Deep_Learning
Prediction
PDF
2022-05-26
SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned Distribution Perturbation
Yuan Hu, Lei Chen, Zhibin Wang, Hao Li
arXiv_CV
arXiv_CV
Transformer
Optimization
RNN
Pose
Face
Inference
Prediction
PDF
2022-05-26
MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning
Jihao Liu, Xin Huang, Yu Liu, Hongsheng Li
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Represenation_Learning
Pose
PDF
2022-05-25
BiT: Robustly Binarized Multi-distilled Transformer
Zechun Liu, Barlas Oguz, Aasish Pappu, Lin Xiao, Scott Yih, Meng Li, Raghuraman Krishnamoorthi, Yashar Mehdad
arXiv_CL
arXiv_CL
Transformer
Bert
Optimization
PDF
2022-05-25
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling
Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Attention
Language_Model
PDF
2022-05-25
Inception Transformer
Chenyang Si, Weihao Yu, Pan Zhou, Yichen Zhou, Xinchao Wang, Shuicheng Yan
arXiv_AI
arXiv_AI
Transformer
Segmentation
Pose
Classification
Detection
Attention
Image_Classification
PDF
2022-05-25
Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors
Liyan Tang, Tanya Goyal, Alexander R. Fabbri, Philippe Laban, Jiacheng Xu, Semih Yahvuz, Wojciech Kryściński, Justin F. Rousseau, Greg Durrett
arXiv_AI
arXiv_AI
Transformer
Detection
Object_Detection
Summarization
PDF
2022-05-25
A Comparative Study of Gastric Histopathology Sub-size Image Classification: from Linear Regression to Visual Transformer
Weiming Hu, Haoyuan Chen, Wanli Liu, Xiaoyan Li, Hongzan Sun, Xinyu Huang, Marcin Grzegorzek, Chen Li
arXiv_CV
arXiv_CV
Transformer
Classification
Deep_Learning
Detection
CNN
Image_Classification
PDF
2022-05-25
AO2-DETR: Arbitrary-Oriented Object Detection Transformer
Linhui Dai, Hong Liu, Hao Tang, Zhiwei Wu, Pinhao Song
arXiv_CV
arXiv_CV
Transformer
Pose
Detection
Object_Detection
Attention
Prediction
Matching
PDF
2022-05-25
Eliciting Transferability in Multi-task Learning with Task-level Mixture-of-Experts
Qinyuan Ye, Juan Zha, Xiang Ren
arXiv_CL
arXiv_CL
Transformer
Knowledge
Pose
Classification
PDF
2022-05-25
Lifelong Learning Natural Language Processing Approach for Multilingual Data Classification
Jędrzej Kozal, Michał Leś, Paweł Zyblewski, Paweł Ksieniewicz, Michał Woźniak
arXiv_CL
arXiv_CL
Transformer
Bert
Knowledge
Pose
Classification
Deep_Learning
Detection
PDF
2022-05-25
MoCoViT: Mobile Convolutional Vision Transformer
Hailong Ma, Xin Xia, Xing Wang, Xuefeng Xiao, Jiashi Li, Min Zheng
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Detection
Object_Detection
Attention
CNN
PDF
2022-05-25
Location-free Human Pose Estimation
Xixia Xu, Yingguo Gao, Ke Yan, Xue Lin, Qi Zou
arXiv_CV
arXiv_CV
Transformer
Pose_Estimation
Pose
Classification
Relation
PDF
2022-05-25
VTP: Volumetric Transformer for Multi-view Multi-person 3D Pose Estimation
Yuxing Chen, Renshu Gu, Ouhan Huang, Gangyong Jia
arXiv_CV
arXiv_CV
Transformer
Embedding
3D
Sparse
Pose_Estimation
Pose
Relation
Attention
CNN
PDF
2022-05-25
RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning
Soumya Sanyal, Zeyi Liao, Xiang Ren
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Language_Model
PDF
2022-05-25
Breaking the Chain of Gradient Leakage in Vision Transformers
Yahui Liu, Bin Ren, Yue Song, Wei Bi, Nicu Sebe, Wei Wang
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
PDF
2022-05-25
Eye-gaze-guided Vision Transformer for Rectifying Shortcut Learning
Chong Ma, Lin Zhao, Yuzhong Chen, Lu Zhang, Zhenxiang Xiao, Haixing Dai, David Liu, Zihao Wu, Zhengliang Liu, Sheng Wang, Jiaxing Gao, Changhe Li, Xi Jiang, Tuo Zhang, Qian Wang, Dinggang Shen, Dajiang Zhu, Tianming Liu
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Relation
Attention
Medical
PDF
2022-05-24
AdaMix: Mixture-of-Adapter for Parameter-efficient Tuning of Large Language Models
Yaqing Wang, Subhabrata Mukherjee, Xiaodong Liu, Jing Gao, Ahmed Hassan Awadallah, Jianfeng Gao
arXiv_AI
arXiv_AI
Transformer
Sparse
Pose
Few-Shot
Language_Model
PDF
2022-05-24
FLUTE: Figurative Language Understanding and Textual Explanations
Tuhin Chakrabarty, Arkadiy Saakyan, Debanjan Ghosh, Smaranda Muresan
arXiv_CL
arXiv_CL
Transformer
Relation
Inference
Language_Model
PDF
2022-05-24
Garden-Path Traversal within GPT-2
William Jurayj, William Rudman, Carsten Eickhoff
arXiv_CL
arXiv_CL
Transformer
Bert
Language_Model
PDF
2022-05-24
FreDo: Frequency Domain-based Long-Term Time Series Forecasting
Fan-Keng Sun, Duane S. Boning
arXiv_AI
arXiv_AI
Transformer
Pose
PDF
2022-05-24
History Compression via Language Models in Reinforcement Learning
Fabian Paischer, Thomas Adler, Vihang Patil, Angela Bitto-Nemling, Markus Holzleitner, Sebastian Lehner, Hamid Eghbal-zadeh, Sepp Hochreiter
arXiv_CL
arXiv_CL
Transformer
Embedding
Reinforcement_Learning
Pose
Language_Model
PDF
2022-05-24
TALM: Tool Augmented Language Models
Aaron Parisi, Yao Zhao, Noah Fiedel
arXiv_AI
arXiv_AI
Transformer
Knowledge
Inference
Language_Model
QA
PDF
2022-05-24
ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions
Difan Liu, Sandesh Shetty, Tobias Hinz, Matthew Fisher, Richard Zhang, Taesung Park, Evangelos Kalogerakis
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Action
Quantitative
Attention
PDF
2022-05-24
PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry Generation
Aitor Ormazabal, Mikel Artetxe, Manex Agirrezabal, Aitor Soroa, Eneko Agirre
arXiv_AI
arXiv_AI
Transformer
Unsupervised
Pose
Inference
Language_Model
PDF
2022-05-24
Privacy-Preserving Image Classification Using Vision Transformer
Zheng Qi, AprilPyone MaungMaung, Yuma Kinoshita, Hitoshi Kiya
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Classification
Image_Classification
PDF
2022-05-24
RetroMAE: Pre-training Retrieval-oriented Transformers via Masked Auto-Encoder
Zheng Liu, Yingxia Shao
arXiv_CL
arXiv_CL
Transformer
Reconstruction
Embedding
Bert
Pose
PDF
2022-05-24
Analysing the Greek Parliament Records with Emotion Classification
John Pavlopoulos, Vanessa Lislevand
arXiv_CL
arXiv_CL
Transformer
Speech
Emotion
Classification
Sentiment
Language_Model
PDF
2022-05-24
Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition
Yuting Yang, Binbin Du, Yuke Li
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Speech_Recognition
Inference
Language_Model
PDF
2022-05-24
Community Question Answering Entity Linking via Leveraging Auxiliary Data
Yuhan Li, Wei Shen, Jianbo Gao, Yadong Wang
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
QA
PDF
2022-05-24
Unsupervised Difference Learning for Noisy Rigid Image Alignment
Yu-Xuan Chen, Dagan Feng, Hong-Bin Shen
arXiv_CV
arXiv_CV
Transformer
Unsupervised
Quantitative
PDF
2022-05-24
Symbolic Expression Transformer: A Computer Vision Approach for Symbolic Regression
Jiachen Li, Ye Yuan, Hong-Bin Shen
arXiv_AI
arXiv_AI
Image_Caption
Transformer
Pose
Caption
PDF
2022-05-24
Meta Policy Learning for Cold-Start Conversational Recommendation
Zhendong Chu, Hongning Wang, Yun Xiao, Bo Long, Lingfei Wu
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Pose
Recommendation
PDF
2022-05-24
UMSNet: An Universal Multi-sensor Network for Human Activity Recognition
Jialiang Wang, Haotian Wei, Yi Wang, Shu Yang, Chi Li
arXiv_AI
arXiv_AI
Transformer
Recognition
Pose
Classification
Relation
Activity
PDF
2022-05-24
BabyBear: Cheap inference triage for expensive language models
Leila Khalili, Yao You, John Bohannon
arXiv_CL
arXiv_CL
Transformer
Recognition
Classification
Deep_Learning
Inference
Language_Model
Prediction
PDF
2022-05-24
PERT: A New Solution to Pinyin to Character Conversion Task
Jinghui Xiao, Qun Liu, Xin Jiang, Yuanfeng Xiong, Haiteng Wu, Zhe Zhang
arXiv_AI
arXiv_AI
Transformer
RNN
Language_Model
PDF
2022-05-24
SCVRL: Shuffled Contrastive Video Representation Learning
Michael Dorkenwald, Fanyi Xiao, Biagio Brattoli, Joseph Tighe, Davide Modolo
arXiv_CV
arXiv_CV
Transformer
Represenation_Learning
Self-Supervised
Pose
Contrastive_Learning
PDF
2022-05-24
Workflow Discovery from Dialogues in the Low Data Regime
Amine El Hattami, Stefania Raimondo, Issam Laradji, David Vazquez, Pau Rodriguez, Chris Pal
arXiv_CL
arXiv_CL
Transformer
Tracking
Zero-Shot
Pose
Action
Few-Shot
GAN
PDF
2022-05-23
FlexiBERT: Are Current Transformer Architectures too Homogeneous and Rigid?
Shikhar Tuli, Bhishma Dedhia, Shreshth Tuli, Niraj K. Jha
arXiv_CL
arXiv_CL
Transformer
NAS
Embedding
Bert
Optimization
Pose
Language_Model
PDF
2022-05-23
TransforMatcher: Match-to-Match Attention for Semantic Correspondence
Seungwook Kim, Juhong Min, Minsu Cho
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
Attention
Matching
PDF
2022-05-23
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer
Javier Ferrando, Gerard I. Gállego, Belen Alastruey, Carlos Escolano, Marta R. Costa-jussà
arXiv_CL
arXiv_CL
Transformer
Pose
NMT
Prediction
PDF
2022-05-23
Simple Recurrence Improves Masked Language Models
Tao Lei, Ran Tian, Jasmijn Bastings, Ankur P. Parikh
arXiv_AI
arXiv_AI
Transformer
Bert
Optimization
Language_Model
PDF
2022-05-23
HyperTree Proof Search for Neural Theorem Proving
Guillaume Lample, Marie-Anne Lachaux, Thibaut Lavril, Xavier Martinet, Amaury Hayat, Gabriel Ebner, Aurélien Rodriguez, Timothée Lacroix
arXiv_AI
arXiv_AI
Transformer
Pose
PDF
2022-05-23
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi
arXiv_CV
arXiv_CV
Transformer
GAN
Language_Model
PDF
2022-05-23
A Question-Answer Driven Approach to Reveal Affirmative Interpretations from Verbal Negations
Md Mosharaf Hossain, Luke Holman, Anusha Kakileti, Tiffany Iris Kao, Nathan Raul Brito, Aaron Abraham Mathews, Eduardo Blanco
arXiv_CL
arXiv_CL
Transformer
Classification
Inference
PDF
2022-05-23
Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers
Luis Espinosa-Anke, Alexander Shvets, Alireza Mohammadshahi, James Henderson, Leo Wanner
arXiv_CL
arXiv_CL
Transformer
Recognition
Bert
Action
PDF
2022-05-23
Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities
Benjamin Hsu, Graham Horwood
arXiv_CL
arXiv_CL
Transformer
Represenation_Learning
Contrastive_Learning
Classification
Inference
PDF
2022-05-23
Super Vision Transformer
Mingbao Lin, Mengzhao Chen, Yuxin Zhang, Ke Li, Yunhang Shen, Chunhua Shen, Rongrong Ji
arXiv_CV
arXiv_CV
Transformer
Recognition
Inference
PDF
2022-05-23
Outliers Dimensions that Disrupt Transformers Are Driven by Frequency
Giovanni Puccetti, Anna Rogers, Aleksandr Drozd, Felice Dell'Orletta
arXiv_AI
arXiv_AI
Transformer
Embedding
Bert
Relation
Attention
Language_Model
PDF
2022-05-23
Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore
Edward Gow-Smith, Mark McConville, William Gillies, Jade Scott, Roibeard Ó Maolalaigh
arXiv_CL
arXiv_CL
Transformer
PDF
2022-05-23
ScholarBERT: Bigger is Not Always Better
Zhi Hong, Aswathy Ajith, Gregory Pauloski, Eamon Duede, Carl Malamud, Roger Magoulas, Kyle Chard, Ian Foster
arXiv_CL
arXiv_CL
Transformer
Bert
Language_Model
PDF
2022-05-23
Sample Efficient Approaches for Idiomaticity Detection
Dylan Phelps, Xuan-Rui Fan, Edward Gow-Smith, Harish Tayyar Madabushi, Carolina Scarton, Aline Villavicencio
arXiv_CL
arXiv_CL
Transformer
Embedding
Bert
Classification
Detection
Few-Shot
Language_Model
PDF
2022-05-23
SelfReformer: Self-Refined Network with Transformer for Salient Object Detection
Yi Ke Yun, Weisi Lin
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Salient
Pose
Detection
Object_Detection
Prediction
PDF
2022-05-23
DistilCamemBERT: a distillation of the French model CamemBERT
Cyrile Delestre, Abibatou Amar
arXiv_CL
arXiv_CL
Transformer
Bert
PDF
2022-05-23
MonoFormer: Towards Generalization of self-supervised monocular depth estimation with Transformers
Jinwoo Bae, Sungho Moon, Sunghoon Im
arXiv_AI
arXiv_AI
Transformer
Self-Supervised
Pose
PDF
2022-05-23
BanglaNLG: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Abhik Bhattacharjee, Tahmid Hasan, Wasi Uddin Ahmad, Rifat Shahriyar
arXiv_CL
arXiv_CL
Transformer
Text_Generation
Language_Model
PDF
2022-05-23
AdaptivePaste: Code Adaptation through Learning Semantics-aware Variable Usage Representations
Xiaoyu Liu, Jinu Jang, Neel Sundaresan, Miltiadis Allamanis, Alexey Svyatkovskiy
arXiv_CL
arXiv_CL
Transformer
PDF
2022-05-22
Dynamic Query Selection for Fast Visual Perceiver
Corentin Dancette, Matthieu Cord
arXiv_CV
arXiv_CV
Transformer
Attention
CNN
Inference
Matching
PDF
2022-05-22
Relphormer: Relational Graph Transformer for Knowledge Graph Representation
Zhen Bi, Siyuan Cheng, Ningyu Zhang, Xiaozhuan Liang, Feiyu Xiong, Huajun Chen
arXiv_CL
arXiv_CL
Transformer
Represenation_Learning
Knowledge
Knowledge_Graph
Pose
Relation
Attention
Prediction
PDF
2022-05-22
Knowledge Distillation via the Target-aware Transformer
Sihao Lin, Hongwei Xie, Bing Wang, Kaicheng Yu, Xiaojun Chang, Xiaodan Liang, Gang Wang
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Matching
PDF
2022-05-22
A Domain-adaptive Pre-training Approach for Language Bias Detection in News
Jan-David Krieger, Timo Spinde, Terry Ruas, Juhi Kulshrestha, Bela Gipp
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Face
Detection
PDF
2022-05-22
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contexts
Shanya Sharma, Manan Dey, Koustuv Sinha
arXiv_CL
arXiv_CL
Transformer
Pose
Inference
PDF
2022-05-22
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang, Tianqi Liu, Jialu Liu, Adam D. Lelkes, Cong Yu, Jiawei Han
arXiv_CL
arXiv_CL
Transformer
Text_Classification
Knowledge
Pose
Classification
Inference
PDF
2022-05-21
Life after BERT: What do Other Muppets Understand about Language?
Vladislav Lialin, Kevin Zhao, Namrata Shivagunde, Anna Rumshisky
arXiv_CL
arXiv_CL
Transformer
Bert
Zero-Shot
PDF
2022-05-21
Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy
Zhiqi Bu, Jialin Mao, Shiyun Xu
arXiv_CV
arXiv_CV
Transformer
Optimization
Pose
Classification
CNN
PDF
2022-05-21
Transformer based Generative Adversarial Network for Liver Segmentation
Ugur Demir, Zheyuan Zhang, Bin Wang, Matthew Antalek, Elif Keles, Debesh Jha, Amir Borhani, Daniela Ladner, Ulas Bagci
arXiv_CV
arXiv_CV
Transformer
Segmentation
Adversarial
Pose
Attention
GAN
Medical
CNN
PDF
2022-05-21
Vision Transformers in 2022: An Update on Tiny ImageNet
Ethan Huynh
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
Attention
PDF
2022-05-21
Transformer-based out-of-distribution detection for clinically safe segmentation
Mark S Graham, Petru-Daniel Tudosiu, Paul Wright, Walter Hugo Lopez Pinaya, U Jean-Marie, Yee Mah, James Teo, Rolf H Jäger, David Werring, Parashkev Nachev, Sebastien Ourselin, M Jorge Cardoso
arXiv_CV
arXiv_CV
Transformer
Segmentation
3D
Knowledge
Pose
Detection
Relation
GAN
Prediction
PDF
2022-05-21
DProQ: A Gated-Graph Transformer for Protein Complex Structure Assessment
Xiao Chen, Alex Morehead, Jian Liu, Jianlin Cheng
arXiv_AI
arXiv_AI
Transformer
3D
Knowledge
Medical
Prediction
PDF
2022-05-21
Calibration of Natural Language Understanding Models with Venn--ABERS Predictors
Patrizio Giovannotti
arXiv_CL
arXiv_CL
Transformer
Pose
Prediction
PDF
2022-05-21
Boosting Camouflaged Object Detection with Dual-Task Interactive Transformer
Zhengyi Liu, Zhili Zhang, Wei Wu
arXiv_CV
arXiv_CV
Transformer
Pose
Boundary_Detection
Detection
Object_Detection
Attention
PDF
2022-05-21
HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking
Yanzhao Zhang, Dingkun Long, Guangwei Xu, Pengjun Xie
arXiv_CL
arXiv_CL
Transformer
Bert
Optimization
Language_Model
PDF
2022-05-21
Robot Person Following in Uniform Crowd Environment
Adarsh Ghimire, Xiaoxiong Zhang, Sajid Javed, Jorge Dias, Naoufel Werghi
arXiv_CV
arXiv_CV
Transformer
Tracking
Quantitative
PDF
2022-05-21
Visualizing CoAtNet Predictions for Aiding Melanoma Detection
Daniel Kvak
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Detection
Attention
CNN
Prediction
PDF
2022-05-21
Deeper vs Wider: A Revisit of Transformer Configuration
Fuzhao Xue, Jianghai Chen, Aixin Sun, Xiaozhe Ren, Zangwei Zheng, Xiaoxin He, Xin Jiang, Yang You
arXiv_AI
arXiv_AI
Transformer
Bert
Pose
PDF
2022-05-21
DKG: A Descriptive Knowledge Graph for Explaining Relationships between Entities
Jie Huang, Kerui Zhu, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-mei Hwu
arXiv_AI
arXiv_AI
Transformer
Knowledge
Knowledge_Graph
Self-Supervised
Pose
Relation
Prediction
PDF
2022-05-20
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection
Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
QA
PDF
2022-05-20
Temporally Precise Action Spotting in Soccer Videos Using Dense Detection Anchors
João V. B. Soares, Avijit Shah, Topojoy Biswas
arXiv_CV
arXiv_CV
Transformer
Action
Detection
PDF
2022-05-20
Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts
Felix Drinkall, Stefan Zohren, Janet B. Pierrehumbert
arXiv_CL
arXiv_CL
Transformer
Embedding
Tracking
Unsupervised
Classification
Language_Model
PDF
2022-05-20
Lossless Acceleration for Seq2seq Generation with Aggressive Decoding
Tao Ge, Heming Xia, Xin Sun, Si-Qing Chen, Furu Wei
arXiv_CL
arXiv_CL
Transformer
Pose
Summarization
PDF
2022-05-20
Self-supervised 3D anatomy segmentation using self-distilled masked image transformer
Jue Jiang, Neelam Tyagi, Kathryn Tringale, Christopher Crane, Harini Veeraraghavan
arXiv_CV
arXiv_CV
Transformer
Segmentation
3D
Self-Supervised
GAN
Medical
CNN
Prediction
PDF
2022-05-20
Heterformer: A Transformer Architecture for Node Representation Learning on Heterogeneous Text-Rich Networks
Bowen Jin, Yu Zhang, Qi Zhu, Jiawei Han
arXiv_CL
arXiv_CL
Transformer
Enhancement
Represenation_Learning
Pose
Classification
Attention
Language_Model
Prediction
PDF
2022-05-20
Learning to Count Anything: Reference-less Class-agnostic Counting with Weak Supervision
Michael Hobley, Victor Prisacariu
arXiv_CV
arXiv_CV
Transformer
Recognition
Knowledge
Self-Supervised
PDF
2022-05-20
Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging
Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van Gool
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Pose
PDF
2022-05-20
MSTRIQ: No Reference Image Quality Assessment Based on Swin Transformer with Multi-Stage Fusion
Jing Wang, Haotian Fa, Xiaoxia Hou, Yitian Xu, Tao Li, Xuechao Lu, Lean Fu
arXiv_CV
arXiv_CV
Transformer
Pose
QA
PDF
2022-05-20
Visual Concepts Tokenization
Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng
arXiv_CV
arXiv_CV
Transformer
Unsupervised
Represenation_Learning
Pose
Attention
PDF
2022-05-20
Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality
Xiang Li, Wenhai Wang, Lingfeng Yang, Jian Yang
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Pose
Detection
Object_Detection
Attention
GAN
PDF
2022-05-20
Exploring Extreme Parameter Compression for Pre-trained Language Models
Yuxin Ren, Benyou Wang, Lifeng Shang, Xin Jiang, Qun Liu
arXiv_CL
arXiv_CL
Transformer
Reconstruction
Embedding
Bert
Knowledge
Pose
Attention
Inference
Language_Model
PDF
2022-05-20
Translating Hanja historical documents to understandable Korean and English
Juhee Son, Jiho Jin, Haneul Yoo, JinYeong Bak, Kyunghyun Cho, Alice Oh
arXiv_AI
arXiv_AI
Transformer
Pose
GAN
PDF
2022-05-20
Mask-guided Vision Transformer for Few-Shot Learning
Yuzhong Chen, Zhenxiang Xiao, Lin Zhao, Lu Zhang, Haixing Dai, David Weizhong Liu, Zihao Wu, Changhe Li, Tuo Zhang, Changying Li, Dajiang Zhu, Tianming Liu, Xi Jiang
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Classification
Deep_Learning
Detection
Few-Shot
Attention
PDF
2022-05-20
A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers
Yuzhong Chen, Yu Du, Zhenxiang Xiao, Lin Zhao, Lu Zhang, David Weizhong Liu, Dajiang Zhu, Tuo Zhang, Xintao Hu, Tianming Liu, Xi Jiang
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
Prediction
PDF
2022-05-20
A Correlation Information-based Spatiotemporal Network for Traffic Flow Forecasting
Weiguo Zhu, Yongqi Sun, Xintong Yi, Yan Wang
arXiv_AI
arXiv_AI
Transformer
Pose
Relation
Attention
PDF
2022-05-20
Deep transfer learning for image classification: a survey
Jo Plested, Tom Gedeon
arXiv_AI
arXiv_AI
Transformer
Transfer_Learning
Knowledge
Review
Survey
Classification
Relation
CNN
Image_Classification
PDF
2022-05-19
Explainable Graph Theory-Based Identification of Meter-Transformer Mapping
Bilal Saleem, Yang Weng
arXiv_AI
arXiv_AI
Transformer
Embedding
Pose
PDF
2022-05-19
Towards Unified Keyframe Propagation Models
Patrick Esser, Peter Michael, Soumyadip Sengupta
arXiv_CV
arXiv_CV
Transformer
Inpainting
Action
Attention
PDF
2022-05-19
VNT-Net: Rotational Invariant Vector Neuron Transformers
Hedi Zisling, Andrei Sharf
arXiv_CV
arXiv_CV
Transformer
Segmentation
Point_Cloud
3D
Pose
Action
Classification
Attention
PDF
2022-05-19
ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD
Moustafa Al-Hajj, Mustafa Jarrar
arXiv_AI
arXiv_AI
Transformer
Bert
Ontology
Classification
PDF
2022-05-19
A graph-transformer for whole slide image classification
Yi Zheng, Rushin H. Gindra, Emily J. Green, Eric J. Burks, Margrit Betke, Jennifer E. Beane, Vijaya B. Kolachalama
arXiv_CV
arXiv_CV
Transformer
Salient
Contrastive_Learning
Classification
Deep_Learning
Image_Classification
PDF
2022-05-19
Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models
Joseph McDonald, Baolin Li, Nathan Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
arXiv_AI
arXiv_AI
Transformer
Inference
Language_Model
Recommendation
PDF
2022-05-19
Acceptability Judgements via Examining the Topology of Attention Maps
Daniil Cherniavskii, Eduard Tulchinskii, Vladislav Mikhailov, Irina Proskurina, Laida Kushnareva, Ekaterina Artemova, Serguei Barannikov, Irina Piontkovskaya, Dmitri Piontkovski, Evgeny Burnaev
arXiv_CL
arXiv_CL
Transformer
Bert
Knowledge
Attention
PDF
2022-05-19
Masked Image Modeling with Denoising Contrast
Kun Yi, Yixiao Ge, Xiaotong Li, Shusheng Yang, Dian Li, Jianping Wu, Ying Shan, Xiaohu Qie
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Represenation_Learning
Self-Supervised
Contrastive_Learning
Classification
Detection
Object_Detection
Denoising
Image_Classification
Prediction
PDF
2022-05-19
Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Xiaosong Zhang, Feng Liu, Zhiliang Peng, Zonghao Guo, Fang Wan, Xiangyang Ji, Qixiang Ye
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Detection
Few-Shot
Object_Detection
PDF
2022-05-19
TRT-ViT: TensorRT-oriented Vision Transformer
Xin Xia, Jiashi Li, Jie Wu, Xing Wang, Mingkai Wang, Xuefeng Xiao, Min Zheng, Rui Wang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Classification
Detection
Object_Detection
Image_Classification
Inference
PDF
2022-05-19
Insights on Neural Representations for End-to-End Speech Recognition
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain
arXiv_CL
arXiv_CL
Transformer
Recognition
RNN
Speech
Relation
Speech_Recognition
PDF
2022-05-19
Cross-Enhancement Transformer for Action Segmentation
Jiahui Wang, Zhenyou Wang, Shanna Zhuang, Hui Wang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Enhancement
Recognition
Pose
Action
Attention
CNN
PDF
2022-05-19
Transformers as Neural Augmentors: Class Conditional Sentence Generation via Variational Bayes
M. Şafak Bilici, Mehmet Fatih Amasyali
arXiv_CL
arXiv_CL
Transformer
Pose
Language_Model
PDF
2022-05-19
BabyNet: Residual Transformer Module for Birth Weight Prediction on Fetal Ultrasound Video
Szymon Płotka, Michał K. Grzeszczyk, Robert Brawura-Biskupski-Samaha, Paweł Gutaj, Michał Lipa, Tomasz Trzciński, Arkadiusz Sitek
arXiv_CV
arXiv_CV
Transformer
3D
Pose
Prediction
PDF
2022-05-19
TransTab: Learning Transferable Tabular Transformers Across Tables
Zifeng Wang, Jimeng Sun
arXiv_AI
arXiv_AI
Transformer
Embedding
Transfer_Learning
Self-Supervised
Pose
PDF
2022-05-19
Training Vision-Language Transformers from Captions Alone
Liangke Gui, Qiuyuan Huang, Alex Hauptmann, Yonatan Bisk, Jianfeng Gao
arXiv_CV
arXiv_CV
Transformer
Classification
Caption
Prediction
PDF
2022-05-18
On the Limits of Evaluating Embodied Agent Model Generalization Using Validation Sets
Hyounghun Kim, Aishwarya Padmakumar, Di Jin, Mohit Bansal, Dilek Hakkani-Tur
arXiv_AI
arXiv_AI
Transformer
Pose
Action
PDF
2022-05-18
Modeling Multi-hop Question Answering as Single Sequence Prediction
Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou, Nitish Shirish Keskar, Caiming Xiong
arXiv_CL