Attention
Attention
-
Adaptive Step Size Learning with Applications to Velocity Aided Inertial Navigation System
Barak Or, Itzik Klein
arXiv_RO
arXiv_RO
Pose
Attention
Autonomous
PDF
-
Mushroom image recognition and distance generation based on attention-mechanism model and genetic information
Wenbin Liao, Jiewen Xiao, Chengbo Zhao, Yonggong Han, ZhiJie Geng, Jianxin Wang, Yihua Yang
arXiv_CV
arXiv_CV
Embedding
Recognition
Knowledge
Pose
Classification
Attention
PDF
-
Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy, Amir H. Poorjam, Deepak Mittal, Maneesh Singh
arXiv_SD
arXiv_SD
Transfer_Learning
Represenation_Learning
Speech
Pose
Detection
Attention
CNN
PDF
-
Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos
Yuchen Wang, Zhongyu Li, Xiangxiang Cui, Liangliang Zhang, Xiang Luo, Meng Yang, Shi Chang
arXiv_CV
arXiv_CV
Recognition
Pose
Classification
Deep_Learning
Detection
Attention
Video_Classification
PDF
-
Automatic identification of segmentation errors for radiotherapy using geometric learning
Edward G. A. Henderson, Andrew F. Green, Marcel van Herk, Eliana M. Vasquez Osorio
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
Transfer_Learning
3D
Self-Supervised
Pose
Contour
Attention
GAN
CNN
QA
PDF
-
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
Florent Bartoccioni, Éloi Zablocki, Andrei Bursuc, Patrick Pérez, Matthieu Cord, Karteek Alahari
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Action
Attention
Autonomous
Prediction
PDF
-
Linguistic Correlation Analysis: Discovering Salient Neurons in deepNLP models
Nadir Durrani, Fahim Dalvi, Hassan Sajjad
arXiv_CL
arXiv_CL
Transformer
Transfer_Learning
Salient
Knowledge
Quantitative
Relation
Attention
PDF
-
Few-Shot Stance Detection via Target-Aware Prompt Distillation
Yan Jiang, Jinhua Gao, Huawei Shen, Xueqi Cheng
arXiv_CL
arXiv_CL
Knowledge
Pose
Detection
Relation
Few-Shot
Attention
Language_Model
PDF
-
Self-supervised Learning in Remote Sensing: A Review
Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham, Lichao Mou, Xiao Xiang Zhu
arXiv_CV
arXiv_CV
Review
Self-Supervised
Action
Deep_Learning
Attention
PDF
-
Kernel Attention Transformer for Histopathology Whole Slide Image Classification
Yushan Zheng, Jun Li, Jun Shi, Fengying Xie, Zhiguo Jiang
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Classification
Attention
Image_Classification
PDF
-
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Chuwei Luo, Guozhi Tang, Qi Zheng, Cong Yao, Lianwen Jin, Chenliang Li, Yang Xue, Luo Si
arXiv_CV
arXiv_CV
Represenation_Learning
Pose
Action
Classification
Attention
Language_Model
QA
PDF
-
A two-stage full-band speech enhancement model with effective spectral compression mapping
Zhongshu Hou, Qinwen Hu, Kai Chen, Jing Lu
arXiv_SD
arXiv_SD
Enhancement
Speech
Pose
Face
Attention
PDF
-
Lesion-Aware Contrastive Representation Learning for Histopathology Whole Slide Images Analysis
Jun Li, Yushan Zheng, Kun Wu, Jun Shi, Fengying Xie, Zhiguo Jiang
arXiv_CV
arXiv_CV
Image_Caption
Represenation_Learning
Self-Supervised
Pose
Contrastive_Learning
Classification
Attention
PDF
-
PST: Plant Segmentation Transformer Enhanced Phenotyping of MLS Oilseed Rape Point Cloud
Ruiming Du, Zhihong Ma, Pengyao Xie, Haiyan Cen, Yong He
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Point_Cloud
Pose
Deep_Learning
Attention
GAN
PDF
-
Improving the Training Recipe for a Robust Conformer-based Hybrid Model
Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Ralf Schlüter, Hermann Ney
arXiv_CL
arXiv_CL
Recognition
Speech
Pose
Attention
Speech_Recognition
PDF
-
Video Activity Localisation with Uncertainties in Temporal Boundary
Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu
arXiv_CV
arXiv_CV
Relation
Attention
Activity
Matching
PDF
-
Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory Exchange
Guodong Shen, Yuqi Ouyang, Victor Sanchez
arXiv_CV
arXiv_CV
Reconstruction
RNN
Action
Detection
Attention
CNN
Prediction
PDF
-
Batch-Ensemble Stochastic Neural Networks for Out-of-Distribution Detection
Xiongjie Chen, Yunpeng Li, Yongxin Yang
arXiv_AI
arXiv_AI
Pose
Detection
Attention
PDF
-
Image Aesthetics Assessment Using Graph Attention Network
Koustav Ghosal, Aljosa Smolic
arXiv_CV
arXiv_CV
Pose
Relation
Attention
CNN
PDF
-
Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Yusheng Tian, Jingyu Li, Tan Lee
arXiv_SD
arXiv_SD
Embedding
Pose
Attention
PDF
-
Semantic Role Aware Correlation Transformer for Text to Video Retrieval
Burak Satar, Hongyuan Zhu, Xavier Bresson, Joo Hwee Lim
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Relation
Attention
Video_Retrieval
Matching
PDF
-
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim
arXiv_CV
arXiv_CV
Transformer
Embedding
Pose
Relation
Attention
Video_Retrieval
PDF
-
On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Raviraj Joshi, Subodh Kumar
arXiv_CL
arXiv_CL
Transformer
Recognition
RNN
Speech
Attention
Speech_Recognition
Text_Generation
PDF
-
Knowledge Distillation with Representative Teacher Keys Based on Attention Mechanism for Image Classification Model Compression
Jun-Teng Yang, Sheng-Che Kao, Scott C.-H. Huang
arXiv_AI
arXiv_AI
Knowledge
Pose
Classification
Attention
Image_Classification
PDF
-
Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective
Qingcheng Zeng, Dading Chong, Peilin Zhou, Jie Yang
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Classification
Deep_Learning
Attention
Speech_Recognition
PDF
-
HyGNN: Drug-Drug Interaction Prediction via Hypergraph Neural Network
Khaled Mohammed Saifuddin, Bri Bumgardnerr, Farhan Tanvir, Esra Akbas
arXiv_AI
arXiv_AI
Pose
Action
Attention
Prediction
PDF
-
Anatomy-Guided Weakly-Supervised Abnormality Localization in Chest X-rays
Ke Yu, Shantanu Ghosh, Zhexiong Liu, Christopher Deible, Kayhan Batmanghelich
arXiv_CV
arXiv_CV
Pose
Quantitative
Classification
Detection
Attention
Medical
PDF
-
Adversarial Self-Attention for Language Understanding
Hongqiu Wu, Hai Zhao
arXiv_CL
arXiv_CL
Transformer
Adversarial
Pose
Attention
Language_Model
PDF
-
From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question Answering
Zihao Zhu
arXiv_CV
arXiv_CV
Knowledge
Pose
VQA
Attention
QA
PDF
-
Probing Causes of Hallucinations in Neural Machine Translations
Jianhao Yan, Fandong Meng, Jie Zhou
arXiv_CL
arXiv_CL
Embedding
Pose
NMT
Attention
PDF
-
Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRI
Hao Guan, Ling Yue, Pew-Thian Yap, Andrea Bozoki, Mingxia Liu
arXiv_CV
arXiv_CV
Knowledge
Pose
Classification
Deep_Learning
Attention
Medical
Prediction
PDF
-
Megapixel Image Generation with Step-Unrolled Denoising Autoencoders
Alex F. McKinney, Chris G. Willcocks
arXiv_CV
arXiv_CV
Transformer
Quantization
Inpainting
Pose
Denoising
Attention
GAN
PDF
-
Capture Salient Historical Information: A Fast and Accurate Non-Autoregressive Model for Multi-turn Spoken Language Understanding
Lizhi Cheng, Weijia jia, Wenmian Yang
arXiv_CL
arXiv_CL
Transformer
Salient
Pose
Attention
Inference
Prediction
PDF
-
Excavating RoI Attention for Underwater Object Detection
Xutao Liang, Pinhao Song
arXiv_CV
arXiv_CV
Pose
Deep_Learning
Detection
Object_Detection
Attention
PDF
-
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Cheng Tan, Zhangyang Gao, Siyuan Li, Yongjie Xu, Stan Z. Li
arXiv_CV
arXiv_CV
Regularization
Pose
Relation
Attention
Prediction
PDF
-
Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-Resolution
Bo Yan, Leilei Cao, Fengliang Qi, Hongbin Wang
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Pose
Attention
Medical
PDF
-
Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs
Yi-Lun Liao, Tess Smidt
arXiv_AI
arXiv_AI
Transformer
3D
Pose
Attention
Prediction
PDF
-
Never trust, always verify : a roadmap for Trustworthy AI?
Lionel Nganyewou Tidjon, Foutse Khomh
arXiv_AI
arXiv_AI
Review
Pose
Action
Attention
GAN
Autonomous
PDF
-
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
Jinmiao Huang, Waseem Gharbieh, Qianhui Wan, Han Suk Shim, Chul Lee
arXiv_CL
arXiv_CL
Transformer
RNN
Pose
Action
Attention
PDF
-
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta, Stephen Tian, Yunzhi Zhang, Jiajun Wu, Roberto Martín-Martín, Li Fei-Fei
arXiv_CV
arXiv_CV
Transformer
Knowledge
Video_Prediction
Attention
Inference
Prediction
PDF
-
CoSP: Co-supervised pretraining of pocket and ligand
Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li
arXiv_AI
arXiv_AI
Embedding
3D
Knowledge
Pose
Contrastive_Learning
Action
Attention
GAN
Prediction
Matching
PDF
-
Intelligent Request Strategy Design in Recommender System
Xufeng Qian, Yue Xu, Fuyu Lv, Shengyu Zhang, Ziwen Jiang, Qingwen Liu, Xiaoyi Zeng, Tat-Seng Chua, Fei Wu
arXiv_AI
arXiv_AI
Pose
Attention
Inference
Recommendation
Matching
PDF
-
Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency
Weijie Ma, Ye Zhu, Ruimao Zhang, Jie Yang, Yiwen Hu, Zhen Li, Li Xiang
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Classification
Attention
Image_Classification
Prediction
PDF
-
YOLOSA: Object detection based on 2D local feature superimposed self-attention
Weisheng Li, Lin Huang
arXiv_CV
arXiv_CV
Pose
Detection
Object_Detection
Attention
Inference
PDF
-
Measuring the Feasibility of Analogical Transfer using Complexity
Pierre-Alexandre Murena
arXiv_AI
arXiv_AI
Unsupervised
Pose
Relation
Attention
PDF
-
Learning To Generate Scene Graph from Head to Tail
Chaofan Zheng, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, Jingkuan Song, Lianli Gao
arXiv_CV
arXiv_CV
Pose
Action
Attention
PDF
-
A Survey on Learnable Evolutionary Algorithms for Scalable Multiobjective Optimization
Songbai Liu
arXiv_AI
arXiv_AI
Optimization
Pose
Survey
Attention
PDF
-
Dynamic Scene Deblurring Base on Continuous Cross-Layer Attention Transmission
Xia Hua, Junxiong Fei, Mingxin Li, ZeZheng Li, Yu Shi, JiangGuo Liu, Hanyu Hong
arXiv_CV
arXiv_CV
Pose
Attention
CNN
PDF
-
Towards Better User Studies in Computer Graphics and Vision
Zoya Bylinskii, Laura Herman, Aaron Hertzmann, Stefanie Hutka, Yile Zhang
arXiv_CV
arXiv_CV
Knowledge
Review
Survey
Action
Attention
Recommendation
PDF
-
Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout Cues
Nikolaos Zioulis, Federico Alvarez, Dimitrios Zarpalas, Petros Daras
arXiv_CV
arXiv_CV
Reconstruction
3D
Attention
PDF
-
Vision- and tactile-based continuous multimodal intention and attention recognition for safer physical human-robot interaction
Christopher Yee Wong, Lucas Vergez, Wael Suleiman
arXiv_RO
arXiv_RO
Recognition
Pose
Action
Attention
PDF
-
Depth-aware Glass Surface Detection with Cross-modal Context Mining
Jiaying Lin, Yuen Hei Yeung, Rynson W.H. Lau
arXiv_CV
arXiv_CV
Face_Detection
3D
Pose
Face
Detection
Attention
Drone
Autonomous
PDF
-
Towards Unsupervised Content Disentanglement in Sentence Representations via Syntactic Roles
Ghazi Felhi, Joseph Le Roux, Djamé Seddah
arXiv_AI
arXiv_AI
Transformer
Unsupervised
Pose
Action
Attention
PDF
-
CNN-based fully automatic wrist cartilage volume quantification in MR Image
Nikita Vladimirov, Ekaterina Brui, Anatoliy Levchuk, Vladimir Fokin, Aleksandr Efimtcev, David Bendahan
arXiv_CV
arXiv_CV
Segmentation
3D
Detection
Relation
Attention
CNN
PDF
-
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Raviraj Joshi, Anupam Singh
arXiv_SD
arXiv_SD
Recognition
Speech
Pose
Deep_Learning
Attention
Speech_Recognition
PDF
-
Prototypical Contrastive Language Image Pretraining
Delong Chen, Zhao Wu, Fan Liu, Zaiquan Yang, Yixiang Huang, Yiping Bao, Erjin Zhou
arXiv_CV
arXiv_CV
Zero-Shot
Knowledge
Pose
Classification
Attention
Caption
PDF
-
Toward An Optimal Selection of Dialogue Strategies: A Target-Driven Approach for Intelligent Outbound Robots
Ruifeng Qian, Shijie Li, Mengjiao Bao, Huan Chen, Yu Che
arXiv_AI
arXiv_AI
Attention
PDF
-
SpA-Former: Transformer image shadow detection and removal via spatial attention
Xiao Feng Zhang, Chao Chen Gu, Shan Ying Zhu
arXiv_CV
arXiv_CV
Transformer
Pose
Detection
Attention
PDF
-
Feature Re-calibration based MIL for Whole Slide Image Classification
Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Meejeong Kim, Hyun Jung Sung, Sang Hyun Park
arXiv_CV
arXiv_CV
Transformer
Weakly_Supervised
Pose
Classification
Attention
Image_Classification
PDF
-
No Attention is Needed: Grouped Spatial-temporal Shift for Simple and Efficient Video Restorers
Dasong Li, Xiaoyu Shi, Yi Zhang, Xiaogang Wang, Hongwei Qin, Hongsheng Li
arXiv_CV
arXiv_CV
Restoration
Pose
Denoising
Attention
Optical_Flow
PDF
-
SVoRT: Iterative Transformer for Slice-to-Volume Registration in Fetal Brain MRI
Junshen Xu, Daniel Moyer, P. Ellen Grant, Polina Golland, Juan Eugenio Iglesias, Elfar Adalsteinsson
arXiv_CV
arXiv_CV
Transformer
Reconstruction
3D
Pose
Attention
PDF
-
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz Khan
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Classification
Detection
Attention
PDF
-
Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements
Leonard E. van Dyck, Sebastian J. Denzler, Walter R. Gruber
arXiv_CV
arXiv_CV
Face_Detection
Tracking
Recognition
Salient
Pose
Face
Deep_Learning
Detection
Attention
CNN
PDF
-
Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic Consistency
Jie Yang, Ruimao Zhang, Chaoqun Wang, Zhen Li, Xiang Wan, Lingyan Zhang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Regularization
Pose
Relation
Attention
GAN
Medical
Prediction
PDF
-
Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth
Nitin Bansal, Pan Ji, Junsong Yuan, Yi Xu
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Pose
Attention
Inference
Prediction
PDF
-
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine
Jiayi Weng, Min Lin, Shengyi Huang, Bo Liu, Denys Makoviichuk, Viktor Makoviychuk, Zichen Liu, Yufan Song, Ting Luo, Yukun Jiang, Zhongwen Xu, Shuicheng Yan
arXiv_AI
arXiv_AI
Reinforcement_Learning
Attention
PDF
-
Vicinity Vision Transformer
Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran Zhong
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Image_Classification
PDF
-
Rethinking Audio-visual Synchronization for Active Speaker Detection
Abudukelimu Wuerkaixi, You Zhang, Zhiyao Duan, Changshui Zhang
arXiv_AI
arXiv_AI
Pose
Contrastive_Learning
Detection
Attention
PDF
-
An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition Application
Youlong Ding, Xueyang Wu, Zhitao Li, Zeheng Wu, Shengqi Tan, Qian Xu, Weike Pan, Qiang Yang
arXiv_CV
arXiv_CV
Transfer_Learning
Recognition
Pose
Face
Face_Recognition
Attention
Prediction
PDF
-
WrapperFL: A Model Agnostic Plug-in for Industrial Federated Learning
Xueyang Wu, Shengqi Tan, Qian Xu, Qiang Yang
arXiv_AI
arXiv_AI
Pose
Face
Attention
PDF
-
Position-prior Clustering-based Self-attention Module for Knee Cartilage Segmentation
Dong Liang, Jun Liu, Kuanquan Wang, Gongning Luo, Wei Wang, Shuo Li
arXiv_CV
arXiv_CV
Segmentation
Pose
Attention
Medical
CNN
PDF
-
Attention-driven Active Vision for Efficient Reconstruction of Plants and Targeted Plant Parts
Akshay K. Burusa, Eldert J. van Henten, Gert Kootstra
arXiv_CV
arXiv_CV
Reconstruction
3D
Pose
Attention
PDF
-
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Gang Li, Heliang Zheng, Daqing Liu, Bing Su, Changwen Zheng
arXiv_CV
arXiv_CV
Image_Caption
Segmentation
Semantic_Segmentation
Recognition
Self-Supervised
Relation
Attention
Language_Model
PDF
-
TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks
Rui-Jie Zhu, Qihang Zhao, Tianjing Zhang, Haoyu Deng, Yule Duan, Malu Zhang, Liang-Jian Deng
arXiv_AI
arXiv_AI
Gesture
Pose
Action
Classification
Deep_Learning
Relation
Attention
CNN
PDF
-
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection
Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He
arXiv_SD
arXiv_SD
Detection
Attention
PDF
-
Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Shuaicheng Li, Feng Zhang, Rui-Wei Zhao, Rui Feng, Kunlin Yang, Lingbo Liu, Jun Hou
arXiv_CV
arXiv_CV
Pose
Action
Detection
Relation
Attention
Activity
PDF
-
Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation
Yuehai Chen, Jing Yang, Badong Chen, Shaoyi Du
arXiv_CV
arXiv_CV
Transformer
Sparse
Pose
Relation
Attention
Prediction
PDF
-
Bypass Network for Semantics Driven Image Paragraph Captioning
Qi Zheng, Chaoyue Wang, Dadong Wang
arXiv_CV
arXiv_CV
Pose
Attention
Caption
PDF
-
Global Context Vision Transformers
Ali Hatamizadeh, Hongxu Yin, Jan Kautz, Pavlo Molchanov
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Pose
Action
Classification
Detection
Object_Detection
Attention
Image_Classification
PDF
-
ORFD: A Dataset and Benchmark for Off-Road Freespace Detection
Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai
arXiv_CV
arXiv_CV
Transformer
Point_Cloud
Knowledge
Pose
Deep_Learning
Detection
Attention
Autonomous
PDF
-
WiFi-based Spatiotemporal Human Action Perception
Yanling Hao, Zhiyuan Shi, Yuanwei Liu
arXiv_CV
arXiv_CV
Recognition
3D
Pose
Action
Quantitative
Attention
Activity
PDF
-
DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment
Haoning Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
VQA
Attention
QA
PDF
-
A Distributional Approach for Soft Clustering Comparison and Evaluation
Andrea Campagner, Davide Ciucci, Thierry Denœux
arXiv_AI
arXiv_AI
Pose
Attention
PDF
-
SMT-DTA: Improving Drug-Target Affinity Prediction with Semi-supervised Multi-task Training
Qizhi Pei, Lijun Wu, Jinhua Zhu, Yingce Xia, Shufang Xia, Tao Qin, Haiguang Liu, Tie-Yan Liu
arXiv_AI
arXiv_AI
Represenation_Learning
Pose
Action
Deep_Learning
Attention
Language_Model
Prediction
PDF
-
Cross-Modal Transformer GAN: A Brain Structure-Function Deep Fusing Framework for Alzheimer's Disease
Junren Pan, Shuqiang Wang
arXiv_CV
arXiv_CV
Transformer
Adversarial
Pose
Classification
Attention
GAN
PDF
-
Remote Sensing Image Classification using Transfer Learning and Attention Based Deep Neural Network
Lam Pham, Khoa Tran, Dat Ngo, Jasmin Lampert, Alexander Schindler
arXiv_CV
arXiv_CV
Transfer_Learning
Scene_Classification
Pose
Classification
Deep_Learning
Detection
Object_Detection
Attention
Image_Classification
PDF
-
MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot Segmentation
Ehtesham Iqbal, Sirojbek Safarov, Seongdeok Bang
arXiv_CV
arXiv_CV
Segmentation
Pose
Relation
Few-Shot
Attention
Prediction
PDF
-
SJ-HD^2R: Selective Joint High Dynamic Range and Denoising Imaging for Dynamic Scenes
Wei Li, Shuai Xiao, Tianhong Dai, Shanxin Yuan, Tao Wang, Cheng Li, Fenglong Song
arXiv_CV
arXiv_CV
Pose
Denoising
Attention
PDF
-
S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?
Shuang Luo, Yinchuan Li, Jiahui Li, Kun Kuang, Furui Liu, Yunfeng Shao, Chao Wu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Sparse
Pose
Attention
Inference
PDF
-
A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!
Chenglizhao Chen, Mengke Song, Wenfeng Song, Li Guo, Muwei Jian
arXiv_CV
arXiv_CV
Salient
Review
Survey
Quantitative
Detection
Attention
PDF
-
SPBERTQA: A Two-Stage Question Answering System Based on Sentence Transformers for Medical Texts
Nhung Thi-Hong Nguyen, Phuong Phan-Dieu Ha, Luan Thanh Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Attention
Medical
QA
PDF
-
From Multi-agent to Multi-robot: A Scalable Training and Evaluation Platform for Multi-robot Reinforcement Learning
Zhiuxan Liang, Jiannong Cao, Shan Jiang, Divya Saxena, Jinlin Chen, Huafeng Xu
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Attention
PDF
-
Resource-Efficient Separation Transformer
Cem Subakan, Mirco Ravanelli, Samuele Cornell, Frédéric Lepoutre, François Grondin
arXiv_SD
arXiv_SD
Transformer
RNN
Speech
Attention
Inference
PDF
-
3D Object Detection for Autonomous Driving: A Review and New Outlooks
Jiageng Mao, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li
arXiv_AI
arXiv_AI
3D
Review
Survey
Detection
Object_Detection
Attention
Autonomous
Prediction
PDF
-
All you need is feedback: Communication with block attention feedback codes
Emre Ozfatura, Yulin Shao, Alberto Perotti, Branislav Popovic, Deniz Gunduz
arXiv_AI
arXiv_AI
Deep_Learning
Attention
PDF
-
Towards Generalizable Person Re-identification with a Bi-stream Generative Model
Xin Xu, Wei Liu, Zheng Wang, Ruiming Hu, Qi Tian
arXiv_CV
arXiv_CV
Person_Re-identification
Pose
Attention
Re-identification
PDF
-
Learning Multiscale Transformer Models for Sequence Generation
Bei Li, Tong Zheng, Yi Jing, Chengbo Jiao, Tong Xiao, Jingbo Zhu
arXiv_CL
arXiv_CL
Transformer
Knowledge
Pose
Relation
Attention
PDF
-
Can Language Models Capture Graph Semantics? From Graphs to Language Model and Vice-Versa
Tarun Garg, Kaushik Roy, Amit Sheth
arXiv_CL
arXiv_CL
Transformer
Knowledge
Knowledge_Graph
Deep_Learning
Relation
Attention
Language_Model
PDF
-
Camera Adaptation for Fundus-Image-Based CVD Risk Estimation
Zhihong Lin, Danli Shi, Donghao Zhang, Xianwen Shang, Mingguang He, Zongyuan Ge
arXiv_CV
arXiv_CV
OCR
Knowledge
Pose
Deep_Learning
Attention
PDF
-
3D unsupervised anomaly detection and localization through virtual multi-view projection and reconstruction: Clinical validation on low-dose chest computed tomography
Kyung-Su Kim, Seong Je Oh, Ju Hwan Lee, Myung Jin Chung
arXiv_CV
arXiv_CV
Reconstruction
Unsupervised
Recognition
3D
Restoration
Pose
Deep_Learning
Detection
Attention
PDF
-
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
Jaehyuk Heo, YongGi Jeong, Sunwoo Kim, Jaehee Kim, Pilsung Kang
arXiv_CV
arXiv_CV
Segmentation
Embedding
Semantic_Segmentation
Video_Caption
Attention
Caption
PDF
-
A Double-Graph Based Framework for Frame Semantic Parsing
Ce Zheng, Xudong Chen, Runxin Xu, Baobao Chang
arXiv_CL
arXiv_CL
Knowledge
Knowledge_Graph
Pose
Action
Classification
Relation
Attention
PDF
-
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Mike Goodwin, Arvindh Krishnaswamy
arXiv_SD
arXiv_SD
Transformer
Pose
Action
Attention
PDF
-
Attention-based Dynamic Subspace Learners for Medical Image Analysis
Sukesh Adiga V, Jose Dolz, Herve Lombaert
arXiv_CV
arXiv_CV
Segmentation
Embedding
Weakly_Supervised
Image_Retrieval
Pose
Classification
Attention
Medical
Recommendation
PDF
-
TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation
Nikhil Kumar Tomar, Annie Shergill, Brandon Rieders, Ulas Bagci, Debesh Jha
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Deep_Learning
Detection
Attention
PDF
-
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Detection
Attention
PDF
-
Cross-task Attention Mechanism for Dense Multi-task Learning
Ivan Lopes, Tuan-Hung Vu, Raoul de Charette
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
Semantic_Segmentation
Represenation_Learning
Pose
Face
Relation
Attention
PDF
-
Adapting the Linearised Laplace Model Evidence for Modern Deep Learning
Javier Antorán, David Janz, James Urquhart Allingham, Erik Daxberger, Riccardo Barbano, Eric Nalisnick, José Miguel Hernández-Lobato
arXiv_AI
arXiv_AI
Transformer
Deep_Learning
Attention
Recommendation
PDF
-
SimA: Simple Softmax-free Attention for Vision Transformers
Soroush Abbasi Koohpayegani, Hamed Pirsiavash
arXiv_CV
arXiv_CV
Transformer
Attention
PDF
-
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
Reinforcement_Learning
Pose
Attention
PDF
-
Multimodal Attention-based Deep Learning for Alzheimer's Disease Diagnosis
Michal Golovanevsky, Carsten Eickhoff, Ritambhara Singh
arXiv_CV
arXiv_CV
Action
Classification
Deep_Learning
Attention
Medical
PDF
-
Holistic Transformer: A Joint Neural Network for Trajectory Prediction and Decision-Making of Autonomous Vehicles
Hongyu Hu, Qi Wang, Zhengguang Zhang, Zhengyi Li, Zhenhai Gao
arXiv_RO
arXiv_RO
Transformer
Sparse
Knowledge
Pose
Relation
Attention
Autonomous
Prediction
PDF
-
CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images
Weiwei Cui, Yaqi Wang, Qianni Zhang, Huiyu Zhou, Dan Song, Xingyong Zuo, Gangyong Jia, Liaoyuan Zeng
arXiv_CV
arXiv_CV
Segmentation
3D
Restoration
Knowledge
Deep_Learning
Attention
PDF
-
A Quantitative and Qualitative Analysis of Suicide Ideation Detection using Deep Learning
Siqu Long, Rina Cabral, Josiah Poon, Soyeon Caren Han
arXiv_CL
arXiv_CL
Embedding
Text_Classification
RNN
Quantitative
Classification
Deep_Learning
Detection
Attention
Language_Model
Prediction
PDF
-
Local Slot Attention for Vision-and-Language Navigation
Yifeng Zhuang, Qiang Sun, Yanwei Fu, Lifeng Chen, Xiangyang Sue
arXiv_CV
arXiv_CV
Transformer
Segmentation
Bert
Pose
Attention
PDF
-
Improving Diversity of Multiple Trajectory Prediction based on Map-adaptive Lane Loss
Sanmin Kim, Hyeongseok Jeon, Junwon Choi, Dongsuk Kum
arXiv_CV
arXiv_CV
Pose
Quantitative
Attention
Autonomous
Prediction
PDF
-
Learning Using Privileged Information for Zero-Shot Action Recognition
Zhiyi Gao, Wanqing Li, Zihui Guo, Bin Yu, Yonghong Hou
arXiv_CV
arXiv_CV
Recognition
Zero-Shot
Pose
Action_Recognition
Action
Attention
PDF
-
Medical Dialogue Response Generation with Pivotal Information Recalling
Yu Zhao, Yunxin Li, Yuxiang Wu, Baotian Hu, Qingcai Chen, Xiaolong Wang, Yuxin Ding, Min Zhang
arXiv_AI
arXiv_AI
Knowledge
Pose
Relation
Attention
Medical
Language_Model
PDF
-
Automatic Correction of Human Translations
Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero
arXiv_CL
arXiv_CL
Attention
PDF
-
Bootstrapped Transformer for Offline Reinforcement Learning
Kerong Wang, Hanye Zhao, Xufang Luo, Kan Ren, Weinan Zhang, Dongsheng Li
arXiv_RO
arXiv_RO
Transformer
Reinforcement_Learning
Pose
Attention
PDF
-
Rectify ViT Shortcut Learning by Visual Saliency
Chong Ma, Lin Zhao, Yuzhong Chen, David Weizhong Liu, Xi Jiang, Tuo Zhang, Xintao Hu, Dinggang Shen, Dajiang Zhu, Tianming Liu
arXiv_CV
arXiv_CV
Transformer
Salient
Knowledge
Pose
Deep_Learning
Attention
Medical
PDF
-
Bio-inspired Intelligence with Applications to Robotics: A Survey
Junfei Li, Zhe Xu, Danjie Zhu, Kevin Dong, Tao Yan, Zhu Zeng, Simon X. Yang
arXiv_RO
arXiv_RO
Tracking
Knowledge
Review
Survey
Attention
Autonomous
PDF
-
What do navigation agents learn about their environment?
Kshitij Dwivedi, Gemma Roig, Aniruddha Kembhavi, Roozbeh Mottaghi
arXiv_CV
arXiv_CV
Action
Deep_Learning
Attention
PDF
-
Backdoor Attacks on Vision Transformers
Akshayvarun Subramanya, Aniruddha Saha, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed Pirsiavash
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Attention
PDF
-
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes
Rui Zhu, Zhengqin Li, Janarbek Matai, Fatih Porikli, Manmohan Chandraker
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Attention
PDF
-
Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media
Abdelkader El Mahdaouy, Abdellah El Mekki, Ahmed Oumar, Hajar Mousannif, Ismail Berrada
arXiv_CL
arXiv_CL
Bert
Speech
Attention
Language_Model
PDF
-
Know your audience: specializing grounded language models with the game of Dixit
Aaditya K. Singh, David Ding, Andrew Saxe, Felix Hill, Andrew K. Lampinen
arXiv_AI
arXiv_AI
Zero-Shot
Pose
Attention
Language_Model
PDF
-
Adversarial Patch Attacks and Defences in Vision-Based Tasks: A Survey
Abhijith Sharma, Yijun Bian, Phil Munz, Apurva Narayan
arXiv_CV
arXiv_CV
Adversarial
Survey
Deep_Learning
Detection
Attention
PDF
-
Event-related data conditioning for acoustic event classification
Yuanbo Hou, Dick Botteldooren
arXiv_SD
arXiv_SD
Pose
Classification
Attention
PDF
-
Multi scale Feature Extraction and Fusion for Online Knowledge Distillation
Panpan Zou, Yinglei Teng, Tao Niu
arXiv_CV
arXiv_CV
Knowledge
Pose
Action
Attention
Prediction
PDF
-
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
Viraj Prabhu, Sriram Yenamandra, Aaditya Singh, Judy Hoffman
arXiv_CV
arXiv_CV
Transformer
Recognition
Self-Supervised
Pose
Attention
CNN
PDF
-
Channel Importance Matters in Few-Shot Image Classification
Xu Luo, Jing Xu, Zenglin Xu
arXiv_CV
arXiv_CV
Image_Caption
Pose
Classification
Few-Shot
Attention
CNN
Image_Classification
PDF
-
Reinforcement Learning-enhanced Shared-account Cross-domain Sequential Recommendation
Lei Guo, Jinyu Zhang, Tong Chen, Xinhua Wang, Hongzhi Yin
arXiv_AI
arXiv_AI
Reinforcement_Learning
RNN
Pose
Action
Attention
Recommendation
PDF
-
U-PET: MRI-based Dementia Detection with Joint Generation of Synthetic FDG-PET Images
Marcel Kollovieh, Matthias Keicher, Stephan Wunderlich, Hendrik Burwinkel, Thomas Wendler, Nassir Navab
arXiv_CV
arXiv_CV
Pose
Classification
Detection
Attention
PDF
-
Time Interval-enhanced Graph Neural Network for Shared-account Cross-domain Sequential Recommendation
Lei Guo, Jinyu Zhang, Li Tang, Tong Chen, Lei Zhu, Hongzhi Yin
arXiv_AI
arXiv_AI
Represenation_Learning
RNN
Knowledge
Pose
Relation
Attention
Recommendation
PDF
-
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History
Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari
arXiv_SD
arXiv_SD
Embedding
Speech
Self-Supervised
Pose
Attention
PDF
-
DIALOG-22 RuATD Generated Text Detection
Narek Maloyan, Bulat Nutfullin, Eugene Ilyushin
arXiv_CL
arXiv_CL
Pose
Classification
Detection
Object_Detection
Attention
Text_Generation
PDF
-
Multi-View Imputation and Cross-Attention Network Based on Incomplete Longitudinal and Multi-Modal Data for Alzheimer's Disease Prediction
Meiyan Huang, Tao Wang, Xiumei Chen, Xiaoling Zhang, Shuoling Zhou, Qianjin Feng
arXiv_CV
arXiv_CV
Adversarial
Pose
Classification
Attention
Prediction
PDF
-
Patch-level Representation Learning for Self-supervised Vision Transformers
Sukmin Yun, Hankook Lee, Jaehyung Kim, Jinwoo Shin
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Represenation_Learning
Self-Supervised
Detection
Relation
Object_Detection
Attention
CNN
Prediction
PDF
-
Image Captioning based on Feature Refinement and Reflective Decoding
Ghadah Alabduljabbar, Hafida Benhidour, Said Kerrache
arXiv_CV
arXiv_CV
Image_Caption
Salient
Pose
Action
Deep_Learning
Attention
Caption
PDF
-
PeQuENet: Perceptual Quality Enhancement of Compressed Video with Adaptation- and Attention-based Network
Saiping Zhang, Luis Herranz, Marta Mrak, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang
arXiv_CV
arXiv_CV
Quantization
Enhancement
Adversarial
Pose
Relation
Attention
GAN
PDF
-
What makes domain generalization hard?
Spandan Madan, Li You, Mengmi Zhang, Hanspeter Pfister, Gabriel Kreiman
arXiv_AI
arXiv_AI
Transformer
Recognition
3D
Pose
Attention
PDF
-
AVATAR: Unconstrained Audiovisual Speech Recognition
Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid
arXiv_CV
arXiv_CV
Transformer
Recognition
Speech
Pose
Action
Attention
Speech_Recognition
PDF
-
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng Hua
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Relation
Attention
CNN
Image_Classification
PDF
-
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Detection
VQA
Object_Detection
Attention
Caption
QA
PDF
-
AMR Alignment: Paying Attention to Cross-Attention
Pere-Lluís Huguet Cabot, Abelardo Carlos Martínez Lorenzo, Roberto Navigli
arXiv_CL
arXiv_CL
Transformer
Attention
PDF
-
A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions
Sheng Zhou, Hongjia Xu, Zhuonan Zheng, Jiawei Chen, Zhao li, Jiajun Bu, Jia Wu, Xin Wang, Wenwu Zhu, Martin Ester
arXiv_AI
arXiv_AI
Unsupervised
Represenation_Learning
Pose
Survey
Action
Deep_Learning
Attention
PDF
-
Forecasting of depth and ego-motion with transformers and self-supervision
Houssem Boulahbal, Adrian Voicila, Andrew Comport
arXiv_CV
arXiv_CV
Transformer
Self-Supervised
Attention
Inference
PDF
-
Self-Supervised Implicit Attention: Guided Attention by The Model Itself
Jinyi Wu, Xun Gong, Zhemin Zhang
arXiv_CV
arXiv_CV
Self-Supervised
Pose
Classification
Attention
CNN
Image_Classification
Inference
PDF
-
Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological Images
Amirreza Mahbod, Rahim Entezari, Isabella Ellinger, Olga Saukh
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Attention
Medical
Inference
PDF
-
NatiQ: An End-to-end Text-to-Speech System for Arabic
Ahmed Abdelali, Nadir Durrani, Cenk Demiroglu, Fahim Dalvi, Hamdy Mubarak, Kareem Darwish
arXiv_CL
arXiv_CL
Transformer
RNN
Speech
Attention
GAN
PDF
-
MonoGround: Detecting Monocular 3D Objects from the Ground
Zequn Qin, Xi Li
arXiv_CV
arXiv_CV
3D
Pose
Detection
Object_Detection
Attention
Inference
PDF
-
Lattice Convolutional Networks for Learning Ground States of Quantum Many-Body Systems
Cong Fu, Xuan Zhang, Huixin Zhang, Hongyi Ling, Shenglong Xu, Shuiwang Ji
arXiv_AI
arXiv_AI
Pose
Deep_Learning
Attention
CNN
PDF
-
XMorpher: Full Transformer for Deformable Medical Image Registration via Cross Attention
Jiacheng Shi, Yuting He, Youyong Kong, Jean-Louis Coatrieux, Huazhong Shu, Guanyu Yang, Shuo Li
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Deep_Learning
Attention
Medical
PDF
-
Stextsuperscript{2}-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic Segmentation
Mohammed A. M. Elhassan, Chenhui Yang, Chenxi Huang, Tewodros Legesse Munea, Xin Hong
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Pose
Attention
PDF
-
Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang
arXiv_AI
arXiv_AI
Recognition
Speech
Contrastive_Learning
Detection
Attention
PDF
-
Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises
Minkyu Choi, Yizhen Zhang, Kuan Han, Xiaokai Wang, Zhongming Liu
arXiv_CV
arXiv_CV
Recognition
RNN
Salient
Adversarial
Pose
Attention
CNN
PDF
-
Defending Observation Attacks in Deep Reinforcement Learning via Detection and Denoising
Zikang Xiong, Joe Eappen, He Zhu, Suresh Jagannathan
arXiv_RO
arXiv_RO
Reinforcement_Learning
Adversarial
Pose
Detection
Denoising
Attention
PDF
-
TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose Estimation
Mohammad Rezaei, Razieh Rastgoo, Vassilis Athitsos
arXiv_CV
arXiv_CV
3D
Pose_Estimation
Knowledge
Pose
Attention
Prediction
PDF
-
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention
Quanzeng You, Jiang Wang, Peng Chu, Andre Abrantes, Zicheng Liu
arXiv_CV
arXiv_CV
Segmentation
Pose
Quantitative
Attention
Prediction
PDF
-
FETILDA: An Effective Framework For Fin-tuned Embeddings For Long Financial Text Documents
Bolun "Namir" Xia, Vipula D. Rawte, Mohammed J. Zaki, Aparna Gupta
arXiv_CL
arXiv_CL
Embedding
Pose
Deep_Learning
Attention
Language_Model
Prediction
PDF
-
Stand-Alone Inter-Frame Attention in Video Models
Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Jiebo Luo, Tao Mei
arXiv_CV
arXiv_CV
Transformer
Video_Caption
3D
Deep_Learning
Attention
Prediction
PDF
-
A Multi-task Framework for Infrared Small Target Detection and Segmentation
Yuhang Chen, Liyuan Li, Xin Liu, Xiaofeng Su, Fansheng Chen
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Pose
Detection
Object_Detection
Attention
Inference
PDF
-
RDU: A Region-based Approach to Form-style Document Understanding
Fengbin Zhu, Chao Wang, Wenqiang Lei, Ziyang Liu, Tat Seng Chua
arXiv_CL
arXiv_CL
Recognition
OCR
Bert
Optical_Character
Pose
Face
Action
Detection
Object_Detection
Attention
Prediction
PDF
-
Peripheral Vision Transformer
Juhong Min, Yucheng Zhao, Chong Luo, Minsu Cho
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Classification
Contour
Attention
Image_Classification
PDF
-
How to Agree to Disagree: Managing Ontological Perspectives using Standpoint Logic
Lucía Gómez Álvarez, Sebastian Rudolph, Hannes Strass
arXiv_AI
arXiv_AI
Knowledge
Pose
Ontology
Relation
Attention
GAN
PDF
-
Learning Dense Features for Point Cloud Registration Using Graph Attention Network
Lai Dang Quoc Vinh, Sarvar Hussain Nengroo, Hojun Jin
arXiv_CV
arXiv_CV
Reconstruction
Tracking
Point_Cloud
Detection
Relation
Object_Detection
Attention
Matching
PDF
-
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction
Andreas Triantafyllopoulos, Meishu Song, Zijiang Yang, Xin Jing, Björn W. Schuller
arXiv_SD
arXiv_SD
Emotion
Few-Shot
Attention
Prediction
PDF
-
Learning Best Combination for Efficient N:M Sparsity
Yuxin Zhang, Mingbao Lin, Zhihang Lin, Yiting Luo, Ke Li, Fei Chao, Yongjian Wu, Rongrong Ji
arXiv_CV
arXiv_CV
Attention
PDF
-
Semantic-Discriminative Mixup for Generalizable Sensor-based Cross-domain Activity Recognition
Wang Lu, Jindong Wang, Yiqiang Chen, Sinno Jialin Pan, Chunyu Hu, Xin Qin
arXiv_AI
arXiv_AI
Transfer_Learning
Recognition
Pose
Classification
Attention
Activity
PDF
-
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
Action
Attention
PDF
-
Generalizable Method for Face Anti-Spoofing with Semi-Supervised Learning
Nikolay Sergievskiy, Roman Vlasov, Roman Trusov
arXiv_CV
arXiv_CV
Unsupervised
Pose
Face
Classification
Attention
PDF
-
GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation
Wenhao Li, Hong Liu, Tianyu Guo, Hao Tang, Runwei Ding
arXiv_CV
arXiv_CV
3D
Represenation_Learning
Pose_Estimation
Knowledge
Pose
Action
Attention
CNN
PDF
-
Language Models are General-Purpose Interfaces
Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
arXiv_CL
arXiv_CL
Zero-Shot
Pose
Face
Few-Shot
Attention
Language_Model
PDF
-
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei
arXiv_AI
arXiv_AI
Transformer
NAS
Recognition
3D
Pose
Attention
CNN
PDF
-
RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans
Pengxin Yu, Haoyue Zhang, Han Kang, Wen Tang, Corey W. Arnold, Rongguo Zhang
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Pose
Deep_Learning
Attention
Medical
CNN
PDF
-
Transformer Lesion Tracker
Wen Tang, Han Kang, Haoyue Zhang, Pengxin Yu, Corey W. Arnold, Rongguo Zhang
arXiv_CV
arXiv_CV
Transformer
Tracking
Sparse
Knowledge
Pose
Action
Attention
Matching
PDF
-
AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields
Takuhiro Kaneko
arXiv_AI
arXiv_AI
Unsupervised
3D
Represenation_Learning
Adversarial
Pose
Face
Relation
Attention
GAN
PDF
-
TriMix: Virtual embeddings and self-consistency for self-supervised learning
Tariq Bdair, Hossam Abdelhamid, Nassir Navab, Shadi Albarqouni
arXiv_AI
arXiv_AI
Embedding
Represenation_Learning
Self-Supervised
Pose
Attention
Medical
PDF
-
Efficient Human-in-the-loop System for Guiding DNNs Attention
Yi He, Xi Yang, Chia-Ming Chang, Haoran Xie, Takeo Igarashi
arXiv_CV
arXiv_CV
Segmentation
Pose
Classification
Deep_Learning
Attention
Image_Classification
PDF
-
A Database for Perceived Quality Assessment of User-Generated VR Videos
Yuming Fang, Yiru Yao, Xiangjie Sui, Kede Ma
arXiv_CV
arXiv_CV
Salient
Detection
Attention
PDF
-
Revisiting Whole-Slide Image Pyramids for Cancer Prognosis via Dual-Stream Networks
Pei Liu, Bo Fu, Feng Ye, Rui Yang, Bin Xu, Luping Ji
arXiv_CV
arXiv_CV
Pose
Relation
Attention
PDF
-
Human-Following and -guiding in Crowded Environments using Semantic Deep-Reinforcement-Learning for Mobile Service Robots
Linh Kästner, Bassel Fatloun, Zhengcheng Shen, Daniel Gawrisch, Jens Lambrecht
arXiv_RO
arXiv_RO
Pose
Action
Attention
PDF
-
Consistent Attack: Universal Adversarial Perturbation on Embodied Vision Navigation
You Qiaoben, Chengyang Ying, Xinning Zhou, Hang Su, Jun Zhu, Bo Zhang
arXiv_CV
arXiv_CV
Adversarial
Pose
Attention
PDF
-
Multimodal Fake News Detection with Adaptive Unimodal Representation Aggregation
Qichao Ying, Yangming Zhou, Zhenxing Qian, Dan Zeng, Shiming Ge
arXiv_CV
arXiv_CV
Pose
Action
Classification
Detection
Attention
Prediction
PDF
-
Human Mobility Prediction with Causal and Spatial-constrained Multi-task Network
Zongyuan Huang, Shengyuan Xu, Menghan Wang, Hansi Wu, Yanyan Xu, Yaohui Jin
arXiv_AI
arXiv_AI
RNN
Pose
Attention
Activity
Prediction
PDF
-
RL-EA: A Reinforcement Learning-Based Evolutionary Algorithm Framework for Electromagnetic Detection Satellite Scheduling Problem
Yanjie Song, Luona Wei, Qing Yang, Jian Wu, Lining Xing, Yingwu Chen
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Action
Detection
Attention
PDF
-
DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement
Yuhang Dong, Gongping Yang, Yilong Yin
arXiv_CV
arXiv_CV
Reconstruction
Pose
Face
Attention
PDF
-
Indirect-Instant Attention Optimization for Crowd Counting in Dense Scenes
Suyu Han, Guodong Wang, Donghua Liu
arXiv_CV
arXiv_CV
Optimization
Action
Relation
Attention
PDF
-
Parameter Convex Neural Networks
Jingcheng Zhou, Wei Wei, Xing Li, Bowen Pang, Zhiming Zheng
arXiv_AI
arXiv_AI
Gradient_Descent
Optimization
Pose
Deep_Learning
Attention
CNN
Recommendation
PDF
-
Defending Adversarial Examples by Negative Correlation Ensemble
Wenjian Luo, Hongwei Zhang, Linghao Kong, Zhijian Chen, Ke Tang
arXiv_AI
arXiv_AI
Adversarial
Pose
Deep_Learning
Relation
Attention
Prediction
PDF
-
Toward Real-world Single Image Deraining: A New Benchmark and Beyond
Wei Li, Qiming Zhang, Jing Zhang, Zhen Huang, Xinmei Tian, Dacheng Tao
arXiv_CV
arXiv_CV
Transfer_Learning
Restoration
Pose
Attention
PDF
-
DRAformer: Differentially Reconstructed Attention Transformer for Time-Series Forecasting
Benhan Li, Shengdong Du, Tianrui Li, Jie Hu, Zhen Jia
arXiv_AI
arXiv_AI
Transformer
Pose
Relation
Attention
PDF
-
Rethinking the Defense Against Free-rider Attack From the Perspective of Model Weight Evolving Frequency
Jinyin Chen, Mingjun Li, Tao Liu, Haibin Zheng, Yao Cheng, Changting Lin
arXiv_AI
arXiv_AI
Pose
Attention
PDF
-
High-Definition Map Generation Technologies For Autonomous Driving: A Review
Zhibin Bao, Sabir Hossain, Haoxiang Lang, Xianke Lin
arXiv_RO
arXiv_RO
Segmentation
3D
Review
Pose
Detection
Object_Detection
Attention
GAN
Autonomous
PDF
-
Generalizable Neural Radiance Fields for Novel View Synthesis with Transformer
Dan Wang, Xinrui Cui, Septimiu Salcudean, Z. Jane Wang
arXiv_CV
arXiv_CV
Transformer
3D
Pose
Relation
Attention
PDF
-
ProActive: Self-Attentive Temporal Point Process Flows for Activity Sequences
Vinayak Gupta, Srikanta Bedathur
arXiv_CV
arXiv_CV
Recognition
Optimization
Action
Detection
Attention
Activity
Prediction
PDF
-
Exploring Feature Self-relation for Self-supervised Transformer
Zhong-Yu Li, Shanghua Gao, Ming-Ming Cheng
arXiv_CV
arXiv_CV
Transformer
Embedding
Self-Supervised
Relation
Attention
CNN
PDF
-
Saccade Mechanisms for Image Classification, Object Detection and Tracking
Saurabh Farkya, Zachary Daniels, Aswin Nadamuni Raghavan, David Zhang, Michael Piacentino
arXiv_CV
arXiv_CV
Transformer
Tracking
Object_Tracking
Pose
Classification
Detection
Object_Detection
Attention
CNN
Image_Classification
PDF
-
An Enactivist-Inspired Mathematical Model of Cognition
Vadim Weinstein, Basak Sakcak, Steven M. LaValle
arXiv_AI
arXiv_AI
Pose
Attention
GAN
PDF
-
Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models
Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel
arXiv_CV
arXiv_CV
Weakly_Supervised
Pose
Denoising
Attention
PDF
-
Unsupervised and Few-shot Parsing from Pretrained Language Models
Zhiyuan Zeng, Deyi Xiong
arXiv_CL
arXiv_CL
Segmentation
Unsupervised
Knowledge
Pose
Few-Shot
Attention
Language_Model
PDF
-
Evolutionary Echo State Network: evolving reservoirs in the Fourier space
Sebastian Basterrech, Gerardo Rubino
arXiv_AI
arXiv_AI
Pose
Attention
PDF
-
Deep Multi-view Semi-supervised Clustering with Sample Pairwise Constraints
Rui Chen, Yongqiang Tang, Wensheng Zhang, Wenlong Feng
arXiv_CV
arXiv_CV
Reconstruction
Optimization
Pose
Attention
Prediction
PDF
-
MAREO: Memory- and Attention- based visual REasOning
Mohit Vaishnav, Thomas Serre
arXiv_AI
arXiv_AI
Transformer
Relation
Attention
PDF
-
Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality
Ming Yin, Wenjing Chen, Mengdi Wang, Yu-Xiang Wang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Action
Attention
PDF
-
PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen Categories
Yuchen Rao, Yinyu Nie, Angela Dai
arXiv_CV
arXiv_CV
Reconstruction
3D
Pose
Attention
PDF
-
NAGphormer: Neighborhood Aggregation Graph Transformer for Node Classification in Large Graphs
Jinsong Chen, Kaiyuan Gao, Gaichao Li, Kun He
arXiv_AI
arXiv_AI
Transformer
Pose
Classification
Attention
PDF
-
Learning to Estimate Shapley Values with Vision Transformers
Ian Covert, Chanwoo Kim, Su-In Lee
arXiv_CV
arXiv_CV
Transformer
Attention
Prediction
PDF
-
Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model
Fabian Deuser, Konrad Habel, Philipp J. Rösch, Norbert Oswald
arXiv_CV
arXiv_CV
Classification
VQA
Attention
PDF
-
AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism
Yang Zhao, Xuan Lin, Wenqiang Xu, Maozong Zheng, Zhengyong Liu, Zhou Zhao
arXiv_CV
arXiv_CV
Pose
Detection
Relation
Attention
PDF
-
Deep Leakage from Model in Federated Learning
Zihao Zhao, Mengen Luo, Wenbo Ding
arXiv_AI
arXiv_AI
Pose
Attention
PDF
-
Emoji-based Fine-grained Attention Network for Sentiment Analysis in the Microblog Comments
Deng Yang, Liu Kejian, Yang Cheng, Feng Yuanyuan, Li Weihao
arXiv_CL
arXiv_CL
Embedding
Bert
Sentiment_Classification
RNN
Pose
Emotion
Action
Classification
Sentiment
Attention
PDF
-
Symbolic image detection using scene and knowledge graphs
Nasrin Kalanat, Adriana Kovashka
arXiv_CV
arXiv_CV
Knowledge
Knowledge_Graph
Pose
Classification
Detection
Relation
Attention
PDF
-
R4D: Utilizing Reference Objects for Long-Range Distance Estimation
Yingwei Li, Tiffany Chen, Maya Kabkab, Ruichi Yu, Longlong Jing, Yurong You, Hang Zhao
arXiv_CV
arXiv_CV
Pose
Attention
Autonomous
Prediction
PDF
-
Superresolution and Segmentation of OCT scans using Multi-Stage adversarial Guided Attention Training
Paria Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Dabouei, Ali Rezai, Nasser M. Nasrabadi
arXiv_CV
arXiv_CV
Segmentation
Adversarial
Pose
Relation
Attention
GAN
PDF
-
Building Spatio-temporal Transformers for Egocentric 3D Pose Estimation
Jinman Park, Kimathi Kaai, Saad Hossain, Norikatsu Sumi, Sirisha Rambhatla, Paul Fieguth
arXiv_AI
arXiv_AI
Transformer
3D
Pose_Estimation
Pose
Attention
CNN
PDF
-
An Empirical Study on Disentanglement of Negative-free Contrastive Learning
Jinkun Cao, Ruiqian Nai, Qing Yang, Jialei Huang, Yang Gao
arXiv_AI
arXiv_AI
Represenation_Learning
Self-Supervised
Pose
Contrastive_Learning
Attention
PDF
-
GateHUB: Gated History Unit with Background Suppression for Online Action Detection
Junwen Chen, Gaurav Mittal, Ye Yu, Yu Kong, Mei Chen
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Detection
Attention
Optical_Flow
Prediction
PDF
-
Extreme Masking for Learning Instance and Distributed Visual Representations
Zhirong Wu, Zihang Lai, Xiao Sun, Stephen Lin
arXiv_CV
arXiv_CV
Attention
PDF
-
AGConv: Adaptive Graph Convolution on 3D Point Clouds
Mingqiang Wei, Zeyong Wei, Haoran Zhou, Fei Hu, Huajian Si, Zhilei Chen, Zhe Zhu, Jingbo Qiu, Xuefeng Yan, Yanwen Guo, Jun Wang, Jing Qin
arXiv_CV
arXiv_CV
Segmentation
Point_Cloud
3D
Pose
Action
Classification
Deep_Learning
Relation
Denoising
Attention
PDF
-
Simple Cues Lead to a Strong Multi-Object Tracker
Jenny Seidenschwarz, Guillem Braso, Ismail Elezi, Laura Leal-Taixe
arXiv_CV
arXiv_CV
Tracking
Object_Tracking
Detection
Attention
Matching
PDF
-
Spatial Entropy Regularization for Vision Transformers
Elia Peruzzo, Enver Sangineto, Yahui Liu, Marco De Nadai, Wei Bi, Bruno Lepri, Nicu Sebe
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Regularization
Self-Supervised
Pose
Attention
PDF
-
GASP: Gated Attention For Saliency Prediction
Fares Abawi, Tom Weber, Stefan Wermter
arXiv_AI
arXiv_AI
Salient
Attention
Prediction
PDF
-
Revisiting End-to-End Speech-to-Text Translation From Scratch
Biao Zhang, Barry Haddow, Rico Sennrich
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Pose
Attention
Speech_Recognition
PDF
-
SparseFormer: Attention-based Depth Completion Network
Frederik Warburg, Michael Ramamonjisoa, Manuel López-Antequera
arXiv_CV
arXiv_CV
Transformer
3D
Sparse
SLAM
Attention
PDF
-
BSM loss: A superior way in modeling aleatory uncertainty of fine_grained classification
Shuang Ge, Kehong Yuan, Maokun Han, Desheng Sun, Huabin Zhang, Qiongyu Ye
arXiv_CV
arXiv_CV
Pose
Classification
Relation
Attention
Medical
Inference
PDF
-
Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-Identification
Xiaohong Wang, Chaoqi Li, Xiangcai Ma
arXiv_CV
arXiv_CV
Person_Re-identification
Enhancement
Recognition
Pose
Attention
Re-identification
PDF
-
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes
Kim Youwang, Kim Ji-Yeon, Tae-Hyun Oh
arXiv_AI
arXiv_AI
Embedding
3D
Optimization
Pose
Attention
Recommendation
PDF
-
Graph Attention Multi-Layer Perceptron
Wentao Zhang, Ziqi Yin, Zeang Sheng, Yang Li, Wen Ouyang, Xiaosen Li, Yangyu Tao, Zhi Yang, Bin Cui
arXiv_AI
arXiv_AI
Sparse
Knowledge
Pose
Relation
Attention
PDF
-
Audio-video fusion strategies for active speaker detection in meetings
Lionel Pibre, Francisco Madrigal, Cyrille Equoy, Frédéric Lerasle, Thomas Pellegrini, Julien Pinquier, Isabelle Ferrané
arXiv_SD
arXiv_SD
Unsupervised
3D
Gesture
Pose
Face
Action
Detection
Attention
Activity
CNN
Optical_Flow
PDF
-
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang, Arturs Backurs, Sébastien Bubeck, Ronen Eldan, Suriya Gunasekar, Tal Wagner
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
Attention
CNN
PDF
-
Indoor Depth Completion with Boundary Consistency and Self-Attention
Yu-Kai Huang, Tsung-Han Wu, Yueh-Cheng Liu, Winston H. Hsu
arXiv_CV
arXiv_CV
Enhancement
Recognition
Inpainting
3D
Restoration
Pose
Face
Attention
PDF
-
VN-Transformer: Rotation-Equivariant Attention for Vector Neurons
Serge Assaad, Carlton Downey, Rami Al-Rfou, Nigamaa Nayakanti, Ben Sapp
arXiv_CV
arXiv_CV
Transformer
3D
Classification
Attention
Inference
PDF
-
DRHDR: A Dual branch Residual Network for Multi-Bracket High Dynamic Range Imaging
Juan Marín-Vega, Michael Sloth, Peter Schneider-Kamp, Richard Röttger
arXiv_CV
arXiv_CV
Pose
Attention
CNN
PDF
-
Learning Ego 3D Representation as Ray Tracing
Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang
arXiv_CV
arXiv_CV
Segmentation
3D
Sparse
Represenation_Learning
Detection
Object_Detection
Attention
PDF
-
ReCo: A Dataset for Residential Community Layout Planning
Xi Chen, Yun Xiong, Siqi Wang, Haofen Wang, Tao Sheng, Yao Zhang, Yu Ye
arXiv_AI
arXiv_AI
Recognition
Adversarial
Deep_Learning
Attention
GAN
PDF
-
Few-Shot Audio-Visual Learning of Environment Acoustics
Sagnik Majumder, Changan Chen, Ziad Al-Halah, Kristen Grauman
arXiv_CV
arXiv_CV
Transformer
3D
Sparse
Few-Shot
Attention
Prediction
PDF
-
Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection
Mohsen Vadidar, Ali Kariminezhad, Christian Mayr, Laurent Kloeker, Lutz Eckstein
arXiv_CV
arXiv_CV
Pose
Detection
Object_Detection
Attention
CNN
PDF
-
Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window
Qiuli Wang, Xin Tan, Chen Liu
arXiv_CV
arXiv_CV
Pose
Classification
Deep_Learning
Attention
Medical
Prediction
PDF
-
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu
arXiv_CV
arXiv_CV
Segmentation
3D
Pose
Action
Denoising
Attention
PDF
-
Set Interdependence Transformer: Set-to-Sequence Neural Networks for Permutation Learning and Structure Prediction
Mateusz Jurewicz, Leon Derczynski
arXiv_CL
arXiv_CL
Transformer
Optimization
Pose
Action
Relation
Attention
Prediction
PDF
-
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Puyang Zhang, Kaihao Zhang, Wenhan Luo, Changsheng Li, Guoren Wang
arXiv_CV
arXiv_CV
Transformer
Restoration
Pose
Face
Action
Quantitative
Attention
PDF
-
A Unified Model for Multi-class Anomaly Detection
Zhiyuan You, Lei Cui, Yujun Shen, Kai Yang, Xin Lu, Yu Zheng, Xinyi Le
arXiv_CV
arXiv_CV
Reconstruction
Embedding
Unsupervised
Pose
Detection
Attention
CNN
PDF
-
UHD Image Deblurring via Multi-scale Cubic-Mixer
Zhuoran Zheng, Xiuyi Jia
arXiv_CV
arXiv_CV
Transformer
Pose
Attention
PDF
-
Joint Adversarial Learning for Cross-domain Fair Classification
Yueqing Liang, Canyu Chen, Tian Tian, Kai Shu
arXiv_AI
arXiv_AI
Adversarial
Pose
Classification
Attention
Prediction
PDF
-
Spatial Cross-Attention Improves Self-Supervised Visual Representation Learning
Mehdi Seyfi, Amin Banitalebi-Dehkordi, Yong Zhang
arXiv_AI
arXiv_AI
Unsupervised
Represenation_Learning
Knowledge
Self-Supervised
Pose
Classification
Detection
Relation
Object_Detection
Attention
Inference
PDF
-
Code-DKT: A Code-based Knowledge Tracing Model for Programming Tasks
Yang Shi, Min Chi, Tiffany Barnes, Thomas Price
arXiv_AI
arXiv_AI
Knowledge
Pose
Attention
Prediction
PDF
-
How to Dissect a Muppet: The Structure of Transformer Embedding Spaces
Timothee Mickus, Denis Paperno, Mathieu Constant
arXiv_CL
arXiv_CL
Transformer
Embedding
Quantitative
Attention
PDF
-
Can CNNs Be More Robust Than Transformers?
Zeyu Wang, Yutong Bai, Yuyin Zhou, Cihang Xie
arXiv_CV
arXiv_CV
Transformer
Recognition
Attention
CNN
PDF
-
RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction
Yuan Liang, Zhuoxuan Jiang, Di Yin, Bo Ren
arXiv_CL
arXiv_CL
Transformer
Pose
Action
Relation
Attention
Prediction
PDF
-
IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach
Haoyuan Chen, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Weiming Hu, Yixin Li, Wanli Liu, Changhao Sun, Hongzan Sun, Xinyu Huang, Marcin Grzegorzek
arXiv_CV
arXiv_CV
Weakly_Supervised
Pose
Action
Classification
Deep_Learning
Attention
Activity
CNN
Image_Classification
PDF
-
Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution
Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
arXiv_CV
arXiv_CV
Super_Resolution
Optimization
Pose
Quantitative
Relation
Attention
PDF
-
DeepTPI: Test Point Insertion with Deep Reinforcement Learning
Zhengyuan Shi, Min Li, Sadaf Khan, Liuzheng Wang, Naixing Wang, Yu Huang, Qiang Xu
arXiv_AI
arXiv_AI
Embedding
Enhancement
Reinforcement_Learning
Pose
Action
Attention
PDF
-
Learning Attention-based Representations from Multiple Patterns for Relation Prediction in Knowledge Graphs
Vítor Lourenço, Aline Paes
arXiv_AI
arXiv_AI
Embedding
Knowledge
Knowledge_Graph
Pose
Relation
Attention
Prediction
PDF
-
Wavelet Prior Attention Learning in Axial Inpainting Network
Chenjie Cao, Chengrong Wang, Yuntao Zhang, Yanwei Fu
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Inpainting
Knowledge
Pose
Quantitative
Attention
PDF
-
Dual Swin-Transformer based Mutual Interactive Network for RGB-D Salient Object Detection
Chao Zeng, Sam Kwong
arXiv_CV
arXiv_CV
Transformer
Salient
Pose
Action
Detection
Object_Detection
Attention
Prediction
PDF
-
Explainable Artificial Intelligence for Internet of Things: A Survey
Ibrahim Kok, Feyza Yildirim Okay, Ozgecan Muyanli, Suat Ozdemir
arXiv_AI
arXiv_AI
Review
Survey
Attention
PDF
-
A Privacy-Preserving Subgraph-Level Federated Graph Neural Network via Differential Privacy
Yeqing Qiu, Chenyu Huang, Jianzong Wang, Zhangcheng Huang, Jing Xiao
arXiv_AI
arXiv_AI
Pose
Attention
Recommendation
PDF
-
Recent Advances for Quantum Neural Networks in Generative Learning
Jinkai Tian, Xiaoyu Sun, Yuxuan Du, Shanshan Zhao, Qing Liu, Kaining Zhang, Wei Yi, Wanrong Huang, Chaoyue Wang, Xingyao Wu, Min-Hsiu Hsieh, Tongliang Liu, Wenjing Yang, Dacheng Tao
arXiv_CV
arXiv_CV
Review
Adversarial
Pose
Relation
Attention
GAN
PDF
-
Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection
Hongsheng Li, Guangming Zhu, Wu Zhen, Lan Ni, Peiyi Shen, Liang Zhang, Ning Wang, Cong Hua
arXiv_CV
arXiv_CV
Recognition
Optimization
Pose
Action_Recognition
Action
Detection
Relation
Attention
PDF
-
Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT Scans
Jiansheng Fang, Jingwen Wang, Anwei Li, Yuguang Yan, Yonghe Hou, Chao Song, Hongbo Liu, Jiang Liu
arXiv_CV
arXiv_CV
3D
Pose
Attention
GAN
Prediction
Recommendation
PDF
-
TriBYOL: Triplet BYOL for Self-Supervised Representation Learning
Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama
arXiv_AI
arXiv_AI
Represenation_Learning
Self-Supervised
Pose
Attention
PDF
-
Transformer-based Personalized Attention Mechanism for Medical Images with Clinical Records
Yusuke Takagi, Noriaki Hashimoto, Hiroki Masuda, Hiroaki Miyoshi, Koichi Ohshima, Hidekata Hontani, Ichiro Takeuchi
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Relation
Attention
Medical
PDF
-
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma
arXiv_CV
arXiv_CV
Recognition
OCR
RNN
Optical_Character
Self-Supervised
Pose
Detection
Object_Detection
Attention
Inference
PDF
-
DETR++: Taming Your Multi-Scale Detection Transformer
Chi Zhang, Lijuan Liu, Xiaoxue Zang, Frederick Liu, Hao Zhang, Xinying Song, Jindong Chen
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Classification
Detection
Object_Detection
Attention
CNN
PDF
-
HMRNet: High and Multi-Resolution Network with Bidirectional Feature Calibration for Brain Structure Segmentation in Radiotherapy
Hao Fu, Guotai Wang, Wenhui Lei, Wei Xu, Qianfei Zhao, Shichuan Zhang, Kang Li, Shaoting Zhang
arXiv_CV
arXiv_CV
Segmentation
Pose
Attention
PDF
-
Efficient entity-based reinforcement learning
Vince Jankovics, Michael Garcia Ortiz, Eduardo Alonso
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
PDF
-
A Bird's-Eye Tutorial of Graph Attention Architectures
Kaustubh D. Dhole, Carl Yang
arXiv_AI
arXiv_AI
Transformer
Attention
Recommendation
PDF
-
Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images
Tom Ron, Michal Weiler-Sagie, Tamir Hazan
arXiv_CV
arXiv_CV
Recognition
Optimization
Salient
Pose
Attention
Medical
CNN
Prediction
PDF
-
Multi-Behavior Sequential Recommendation with Temporal Graph Transformer
Lianghao Xia, Chao Huang, Yong Xu, Jian Pei
arXiv_AI
arXiv_AI
Transformer
Knowledge
Pose
Action
Relation
Attention
Recommendation
PDF
-
Separable Self-attention for Mobile Vision Transformers
Sachin Mehta, Mohammad Rastegari
arXiv_AI
arXiv_AI
Transformer
Pose
Classification
Detection
Object_Detection
Attention
CNN
PDF
-
Learning with Capsules: A Survey
Fabio De Sousa Ribeiro, Kevin Duarte, Miles Everett, Georgios Leontidis, Mubarak Shah
arXiv_CV
arXiv_CV
Transformer
Represenation_Learning
Pose
Survey
Deep_Learning
Relation
Attention
Medical
CNN
Inference
PDF
-
Learning Generalized Wireless MAC Communication Protocols via Abstraction
Luciano Miuccio, Salvatore Riolo, Sumudu Samarakoony, Daniela Panno, Mehdi Bennis
arXiv_AI
arXiv_AI
Reinforcement_Learning
Optimization
Pose
Action
Attention
PDF
-
Real-World Image Super-Resolution by Exclusionary Dual-Learning
Hao Li, Jinghui Qin, Zhijing Yang, Pengxu Wei, Jinshan Pan, Liang Lin, Yukai Shi
arXiv_CV
arXiv_CV
Super_Resolution
Restoration
Optimization
Pose
Deep_Learning
Relation
Attention
PDF
-
Towards Practical Differential Privacy in Data Analysis: Understanding the Effect of Epsilon on Utility in Private ERM
Yuzhe Li, Yong Liu, Bo Li, Weiping Wang, Nan Liu
arXiv_AI
arXiv_AI
Pose
Relation
Attention
GAN
PDF
-
Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object Detection
Chaoqi Chen, Jiongcheng Li, Hong-Yu Zhou, Xiaoguang Han, Yue Huang, Xinghao Ding, Yizhou Yu
arXiv_CV
arXiv_CV
Knowledge
Pose
Action
Detection
Relation
Object_Detection
Attention
PDF
-
Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data
Shohreh Deldari, Hao Xue, Aaqib Saeed, Jiayuan He, Daniel V. Smith, Flora D. Salim
arXiv_CV
arXiv_CV
Represenation_Learning
Review
Speech
Self-Supervised
Attention
PDF
-
MASNet:Improve Performance of Siamese Networks with Mutual-attention for Remote Sensing Change Detection Tasks
Hongbin Zhou, Yupeng Ren, Qiankun Li, Jun Yin, Yonggang Lin
arXiv_AI
arXiv_AI
Transformer
Action
Detection
Attention
CNN
PDF
-
Tagged-MRI2Audio with Attention Guided Heterogeneous Translator
Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Jiachen Zhuo, Maureen Stone, Georges El Fakhri, Jonghye Woo
arXiv_CV
arXiv_CV
Adversarial
Speech
Deep_Learning
Relation
Attention
GAN
CNN
PDF
-
SealID: Saimaa ringed seal re-identification dataset
Ekaterina Nepovinnykh, Tuomas Eerola, Vincent Biard, Piia Mutka, Marja Niemi, Heikki Kälviäinen, Mervi Kunnasranta
arXiv_CV
arXiv_CV
Pose
Attention
Re-identification
PDF
-
3D Convolutional with Attention for Action Recognition
Labina Shrestha, Shikha Dubey, Farrukh Olimov, Muhammad Aasim Rafique, Moongu Jeon
arXiv_CV
arXiv_CV
Recognition
3D
RNN
Pose
Action_Recognition
Action
Attention
CNN
Optical_Flow
PDF
-
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation
Vishal Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe
arXiv_CV
arXiv_CV
Recognition
Pose
Emotion
Action
Attention
PDF
-
A Simple Meta-learning Paradigm for Zero-shot Intent Classification with Mixture Attention Mechanism
Han Liu, Siyang Zhao, Xiaotong Zhang, Feng Zhang, Junjie Sun, Hong Yu, Xianchao Zhang
arXiv_CL
arXiv_CL
Zero-Shot
Pose
Classification
Attention
PDF
-
MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving
Stepan Konev, Kirill Brodt, Artsiom Sanakoyeu
arXiv_CV
arXiv_CV
Pose
Attention
CNN
Autonomous
Prediction
PDF
-
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye
arXiv_CL
arXiv_CL
Knowledge
Speech
Pose
Attention
PDF
-
Recurrent Video Restoration Transformer with Guided Deformable Attention
Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van Gool
arXiv_CV
arXiv_CV
Transformer
Super_Resolution
Restoration
Pose
Denoising
Attention
PDF
-
MPANet: Multi-Patch Attention For Infrared Small Target object Detection
Ao Wang, Wei Li, Xin Wu, Zhanchao Huang, Ran Tao
arXiv_CV
arXiv_CV
Pose
Detection
Object_Detection
Attention
CNN
PDF
-
PIDNet: A Real-time Semantic Segmentation Network Inspired from PID Controller
Jiacong Xu, Zixiang Xiong, Shankar P. Bhattacharyya
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Pose
Attention
CNN
Inference
PDF
-
CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasks
Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jingrong Jiang, Qianjin Guo, Linghan Zheng
arXiv_AI
arXiv_AI
Transformer
Unsupervised
Detection
Relation
Attention
CNN
Inference
PDF
-
MACC: Cross-Layer Multi-Agent Congestion Control with Deep Reinforcement Learning
Jianing Bai, Tianhao Zhang, Guangming Xie
arXiv_AI
arXiv_AI
Reinforcement_Learning
Pose
Attention
PDF
-
From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen
arXiv_CV
arXiv_CV
Pose
VQA
Attention
QA
PDF
-
Video-based Human-Object Interaction Detection from Tubelet Tokens
Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, Wei Shen
arXiv_CV
arXiv_CV
Transformer
Action
Detection
Attention
PDF
-
Recurrent Image Registration using Mutual Attention based Network
Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Tonia Vincent, Bartlomiej W. Papiez
arXiv_CV
arXiv_CV
3D
Pose
Face
Deep_Learning
Attention
GAN
Medical
Inference
PDF
-
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Zhewei Yao, Reza Yazdani Aminabadi, Minjia Zhang, Xiaoxia Wu, Conglong Li, Yuxiong He
arXiv_CL
arXiv_CL
Transformer
Quantization
Bert
Knowledge
Attention
Inference
Language_Model
PDF
-
EAANet: Efficient Attention Augmented Convolutional Networks
Runqing Zhang, Tianshu Zhu
arXiv_CV
arXiv_CV
Salient
Pose
Attention
CNN
PDF
-
QAGCN: A Graph Convolutional Network-based Multi-Relation Question Answering System
Ruijie Wang, Luca Rossetto, Michael Cochez, Abraham Bernstein
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Knowledge
Knowledge_Graph
Pose
Relation
Attention
CNN
QA
PDF
-
Additive MIL: Intrinsic Interpretability for Pathology
Syed Ashar Javed, Dinkar Juyal, Harshith Padigela, Amaro Taylor-Weiner, Limin Yu, Aaditya Prakash
arXiv_CV
arXiv_CV
Pose
Attention
PDF
-
Radar Guided Dynamic Visual Attention for Resource-Efficient RGB Object Detection
Hemant Kumawat, Saibal Mukhopadhyay
arXiv_AI
arXiv_AI
Point_Cloud
Pose
Deep_Learning
Detection
Object_Detection
Attention
Autonomous
PDF
-
Egocentric Video-Language Pretraining
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou
arXiv_AI
arXiv_AI
Recognition
Pose
Contrastive_Learning
Action_Recognition
Action
Classification
Attention
PDF
-
Employing Socially Interactive Agents for Robotic Neurorehabilitation Training
Rhythm Arora, Matteo Lavit Nicora, Pooja Prajod, Daniele Panzeri, Elisabeth André, Patrick Gebhard, Matteo Malosio
arXiv_AI
arXiv_AI
Face
Classification
Attention
PDF
-
Anomaly detection in surveillance videos using transformer based attention model
Kapil Deshpande, Narinder Singh Punn, Sanjay Kumar Sonbhadra, Sonali Agarwal
arXiv_CV
arXiv_CV
Transformer
Surveillance
Weakly_Supervised
Pose
Detection
Attention
PDF
-
GraphDistNet: A Graph-based Collision-distance Estimator for Gradient-based Trajectory
Yeseung Kim, Jinwoo Kim, Daehyung Park
arXiv_RO
arXiv_RO
Optimization
Attention
CNN
PDF
-
YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detection
Xiao Ruiqiang
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Deep_Learning
Detection
Attention
Inference
PDF
-
Transformer-Based Self-Supervised Learning for Emotion Recognition
Juan Vazquez-Rodriguez (M-PSI), Grégoire Lefebvre, Julien Cumin, James L. Crowley (M-PSI)
arXiv_AI
arXiv_AI
Transformer
Recognition
Self-Supervised
Pose
Emotion
Attention
PDF
-
Exploring Transformers for Behavioural Biometrics: A Case Study in Gait Recognition
Paula Delgado-Santos, Ruben Tolosana, Richard Guest, Farzin Deravi, Ruben Vera-Rodriguez
arXiv_CV
arXiv_CV
Transformer
Recognition
Gait_Recognition
RNN
Knowledge
Pose
Deep_Learning
Attention
CNN
PDF
-
Slot Order Matters for Compositional Scene Understanding
Patrick Emami, Pan He, Sanjay Ranka, Anand Rangarajan
arXiv_CV
arXiv_CV
Relation
Attention
Inference
PDF
-
Adversarial Attacks on Human Vision
Victor A. Mateescu, Ivan V. Bajić
arXiv_CV
arXiv_CV
Salient
Review
Adversarial
Attention
PDF
-
MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data
Yilun Zhao, Yunxiang Li, Chenying Li, Rui Zhang
arXiv_AI
arXiv_AI
Attention
QA
PDF
-
H-EMD: A Hierarchical Earth Mover's Distance Method for Instance Segmentation
Peixian Liang, Yizhe Zhang, Yifan Ding, Jianxu Chen, Chinedu S. Madukoma, Tim Weninger, Joshua D. Shrout, Danny Z. Chen
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
3D
Optimization
Pose
Action
Deep_Learning
Attention
Medical
PDF
-
Entangled Residual Mappings
Mathias Lechner, Ramin Hasani, Zahra Babaiee, Radu Grosu, Daniela Rus, Thomas A. Henzinger, Sepp Hochreiter
arXiv_AI
arXiv_AI
Transformer
Sparse
Represenation_Learning
RNN
Relation
Attention
CNN
PDF
-
EfficientFormer: Vision Transformers at MobileNet Speed
Yanyu Li, Geng Yuan, Yang Wen, Eric Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian Ren
arXiv_CV
arXiv_CV
Transformer
NAS
Attention
CNN
Inference
PDF
-
A Dual-fusion Semantic Segmentation Framework With GAN For SAR Images
Donghui Li, Jia Liu, Fang Liu, Wenhua Zhang, Andi Zhang, Wenfei Gao, Jiao Shi
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Pose
Deep_Learning
Attention
GAN
PDF
-
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger, Robert Platt, Christopher Amato
arXiv_AI
arXiv_AI
Transformer
Reinforcement_Learning
RNN
Pose
Attention
PDF
-
Detecting the Severity of Major Depressive Disorder from Speech: A Novel HARD-Training Methodology
Edward L. Campbell, Judith Dineley, Pauline Conde, Faith Matcham, Femke Lamers, Sara Siddi, Laura Docio-Fernandez, Carmen Garcia-Mateo, Nicholas Cummins, the RADAR-CNS Consortium
arXiv_SD
arXiv_SD
Speech
Pose
Classification
Detection
Attention
Prediction
PDF
-
Adversarial Laser Spot: Robust and Covert Physical Adversarial Attack to DNNs
Chengyin Hu
arXiv_AI
arXiv_AI
Adversarial
Pose
Attention
PDF
-
Structured Two-stream Attention Network for Video Question Answering
Lianli Gao, Pengpeng Zeng, Jingkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei, Heng Tao Shen
arXiv_CV
arXiv_CV
Pose
Action
VQA
Attention
QA
PDF
-
Long-tailed Recognition by Learning from Latent Categories
Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
arXiv_CV
arXiv_CV
Recognition
Pose
Attention
PDF
-
Unified Recurrence Modeling for Video Action Anticipation
Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz
arXiv_CV
arXiv_CV
Pose
Action
Attention
Inference
Prediction
PDF
-
Modeling Image Composition for Complex Scene Generation
Zuopeng Yang, Daqing Liu, Chaoyue Wang, Jie Yang, Dacheng Tao
arXiv_CV
arXiv_CV
Transformer
Scene_Generation
Pose
Quantitative
Relation
Few-Shot
Attention
PDF
-
A General Framework for the Representation of Function and Affordance: A Cognitive, Causal, and Grounded Approach, and a Step Toward AGI
Seng-Beng Ho
arXiv_AI
arXiv_AI
Sparse
Attention
PDF
-
KPGT: Knowledge-Guided Pre-training of Graph Transformer for Molecular Property Prediction
Han Li, Dan Zhao, Jianyang Zeng
arXiv_AI
arXiv_AI
Transformer
Represenation_Learning
Knowledge
Self-Supervised
Pose
Deep_Learning
Attention
Prediction
PDF
-
MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet
Nan Wang, Shaohui Lin, Xiaoxiao Li, Ke Li, Yunhang Shen, Yue Gao, Lizhuang Ma
arXiv_CV
arXiv_CV
Transformer
Segmentation
3D
Pose
Action
Attention
Medical
Inference
PDF
-
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Sehoon Kim, Amir Gholami, Albert Shaw, Nicholas Lee, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Kurt Keutzer
arXiv_CL
arXiv_CL
Transformer
Recognition
Speech
Pose
Attention
Speech_Recognition
CNN
Language_Model
PDF
-
Watch Out for the Safety-Threatening Actors: Proactively Mitigating Safety Hazards
Saurabh Jha, Shengkun Cui, Zbigniew Kalbarczyk, Ravishankar K. Iyer
arXiv_RO
arXiv_RO
Pose
Attention
Autonomous
PDF
-
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning
Hao Chen, Guangkai Yang, Junge Zhang, Qiyue Yin, Kaiqi Huang
arXiv_AI
arXiv_AI
Reinforcement_Learning
Zero-Shot
Pose
Face
Action
Relation
Attention
PDF
-
XBound-Former: Toward Cross-scale Boundary Modeling in Transformers
Jiacheng Wang, Fei Chen, Yuxi Ma, Liansheng Wang, Zhaodong Fei, Jianwei Shuai, Xiangdong Tang, Qichao Zhou, Jing Qin
arXiv_AI
arXiv_AI
Transformer
Segmentation
Knowledge
Pose
Quantitative
Attention
PDF
-
Dynamic Linear Transformer for 3D Biomedical Image Segmentation
Zheyuan Zhang, Ulas Bagci
arXiv_CV
arXiv_CV
Transformer
Segmentation
3D
Pose
Attention
Medical
PDF
-
What Changed? Investigating Debiasing Methods using Causal Mediation Analysis
Sullam Jeoung, Jana Diesner
arXiv_AI
arXiv_AI
Pose
Detection
Attention
Language_Model
Prediction
PDF
-
Higher-Order Attention Networks
Mustafa Hajij, Ghada Zamzmi, Theodore Papamarkou, Nina Miolane, Aldo Guzmán-Sáenz, Karthikeyan Natesan Ramamurthy
arXiv_CV
arXiv_CV
Relation
Attention
PDF
-
Impact of loss function in Deep Learning methods for accurate retinal vessel segmentation
Daniela Herrera, Gilberto Ochoa-Ruiz, Miguel Gonzalez-Mendoza, Christian Mata
arXiv_CV
arXiv_CV
Segmentation
Deep_Learning
Attention
PDF
-
Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines
Camilo Fosco, Emilie Josephs, Alex Andonian, Allen Lee, Xi Wang, Aude Oliva
arXiv_CV
arXiv_CV
Recognition
Pose
Detection
Attention
PDF
-
What a Creole Wants, What a Creole Needs
Heather Lent, Kelechi Ogueji, Miryam de Lhoneux, Orevaoghene Ahia, Anders Søgaard
arXiv_CL
arXiv_CL
Survey
Attention
PDF
-
Evaluating Gaussian Grasp Maps for Generative Grasping Models
William Prew, Toby P. Breckon, Magnus Bordewich, Ulrik Beierholm
arXiv_CV
arXiv_CV
Transfer_Learning
Pose
Attention
Inference
PDF
-
Strongly Augmented Contrastive Clustering
Xiaozhi Deng, Dong Huang, Ding-Hua Chen, Chang-Dong Wang, Jian-Huang Lai
arXiv_CV
arXiv_CV
Unsupervised
Represenation_Learning
Pose
Contrastive_Learning
Attention
PDF
-
Augmenting Message Passing by Retrieving Similar Graphs
Dingmin Wang, Shengchao Liu, Hanchen Wang, Linfeng Song, Jian Tang, Song Le, Bernardo Cuenca Grau, Qi Liu
arXiv_AI
arXiv_AI
Represenation_Learning
Pose
Action
Classification
Attention
PDF
-
CellCentroidFormer: Combining Self-attention and Convolution for Cell Detection
Royden Wagner, Karl Rohr
arXiv_CV
arXiv_CV
Transformer
Transfer_Learning
Pose
Deep_Learning
Detection
Attention
CNN
PDF
-
Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices
Bahri Batuhan Bilecen, Alparslan Fisne, Mustafa Ayazoglu
arXiv_CV
arXiv_CV
Super_Resolution
Pose
Relation
Attention
PDF
-
Automatic Bounding Box Annotation with Small Training Data Sets for Industrial Manufacturing
Manuela Geiß, Raphael Wagner, Martin Baresch, Josef Steiner, Michael Zwick
arXiv_CV
arXiv_CV
Deep_Learning
Detection
Object_Detection
Attention
PDF
-
Visual Transformer for Object Detection
Michael Yang
arXiv_CV
arXiv_CV
Image_Caption
Transformer
Pose
Action
Detection
Object_Detection
Attention
Caption
CNN
PDF
-
Fair Comparison between Efficient Attentions
Jiuk Hong, Chaehyeon Lee, Soyoun Bang, Heechul Jung
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Prediction
PDF
-
Lower and Upper Bounds for Numbers of Linear Regions of Graph Convolutional Networks
Hao Chen, Yu Guang Wang, Huan Xiong
arXiv_AI
arXiv_AI
Attention
CNN
PDF
-
Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token Alignment
Jinhong Deng, Xiaoyue Zhang, Wen Li, Lixin Duan
arXiv_CV
arXiv_CV
Transformer
Embedding
Adversarial
Pose
Detection
Relation
Object_Detection
Attention
PDF
-
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su
arXiv_SD
arXiv_SD
Speech
Pose
Attention
PDF
-
Differentiable Soft-Masked Attention
Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Attention
PDF
-
Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation
Leyla Khaleghi, Joshua Marshall, Ali Etemad
arXiv_CV
arXiv_CV
Transformer
Embedding
3D
Pose_Estimation
Pose
Action
Attention
CNN
PDF
-
Design and Simulation of an Autonomous Quantum Flying Robot Vehicle: An IBM Quantum Experience
Sudev Pradhan, Anshuman Padhi, Bikash Kumar Behera
arXiv_RO
arXiv_RO
Pose
Attention
Autonomous
PDF
-
VALHALLA: Visual Hallucination for Machine Translation
Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu (Richard) Chen, Rogerio Feris, David Cox, Nuno Vasconcelos
arXiv_CV
arXiv_CV
Transformer
Attention
Inference
Prediction
PDF
-
Neural Retriever and Go Beyond: A Thesis Proposal
Man Luo
arXiv_CL
arXiv_CL
Knowledge
Pose
Face
Attention
Matching
PDF
-
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving
Kashyap Chitta, Aditya Prakash, Bernhard Jaeger, Zehao Yu, Katrin Renz, Andreas Geiger
arXiv_AI
arXiv_AI
Transformer
Pose
Detection
Object_Detection
Attention
Autonomous
PDF
-
Indirect Point Cloud Registration: Aligning Distance Fields using a Pseudo Third Point Ses
Yijun Yuan, Andreas Nuechter
arXiv_RO
arXiv_RO
Reconstruction
Point_Cloud
3D
Pose
Deep_Learning
Attention
PDF
-
Two-Dimensional Quantum Material Identification via Self-Attention and Soft-labeling in Deep Learning
Xuan Bac Nguyen, Apoorva Bisht, Hugh Churchill, Khoa Luu
arXiv_AI
arXiv_AI
Segmentation
Pose
Deep_Learning
Detection
Attention
PDF
-
Skeleton-based Action Recognition via Temporal-Channel Aggregation
Shengqin Wang, Yongji Zhang, Fenglin Wei, Kai Wang, Minghao Zhao, Yu Jiang
arXiv_CV
arXiv_CV
Recognition
Knowledge
Pose
Action_Recognition
Action
Attention
CNN
PDF
-
A robust and lightweight deep attention multiple instance learning algorithm for predicting genetic alterations
Bangwei Guo, Xingyu Li, Miaomiao Yang, Hong Zhang, Xu Steven Xu
arXiv_AI
arXiv_AI
Pose
Attention
CNN
PDF
-
A Review of Mobile Mapping Systems: From Sensors to Applications
Mostafa Elhashash, Hessah Albanwan, Rongjun Qin
arXiv_CV
arXiv_CV
Review
Attention
PDF
-
Surface Analysis with Vision Transformers
Simon Dahan, Logan Z. J. Williams, Abdulah Fawaz, Daniel Rueckert, Emma C. Robinson
arXiv_CV
arXiv_CV
Transformer
Pose
Face
Attention
CNN
Prediction
PDF
-
SymFormer: End-to-end symbolic regression using transformer-based architecture
Vastl, Martin, Kulhánek, Jonáš, Kubalík, Jiří, Derner, Erik, Babuška, Robert
arXiv_CV
arXiv_CV
Transformer
3D
Sparse
Pose_Estimation
Pose
Attention
PDF
-
Omni-Granular Ego-Semantic Propagation for Self-Supervised Graph Representation Learning
Ling Yang, Shenda Hong
arXiv_AI
arXiv_AI
Unsupervised
Represenation_Learning
Self-Supervised
Pose
Classification
Attention
PDF
-
Transformers for Multi-Object Tracking on Point Clouds
Felicia Ruppel, Florian Faion, Claudius Gläser, Klaus Dietmayer
arXiv_CV
arXiv_CV
Transformer
Tracking
Point_Cloud
Object_Tracking
Pose
Detection
Object_Detection
Attention
Prediction
PDF
-
Progressive Multi-scale Consistent Network for Multi-class Fundus Lesion Segmentation
Along He, Kai Wang, Tao Li, Wang Bo, Hong Kang, Huazhu Fu
arXiv_CV
arXiv_CV
Segmentation
Pose
Action
Attention
Prediction
PDF
-
Automatic Relation-aware Graph Network Proliferation
Shaofei Cai, Liang Li, Xinzhe Han, Jiebo Luo, Zheng-Jun Zha, Qingming Huang
arXiv_AI
arXiv_AI
NAS
Pose
Relation
Attention
PDF
-
Individual health-disease phase diagrams for disease prevention based on machine learning
Kazuki Nakamura, Eiichiro Uchino, Noriaki Sato, Ayano Araki, Kei Terayama, Ryosuke Kojima, Koichi Murashita, Ken Itoh, Tatsuya Mikami, Yoshinori Tamada, Yasushi Okuno
arXiv_AI
arXiv_AI
Detection
Relation
Attention
Prediction
PDF
-
An Effective Fusion Method to Enhance the Robustness of CNN
Yating Ma, Zhichao Lian
arXiv_CV
arXiv_CV
Pose
Classification
Denoising
Attention
CNN
Image_Classification
PDF
-
Hierarchical Spherical CNNs with Lifting-based Adaptive Wavelets for Pooling and Unpooling
Mingxing Xu, Chenglin Li, Wenrui Dai, Siheng Chen, Junni Zou, Pascal Frossard, Hongkai Xiong
arXiv_AI
arXiv_AI
Pose
Relation
Attention
CNN
PDF
-
A Multi-level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference
Shu'ang Li, Xuming Hu, Li Lin, Aiwei Liu, Lijie Wen, Philip S. Yu
arXiv_CL
arXiv_CL
Text_Classification
Pose
Contrastive_Learning
Classification
Relation
Attention
Inference
Prediction
PDF
-
itKD: Interchange Transfer-based Knowledge Distillation for 3D Object Detection
Hyeon Cho, Junyong Choi, Geonwoo Baek, Wonjun Hwang
arXiv_CV
arXiv_CV
Reconstruction
Point_Cloud
3D
Regularization
Knowledge
Pose
Deep_Learning
Detection
Object_Detection
Attention
Matching
PDF
-
A Knowledge-Enhanced Adversarial Model for Cross-lingual Structured Sentiment Analysis
Qi Zhang, Jie Zhou, Qin Chen, Qingchun Bai, Jun Xiao, Liang He
arXiv_CL
arXiv_CL
Embedding
Unsupervised
Knowledge
Adversarial
Pose
Sentiment
Attention
PDF
-
Joint Spatial-Temporal and Appearance Modeling with Transformer for Multiple Object Tracking
Peng Dai, Yiqiang Feng, Renliang Weng, Changshui Zhang
arXiv_CV
arXiv_CV
Transformer
Tracking
Object_Tracking
Pose
Deep_Learning
Detection
Relation
Attention
PDF
-
Analyzing Modality Robustness in Multimodal Sentiment Analysis
Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria
arXiv_CL
arXiv_CL
Pose
Sentiment
Attention
PDF
-
HeatER: An Efficient and Unified Network for Human Reconstruction via Heatmap-based TransformER
Ce Zheng, Matias Mendieta, Taojiannan Yang, Chen Chen
arXiv_AI
arXiv_AI
Transformer
Reconstruction
3D
Pose_Estimation
Pose
Attention
PDF
-
Continual Object Detection: A review of definitions, strategies, and challenges
Angelo G. Menezes, Gustavo de Moura, Cézanne Alves, André C. P. L. F. de Carvalho
arXiv_CV
arXiv_CV
Review
Pose
Classification
Detection
Object_Detection
Attention
Autonomous
PDF
-
Can Transformer be Too Compositional? Analysing Idiom Processing in Neural Machine Translation
Verna Dankers, Christopher G. Lucas, Ivan Titov
arXiv_CL
arXiv_CL
Transformer
Action
NMT
Attention
PDF
-
GAN-based Medical Image Small Region Forgery Detection via a Two-Stage Cascade Framework
Jianyi Zhang, Xuanxi Huang, Yaqi Liu, Yuyang Han, Zixiao Xiang
arXiv_CV
arXiv_CV
Enhancement
Adversarial
Pose
Classification
Detection
Object_Detection
Attention
GAN
Medical
PDF
-
Detecting fake news by enhanced text representation with multi-EDU-structure awareness
Yuhang Wang, Li Wang, Yanjie Yang, Yilin Zhang
arXiv_AI
arXiv_AI
Pose
Detection
Relation
Attention
PDF
-
Transformer with Tree-order Encoding for Neural Program Generation
Klaudia-Doris Thellmann, Bernhard Stadler, Ricardo Usbeck, Jens Lehmann
arXiv_CL
arXiv_CL
Transformer
RNN
Attention
PDF
-
CompleteDT: Point Cloud Completion with Dense Augment Inference Transformers
Jun Li, Shangwei Guo, Zhengchao Lai, Xiantong Meng, Shaokun Han
arXiv_CV
arXiv_CV
Transformer
Point_Cloud
Pose
Relation
Attention
Inference
PDF
-
HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Xiaosong Zhang, Yunjie Tian, Wei Huang, Qixiang Ye, Qi Dai, Lingxi Xie, Qi Tian
arXiv_CV
arXiv_CV
Transformer
Segmentation
Transfer_Learning
Self-Supervised
Pose
Detection
Attention
PDF
-
Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving
Peixuan Li, Jieyu Jin
arXiv_CV
arXiv_CV
Transformer
Tracking
3D
Object_Tracking
Pose
Detection
Relation
Object_Detection
Attention
Autonomous
PDF
-
Benchmarking Unsupervised Anomaly Detection and Localization
Ye Zheng, Xiang Wang, Yu Qi, Wei Li, Liwei Wu
arXiv_CV
arXiv_CV
Unsupervised
3D
Pose
Detection
Attention
Inference
PDF
-
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu
arXiv_SD
arXiv_SD
Pose
Attention
PDF
-
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Aniket Didolkar, Kshitij Gupta, Anirudh Goyal, Alex Lamb, Nan Rosemary Ke, Yoshua Bengio
arXiv_AI
arXiv_AI
Transformer
RNN
Pose
Attention
PDF
-
UPB at SemEval-2022 Task 5: Enhancing UNITER with Image Sentiment and Graph Convolutional Networks for Multimedia Automatic Misogyny Identification
Andrei Paraschiv, Mihai Dascalu, Dumitru-Clementin Cercel
arXiv_AI
arXiv_AI
Speech
Pose
Classification
Detection
Sentiment
Attention
Caption
CNN
PDF
-
EfficientViT: Enhanced Linear Attention for High-Resolution Low-Computation Visual Recognition
Han Cai, Chuang Gan, Song Han
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Recognition
Pose
Action
Detection
Object_Detection
Attention
CNN
PDF
-
Modeling Beats and Downbeats with a Time-Frequency Transformer
Yun-Ning Hung, Ju-Chiang Wang, Xuchen Song, Wei-Tsung Lu, Minz Won
arXiv_SD
arXiv_SD
Transformer
Tracking
Pose
Attention
CNN
PDF
-
Context-based Virtual Adversarial Training for Text Classification with Noisy Labels
Do-Myoung Lee, Yeachan Kim, Chang-gyun Seo
arXiv_CL
arXiv_CL
Text_Classification
Adversarial
Pose
Classification
Attention
PDF
-
Micro-Expression Recognition Based on Attribute Information Embedding and Cross-modal Contrastive Learning
Yanxin Song, Jianzong Wang, Tianbo Wu, Zhangcheng Huang, Jing Xiao
arXiv_AI
arXiv_AI
Embedding
Recognition
3D
Bert
Pose
Contrastive_Learning
Action
Attention
PDF
-
Cervical Glandular Cell Detection from Whole Slide Image with Out-Of-Distribution Data
Ziquan Wei, Shenghua Cheng, Xiuli Liu, Shaoqun Zeng
arXiv_CV
arXiv_CV
Knowledge
Pose
Deep_Learning
Detection
Object_Detection
Attention
Prediction
PDF
-
A General Multiple Data Augmentation Based Framework for Training Deep Neural Networks
Binyan Hu, Yu Sun, A. K. Qin
arXiv_CV
arXiv_CV
Knowledge
Pose
Classification
Attention
Image_Classification
Inference
PDF
-
Masked Distillation with Receptive Tokens
Tao Huang, Yuan Zhang, Shan You, Fei Wang, Chen Qian, Jian Cao, Chang Xu
arXiv_CV
arXiv_CV
Reconstruction
Segmentation
Embedding
Semantic_Segmentation
Detection
Object_Detection
Attention
Prediction
PDF
-
3D-C2FT: Coarse-to-fine Transformer for Multi-view 3D Reconstruction
Leslie Ching Ow Tiong, Dick Sigmund, Andrew Beng Jin Teoh
arXiv_AI
arXiv_AI
Transformer
Reconstruction
3D
Pose
Face
Relation
Attention
PDF
-
Mean Field inference of CRFs based on GAT
LingHong Xing, XiangXiang Ma, GuangSheng Luo
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
Pose
Attention
Inference
PDF
-
A Character-Level Length-Control Algorithm for Non-Autoregressive Sentence Summarization
Puyuan Liu, Xiang Zhang, Lili Mou
arXiv_CL
arXiv_CL
Pose
Classification
Attention
Summarization
PDF
-
Contributor-Aware Defenses Against Adversarial Backdoor Attacks
Glenn Dawson, Muhammad Umer, Robi Polikar
arXiv_AI
arXiv_AI
Knowledge
Adversarial
Pose
Classification
Detection
Relation
Attention
Image_Classification
Inference
PDF
-
MDMLP: Image Classification from Scratch on Small Datasets with MLP
Tian Lv, Chongyang Bai, Chaojie Wang
arXiv_AI
arXiv_AI
Transformer
Classification
Attention
Image_Classification
PDF
-
BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset
Mohammad Faiyaz Khan, S.M. Sadiq-Ur-Rahman Shifath, Md Saiful Islam
arXiv_CL
arXiv_CL
Image_Caption
Pose
Quantitative
Attention
Caption
PDF
-
A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural Networks
Yu Pan, Zeyong Su, Ao Liu, Jingquan Wang, Nannan Li, Zenglin Xu
arXiv_AI
arXiv_AI
Pose
Attention
CNN
PDF
-
Feature Pyramid Attention based Residual Neural Network for Environmental Sound Classification
Liguang Zhou, Yuhongze Zhou, Xiaonan Qi, Junjie Hu, Tin Lun Lam, Yangsheng Xu
arXiv_SD
arXiv_SD
Pose
Classification
Relation
Attention
CNN
PDF
-
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li
arXiv_AI
arXiv_AI
Transformer
Reconstruction
Segmentation
Point_Cloud
3D
Represenation_Learning
Self-Supervised
Pose
Classification
Detection
Few-Shot
Object_Detection
Attention
PDF
-
Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction
Jiaxin Yu, Deqing Yang, Shuyu Tian
arXiv_CL
arXiv_CL
Pose
Action
Classification
Relation
Relation_Extraction
Attention
PDF
-
Multi-Task Learning with Multi-query Transformer for Dense Prediction
Yangyang Xu, Xiangtai Li, Haobo Yuan, Yibo Yang, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao
arXiv_CV
arXiv_CV
Transformer
Pose
Relation
Attention
Prediction
PDF
-
Object-wise Masked Autoencoders for Fast Pre-training
Jiantao Wu, Shentong Mo
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Self-Supervised
Classification
Relation
Attention
Image_Classification
PDF
-
Speech Augmentation Based Unsupervised Learning for Keyword Spotting
Jian Luo, Jianzong Wang, Ning Cheng, Haobin Tang, Jing Xiao
arXiv_SD
arXiv_SD
Unsupervised
Speech
Pose
Classification
Attention
PDF
-
Multimodal Fake News Detection via CLIP-Guided Learning
Yangming Zhou, Qichao Ying, Zhenxing Qian, Sheng Li, Xinpeng Zhang
arXiv_CV
arXiv_CV
Bert
Pose
Detection
Attention
PDF
-
Deep Embedded Clustering with Distribution Consistency Preservation for Attributed Networks
Yimei Zheng, Caiyan Jia, Jian Yu, Xuanya Li
arXiv_AI
arXiv_AI
Pose
Attention
PDF
-
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei, Han Hu, Zhenda Xie, Zheng Zhang, Yue Cao, Jianmin Bao, Dong Chen, Baining Guo
arXiv_CV
arXiv_CV
Optimization
Self-Supervised
Contrastive_Learning
Classification
Attention
Image_Classification
PDF
-
Image Harmonization with Region-wise Contrastive Learning
Jingtang Liang, Chi-Man Pun
arXiv_CV
arXiv_CV
Reconstruction
Enhancement
Pose
Contrastive_Learning
Attention
PDF
-
Future Transformer for Long-term Action Anticipation
Dayoung Gong, Joonseok Lee, Manjin Kim, Seong Jong Ha, Minsu Cho
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
Attention
Inference
PDF
-
What Dense Graph Do You Need for Self-Attention?
Yuxing Wang, Chu-Tak Lee, Qipeng Guo, Zhangyue Yin, Yunhua Zhou, Xuanjing Huang, Xipeng Qiu
arXiv_AI
arXiv_AI
Transformer
Sparse
Pose
Action
Attention
PDF
-
Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation
Bin Lu, Xiaoying Gan, Lina Yang, Weinan Zhang, Luoyi Fu, Xinbing Wang
arXiv_AI
arXiv_AI
Knowledge
Pose
Classification
Few-Shot
Attention
PDF
-
Commonsense and Named Entity Aware Knowledge Grounded Dialogue Generation
Deeksha Varshney, Akshara Prabhakar, Asif Ekbal
arXiv_AI
arXiv_AI
Knowledge
Pose
Attention
PDF
-
EvenNet: Ignoring Odd-Hop Neighbors Improves Robustness of Graph Neural Networks
Runlin Lei, Zhen Wang, Yaliang Li, Bolin Ding, Zhewei Wei
arXiv_AI
arXiv_AI
Pose
Face
Classification
Attention
PDF
-
Textural-Structural Joint Learning for No-Reference Super-Resolution Image Quality Assessment
Yuqing Liu, Qi Jia, Shanshe Wang, Siwei Ma, Wen Gao
arXiv_CV
arXiv_CV
Super_Resolution
Pose
Relation
Attention
Prediction
QA
PDF
-
X-ViT: High Performance Linear Vision Transformer without Softmax
Jeonggeun Song, Heung-Chang Lee
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Image_Classification
Prediction
PDF
-
Text-Based Automatic Personality Prediction Using KGrAt-Net; A Knowledge Graph Attention Network Classifier
Majid Ramezani, Mohammad-Reza Feizi-Derakhshi, Mohammad-Ali Balafar
arXiv_CL
arXiv_CL
Embedding
Knowledge
Knowledge_Graph
Pose
Classification
Relation
Attention
GAN
Prediction
PDF
-
Attention Awareness Multiple Instance Neural Network
Jingjun Yi, Beichen Zhou
arXiv_CV
arXiv_CV
Recognition
Pose
Classification
Attention
PDF
-
Image Reconstruction of Multi Branch Feature Multiplexing Fusion Network with Mixed Multi-layer Attention
Yuxi Cai, Huicheng Lai
arXiv_CV
arXiv_CV
Reconstruction
Super_Resolution
Restoration
Pose
Attention
CNN
PDF
-
Understanding Long Programming Languages with Structure-Aware Sparse Attention
Tingting Liu, Chengyu Wang, Cen Chen, Ming Gao, Aoying Zhou
arXiv_AI
arXiv_AI
Transformer
Bert
Sparse
Relation
Attention
Language_Model
PDF
-
FedFormer: Contextual Federation with Attention in Reinforcement Learning
Liam Hebert, Lukasz Golab, Pascal Poupart, Robin Cohen
arXiv_AI
arXiv_AI
Transformer
Embedding
Reinforcement_Learning
Pose
Relation
Attention
PDF
-
Reinforcement Learning Approach for Mapping Applications to Dataflow-Based Coarse-Grained Reconfigurable Array
Andre Xian Ming Chang, Parth Khopkar, Bashar Romanous, Abhishek Chaurasia, Patrick Estep, Skyler Windh, Doug Vanesko, Sheik Dawood Beer Mohideen, Eugenio Culurciello
arXiv_AI
arXiv_AI
Embedding
Reinforcement_Learning
Optimization
Sparse
Knowledge
Pose
Attention
PDF
-
Transformer for Partial Differential Equations' Operator Learning
Zijie Li, Kazem Meidani, Amir Barati Farimani
arXiv_AI
arXiv_AI
Transformer
Pose
Deep_Learning
Relation
Attention
CNN
PDF
-
Fairness in Recommendation: A Survey
Yunqi Li, Hanxiong Chen, Shuyuan Xu, Yingqiang Ge, Juntao Tan, Yongfeng Zhang
arXiv_AI
arXiv_AI
Survey
Action
Classification
Attention
GAN
Recommendation
PDF
-
Revealing the Dark Secrets of Masked Image Modeling
Zhenda Xie, Zigang Geng, Jingcheng Hu, Zheng Zhang, Han Hu, Yue Cao
arXiv_AI
arXiv_AI
Transformer
Tracking
Object_Tracking
Pose_Estimation
Pose
Classification
Attention
PDF
-
Dynamically Relative Position Encoding-Based Transformer for Automatic Code Edit
Shiyi Qi, Yaoxian Li, Cuiyun Gao, Xiaohong Su, Shuzheng Gao, Zibin Zheng, Chuanyi Liu
arXiv_CL
arXiv_CL
Transformer
Pose
NMT
Deep_Learning
Detection
Attention
PDF
-
Green Hierarchical Vision Transformer for Masked Image Modeling
Lang Huang, Shan You, Mingkai Zheng, Fei Wang, Chen Qian, Toshihiko Yamasaki
arXiv_CV
arXiv_CV
Transformer
Classification
Detection
Object_Detection
Attention
PDF
-
Are Transformers Effective for Time Series Forecasting?
Ailing Zeng, Muxi Chen, Lei Zhang, Qiang Xu
arXiv_AI
arXiv_AI
Transformer
Pose
Action
Detection
Relation
Relation_Extraction
Attention
Prediction
PDF
-
SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation
Ziyi Wang, Yongming Rao, Xumin Yu, Jie Zhou, Jiwen Lu
arXiv_AI
arXiv_AI
Transformer
Segmentation
Semantic_Segmentation
Point_Cloud
3D
Knowledge
Pose
Quantitative
Attention
PDF
-
Learning to Reconstruct Missing Data from Spatiotemporal Graphs with Sparse Observations
Ivan Marisca, Andrea Cini, Cesare Alippi
arXiv_AI
arXiv_AI
Reconstruction
Sparse
Pose
Attention
Prediction
PDF
-
Efficient U-Transformer with Boundary-Aware Loss for Action Segmentation
Dazhao Du, Bing Su, Yu Li, Zhongang Qi, Lingyu Si, Ying Shan
arXiv_CV
arXiv_CV
Transformer
Segmentation
Pose
Action
Classification
Attention
PDF
-
On stochastic stabilization via non-smooth control Lyapunov functions
Pavel Osinenko, Grigory Yaremenko, Georgiy Malaniya
arXiv_RO
arXiv_RO
Regularization
Action
Attention
PDF
-
Your Transformer May Not be as Powerful as You Expect
Shengjie Luo, Shanda Li, Shuxin Zheng, Tie-Yan Liu, Liwei Wang, Di He
arXiv_CL
arXiv_CL
Transformer
Attention
PDF
-
Target-aware Abstractive Related Work Generation with Contrastive Learning
Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Rui Yan, Xin Gao, Xiangliang Zhang
arXiv_CL
arXiv_CL
Optimization
Pose
Contrastive_Learning
Relation
Attention
PDF
-
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Qiu-Shi Zhu, Jie Zhang, Zi-Qiang Zhang, Li-Rong Dai
arXiv_SD
arXiv_SD
Enhancement
Recognition
Speech
Self-Supervised
Pose
Attention
Speech_Recognition
PDF
-
Objects Matter: Learning Object Relation Graph for Robust Camera Relocalization
Chengyu Qiao, Zhiyu Xiang, Xinglu Wang
arXiv_CV
arXiv_CV
Pose
Deep_Learning
Relation
Attention
PDF
-
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Bruno Sauvalle, Arnaud de La Fortelle
arXiv_CV
arXiv_CV
Transformer
Reconstruction
Segmentation
Unsupervised
Represenation_Learning
Detection
Object_Detection
Attention
PDF
-
MemeTector: Enforcing deep focus for meme detection
Christos Koutlis, Manos Schinas, Symeon Papadopoulos
arXiv_CV
arXiv_CV
Speech
Pose
Detection
Attention
Prediction
PDF
-
Denial-of-Service Attacks on Learned Image Compression
Kang Liu, Di Wu, Yiru Wang, Dan Feng, Benjamin Tan, Siddharth Garg
arXiv_AI
arXiv_AI
Reconstruction
Adversarial
Pose
Deep_Learning
Attention
Image_Compression
PDF
-
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification
Nan Zhang, Jianzong Wang, Zhenhou Hong, Chendong Zhao, Xiaoyang Qu, Jing Xiao
arXiv_SD
arXiv_SD
Transformer
Embedding
Speech
Pose
Attention
PDF
-
Interpretable travel distance on the county-wise COVID-19 by sequence to sequence with attention
Ting Tian, Yukang Jiang, Huajun Xie, Xueqin Wang, Hailiang Guo
arXiv_AI
arXiv_AI
Relation
Attention
Prediction
PDF
-
Fast Vision Transformers with HiLo Attention
Zizheng Pan, Jianfei Cai, Bohan Zhuang
arXiv_AI
arXiv_AI
Transformer
Segmentation
Pose
Classification
Detection
Relation
Attention
Image_Classification
PDF
-
Decoupled Pyramid Correlation Network for Liver Tumor Segmentation from CT images
Yao Zhang, Jiawei Yang, Yang Liu, Jiang Tian, Siyun Wang, Cheng Zhong, Zhongchao Shi, Yang Zhang, Zhiqiang He
arXiv_CV
arXiv_CV
Segmentation
Pose
Face
Relation
Attention
Medical
CNN
PDF
-
Other Roles Matter! Enhancing Role-Oriented Dialogue Summarization via Role Interactions
Haitao Lin, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong
arXiv_CL
arXiv_CL
Pose
Action
Attention
Summarization
PDF
-
Transferable Adversarial Attack based on Integrated Gradients
Yi Huang, Adams Wai-Kin Kong
arXiv_CV
arXiv_CV
Adversarial
Pose
Face
Attention
PDF
-
Exploring Map-based Features for Efficient Attention-based Vehicle Motion Prediction
Carlos Gómez-Huélamo, Marcos V. Conde, Miguel Ortiz
arXiv_CV
arXiv_CV
RNN
Attention
Autonomous
Prediction
PDF
-
TSEM: Temporally Weighted Spatiotemporal Explainable Neural Network for Multivariate Time Series
Anh-Duy Pham, Anastassia Kuestenmacher, Paul G. Ploeger
arXiv_AI
arXiv_AI
RNN
Deep_Learning
Attention
PDF
-
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling
Kaitao Song, Yichong Leng, Xu Tan, Yicheng Zou, Tao Qin, Dongsheng Li
arXiv_CL
arXiv_CL
Transformer
Bert
Pose
Attention
Language_Model
PDF
-
Inception Transformer
Chenyang Si, Weihao Yu, Pan Zhou, Yichen Zhou, Xinchao Wang, Shuicheng Yan
arXiv_AI
arXiv_AI
Transformer
Segmentation
Pose
Classification
Detection
Attention
Image_Classification
PDF
-
Domain Adaptation for Object Detection using SE Adaptors and Center Loss
Sushruth Nagesh, Shreyas Rajesh, Asfiya Baig, Savitha Srinivasan
arXiv_CV
arXiv_CV
Unsupervised
Regularization
Knowledge
Detection
Object_Detection
Attention
PDF
-
On the solvability of weakly linear systems of fuzzy relation equations
Stefan Stanimirovic, Ivana Micic
arXiv_AI
arXiv_AI
Relation
Attention
PDF
-
You Need to Read Again: Multi-granularity Perception Network for Moment Retrieval in Videos
Xin Sun, Xuan Wang, Jialin Gao, Qiong Liu, Xi Zhou
arXiv_AI
arXiv_AI
Pose
Action
Attention
Caption
Activity
PDF
-
Structure Unbiased Adversarial Model for Medical Image Segmentation
Tianyang Zhang, Shaoming Zheng, Jun Cheng, Xi Jia, Joseph Bartlett, Huazhu Fu, Zhaowen Qiu, Jiang Liu, Jinming Duan
arXiv_CV
arXiv_CV
Segmentation
Recognition
Adversarial
Pose
Attention
Medical
Prediction
PDF
-
AO2-DETR: Arbitrary-Oriented Object Detection Transformer
Linhui Dai, Hong Liu, Hao Tang, Zhiwei Wu, Pinhao Song
arXiv_CV
arXiv_CV
Transformer
Pose
Detection
Object_Detection
Attention
Prediction
Matching
PDF
-
Evaluating Inclusivity, Equity, and Accessibility of NLP Technology: A Case Study for Indian Languages
Simran Khanuja, Sebastian Ruder, Partha Talukdar
arXiv_CL
arXiv_CL
Pose
Attention
PDF
-
MoCoViT: Mobile Convolutional Vision Transformer
Hailong Ma, Xin Xia, Xing Wang, Xuefeng Xiao, Jiashi Li, Min Zheng
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Detection
Object_Detection
Attention
CNN
PDF
-
Guiding Visual Question Answering with Attention Priors
Thao Minh Le, Vuong Le, Sunil Gupta, Svetha Venkatesh, Truyen Tran
arXiv_CV
arXiv_CV
Enhancement
Sparse
Knowledge
Pose
VQA
Attention
Inference
QA
PDF
-
VTP: Volumetric Transformer for Multi-view Multi-person 3D Pose Estimation
Yuxing Chen, Renshu Gu, Ouhan Huang, Gangyong Jia
arXiv_CV
arXiv_CV
Transformer
Embedding
3D
Sparse
Pose_Estimation
Pose
Relation
Attention
CNN
PDF
-
Memorization in NLP Fine-tuning Methods
Fatemehsadat Mireshghallah, Archit Uniyal, Tianhao Wang, David Evans, Taylor Berg-Kirkpatrick
arXiv_CL
arXiv_CL
Action
Attention
Inference
Language_Model
PDF
-
Leveraging Locality in Abstractive Text Summarization
Yixin Liu, Ansong Ni, Linyong Nan, Budhaditya Deb, Chenguang Zhu, Ahmed H. Awadallah, Dragomir Radev
arXiv_CL
arXiv_CL
Attention
Summarization
PDF
-
Eye-gaze-guided Vision Transformer for Rectifying Shortcut Learning
Chong Ma, Lin Zhao, Yuzhong Chen, Lu Zhang, Zhenxiang Xiao, Haixing Dai, David Liu, Zihao Wu, Zhengliang Liu, Sheng Wang, Jiaxing Gao, Changhe Li, Xi Jiang, Tuo Zhang, Qian Wang, Dinggang Shen, Dajiang Zhu, Tianming Liu
arXiv_CV
arXiv_CV
Transformer
Knowledge
Pose
Relation
Attention
Medical
PDF
-
A Lightweight NMS-free Framework for Real-time Visual Fault Detection System of Freight Trains
Guodong Sun, Yang Zhou, Huilin Pan, Bo Wu, Ye Hu, Yang Zhang
arXiv_CV
arXiv_CV
Enhancement
Pose
Action
Classification
Detection
Object_Detection
Attention
CNN
PDF
-
Region-aware Knowledge Distillation for Efficient Image-to-Image Translation
Linfeng Zhang, Xin Chen, Runpei Dong, Kaisheng Ma
arXiv_CV
arXiv_CV
Knowledge
Adversarial
Pose
Contrastive_Learning
Classification
Attention
GAN
Image_Classification
PDF
-
Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks
Yijun Tian, Chuxu Zhang, Zhichun Guo, Yihong Ma, Ronald Metoyer, Nitesh V. Chawla
arXiv_CL
arXiv_CL
Embedding
Represenation_Learning
Adversarial
Pose
Classification
Relation
Attention
Prediction
PDF
-
MaskEval: Weighted MLM-Based Evaluation for Text Summarization and Simplification
Yu Lu Liu, Rachel Bawden, Thomas Scaliom, Benoît Sagot, Jackie Chi Kit Cheung
arXiv_CL
arXiv_CL
Relation
Attention
Summarization
Language_Model
PDF
-
Face2Text revisited: Improved data set and baseline results
Marc Tanti, Shaun Abdilla, Adrian Muscat, Claudia Borg, Reuben A. Farrugia, Albert Gatt
arXiv_CV
arXiv_CV
Image_Caption
Transfer_Learning
RNN
Face
Attention
PDF
-
Adaptive multilingual speech recognition with pretrained models
Ngoc-Quan Pham, Alex Waibel, Jan Niehues
arXiv_CL
arXiv_CL
Unsupervised
Recognition
Knowledge
Speech
Attention
Speech_Recognition
PDF
-
OnePose: One-Shot Object Pose Estimation without CAD Models
Jiaming Sun, Zihao Wang, Siyu Zhang, Xingyi He, Hongcheng Zhao, Guofeng Zhang, Xiaowei Zhou
arXiv_CV
arXiv_CV
3D
Sparse
Pose_Estimation
Pose
Attention
Matching
PDF
-
ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions
Difan Liu, Sandesh Shetty, Tobias Hinz, Matthew Fisher, Richard Zhang, Taesung Park, Evangelos Kalogerakis
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Action
Quantitative
Attention
PDF
-
Aerial Vision-and-Dialog Navigation
Yue Fan, Winson Chen, Tongzhou Jiang, Chun Zhou, Yi Zhang, Xin Eric Wang
arXiv_AI
arXiv_AI
Pose
Attention
Drone
PDF
-
Learning for Expressive Task-Related Sentence Representations
Xueying Bai, Jinghuan Shang, Yifan Sun, Niranjan Balasubramanian
arXiv_AI
arXiv_AI
Regularization
Pose
Classification
Attention
Language_Model
Prediction
PDF
-
Context Attention Network for Skeleton Extraction
Zixuan Huang, Yunfeng Wang, Zhiwen Chen, Xin Gao, Ruili Feng, Xiaobo Li
arXiv_CV
arXiv_CV
Pose
Action
Attention
PDF
-
VLCDoC: Vision-Language Contrastive Pre-Training Model for Cross-Modal Document Classification
Souhail Bakkali, Zuheng Ming, Mickael Coustaty, Marçal Rusiñol, Oriol Ramos Terrades
arXiv_CV
arXiv_CV
Pose
Action
Classification
Relation
Attention
PDF
-
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
Chenliang Li, Haiyang Xu, Junfeng Tian, Wei Wang, Ming Yan, Bin Bi, Jiabo Ye, Hehong Chen, Guohai Xu, Zheng Cao, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou
arXiv_CV
arXiv_CV
Image_Caption
Zero-Shot
VQA
Attention
Caption
PDF
-
A Wireless-Vision Dataset for Privacy Preserving Human Activity Recognition
Yanling Hao, Zhiyuan Shi, Yuanwei Liu
arXiv_CV
arXiv_CV
Segmentation
Recognition
Pose
Action
Deep_Learning
Attention
Activity
PDF
-
Attentional Mixtures of Soft Prompt Tuning for Parameter-efficient Multi-task Knowledge Sharing
Akari Asai, Mohammadreza Salehi, Matthew E. Peters, Hannaneh Hajishirzi
arXiv_CL
arXiv_CL
Knowledge
Attention
Language_Model
PDF
-
GraSens: A Gabor Residual Anti-aliasing Sensing Framework for Action Recognition using WiFi
Yanling Hao, Zhiyuan Shi, Xidong Mu, Yuanwei Liu
arXiv_CV
arXiv_CV
Recognition
Pose
Action_Recognition
Action
Attention
PDF
-
Image Trinarization Using a Partial Differential Equations: A Novel Approach to Automatic Sperm Image Analysis
B. A. Jacobs
arXiv_CV
arXiv_CV
Segmentation
Pose
Attention
PDF
-
Collaborative 3D Object Detection for Automatic Vehicle Systems via Learnable Communications
Junyong Wang, Yuan Zeng, Yi Gong
arXiv_AI
arXiv_AI
Point_Cloud
3D
Pose
Detection
Object_Detection
Attention
Autonomous
PDF
-
CDFKD-MFS: Collaborative Data-free Knowledge Distillation via Multi-level Feature Sharing
Zhiwei Hao, Yong Luo, Zhi Wang, Han Hu, Jianping An
arXiv_AI
arXiv_AI
Knowledge
Adversarial
Pose
Attention
Prediction
PDF
-
DistillAdapt: Source-Free Active Visual Domain Adaptation
Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha
arXiv_CV
arXiv_CV
Segmentation
Classification
Detection
Attention
PDF
-
AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition
Mingzhe Sui, Hanting Li, Zhaoqing Zhu, Feng Zhao
arXiv_AI
arXiv_AI
Recognition
3D
Salient
Knowledge
Pose
Face
Deep_Learning
Attention
CNN
PDF
-
LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping
Andrzej Reinke, Matteo Palieri, Benjamin Morrell, Yun Chang, Kamak Ebadi, Luca Carlone, Ali-akbar Agha-mohammadi
arXiv_RO
arXiv_RO
Point_Cloud
3D
Pose
Attention
Autonomous
PDF
-
Graph Neural Networks Intersect Probabilistic Graphical Models: A Survey
Chenqing Hua
arXiv_AI
arXiv_AI
Survey
Relation
Attention
Inference
Prediction
PDF
-
Single-View View Synthesis in the Wild with Learned Adaptive Multiplane Images
Yuxuan Han, Ruicheng Wang, Jiaolong Yang
arXiv_CV
arXiv_CV
Inpainting
3D
Pose
Action
Attention
Prediction
PDF
-
On the Role of Bidirectionality in Language Model Pre-Training
Mikel Artetxe, Jingfei Du, Naman Goyal, Luke Zettlemoyer, Ves Stoyanov
arXiv_AI
arXiv_AI
Bert
Zero-Shot
Pose
Attention
Language_Model
Prediction
PDF
-
Semi-Parametric Deep Neural Networks in Linear Time and Memory
Richa Rastogi, Yuntian Deng, Ian Lee, Mert R. Sabuncu, Volodymyr Kuleshov
arXiv_CV
arXiv_CV
Pose
Deep_Learning
Attention
Inference
PDF
-
High-Order Pooling for Graph Neural Networks with Tensor Decomposition
Chenqing Hua, Guillaume Rabusseau, Jian Tang
arXiv_AI
arXiv_AI
Pose
Action
Classification
Attention
PDF
-
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W Black, Ana Marasovic
arXiv_CV
arXiv_CV
Image_Caption
Action
VQA
Attention
Text_Generation
Caption
Language_Model
QA
PDF
-
TransforMatcher: Match-to-Match Attention for Semantic Correspondence
Seungwook Kim, Juhong Min, Minsu Cho
arXiv_CV
arXiv_CV
Transformer
Pose
Action
Relation
Attention
Matching
PDF
-
Conditional Supervised Contrastive Learning for Fair Text Classification
Jianfeng Chi, William Shand, Yaodong Yu, Kai-Wei Chang, Han Zhao, Yuan Tian
arXiv_AI
arXiv_AI
Text_Classification
Represenation_Learning
Pose
Contrastive_Learning
Classification
Attention
PDF
-
SiPRNet: End-to-End Learning for Single-Shot Phase Retrieval
Qiuliang Ye, Li-Wen Wang, Daniel P.K. Lun
arXiv_CV
arXiv_CV
Reconstruction
Optimization
Pose
Deep_Learning
Attention
CNN
PDF
-
Enhanced Prototypical Learning for Unsupervised Domain Adaptation in LiDAR Semantic Segmentation
Eojindl Yi, Juyoung Yang, Junmo Kim
arXiv_CV
arXiv_CV
Reconstruction
Segmentation
Unsupervised
Semantic_Segmentation
3D
Pose
Attention
Inference
PDF
-
Multi-Temporal Spatial-Spectral Comparison Network for Hyperspectral Anomalous Change Detection
Meiqi Hu, Chen Wu, Bo Du
arXiv_CV
arXiv_CV