Video_Classification
Video_Classification
-
Augmenting Ego-Vehicle for Traffic Near-Miss and Accident Classification Dataset using Manipulating Conditional Style Translation
Hilmil Pradana, Minh-Son Dao, Koji Zettsu
arXiv_CV
arXiv_CV
Unsupervised
3D
Pose
Classification
CNN
Video_Classification
PDF
-
Truncate-Split-Contrast: A Framework for Learning from Mislabeled Videos
Wang Zixiao, Weng Junwu, Yuan Chun, Wang Jue
arXiv_CV
arXiv_CV
Pose
Contrastive_Learning
Classification
Detection
Relation
Video_Classification
PDF
-
Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Shen Yan, Tao Zhu, Zirui Wang, Yuan Cao, Mi Zhang, Soham Ghosh, Yonghui Wu, Jiahui Yu
arXiv_CV
arXiv_CV
Embedding
Video_Caption
Zero-Shot
Classification
VQA
Attention
Caption
Activity
QA
Video_Retrieval
Video_Classification
PDF
-
Evaluation of FEM and MLFEM AI-explainers in Image Classification tasks with reference-based and no-reference metrics
A. Zhukov, J. Benois-Pineau, R. Giot
arXiv_CV
arXiv_CV
Pose
Classification
Relation
Image_Classification
Video_Classification
PDF
-
A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition
Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang
arXiv_CV
arXiv_CV
Recognition
Optimization
Represenation_Learning
Regularization
Knowledge
Pose
Face
Classification
Video_Classification
PDF
-
Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Hao Tang, Lei Ding, Songsong Wu, Bin Ren, Nicu Sebe, Paolo Rota
arXiv_CV
arXiv_CV
Unsupervised
RNN
Pose
Action
Classification
CNN
Video_Classification
PDF
-
Transfer-learning for video classification: Video Swin Transformer on multiple domains
Daniel Oliveira, David Martins de Matos
arXiv_AI
arXiv_AI
Transformer
Transfer_Learning
Action
Classification
CNN
Video_Classification
PDF
-
Linear Video Transformer with Feature Fixation
Kaiyue Lu, Zexiang Liu, Jianyuan Wang, Weixuan Sun, Zhen Qin, Dong Li, Xuyang Shen, Hui Deng, Xiaodong Han, Yuchao Dai, Yiran Zhong
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Video_Classification
PDF
-
Overlooked Video Classification in Weakly Supervised Video Anomaly Detection
Weijun Tan, Qi Yao, Jingfeng Liu
arXiv_CV
arXiv_CV
Bert
Weakly_Supervised
RNN
Classification
Detection
Video_Classification
PDF
-
S4ND: Modeling Images and Videos as Multidimensional Signals Using State Spaces
Eric Nguyen, Karan Goel, Albert Gu, Gordon W. Downs, Preey Shah, Tri Dao, Stephen A. Baccus, Christopher Ré
arXiv_CV
arXiv_CV
Transformer
Zero-Shot
Pose
Classification
Attention
Activity
CNN
Video_Classification
PDF
-
TAD: A Large-Scale Benchmark for Traffic Accidents Detection from Video Surveillance
Yajun Xu, Chuwen Huang, Yibing Nan, Shiguo Lian
arXiv_CV
arXiv_CV
Surveillance
Pose
Classification
Detection
Object_Detection
Image_Classification
Autonomous
Prediction
Video_Classification
PDF
-
FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification
Pu Jin, Lichao Mou, Yuansheng Hua, Gui-Song Xia, Xiao Xiang Zhu
arXiv_CV
arXiv_CV
Recognition
Pose
Action_Recognition
Action
Classification
Relation
Drone
Video_Classification
PDF
-
Traffic Congestion Prediction using Deep Convolutional Neural Networks: A Color-coding Approach
Mirza Fuad Adnan, Nadim Ahmed, Imrez Ishraque, Md. Sifath Al Amin, Md. Sumit Hasan
arXiv_CV
arXiv_CV
Pose
Classification
Detection
CNN
Prediction
Video_Classification
PDF
-
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition
Farrukh Rahman, Ömer Mubarek, Zsolt Kira
arXiv_CV
arXiv_CV
Transformer
Recognition
Classification
Image_Classification
Recommendation
Video_Classification
PDF
-
UAV-CROWD: Violent and non-violent crowd activity simulator from the perspective of UAV
Mahieyin Rahmun, Tonmoay Deb, Shahriar Ali Bijoy, Mayamin Hamid Raha
arXiv_CV
arXiv_CV
Surveillance
Segmentation
Semantic_Segmentation
Pose
Action
Classification
Activity
Video_Classification
PDF
-
Motion Sensitive Contrastive Learning for Self-supervised Video Representation
Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang
arXiv_CV
arXiv_CV
Video_Caption
3D
Represenation_Learning
Self-Supervised
Pose
Contrastive_Learning
Classification
Optical_Flow
Video_Retrieval
Video_Classification
PDF
-
Two-Stream Transformer Architecture for Long Video Understanding
Edward Fish, Jon Weinbren, Andrew Gilbert
arXiv_CV
arXiv_CV
Transformer
Recognition
Video_Caption
Pose
Action_Recognition
Action
Classification
Attention
Video_Classification
PDF
-
Visually explaining 3D-CNN predictions for video classification with an adaptive occlusion sensitivity analysis
Tomoki Uchiyama, Naoya Sogi, Koichiro Niinuma, Kazuhiro Fukui
arXiv_CV
arXiv_CV
3D
Pose
Classification
CNN
Image_Classification
Optical_Flow
Prediction
Video_Classification
PDF
-
$textbf{P$^2$A}$: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos
Jiang Bian, Qingzhong Wang, Haoyi Xiong, Jun Huang, Chen Liu, Xuhong Li, Jun Cheng, Jun Zhao, Feixiang Lu, Dejing Dou
arXiv_CV
arXiv_CV
Transformer
Recognition
Action_Localization
Action_Recognition
Action
Classification
Deep_Learning
Detection
Video_Classification
PDF
-
Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning
Arslan Syed, Eman A. Aldhahri, Muhammad Munawar Iqbal, Abid Ali, Ammar Muthanna, Harun Jamil, Faisal Jamil
arXiv_AI
arXiv_AI
Recognition
3D
Knowledge
Pose
Face
Action_Recognition
Action
Classification
Deep_Learning
CNN
Video_Classification
PDF
-
On Higher Adversarial Susceptibility of Contrastive Self-Supervised Learning
Rohit Gupta, Naveed Akhtar, Ajmal Mian, Mubarak Shah
arXiv_CV
arXiv_CV
Adversarial
Self-Supervised
Classification
Video_Classification
PDF
-
Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments
Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen
arXiv_CV
arXiv_CV
Knowledge
Classification
Few-Shot
Video_Classification
Matching
PDF
-
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie
arXiv_CV
arXiv_CV
Transformer
Classification
Video_Classification
PDF
-
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition
Boyang Xia, Wenhao Wu, Haoran Wang, Rui Su, Dongliang He, Haosen Yang, Xiaoran Fan, Wanli Ouyang
arXiv_CV
arXiv_CV
Recognition
Salient
Review
Pose
Classification
Attention
Inference
Video_Classification
PDF
-
GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Huseyin Coskun, Alireza Zareian, Joshua L. Moore, Federico Tombari, Chen Wang
arXiv_CV
arXiv_CV
Unsupervised
Recognition
Represenation_Learning
Regularization
Self-Supervised
Pose
Action_Recognition
Action
Classification
Optical_Flow
Video_Retrieval
Video_Classification
PDF
-
Temporal and cross-modal attention for audio-visual zero-shot learning
Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata
arXiv_CV
arXiv_CV
Zero-Shot
Pose
Classification
Relation
Attention
Activity
Video_Classification
PDF
-
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian, Yeqing Li, Zheng Xu, Ming-Hsuan Yang, Serge Belongie, Yin Cui
arXiv_CV
arXiv_CV
Recognition
Zero-Shot
Classification
Optical_Flow
Language_Model
Video_Classification
PDF
-
Long-term Leap Attention, Short-term Periodic Shift for Video Classification
Hao Zhang, Lechao Cheng, Yanbin Hao, Chong-Wah Ngo
arXiv_CV
arXiv_CV
Transformer
Pose
Classification
Attention
Video_Classification
PDF
-
Beyond Transfer Learning: Co-finetuning for Action Localisation
Anurag Arnab, Xuehan Xiong, Alexey Gritsenko, Rob Romijnders, Josip Djolonga, Mostafa Dehghani, Chen Sun, Mario Lučić, Cordelia Schmid
arXiv_CV
arXiv_CV
Transfer_Learning
Pose
Action
Classification
Video_Classification
PDF
-
DAiSEE: Towards User Engagement Recognition in the Wild
Abhay Gupta, Arjun D'Cunha, Kamal Awasthi, Vineeth Balasubramanian
arXiv_CV
arXiv_CV
Recognition
Action
Classification
Inference
Video_Classification
PDF
-
Task-agnostic Defense against Adversarial Patch Attacks
Ke Xu, Yao Xiao, Zhaoheng Zheng, Kaijie Cai, Ram Nevatia
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Adversarial
Classification
Detection
Object_Detection
Image_Classification
Video_Classification
PDF
-
Transferring Textual Knowledge for Visual Recognition
Wenhao Wu, Zhun Sun, Wanli Ouyang
arXiv_CV
arXiv_CV
Transformer
Recognition
Knowledge
Classification
Language_Model
Video_Classification
PDF
-
Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos
Yuchen Wang, Zhongyu Li, Xiangxiang Cui, Liangliang Zhang, Xiang Luo, Meng Yang, Shi Chang
arXiv_CV
arXiv_CV
Recognition
Pose
Classification
Deep_Learning
Detection
Attention
Video_Classification
PDF
-
Automatic Concept Extraction for Concept Bottleneck-based Video Classification
Jeya Vikranth Jeyakumar, Luke Dickens, Luis Garcia, Yu-Hsi Cheng, Diego Ramirez Echavarria, Joseph Noor, Alessandra Russo, Lance Kaplan, Erik Blasch, Mani Srivastava
arXiv_CV
arXiv_CV
Pose
Action
Classification
Deep_Learning
Relation
Video_Classification
PDF
-
Analysis and Extensions of Adversarial Training for Video Classification
Kaleab A. Kinfu, René Vidal
arXiv_AI
arXiv_AI
Adversarial
Pose
Classification
Denoising
Image_Classification
Video_Classification
PDF
-
Learning Muti-expert Distribution Calibration for Long-tailed Video Classification
Yufan Hu, Junyu Gao, Changsheng Xu
arXiv_CV
arXiv_CV
Knowledge
Pose
Classification
Image_Classification
Video_Classification
PDF
-
Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization
Qinying Liu, Zilei Wang, Ruoxi Chen, Zhilin Li
arXiv_CV
arXiv_CV
Regularization
Action_Localization
Pose
Action
Classification
Attention
Prediction
Video_Classification
PDF
-
Attention in Attention: Modeling Context Correlation for Efficient Video Classification
Yanbin Hao, Shuo Wang, Pei Cao, Xinjian Gao, Tong Xu, Jinmeng Wu, Xiangnan He
arXiv_CV
arXiv_CV
Pose
Classification
Relation
Attention
Video_Classification
PDF
-
Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation
Xiyu Wang, Yuecong Xu, Kezhi Mao, Jianfei Yang
arXiv_CV
arXiv_CV
Unsupervised
Adversarial
Pose
Action
Classification
Attention
Prediction
Video_Classification
PDF
-
SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Victor Escorcia, Ricardo Guerrero, Xiatian Zhu, Brais Martinez
arXiv_AI
arXiv_AI
Recognition
Self-Supervised
Action_Recognition
Action
Classification
Detection
Relation
Object_Detection
Video_Classification
PDF
-
Long Movie Clip Classification with State-Space Video Models
Md Mohaiminul Islam, Gedas Bertasius
arXiv_CV
arXiv_CV
Transformer
Recognition
Pose
Action
Classification
Attention
Activity
Video_Classification
PDF
-
StyleFool: Fooling Video Classification Systems via Style Transfer
Yuxin Cao, Xi Xiao, Ruoxi Sun, Derui Wang, Minhui Xue, Sheng Wen
arXiv_CV
arXiv_CV
Style_Transfer
Adversarial
Pose
Classification
Denoising
Video_Classification
PDF
-
Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification
Shi Pu, Kaili Zhao, Mao Zheng
arXiv_CV
arXiv_CV
Zero-Shot
Represenation_Learning
Pose
Classification
Video_Classification
PDF
-
Unsupervised Pre-training for Temporal Action Localization Tasks
Can Zhang, Tianyu Yang, Junwu Weng, Meng Cao, Jue Wang, Yuexian Zou
arXiv_CV
arXiv_CV
Transformer
Unsupervised
Represenation_Learning
Self-Supervised
Action_Localization
Pose
Contrastive_Learning
Action
Classification
Video_Classification
PDF
-
Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification
Sohini Roychowdhury
arXiv_CV
arXiv_CV
Pose
Classification
Deep_Learning
Detection
Object_Detection
Video_Classification
PDF
-
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang, Licheng Jiao, Xu Liu, Fang Liu, Shuyuan Yang, Zhixi Feng, Xu Tang
arXiv_CV
arXiv_CV
Transformer
Segmentation
Tracking
Object_Tracking
Review
Classification
Detection
Object_Detection
Attention
Image_Classification
Video_Classification
PDF
-
Adversarial Attacks on Deep Learning-based Video Compression and Classification Systems
Jung-Woo Chang, Mojan Javaheripi, Seira Hidano, Farinaz Koushanfar
arXiv_CV
arXiv_CV
Adversarial
Pose
Classification
Deep_Learning
Relation
Denoising
Video_Classification
PDF
-
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
Shiwen Zhang
arXiv_CV
arXiv_CV
Transformer
3D
RNN
Pose
Classification
Video_Classification
PDF
-
Recent Trends in 2D Object Detection and Applications in Video Event Recognition
Prithwish Jana, Partha Pratim Mohanta
arXiv_CV
arXiv_CV
Transformer
Recognition
Classification
Deep_Learning
Detection
Object_Detection
Video_Classification
PDF
-
A Dataset for Medical Instructional Video Classification and Question Answering
Deepak Gupta, Kush Attal, Dina Demner-Fushman
arXiv_CV
arXiv_CV
Pose
Classification
Medical
QA
Video_Classification
PDF
-
Learning To Recognize Procedural Activities with Distant Supervision
Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani
arXiv_CV
arXiv_CV
Recognition
Knowledge
Speech
Pose
Action
Classification
Language_Model
Video_Classification
PDF
-
Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition
Kiyoon Kim, Shreyank N Gowda, Oisin Mac Aodha, Laura Sevilla-Lara
arXiv_CV
arXiv_CV
Recognition
Video_Caption
3D
Pose
Action_Recognition
Action
Classification
Optical_Flow
Video_Classification
PDF
-
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Recognition
Represenation_Learning
Pose_Estimation
Pose
Classification
Detection
Relation
Object_Detection
Attention
CNN
Prediction
Video_Classification
PDF
-
Video Summarization Based on Video-text Representation
Li Haopeng, Ke Qiuhong, Gong Mingming, Zhang Rui
arXiv_CV
arXiv_CV
Self-Supervised
Pose
Classification
Relation
Summarization
Video_Classification
PDF
-
STAF: A Spatio-Temporal Attention Fusion Network for Few-shot Video Classification
Rex Liu, Huanle Zhang, Hamed Pirsiavash, Xin Liu
arXiv_CV
arXiv_CV
Embedding
3D
Pose
Classification
Few-Shot
Attention
CNN
Video_Classification
PDF
-
Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer
arXiv_CV
arXiv_CV
Transformer
Embedding
Recognition
Pose
Classification
Detection
Object_Detection
Attention
Video_Classification
PDF
-
PreViTS: Contrastive Pretraining with Video Tracking Supervision
Brian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik
arXiv_CV
arXiv_CV
Tracking
Unsupervised
Recognition
Self-Supervised
Pose
Action
Classification
Attention
Video_Classification
PDF
-
MorphMLP: A Self-Attention Free, MLP-Like Backbone for Image and Video
David Junhao Zhang, Kunchang Li, Yunpeng Chen, Yali Wang, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou
arXiv_CV
arXiv_CV
Transformer
Segmentation
Semantic_Segmentation
Knowledge
Pose
Classification
Attention
Image_Classification
Video_Classification
PDF
-
Unsupervised Action Localization Crop in Video Retargeting for 3D ConvNets
Prithwish Jana, Swarnabja Bhaumik, Partha Pratim Mohanta
arXiv_CV
arXiv_CV
Surveillance
Unsupervised
3D
Salient
Action_Localization
Pose
Action
Classification
Activity
Video_Classification
PDF
-
Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing
Xuanhan Wang, Xiaojia Chen, Lianli Gao, Lechao Chen, Jingkuan Song
arXiv_CV
arXiv_CV
Recognition
Pose
Face
Action_Recognition
Action
Classification
Detection
Object_Detection
Video_Classification
PDF
-
AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling
Alexandros Stergiou, Ronald Poppe
arXiv_CV
arXiv_CV
Super_Resolution
Pose
Classification
Detection
Object_Detection
CNN
Video_Classification
PDF
-
A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark
Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu
arXiv_CV
arXiv_CV
Recognition
Represenation_Learning
Pose
Action
Classification
Relation
Few-Shot
Video_Classification
PDF
-
Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations
Shasha Li, Abhishek Aich, Shitong Zhu, M. Salman Asif, Chengyu Song, Amit K. Roy-Chowdhury, Srikanth Krishnamurthy
arXiv_CV
arXiv_CV
Adversarial
Pose
Classification
Image_Classification
Video_Classification
PDF
-
Predicting Driver Self-Reported Stress by Analyzing the Road Scene
Cristina Bustos, Neska Elhaouij, Albert Sole-Ribalta, Javier Borge-Holthoefer, Agata Lapedriza, Rosalind Picard
arXiv_CV
arXiv_CV
Segmentation
Recognition
Pose
Classification
Image_Classification
Video_Classification
PDF
-
Overview of Tencent Multi-modal Ads Video Understanding Challenge
Zhenzhi Wang, Liyu Wu, Zhimin Li, Jiangfeng Xiong, Qinglin Lu
arXiv_CV
arXiv_CV
Segmentation
Video_Caption
Pose
Classification
Recommendation
Video_Classification
PDF
-
Goal-driven text descriptions for images
Ruotian Luo
arXiv_CV
arXiv_CV
Image_Caption
Recognition
Pose_Estimation
Speech
Pose
Classification
Deep_Learning
Detection
Object_Detection
Speech_Recognition
Caption
Image_Classification
Video_Classification
PDF
-
A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games
Henrik Biermann, Jonas Theiner, Manuel Bassek, Dominik Raabe, Daniel Memmert, Ralph Ewerth
arXiv_CV
arXiv_CV
3D
Pose
Classification
Detection
Video_Classification
PDF
-
Hand Hygiene Video Classification Based on Deep Learning
Rashmi Bakshi
arXiv_CV
arXiv_CV
Recognition
Gesture
Review
Classification
Deep_Learning
Prediction
Video_Classification
PDF
-
Hand Pose Classification Based on Neural Networks
Rashmi Bakshi
arXiv_CV
arXiv_CV
Transfer_Learning
Gesture
Pose
Classification
Deep_Learning
Prediction
Video_Classification
PDF
-
Two-stream Convolutional Networks for Multi-frame Face Anti-spoofing
Zhuoyi Zhang, Cheng Jiang, Xiya Zhong, Chang Song, Yifeng Zhang
arXiv_CV
arXiv_CV
Recognition
Pose
Face
Classification
Face_Recognition
CNN
Video_Classification
PDF
-
Token Shift Transformer for Video Classification
Hao Zhang, Yanbin Hao, Chong-Wah Ngo
arXiv_CV
arXiv_CV
Transformer
Video_Caption
3D
Classification
Relation
Attention
CNN
Video_Classification
PDF
-
Temporal Alignment Prediction for Few-Shot Video Classification
Fei Pan, Chunlei Xu, Jie Guo, Yanwen Guo
arXiv_CV
arXiv_CV
Pose
Classification
Few-Shot
Prediction
Video_Classification
PDF
-
Fine-Grained AutoAugmentation for Multi-label Classification
Ya Wang, Hesen Chen, Fangyi Zhang, Yaohua Wang, Xiuyu Sun, Ming Lin, Hao Li
arXiv_CV
arXiv_CV
Reinforcement_Learning
Pose
Classification
Deep_Learning
Video_Classification
PDF
-
Aligning Correlation Information for Domain Adaptation in Action Recognition
Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See
arXiv_AI
arXiv_AI
Recognition
Adversarial
Pose
Action_Recognition
Action
Classification
Relation
Video_Classification
PDF
-
Attention Bottlenecks for Multimodal Fusion
Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun
arXiv_CV
arXiv_CV
Transformer
Classification
Attention
Prediction
Video_Classification
PDF
-
When Video Classification Meets Incremental Classes
Hanbin Zhao, Xin Qin, Shihao Su, Zibo Lin, Xi Li
arXiv_CV
arXiv_CV
Knowledge
Pose
Classification
Video_Classification
PDF
-
TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification
Andrés Villa, Juan-Manuel Perez-Rua, Vladimir Araujo, Juan Carlos Niebles, Victor Escorcia, Alvaro Soto
arXiv_CV
arXiv_CV
Recognition
Pose
Action
Classification
Few-Shot
Image_Classification
Inference
Video_Classification
PDF
-
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li, Xianhang Li, Yali Wang, Jun Wang, Yu Qiao
arXiv_CV
arXiv_CV
3D
Pose
Action
Classification
Attention
CNN
Video_Classification
PDF
-
A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP
El-Assal Mireille, Tirilly Pierre, Bilasco Ioan Marius
arXiv_CV
arXiv_CV
Unsupervised
Recognition
Video_Caption
Action_Recognition
Action
Classification
CNN
Video_Classification
PDF
-
Unsupervised Action Segmentation with Self-supervised Feature Learning and Co-occurrence Parsing
Zhe Wang, Hao Chen, Xinyu Li, Chunhui Liu, Yuanjun Xiong, Joseph Tighe, Charless Fowlkes
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
Self-Supervised
Action
Classification
Relation
Activity
Video_Classification
PDF
-
IntFormer: Predicting pedestrian intention with the aid of the Transformer architecture
J. Lorenzo, I. Parra, M. A. Sotelo
arXiv_AI
arXiv_AI
Transformer
Classification
CNN
Video_Classification
PDF
-
Learning Implicit Temporal Alignment for Few-shot Video Classification
Songyang Zhang, Jiale Zhou, Xuming He
arXiv_AI
arXiv_AI
Pose
Classification
Few-Shot
Video_Classification
Matching
PDF
-
VidTr: Video Transformer Without Convolutions
Xinyu Li, Yanyi Zhang, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joseph Tighe
arXiv_CV
arXiv_CV
Transformer
3D
Pose
Action
Classification
Attention
Video_Classification
PDF
-
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories
Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry Davis, Heng Wang
arXiv_CV
arXiv_CV
Recognition
Optimization
Pose
Action_Recognition
Action
Classification
Detection
Prediction
Video_Classification
PDF
-
On the Pitfalls of Learning with Limited Data: A Facial Expression Recognition Case Study
Miguel Rodríguez Santander, Juan Hernández Albarracín, Adín Ramírez Rivera
arXiv_CV
arXiv_CV
Transfer_Learning
Recognition
Classification
Deep_Learning
Video_Classification
PDF
-
ViViT: A Video Vision Transformer
Anurag Arnab, Mostafa Dehghani, Georg Heigold, Chen Sun, Mario Lučić, Cordelia Schmid
arXiv_CV
arXiv_CV
Transformer
3D
Pose
Classification
CNN
Image_Classification
Video_Classification
PDF
-
Video Classification with FineCoarse Networks
Guoxi Huang, Adrian G. Bors
arXiv_CV
arXiv_CV
Embedding
Pose
Classification
Video_Classification
PDF
-
Classifying Video based on Automatic Content Detection Overview
Yilin Wang, Jiayi Ye
arXiv_CV
arXiv_CV
Review
Classification
Detection
Relation
Image_Classification
Video_Classification
PDF
-
Revisiting ResNets: Improved Training and Scaling Strategies
Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph
arXiv_CV
arXiv_CV
Self-Supervised
Classification
Video_Classification
PDF
-
All at Once Network Quantization via Collaborative Knowledge Transfer
Ximeng Sun, Rameswar Panda, Chun-Fu Chen, Naigang Wang, Bowen Pan Kailash Gopalakrishnan, Aude Oliva, Rogerio Feris, Kate Saenko
arXiv_CV
arXiv_CV
Quantization
Knowledge
Pose
Classification
Inference
Video_Classification
PDF
-
On the Post-hoc Explainability of Deep Echo State Networks for Time Series Forecasting, Image and Video Classification
Alejandro Barredo Arrieta, Sergio Gil-Lopez, Ibai Laña, Miren Nekane Bilbao, Javier Del Ser
arXiv_AI
arXiv_AI
Knowledge
Pose
Classification
Relation
Video_Classification
PDF
-
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius, Heng Wang, Lorenzo Torresani
arXiv_CV
arXiv_CV
Transformer
Recognition
Video_Caption
3D
Action_Recognition
Action
Classification
Attention
CNN
Video_Classification
PDF
-
Distribution Adaptive INT8 Quantization for Training CNNs
Kang Zhao, Sida Huang, Pan Pan, Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu
arXiv_AI
arXiv_AI
Quantization
Pose
Classification
Detection
Object_Detection
CNN
Image_Classification
Inference
Video_Classification
PDF
-
RANP: Resource Aware Neuron Pruning at Initialization for 3D CNNs
Zhiwei Xu, Thalaiyasingam Ajanthan, Vibhav Vineet, Richard Hartley
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
3D
Optimization
Classification
CNN
Video_Classification
Matching
PDF
-
Privacy-Preserving Video Classification with Convolutional Neural Networks
Sikha Pentyala, Rafael Dowsley, Martine De Cock
arXiv_CV
arXiv_CV
Recognition
Pose
Emotion
Classification
CNN
Image_Classification
Video_Classification
PDF
-
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Zaiwei Zhang, Rohit Girdhar, Armand Joulin, Ishan Misra
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Recognition
Point_Cloud
3D
Self-Supervised
Classification
Detection
Object_Detection
Video_Classification
PDF
-
Deep Learning Towards Edge Computing: Neural Networks Straight from Compressed Data
Samuel Felipe dos Santos, Jurandy Almeida
arXiv_CV
arXiv_CV
Pose
Face
Classification
Deep_Learning
CNN
Image_Classification
Video_Classification
PDF
-
Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates
Feiyan Hu, Eva Mohedano, Noel O'Connor, Kevin McGuinness
arXiv_CV
arXiv_CV
Pose
Classification
Deep_Learning
Prediction
Video_Classification
PDF
-
Smoothed Gaussian Mixture Models for Video Classification and Recommendation
Sirjan Kafle, Aman Gupta, Xue Xia, Ananth Sankar, Xi Chen, Di Wen, Liang Zhang
arXiv_CV
arXiv_CV
Recognition
Pose
Action_Recognition
Action
Classification
Recommendation
Video_Classification
PDF
-
VideoMix: Rethinking Data Augmentation for Video Classification
Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Jinhyung Kim
arXiv_CV
arXiv_CV
Recognition
Action_Localization
Pose
Action_Recognition
Action
Classification
Detection
Video_Classification
PDF
-
Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Youngwan Lee, Hyung-Il Kim, Kimin Yun, Jinyoung Moon
arXiv_CV
arXiv_CV
3D
Pose
Classification
Relation
Attention
Video_Classification
PDF
-
t-EVA: Time-Efficient t-SNE Video Annotation
Soroosh Poorgholi, Osman Semih Kayhan, Jan C. van Gemert
arXiv_CV
arXiv_CV
Video_Caption
Pose
Action
Classification
Attention
Activity
Video_Classification
PDF
-
Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
Qi Kuang, Xin Jin, Qinping Zhao, Bin Zhou
arXiv_CV
arXiv_CV
Pose
Classification
Detection
Drone
Video_Classification
PDF
-
RANP: Resource Aware Neuron Pruning at Initialization for 3D CNNs
Zhiwei Xu, Thalaiyasingam Ajanthan, Vibhav Vineet, Richard Hartley
arXiv_AI
arXiv_AI
Segmentation
Semantic_Segmentation
3D
Optimization
Classification
CNN
Video_Classification
PDF
-
Attention-Aware Noisy Label Learning for Image Classification
Zhenzhen Wang, Chunyan Xu, Yap-Peng Tan, Junsong Yuan
arXiv_CV
arXiv_CV
Knowledge
Pose
Classification
Attention
CNN
Image_Classification
Video_Classification
PDF
-
Dynamic Regions Graph Neural Networks for Spatio-Temporal Reasoning
Iulia Duta, Andrei Nicolicioiu
arXiv_CV
arXiv_CV
Salient
Pose
Action
Classification
Detection
Relation
Object_Detection
Video_Classification
PDF
-
Multi-Label Activity Recognition using Activity-specific Features
Yanyi Zhang, Xinyu Li, Ivan Marsic
arXiv_CV
arXiv_CV
Recognition
Action
Classification
Attention
Activity
Prediction
Video_Classification
PDF
-
Defending Against Multiple and Unforeseen Adversarial Videos
Shao-Yuan Lo, Vishal M. Patel
arXiv_CV
arXiv_CV
Segmentation
Adversarial
Pose
Classification
Detection
Object_Detection
Inference
Video_Classification
PDF
-
Learning Audio-Visual Representations with Active Contrastive Coding
Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song
arXiv_CV
arXiv_CV
Represenation_Learning
Self-Supervised
Pose
Classification
Video_Classification
PDF
-
Making a Case for 3D Convolutions for Object Segmentation in Videos
Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe
arXiv_CV
arXiv_CV
Segmentation
Unsupervised
3D
Salient
Video_Prediction
Pose
Classification
CNN
Prediction
Video_Classification
PDF
-
Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation
Hongyuan Yu, Yan Huang, Lihong Pi, Liang Wang
arXiv_CV
arXiv_CV
3D
Adversarial
Video_Prediction
Pose
Classification
GAN
CNN
Prediction
Video_Classification
PDF
-
Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report
Jing Shi, Zhiheng Li, Haitian Zheng, Yihang Xu, Tianyou Xiao, Weitao Tan, Xiaoning Guo, Sizhe Li, Bin Yang, Zhexin Xu, Ruitao Lin, Zhongkai Shangguan, Yue Zhao, Jingwen Wang, Rohan Sharma, Surya Iyer, Ajinkya Deshmukh, Raunak Mahalik, Srishti Singh, Jayant G Rohra, Yipeng Zhang, Tongyu Yang, Xuan Wen, Ethan Fahnestock, Bryce Ikeda, Ian Lawson, Alan Finkelstein, Kehao Guo, Richard Magnotti, Andrew Sexton, Jeet Ketan Thaker, Oscar Su, Chenliang Xu
arXiv_CV
arXiv_CV
Action
Classification
Video_Classification
PDF
-
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu, Chang Xu, Langwen Hui, Cewu Lu, Dacheng Tao
arXiv_CV
arXiv_CV
Recognition
Sparse
Pose
Action_Recognition
Action
Classification
Inference
Video_Classification
PDF
-
AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification
Xiaofang Wang, Xuehan Xiong, Maxim Neumann, AJ Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua
arXiv_CV
arXiv_CV
3D
Pose
Classification
Attention
CNN
Video_Classification
PDF
-
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos
Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid
arXiv_CV
arXiv_CV
Recognition
Weakly_Supervised
Action_Recognition
Action
Classification
Detection
Object_Detection
Prediction
Video_Classification
PDF
-
MotionSqueeze: Neural Motion Feature Learning for Video Understanding
Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho
arXiv_CV
arXiv_CV
Recognition
Video_Caption
Pose
Action_Recognition
Action
Classification
Optical_Flow
Prediction
Video_Classification
PDF
-
Region-based Non-local Operation for Video Classification
Guoxi Huang, Adrian G. Bors
arXiv_CV
arXiv_CV
Optimization
Pose
Classification
Attention
CNN
Video_Classification
PDF
-
3D CNN-PCA: A Deep-Learning-Based Parameterization for Complex Geomodels
Yimin Liu, Louis J. Durlofsky
arXiv_CV
arXiv_CV
Reconstruction
3D
Classification
CNN
Video_Classification
Matching
PDF
-
Generalized Many-Way Few-Shot Video Classification
Yongqin Xian, Bruno Korbar, Matthijs Douze, Bernt Schiele, Zeynep Akata, Lorenzo Torresani
arXiv_CV
arXiv_CV
Recognition
3D
Pose
Classification
Few-Shot
Image_Classification
Video_Classification
PDF
-
SmallBigNet: Integrating Core and Contextual Views for Video Classification
Xianhang Li, Yali Wang, Zhipeng Zhou, Yu Qiao
arXiv_CV
arXiv_CV
3D
Pose
Classification
Video_Classification
PDF
-
Video Understanding as Machine Translation
Bruno Korbar, Fabio Petroni, Rohit Girdhar, Lorenzo Torresani
arXiv_CV
arXiv_CV
Video_Caption
Speech
Self-Supervised
Pose
Classification
VQA
Caption
QA
Video_Classification
PDF
-
Learning Representative Temporal Features for Action Recognition
Ali Javidani, Ahmad Mahmoudi-Aznaveh
arXiv_CV
arXiv_CV
Recognition
Pose
Action_Recognition
Action
Classification
CNN
Optical_Flow
Video_Classification
PDF
-
Video Contents Understanding using Deep Neural Networks
Mohammadhossein Toutiaee, Abbas Keshavarzi, Abolfazl Farahani, John A. Miller
arXiv_CV
arXiv_CV
Transfer_Learning
Pose
Classification
Detection
Object_Detection
Video_Classification
PDF
-
PipeNet: Selective Modal Pipeline of Fusion Network for Multi-Modal Face Anti-Spoofing
Qing Yang, Xia Zhu, Jong-Kae Fwu, Yun Ye, Ganmei You, Yuan Zhu
arXiv_CV
arXiv_CV
Recognition
3D
Pose
Face
Classification
Prediction
Video_Classification
PDF
-
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition
Rami Ben-Ari, Mor Shpigel, Ophir Azulai, Udi Barzelay, Daniel Rotman
arXiv_CV
arXiv_CV
Embedding
Recognition
Action_Recognition
Action
Classification
Detection
Few-Shot
Activity
Video_Classification
PDF
-
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka, Tenga Wakamiya, Kensho Hara, Yutaka Satoh
arXiv_CV
arXiv_CV
Recognition
3D
Pose
Classification
Relation
Activity
CNN
Video_Classification
PDF
-
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
arXiv_CV
arXiv_CV
Recognition
3D
Action
Classification
Detection
Image_Classification
Video_Classification
PDF
-
Revisiting Few-shot Activity Detection with Class Similarity Control
Huijuan Xu, Ximeng Sun, Eric Tzeng, Abir Das, Kate Saenko, Trevor Darrell
arXiv_CV
arXiv_CV
Pose
Classification
Detection
Few-Shot
Activity
Video_Classification
PDF
-
Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification
Renchun You, Zhiyao Guo, Lei Cui, Xiang Long, Yingze Bao, Shilei Wen
arXiv_CV
arXiv_CV
Embedding
Pose
Classification
Relation
Attention
Image_Classification
Video_Classification
PDF
-
Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior
Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang
arXiv_CV
arXiv_CV
Adversarial
Pose
Classification
Relation
Video_Classification
PDF
-
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
Osman Semih Kayhan, Jan C. van Gemert
arXiv_AI
arXiv_AI
Classification
CNN
Image_Classification
Video_Classification
Matching
PDF
-
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
Biagio Brattoli, Joe Tighe, Fedor Zhdanov, Pietro Perona, Krzysztof Chalupka
arXiv_CV
arXiv_CV
3D
Zero-Shot
Pose
Classification
Deep_Learning
Video_Classification
PDF
-
VideoSSL: Semi-Supervised Learning for Video Classification
Longlong Jing, Toufiq Parag, Zhe Wu, Yingli Tian, Hongcheng Wang
arXiv_CV
arXiv_CV
Pose
Action
Classification
CNN
Video_Classification
PDF
-
Learning spatio-temporal representations with temporal squeeze pooling
Guoxi Huang, Adrian G. Bors
arXiv_CV
arXiv_CV
Embedding
Optimization
Represenation_Learning
Pose
Classification
CNN
Video_Classification
PDF
-
iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention
Qian Liu, Dongyang Cai, Jie Liu, Nan Ding, Tao Wang
arXiv_CV
arXiv_CV
Pose
Classification
Attention
Activity
Video_Classification
PDF
-
Appending Adversarial Frames for Universal Video Attack
Zhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Qi Tian
arXiv_CV
arXiv_CV
Adversarial
Classification
Image_Classification
Video_Classification
PDF
-
Zero-shot Recognition of Complex Action Sequences
Jonathan D. Jones, Tae Soo Kim, Michael Peven, Jin Bai, Zihao Xiao, Yi Zhang, Weichao Qiu, Alan Yuille, Gregory D. Hager
arXiv_CV
arXiv_CV
Surveillance
Segmentation
Recognition
Zero-Shot
Knowledge
Action
Classification
Detection
Object_Detection
Activity
Video_Classification
PDF
-
A Spectral Nonlocal Block for Neural Networks
Lei Zhu, Qi She, Lidan Zhang, Ping Guo
arXiv_CV
arXiv_CV
Pose
Classification
Video_Classification
PDF
-
LPAT: Learning to Predict Adaptive Threshold for Weakly-supervised Temporal Action Localization
Xudong Lin, Zheng Shou, Shih-Fu Chang
arXiv_CV
arXiv_CV
Knowledge
Action_Localization
Pose
Action
Classification
Activity
Video_Classification
PDF
-
Gated Channel Transformation for Visual Recognition
Zongxin Yang, Linchao Zhu, Yu Wu, Yi Yang
arXiv_CV
arXiv_CV
Segmentation
Recognition
Pose
Classification
Detection
Relation
Object_Detection
CNN
Image_Classification
Video_Classification
PDF
-
Self-Paced Video Data Augmentation with Dynamic Images Generated by Generative Adversarial Networks
Yumeng Zhang, Gaoguo Jia, Li Chen, Mingrui Zhang, Junhai Yong
arXiv_CV
arXiv_CV
Regularization
Adversarial
Pose
Action
Classification
GAN
Video_Classification
PDF
-
Metric-Based Few-Shot Learning for Video Action Recognition
Chris Careaga, Brian Hutchinson, Nathan Hodas, Lawrence Phillips
arXiv_CV
arXiv_CV
Embedding
Recognition
Pose
Action_Recognition
Action
Classification
Few-Shot
CNN
Image_Classification
Video_Classification
PDF
-
Identifying and Resisting Adversarial Videos Using Temporal Consistency
Xiaojun Jia, Xingxing Wei, Xiaochun Cao
arXiv_CV
arXiv_CV
Sparse
Adversarial
Pose
Classification
Detection
Video_Classification
PDF
-
Two-Stream Video Classification with Cross-Modality Attention
Lu Chi, Guiyu Tian, Yadong Mu, Qi Tian
arXiv_CV
arXiv_CV
Pose
Classification
Attention
Prediction
Video_Classification
PDF
-
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
Wenhao Wu, Dongliang He, Xiao Tan, Shifeng Chen, Shilei Wen
arXiv_CV
arXiv_CV
Recognition
Reinforcement_Learning
3D
RNN
Salient
Pose
Action
Classification
Activity
Video_Classification
PDF
-
AVD: Adversarial Video Distillation
Mohammad Tavakolian, Mohammad Sabokrou, Abdenour Hadid
arXiv_CV
arXiv_CV
Reconstruction
Recognition
3D
Adversarial
Pose
Classification
Activity
CNN
Video_Classification
PDF
-
Loss Switching Fusion with Similarity Search for Video Classification
Lei Wang, Du Q. Huynh, Moussa Reda Mansour
arXiv_CV
arXiv_CV
Surveillance
Pose
Classification
Video_Classification
PDF
-
Few-Shot Video Classification via Temporal Alignment
Kaidi Cao, Jingwei Ji, Zhangjie Cao, Chien-Yi Chang, Juan Carlos Niebles
arXiv_CV
arXiv_CV
Pose
Classification
Few-Shot
Video_Classification
PDF
-
Spatio-Temporal Fusion Networks for Action Recognition
Sangwoo Cho, Hassan Foroosh
arXiv_CV
arXiv_CV
Recognition
Action_Recognition
Action
Classification
Activity
Video_Classification
PDF
-
Learning Spatio-Temporal Representation with Local and Global Diffusion
Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Xinmei Tian, Tao Mei
arXiv_CV
arXiv_CV
Recognition
Represenation_Learning
Pose
Action_Recognition
Action
Classification
Detection
CNN
Video_Classification
PDF
-
FASTER Recurrent Networks for Video Classification
Linchao Zhu, Laura Sevilla-Lara, Du Tran, Matt Feiszli, Yi Yang, Heng Wang
arXiv_CV
arXiv_CV
Pose
Action
Classification
Relation
Attention
Inference
Prediction
Video_Classification
PDF
-
Hallucinating Optical Flow Features for Video Classification
Yongyi Tang, Lin Ma, Lianqiang Zhou
arXiv_CV
arXiv_CV
Pose
Classification
Relation
Optical_Flow
Video_Classification
PDF
-
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
arXiv_CV
arXiv_CV
Segmentation
Semantic_Segmentation
Recognition
Video_Caption
Video_Prediction
Pose
Face
Action_Recognition
Action
Classification
Optical_Flow
Inference
Prediction
Video_Classification
PDF
-
Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints
Mengtian Li, Ersin Yumer, Deva Ramanan
arXiv_CV
arXiv_CV
NAS
Segmentation
Semantic_Segmentation
Optimization
Classification
Detection
Object_Detection
Image_Classification
Video_Classification
PDF
-
On Flow Profile Image for Video Representation
Mohammadreza Babaee, David Full, Gerhard Rigoll
arXiv_CV
arXiv_CV
Surveillance
Recognition
Video_Caption
Optimization
Pose
Classification
Caption
Activity
Optical_Flow
Video_Classification
PDF
-
DynamoNet: Dynamic Action and Motion Network
Ali Diba, Vivek Sharma, Luc Van Gool, Rainer Stiefelhagen
arXiv_CV
arXiv_CV
Recognition
3D
Self-Supervised
Pose
Action_Recognition
Action
Classification
CNN
Prediction
Video_Classification
PDF
-
Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Yunhui Liu, Wei Liu
arXiv_CV
arXiv_CV
3D
Represenation_Learning
Self-Supervised
Pose
Classification
Video_Classification
PDF
-
Video Classification with Channel-Separated Convolutional Networks
Du Tran, Heng Wang, Lorenzo Torresani, Matt Feiszli
arXiv_AI
arXiv_AI
3D
Regularization
Action
Classification
CNN
Image_Classification
Video_Classification
PDF
-
Video-based surgical skill assessment using 3D convolutional neural networks
Isabel Funke, Sören Torge Mees, Jürgen Weitz, Stefanie Speidel
arXiv_CV
arXiv_CV
Tracking
3D
Pose
Classification
Deep_Learning
CNN
Optical_Flow
Video_Classification
PDF
-
Semantic Adversarial Network with Multi-scale Pyramid Attention for Video Classification
De Xie, Cheng Deng, Hao Wang, Chao Li, Dapeng Tao
arXiv_CV
arXiv_CV
Adversarial
Pose
Classification
Attention
CNN
Optical_Flow
Video_Classification
PDF
-
Efficient Video Classification Using Fewer Frames
Shweta Bhardwaj, Mukundhan Srinivasan, Mitesh M. Khapra
arXiv_CV
arXiv_CV
Pose
Action
Classification
Inference
Video_Classification
PDF
-
Saliency Tubes: Visual Explanations for Spatio-Temporal Convolutions
Alexandros Stergiou, Georgios Kapidis, Grigorios Kalliatakis, Christos Chrysoulas, Remco Veltkamp, Ronald Poppe
arXiv_CV
arXiv_CV
Recognition
3D
Salient
Pose
Action
Classification
Deep_Learning
CNN
Video_Classification
PDF
-
Rate-Accuracy Trade-Off In Video Classification With Deep Convolutional Neural Networks
Mohammad Jubran, Alhabib Abbas, Aaron Chadha, Yiannis Andreopoulos
arXiv_CV
arXiv_CV
Surveillance
Recognition
Action_Recognition
Action
Classification
CNN
Optical_Flow
Video_Classification
PDF
-
Adversarial Framing for Image and Video Classification
Michał Zając, Konrad Żołna, Negar Rostamzadeh, Pedro O. Pinheiro
arXiv_AI
arXiv_AI
Adversarial
Pose
Classification
Video_Classification
PDF
-
MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language
Hamid Reza Vaezi Joze, Oscar Koller
arXiv_CV
arXiv_CV
Recognition
3D
Pose
Action_Recognition
Action
Classification
Video_Classification
PDF
-
Deep Multimodal Learning: An Effective Method for Video Classification
Tianqi Zhao
arXiv_CV
arXiv_CV
RNN
Pose
Action
Classification
Attention
Language_Model
Video_Classification
PDF
-
Unsupervised Meta-Learning For Few-Shot Image and Video Classification
Siavash Khodadadeh, Ladislau Bölöni, Mubarak Shah
arXiv_AI
arXiv_AI
Unsupervised
Knowledge
Pose
Classification
Few-Shot
Video_Classification
PDF
-
Deep RNN Framework for Visual Sequential Applications
Bo Pang, Kaiwen Zha, Hanwen Cao, Chen Shi, Cewu Lu
arXiv_CV
arXiv_CV
RNN
Pose
Classification
Prediction
Video_Classification
PDF
-
High Order Neural Networks for Video Classification
Jie Shao, Kai Hu, Yixin Bao, Yining Lin, Xiangyang Xue
arXiv_CV
arXiv_CV
Classification
Relation
Video_Classification
PDF
-
NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification
Rongcheng Lin, Jing Xiao, Jianping Fan
arXiv_CV
arXiv_CV
Video_Caption
Pose
Classification
Attention
Video_Classification
PDF
-
Compact Generalized Non-local Network
Kaiyu Yue, Ming Sun, Yuchen Yuan, Feng Zhou, Errui Ding, Fuxin Xu
arXiv_CV
arXiv_CV
Recognition
Optimization
Action
Classification
Relation
Video_Classification
PDF
-
Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization
Haisheng Su, Xu Zhao, Tianwei Lin
arXiv_CV
arXiv_CV
Weakly_Supervised
Adversarial
Action_Localization
Pose
Action
Classification
Detection
Attention
Activity
Video_Classification
PDF
-
Fine-grained Video Categorization with Redundancy Reduction Attention
Chen Zhu, Xiao Tan, Feng Zhou, Xiao Liu, Kaiyu Yue, Errui Ding, Yi Ma
arXiv_CV
arXiv_CV
Pose
Classification
Attention
Video_Classification
PDF
-
Where and When to Look? Spatio-temporal Attention for Action Recognition in Videos
Lili Meng, Bo Zhao, Bo Chang, Gao Huang, Frederick Tung, Leonid Sigal
arXiv_CV
arXiv_CV
Recognition
Salient
Pose
Action_Recognition
Action
Quantitative
Classification
Attention
Video_Classification
PDF
-
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification
Jinlai Liu, Zehuan Yuan, Xiaojie Wang, Changhu Wang
arXiv_CV
arXiv_CV
Classification
Video_Classification
PDF
-
Label Denoising with Large Ensembles of Heterogeneous Neural Networks
Pavel Ostyakov, Elizaveta Logacheva, Roman Suvorov, Vladimir Aliev, Gleb Sterkin, Oleg Khomenko, Sergey I. Nikolenko
arXiv_CV
arXiv_CV
Video_Caption
Knowledge
Classification
Denoising
CNN
Video_Classification
PDF
-
Approach for Video Classification with Multi-label on YouTube-8M Dataset
Kwangsoo Shin, Junhyeong Jeon, Seungbin Lee, Boyoung Lim, Minsoo Jeong, Jongho Nang
arXiv_CV
arXiv_CV
Classification
Video_Classification
PDF
-
Isometric Transformation Invariant Graph-based Deep Neural Network
Renata Khasanova, Pascal Frossard
arXiv_CV
arXiv_CV
Classification
CNN
Video_Classification
PDF
-
Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning
Uta Büchler, Biagio Brattoli, Björn Ommer
arXiv_CV
arXiv_CV
Unsupervised
Transfer_Learning
Reinforcement_Learning
Self-Supervised
Pose
Classification
CNN
Video_Classification
PDF
-
Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI
Pramit Saha, Praneeth Srungarapu, Sidney Fels
arXiv_CV
arXiv_CV
Recognition
Speech
Action_Recognition
Action
Classification
CNN
Video_Classification
PDF
-
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, Kevin Murphy
arXiv_CV
arXiv_CV
3D
Represenation_Learning
Action
Classification
Detection
CNN
Image_Classification
Video_Classification
PDF
-
Multimodal Classification with Deep Convolutional-Recurrent Neural Networks for Electroencephalography
Chuanqi Tan, Fuchun Sun, Wenchang Zhang, Jianhua Chen, Chunfang Liu
arXiv_CV
arXiv_CV
RNN
Pose
Face
Classification
CNN
Optical_Flow
Video_Classification
PDF
-
Deep Discriminative Model for Video Classification
Mohammad Tavakolian, Abdenour Hadid
arXiv_CV
arXiv_CV
Reconstruction
Unsupervised
Recognition
Scene_Classification
Sparse
Pose
Action_Recognition
Action
Classification
Deep_Learning
Video_Classification
PDF
-
Deep Architectures and Ensembles for Semantic Video Classification
Eng-Jon Ong, Sameed Husain, Mikel Bober, Miroslaw Bober
arXiv_CV
arXiv_CV
RNN
Pose
Classification
Relation
Video_Classification
PDF
-
Adversarial Perturbations Against Real-Time Video Classification Systems
Shasha Li, Ajaya Neupane, Sujoy Paul, Chengyu Song, Srikanth V. Krishnamurthy, Amit K. Roy Chowdhury, Ananthram Swami
arXiv_CV
arXiv_CV
Surveillance
Adversarial
Classification
Relation
GAN
Video_Classification
PDF
-
A convex method for classification of groups of examples
Dori Peleg
arXiv_CV
arXiv_CV
Optimization
Pose
Classification
Image_Classification
Video_Classification
PDF
-
Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos
Gunnar A. Sigurdsson, Abhinav Gupta, Cordelia Schmid, Ali Farhadi, Karteek Alahari
arXiv_CV
arXiv_CV
Video_Caption
Classification
Caption
Activity
Video_Classification
PDF
-
Visual Data Synthesis via GAN for Zero-Shot Video Classification
Chenrui Zhang, Yuxin Peng
arXiv_CV
arXiv_CV
Zero-Shot
Knowledge
Pose
Classification
Relation
GAN
Inference
Video_Classification
Matching
PDF
-
Fine-grained Video Classification and Captioning
Farzaneh Mahdisoltani, Guillaume Berger, Waseem Gharbieh, David Fleet, Roland Memisevic
arXiv_CV
arXiv_CV
Video_Caption
Action
Classification
Relation
Caption
Video_Classification
PDF
-
Deep Learning for Video Classification and Captioning
Zuxuan Wu, Ting Yao, Yanwei Fu, Yu-Gang Jiang
arXiv_CV
arXiv_CV
Video_Caption
Review
Action
Classification
Deep_Learning
Caption
Video_Classification
PDF
-
Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video
Lionel Pigou, Aäron van den Oord, Sander Dieleman, Mieke Van Herreweghe, Joni Dambre
arXiv_CV
arXiv_CV
Image_Caption
Recognition
Gesture
RNN
Speech
Pose
Classification
Speech_Recognition
Caption
Video_Classification
PDF
-
Deep End2End Voxel2Voxel Prediction
Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri
arXiv_CV
arXiv_CV
NAS
Segmentation
Semantic_Segmentation
3D
Classification
Deep_Learning
Detection
CNN
Optical_Flow
Prediction
Video_Classification
PDF