The Multi-Modal Video Reasoning and Analyzing Competition

2021-08-18 18:40:00

Haoran Peng, He Huang, Li Xu, Tianjiao Li, Jun Liu, Hossein Rahmani, Qiuhong Ke, Zhicheng Guo, Cong Wu, Rongchang Li, Mang Ye, Jiahao Wang, Jiaxu Zhang, Yuanzhong Liu, Tao He, Fuwei Zhang, Xianbin Liu, Tao Lin

arXiv_AI

arXiv_AI Recognition Action_Recognition Person_Re-identification Re-identification QA Pose Action

Abstract
Abstract (translated)
URL
PDF

Abstract

In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summarize the top-performing methods submitted by the participants in this competition and show their results achieved in the competition.

Abstract (translated)

URL

https://arxiv.org/abs/2108.08344

PDF

https://arxiv.org/pdf/2108.08344.pdf