Paper Reading AI Learner

Bullying10K: A Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition

2023-06-20 13:59:20
Yiting Dong, Yang Li, Dongcheng Zhao, Guobin Shen, Yi Zeng

Abstract

The prevalence of violence in daily life poses significant threats to individuals' physical and mental well-being. Using surveillance cameras in public spaces has proven effective in proactively deterring and preventing such incidents. However, concerns regarding privacy invasion have emerged due to their widespread deployment. To address the problem, we leverage Dynamic Vision Sensors (DVS) cameras to detect violent incidents and preserve privacy since it captures pixel brightness variations instead of static imagery. We introduce the Bullying10K dataset, encompassing various actions, complex movements, and occlusions from real-life scenarios. It provides three benchmarks for evaluating different tasks: action recognition, temporal action localization, and pose estimation. With 10,000 event segments, totaling 12 billion events and 255 GB of data, Bullying10K contributes significantly by balancing violence detection and personal privacy persevering. And it also poses a challenge to the neuromorphic dataset. It will serve as a valuable resource for training and developing privacy-protecting video systems. The Bullying10K opens new possibilities for innovative approaches in these domains.

Abstract (translated)

日常生活中的暴力普遍存在,对个体的身体健康和心理健康构成严重威胁。在公共场所使用监控摄像头已经证明能够有效地预防并阻止此类事件的发生。然而,隐私侵犯问题也因为广泛使用监控摄像头而凸显出来。为了解决这一问题,我们利用动态视觉传感器(DVS)摄像头来检测暴力事件并保护隐私,因为它能够捕捉像素亮度的变化而不是静态图像。我们介绍了“欺凌10K数据集”,涵盖了各种行动、复杂动作和遮挡情况,提供了三个基准任务以评估不同的任务:行动识别、时间动作定位和姿态估计。该数据集有10,000个事件片段,总共涵盖了120亿事件和2.55GB的数据,通过平衡暴力检测和个人隐私坚持, significantly contribute to these domains' innovative approaches. 该数据集也挑战了神经可塑性数据集。它将成为训练和开发保护隐私的视频系统的有价值的资源。欺凌10K为这些领域的创新方法带来了新的可能性。

URL

https://arxiv.org/abs/2306.11546

PDF

https://arxiv.org/pdf/2306.11546.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot