Paper Reading AI Learner

Design and Implementation of A Soccer Ball Detection System with Multiple Cameras

2023-01-31 22:04:53
Lei Li, Tianfang Zhang, Zhongfeng Kang, Wenhan Zhang

Abstract

The detection of small and medium-sized objects in three dimensions has always been a frontier exploration problem. This technology has a very wide application in sports analysis, games, virtual reality, human animation and other fields. The traditional three-dimensional small target detection technology has the disadvantages of high cost, low precision and inconvenience, so it is difficult to apply in practice. With the development of machine learning and deep learning, the technology of computer vision algorithms is becoming more mature. Creating an immersive media experience is considered to be a very important research work in sports. The main work is to explore and solve the problem of football detection under the multiple cameras, aiming at the research and implementation of the live broadcast system of football matches. Using multi cameras detects a target ball and determines its position in three dimension with the occlusion, motion, low illumination of the target object. This paper designed and implemented football detection system under multiple cameras for the detection and capture of targets in real-time matches. The main work mainly consists of three parts, football detector, single camera detection, and multi-cameras detection. The system used bundle adjustment to obtain the three-dimensional position of the target, and the GPU to accelerates data pre-processing and achieve accurate real-time capture of the target. By testing the system, it shows that the system can accurately detect and capture the moving targets in 3D. In addition, the solution in this paper is reusable for large-scale competitions, like basketball and soccer. The system framework can be well transplanted into other similar engineering project systems. It has been put into the market.

Abstract (translated)

检测小和中型物体在三维空间中的检测一直是一个重要的前沿探索问题。该技术在体育分析、游戏、虚拟现实、人类动画和其他领域有着广泛的应用。传统的三维小型目标检测技术具有高成本、低精度和不便用的缺点,因此难以在实践中应用。随着机器学习和深度学习的发展,计算机视觉算法技术正在变得更加成熟。创造沉浸式媒体体验被认为是在体育领域中非常重要的研究工作。其主要工作是探索和解决在多个摄像头下检测足球的问题,旨在研究和实施足球比赛的实时广播系统。利用多个摄像头检测足球球并确定其在三维空间中的的位置,结合目标遮挡、运动和低照明等因素。本文设计和实现了在多个摄像头下检测足球的系统,以在实时比赛中检测和捕捉目标。其主要工作主要由三个部分组成:足球检测、单个摄像头检测和多个摄像头检测。系统使用了卷积调整来获取目标三维位置,并将GPU加速数据预处理,以实现准确的实时目标捕捉。通过测试系统,表明该系统可以准确地检测和捕捉在三维空间中的移动目标。此外,本文的解决方案可以用于大型竞赛,如篮球和足球。系统框架可以轻松移植到其他类似的工程项目系统中。已在市场上发布。

URL

https://arxiv.org/abs/2302.00123

PDF

https://arxiv.org/pdf/2302.00123.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot