Abstract
3D sensors, also known as RGB-D sensors, utilize depth images where each pixel measures the distance from the camera to objects, using principles like structured light or time-of-flight. Advances in artificial vision have led to affordable 3D cameras capable of real-time object detection without object movement, surpassing 2D cameras in information depth. These cameras can identify objects of varying colors and reflectivities and are less affected by lighting changes. The described prototype uses RGB-D sensors for bidirectional people counting in venues, aiding security and surveillance in spaces like stadiums or airports. It determines real-time occupancy and checks against maximum capacity, crucial during emergencies. The system includes a RealSense D415 depth camera and a mini-computer running object detection algorithms to count people and a 2D camera for identity verification. The system supports statistical analysis and uses C++, Python, and PHP with OpenCV for image processing, demonstrating a comprehensive approach to monitoring venue occupancy.
Abstract (translated)
3D传感器,也称为RGB-D传感器,利用深度图,其中每个像素测量相机到物体的距离,利用结构光或时间测距等原理。人工智能的进步使得价格实惠的3D相机能够实现实时物体检测,超过2D相机在信息深度方面的表现。这些相机可以识别各种颜色和反射率的物体,对光线变化的影响较小。描述的原型使用RGB-D传感器进行场馆双向人员计数,帮助体育场馆或机场等空间的安保和监控。它实时确定空位并检查最大容量,在紧急情况下至关重要。系统包括一个RealSense D415深度相机和一个运行物体检测算法的迷你计算机,以及一个2D相机用于身份验证。该系统支持统计分析,并使用C++、Python和PHP与OpenCV进行图像处理,展示了全面监测场馆占有率的方法。
URL
https://arxiv.org/abs/2403.12310