Out of the Box: A combined approach for handling occlusion in Human Pose Estimation

Abstract
Abstract (translated)
URL
PDF

Abstract

Human Pose estimation is a challenging problem, especially in the case of 3D pose estimation from 2D images due to many different factors like occlusion, depth ambiguities, intertwining of people, and in general crowds. 2D multi-person human pose estimation in the wild also suffers from the same problems - occlusion, ambiguities, and disentanglement of people's body parts. Being a fundamental problem with loads of applications, including but not limited to surveillance, economical motion capture for video games and movies, and physiotherapy, this is an interesting problem to be solved both from a practical perspective and from an intellectual perspective as well. Although there are cases where no pose estimation can ever predict with 100% accuracy (cases where even humans would fail), there are several algorithms that have brought new state-of-the-art performance in human pose estimation in the wild. We look at a few algorithms with different approaches and also formulate our own approach to tackle a consistently bugging problem, i.e. occlusions.

Abstract (translated)

人体姿态估计是一个具有挑战性的问题，尤其是在从二维图像进行三维姿态估计的情况下，由于许多不同的因素，如遮挡、深度模糊、人与人之间的相互缠绕以及一般人群。在野外，二维多人人体姿势估计也面临着同样的问题：人体部位的遮挡、模糊和分离。这是一个非常重要的应用问题，包括但不限于监控、视频游戏和电影的经济动作捕捉以及物理治疗，从实践和智力的角度来看，这是一个需要解决的有趣问题。尽管有些情况下，没有姿势估计能够100%准确地预测（即使是人类也会失败），但有几种算法已经在野外为人体姿势估计带来了最先进的性能。我们研究了一些使用不同方法的算法，并制定了自己的方法来解决一个持续不断的窃听问题，即阻塞。

URL

https://arxiv.org/abs/1904.11157

PDF

https://arxiv.org/pdf/1904.11157.pdf