Unsupervised part representation by Flow Capsules

2020-11-27 18:59:42

Sara Sabour, Andrea Tagliasacchi, Soroosh Yazdani, Geoffrey E. Hinton, David J. Fleet

arXiv_CV

arXiv_CV Classification Relation Unsupervised Pose Self-Supervised

Abstract
Abstract (translated)
URL
PDF

Abstract

Capsule networks are designed to parse an image into a hierarchy of objects, parts and relations. While promising, they remain limited by an inability to learn effective low level part descriptions. To address this issue we propose a novel self-supervised method for learning part descriptors of an image. During training, we exploit motion as a powerful perceptual cue for part definition, using an expressive decoder for part generation and layered image formation with occlusion. Experiments demonstrate robust part discovery in the presence of multiple objects, cluttered backgrounds, and significant occlusion. The resulting part descriptors, a.k.a. part capsules, are decoded into shape masks, filling in occluded pixels, along with relative depth on single images. We also report unsupervised object classification using our capsule parts in a stacked capsule autoencoder.

Abstract (translated)

URL

https://arxiv.org/abs/2011.13920

PDF

https://arxiv.org/pdf/2011.13920.pdf