Paper Reading AI Learner

Pixel-level Correspondence for Self-Supervised Learning from Video

2022-07-08 12:50:13

Yash Sharma, Yi Zhu, Chris Russell, Thomas Brox

arXiv_CV

arXiv_CV Classification Image_Classification Tracking Represenation_Learning Prediction Pose Optical_Flow Self-Supervised Contrastive_Learning

Abstract
Abstract (translated)
URL
PDF

Abstract

While self-supervised learning has enabled effective representation learning in the absence of labels, for vision, video remains a relatively untapped source of supervision. To address this, we propose Pixel-level Correspondence (PiCo), a method for dense contrastive learning from video. By tracking points with optical flow, we obtain a correspondence map which can be used to match local features at different points in time. We validate PiCo on standard benchmarks, outperforming self-supervised baselines on multiple dense prediction tasks, without compromising performance on image classification.

Abstract (translated)

URL

https://arxiv.org/abs/2207.03866

PDF

https://arxiv.org/pdf/2207.03866.pdf