Learning from the Tangram to Solve Mini Visual Tasks

2021-12-12 02:02:14

Yizhou Zhao, Liang Qiu, Pan Lu, Feng Shi, Tian Han, Song-Chun Zhu

arXiv_CV

arXiv_CV Transformer Pose Few-Shot Contour Handwriting

Abstract
Abstract (translated)
URL
PDF

Abstract

Current pre-training methods in computer vision focus on natural images in the daily-life context. However, abstract diagrams such as icons and symbols are common and important in the real world. This work is inspired by Tangram, a game that requires replicating an abstract pattern from seven dissected shapes. By recording human experience in solving tangram puzzles, we present the Tangram dataset and show that a pre-trained neural model on the Tangram helps solve some mini visual tasks based on low-resolution vision. Extensive experiments demonstrate that our proposed method generates intelligent solutions for aesthetic tasks such as folding clothes and evaluating room layouts. The pre-trained feature extractor can facilitate the convergence of few-shot learning tasks on human handwriting and improve the accuracy in identifying icons by their contours. The Tangram dataset is available at this https URL.

Abstract (translated)

URL

https://arxiv.org/abs/2112.06113

PDF

https://arxiv.org/pdf/2112.06113.pdf