Paper Reading AI Learner

SRN: Side-output Residual Network for Object Reflection Symmetry Detection and Beyond

2018-07-17 18:51:19
Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye

Abstract

In this paper, we establish a baseline for object reflection symmetry detection in complex backgrounds by presenting a new benchmark and an end-to-end deep learning approach, opening up a promising direction for symmetry detection in the wild. The new benchmark, Sym-PASCAL, spans challenges including object diversity, multi-objects, part-invisibility, and various complex backgrounds that are far beyond those in existing datasets. The end-to-end deep learning approach, referred to as a side-output residual network (SRN), leverages the output residual units (RUs) to fit the errors between the object ground-truth symmetry and the side-outputs of multiple stages. By cascading RUs in a deep-to-shallow manner, SRN exploits the 'flow' of errors among multiple stages to address the challenges of fitting complex output with limited convolutional layers, suppressing the complex backgrounds, and effectively matching object symmetry at different scales. SRN is further upgraded to a multi-task side-output residual network (MT-SRN) for joint symmetry and edge detection, demonstrating its generality to image-to-mask learning tasks. Experimental results validate both the challenging aspects of Sym-PASCAL benchmark related to real-world images and the state-of-the-art performance of the proposed SRN approach.

Abstract (translated)

在本文中,我们通过提出一个新的基准和端到端深度学习方法,为复杂背景下的物体反射对称检测建立基线,为野外对称检测开辟了一个有前景的方向。新的基准测试Sym-PASCAL涵盖了包括对象多样性,多对象,部分不可见性以及远远超出现有数据集的各种复杂背景等挑战。端到端深度学习方法,称为侧输出残余网络(SRN),利用输出残差单位(RU)来拟合对象地面对称性与多级侧向输出之间的误差。 。通过以深度到浅层的方式级联RU,SRN利用多个阶段之间的“流量”错误来解决利用有限卷积层拟合复杂输出,抑制复杂背景以及有效匹配不同尺度的对象对称性的挑战。 SRN进一步升级为多任务侧输出残差网络(MT-SRN),用于联合对称和边缘检测,展示了其对图像到掩模学习任务的通用性。实验结果验证了与真实世界图像相关的Sym-PASCAL基准测试的挑战性方面以及所提出的SRN方法的最新性能。

URL

https://arxiv.org/abs/1807.06621

PDF

https://arxiv.org/pdf/1807.06621.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot