Unifying conditional and unconditional semantic image synthesis with OCO-GAN

2022-11-25 13:43:21

Marlène Careil, Stéphane Lathuilière, Camille Couprie, Jakob Verbeek

arXiv_CV

arXiv_CV Segmentation Semantic_Segmentation GAN Adversarial Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Generative image models have been extensively studied in recent years. In the unconditional setting, they model the marginal distribution from unlabelled images. To allow for more control, image synthesis can be conditioned on semantic segmentation maps that instruct the generator the position of objects in the image. While these two tasks are intimately related, they are generally studied in isolation. We propose OCO-GAN, for Optionally COnditioned GAN, which addresses both tasks in a unified manner, with a shared image synthesis network that can be conditioned either on semantic maps or directly on latents. Trained adversarially in an end-to-end approach with a shared discriminator, we are able to leverage the synergy between both tasks. We experiment with Cityscapes, COCO-Stuff, ADE20K datasets in a limited data, semi-supervised and full data regime and obtain excellent performance, improving over existing hybrid models that can generate both with and without conditioning in all settings. Moreover, our results are competitive or better than state-of-the art specialised unconditional and conditional models.

Abstract (translated)

URL

https://arxiv.org/abs/2211.14105

PDF

https://arxiv.org/pdf/2211.14105.pdf