Paper Reading AI Learner

CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields

2024-04-23 12:22:32
Deheng Zhang, Clara Fernandez-Labrador, Christopher Schroers

Abstract

Creating artistic 3D scenes can be time-consuming and requires specialized knowledge. To address this, recent works such as ARF, use a radiance field-based approach with style constraints to generate 3D scenes that resemble a style image provided by the user. However, these methods lack fine-grained control over the resulting scenes. In this paper, we introduce Controllable Artistic Radiance Fields (CoARF), a novel algorithm for controllable 3D scene stylization. CoARF enables style transfer for specified objects, compositional 3D style transfer and semantic-aware style transfer. We achieve controllability using segmentation masks with different label-dependent loss functions. We also propose a semantic-aware nearest neighbor matching algorithm to improve the style transfer quality. Our extensive experiments demonstrate that CoARF provides user-specified controllability of style transfer and superior style transfer quality with more precise feature matching.

Abstract (translated)

创建艺术化的3D场景可能需要花费时间,并且需要专业知识。为了解决这个问题,最近的工作如ARF,采用基于辐射场的方法,带有风格约束,从用户提供的风格图像中生成类似于用户风格的3D场景。然而,这些方法缺乏对生成场景的细粒度控制。在本文中,我们介绍了可控制的艺术化辐射场(CoARF),一种用于可控制3D场景的风格化的新算法。CoARF允许指定对象的样式转移、合成3D样式转移和语义感知样式转移。我们通过具有不同标签相关损失函数的分割掩码实现可控性。我们还提出了一个语义感知最近邻匹配算法,以提高样式转移质量。我们广泛的实验证明,CoARF提供了用户指定风格转移的可控性和卓越的样式转移质量,具有更精确的特征匹配。

URL

https://arxiv.org/abs/2404.14967

PDF

https://arxiv.org/pdf/2404.14967.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot