Paper Reading AI Learner

ViewCraft3D: High-Fidelity and View-Consistent 3D Vector Graphics Synthesis

2025-05-26 04:21:18
Chuang Wang, Haitao Zhou, Ling Luo, Qian Yu

Abstract

3D vector graphics play a crucial role in various applications including 3D shape retrieval, conceptual design, and virtual reality interactions due to their ability to capture essential structural information with minimal representation. While recent approaches have shown promise in generating 3D vector graphics, they often suffer from lengthy processing times and struggle to maintain view consistency. To address these limitations, we propose ViewCraft3D (VC3D), an efficient method that leverages 3D priors to generate 3D vector graphics. Specifically, our approach begins with 3D object analysis, employs a geometric extraction algorithm to fit 3D vector graphics to the underlying structure, and applies view-consistent refinement process to enhance visual quality. Our comprehensive experiments demonstrate that VC3D outperforms previous methods in both qualitative and quantitative evaluations, while significantly reducing computational overhead. The resulting 3D sketches maintain view consistency and effectively capture the essential characteristics of the original objects.

Abstract (translated)

三维矢量图形在包括三维形状检索、概念设计和虚拟现实交互等各种应用中扮演着重要角色,因为它们能够用最少的表示捕捉到结构信息的本质。尽管最近的方法在这方面显示出了一定的进步,但往往面临着处理时间过长以及难以保持视图一致性的挑战。为了克服这些限制,我们提出了一种名为ViewCraft3D (VC3D) 的高效方法,该方法利用三维先验来生成三维矢量图形。 具体来说,我们的方法从三维物体分析开始,使用几何提取算法将三维矢量图形拟合到底层结构,并应用视图一致的细化过程以提升视觉质量。通过全面实验,我们证明VC3D在定性和定量评估中均优于先前的方法,并且显著减少了计算开销。生成的三维草图保持了视图一致性并有效捕捉到了原始物体的主要特征。

URL

https://arxiv.org/abs/2505.19492

PDF

https://arxiv.org/pdf/2505.19492.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot