Paper Reading AI Learner

DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model

2024-05-04 15:32:57
Liangqi Lei, Keke Gai, Jing Yu, Liehuang Zhu

Abstract

Latent Diffusion Models (LDMs) enable a wide range of applications but raise ethical concerns regarding illegal utilization.Adding watermarks to generative model outputs is a vital technique employed for copyright tracking and mitigating potential risks associated with AI-generated content. However, post-hoc watermarking techniques are susceptible to evasion. Existing watermarking methods for LDMs can only embed fixed messages. Watermark message alteration requires model retraining. The stability of the watermark is influenced by model updates and iterations. Furthermore, the current reconstruction-based watermark removal techniques utilizing variational autoencoders (VAE) and diffusion models have the capability to remove a significant portion of watermarks. Therefore, we propose a novel technique called DiffuseTrace. The goal is to embed invisible watermarks in all generated images for future detection semantically. The method establishes a unified representation of the initial latent variables and the watermark information through training an encoder-decoder model. The watermark information is embedded into the initial latent variables through the encoder and integrated into the sampling process. The watermark information is extracted by reversing the diffusion process and utilizing the decoder. DiffuseTrace does not rely on fine-tuning of the diffusion model components. The watermark is embedded into the image space semantically without compromising image quality. The encoder-decoder can be utilized as a plug-in in arbitrary diffusion models. We validate through experiments the effectiveness and flexibility of DiffuseTrace. DiffuseTrace holds an unprecedented advantage in combating the latest attacks based on variational autoencoders and Diffusion Models.

Abstract (translated)

潜在扩散模型(LDMs)允许应用于广泛的领域,但涉及非法利用的伦理问题。在将水印添加到生成模型的输出中是保护版权跟踪和减轻与AI生成的内容相关的潜在风险的重要技术。然而,后置水印技术易被绕过。现有的LDM水印方法只能嵌入固定的消息。水印消息修改需要模型重构。水印的稳定性受模型更新和迭代的影响。此外,使用变分自编码器(VAE)和扩散模型基于重构的消歧水印去除技术具有去除大量水印的能力。因此,我们提出了名为DiffuseTrace的新技术。目标是将不可见的水印嵌入所有生成的图像中,供未来检测具有语义意义。该方法通过训练编码器-解码器模型,将初始潜在变量和 水印信息建立为统一表示。水印信息通过编码器整合到抽样过程中。通过反转扩散过程并利用解码器提取水印信息。DiffuseTrace不依赖于对扩散模型组件的微调。水印在图像空间语义上嵌入,同时不牺牲图像质量。编码器-解码器可以作为任意扩散模型的插件使用。通过实验验证DiffuseTrace的有效性和灵活性。DiffuseTrace在对抗基于变分自编码器(VAE)和扩散模型的最新攻击方面具有史无前例的优势。

URL

https://arxiv.org/abs/2405.02696

PDF

https://arxiv.org/pdf/2405.02696.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot