Abstract
We propose Pure and Lightning ID customization (PuLID), a novel tuning-free ID customization method for text-to-image generation. By incorporating a Lightning T2I branch with a standard diffusion one, PuLID introduces both contrastive alignment loss and accurate ID loss, minimizing disruption to the original model and ensuring high ID fidelity. Experiments show that PuLID achieves superior performance in both ID fidelity and editability. Another attractive property of PuLID is that the image elements (e.g., background, lighting, composition, and style) before and after the ID insertion are kept as consistent as possible. Codes and models will be available at this https URL
Abstract (translated)
我们提出了PuLID(纯和闪电ID定制),一种新的文本到图像生成的自定义ID方法。通过结合标准扩散一层的闪电T2I分支,PuLID引入了对比对齐损失和精确ID损失,最小化对原始模型的干扰并确保高ID保真度。实验结果表明,PuLID在ID保真度和编辑性方面都取得了卓越的性能。PuLID的另一个吸引人的特性是,在ID插入之前和之后,图像元素(例如背景、光照、构图和样式)保持一致。代码和模型将在这个https:// URL上提供。
URL
https://arxiv.org/abs/2404.16022