Skip to main content


PuLID: Pure and Lightning ID Customization via Contrastive Alignment


Abstract


We propose Pure and Lightning ID customization (PuLID), a novel tuning-free ID customization method for text-to-image generation. By incorporating a Lightning T2I branch with a standard diffusion one, PuLID introduces both contrastive alignment loss and accurate ID loss, minimizing disruption to the original model and ensuring high ID fidelity. Experiments show that PuLID achieves superior performance in both ID fidelity and editability. Another attractive property of PuLID is that the image elements (e.g., background, lighting, composition, and style) before and after the ID insertion are kept as consistent as possible. Codes and models will be available at this https URL


Paper: https://arxiv.org/abs/2404.16022

Code: https://github.com/ToTheBeginning/PuLID

in reply to Even_Adder

New Lemmy Post: PuLID: Pure and Lightning ID Customization via Contrastive Alignment (https://lemmyverse.link/lemmy.dbzer0.com/post/19505268)
Tagging: #StableDiffusion

(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)

I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md