2024-05-02 13:26:19
2024-04-22 18:17:21
2024-04-22 18:17:06
7931514
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Abstract
Recently, a series of diffusion-aware distillation algorithms have emerged to alleviate the computational overhead associated with the multi-step inference process of Diffusion Models (DMs). Current distillation techniques often dichotomize into two distinct aspects: i) ODE Trajectory Preservation; and ii) ODE Trajectory Reformulation. However, these approaches suffer from severe performance degradation or domain shifts. To address these limitations, we propose Hyper-SD, a novel framework that synergistically amalgamates the advantages of ODE Trajectory Preservation and Reformulation, while maintaining near-lossless performance during step compression. Firstly, we introduce Trajectory Segmented Consistency Distillation to progressively perform consistent distillation within pre-defined time-step segments, which facilitates the preservation of the original ODE trajectory from a higher-order perspective. Secondly, we incorporate human feedback learning to boost the performance of the model in a low-step regime and mitigate the performance loss incurred by the distillation process. Thirdly, we integrate score distillation to further improve the low-step generation capability of the model and offer the first attempt to leverage a unified LoRA to support the inference process at all steps. Extensive experiments and user studies demonstrate that Hyper-SD achieves SOTA performance from 1 to 8 inference steps for both SDXL and SD1.5. For example, Hyper-SDXL surpasses SDXL-Lightning by +0.68 in CLIP Score and +0.51 in Aes Score in the 1-step inference.
Paper:
Hugging Face Repo: https://huggingface.co/ByteDance/Hyper-SD
T2I Demo: https://huggingface.co/spaces/ByteDance/Hyper-SDXL-1Step-T2I
Scribble Demo: https://huggingface.co/spaces/ByteDance/Hyper-SD15-Scribble
Project Page: https://hyper-sd.github.io/
ByteDance/Hyper-SD · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.huggingface.co
Lemmy Tagginator
in reply to Even_Adder • • •New Lemmy Post: Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis (https://lemmyverse.link/lemmy.dbzer0.com/post/18929761)
Tagging: #StableDiffusion
(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)
I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md
lemmy-tagginator/README.md at main · db0/lemmy-tagginator
GitHub