Friendica Social Network

Even_Adder via Stable Diffusion

3 weeks ago • •

HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models

Abstract

Diffusion models have become a mainstream approach for high-resolution image synthesis. However, directly generating higher-resolution images from pretrained diffusion models will encounter unreasonable object duplication and exponentially increase the generation time. In this paper, we discover that object duplication arises from feature duplication in the deep blocks of the U-Net. Concurrently, We pinpoint the extended generation times to self-attention redundancy in U-Net's top blocks. To address these issues, we propose a tuning-free higher-resolution framework named HiDiffusion. Specifically, HiDiffusion contains Resolution-Aware U-Net~(RAU-Net) that dynamically adjusts the feature map size to resolve object duplication and engages Modified Shifted Window Multi-head Self-Attention(MSW-MSA) that utilizes optimized window attention to reduce computations. we can integrate HiDiffusion into various pretrained diffusion models to scale image generation resolutions even to 4096×4096 at 1.5-6× the inference speed of previous methods.

HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models

We introduce HiDiffusion, a tuning-free framework comprised of Resolution-Aware U-Net (RAU-Net) and Modified Shifted Window Multi-head Self-Attention (MSW-MSA) to enable pretrained large text-to-image diffusion models to efficiently generate high-res…

^arXiv.org

like this

in reply to Even_Adder

Lemmy Tagginator

in reply to Even_Adder • 3 weeks ago • •

New Lemmy Post: HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models (https://lemmyverse.link/lemmy.dbzer0.com/post/18992762)
Tagging: #StableDiffusion

(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)

I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md

lemmy-tagginator/README.md at main · db0/lemmy-tagginator

A script that attempts to tag all posts from specific communities - db0/lemmy-tagginator

^GitHub

#stablediffusion

in reply to Even_Adder

calabast

in reply to Even_Adder • 3 weeks ago • •

Very cool! I only have experience using automatic1111, so if anyone has any hints on how I could enable this using that tool, I'd love to try it out!

in reply to calabast

Even_Adder

in reply to calabast • 3 weeks ago • •

I think someone will have to implement it as an extension.

in reply to Even_Adder

wewbull

in reply to Even_Adder • 3 weeks ago • •

Does this mean, that because you're now liberated from the dimensions of the training data, that all training data will apply to all sizes? e.g. generated portrait images will be influenced by landscape training data.

⇧

Even_Adder via Stable Diffusion

Even_Adder 3 weeks ago • •

HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models

Abstract

HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models

Lemmy Tagginator

lemmy-tagginator/README.md at main · db0/lemmy-tagginator

calabast

Even_Adder

wewbull

Even_Adder
3 weeks ago • •