Skip Navigation

Stable Diffusion @lemmy.dbzer0.com pablonaj @feddit.de 1y ago

Technical report on SDXL released!

github.com generative-models/assets/sdxl_report.pdf at main · Stability-AI/generative-models

Generative Models by Stability AI. Contribute to Stability-AI/generative-models development by creating an account on GitHub.

generative-models/assets/sdxl_report.pdf at main · Stability-AI/generative-models

Hacker News @derp.foo haxor @derp.foo

1y ago

pdf

Stable Diffusion XL technical report

github.com /Stability-AI/generative-models/blob/main/assets/sdxl_report.pdf

1 0

TechNews @radiation.party irradiated @radiation.party

1y ago

Stable Diffusion XL technical report

github.com /Stability-AI/generative-models/blob/main/assets/sdxl_report.pdf

[HN] Stable Diffusion XL technical report [pdf]

2 0

You're viewing a single thread.

1 comments

We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone: The increase of model parameters is mainly due to more attention blocks and a larger cross-attention context as SDXL uses a second text encoder. We design multiple novel conditioning schemes and train SDXL on multiple aspect ratios. We also introduce a refinement model which is used to improve the visual fidelity of samples generated by SDXL using a post-hoc image-to-image technique. We demonstrate that SDXL shows drastically improved performance compared the previous versions of Stable Diffusion and achieves results competitive with those of black-box state-of-the-art image generators. In the spirit of promoting open research and fostering transparency in large model training and evaluation, we provide access to code and model weights.