High-Resolution Image Synthesis with Latent Diffusion Models

时间：2023-03-16 20:24:09浏览次数：85

标签：Diffusion Latent Models text High cdot 空间 Synthesis

概
大概流程
代码

Rombach R., Blattmann A., Lorenz D., Esser P. and Ommer B. High-resolution image synthesis with latent diffusion models. In IEEE Computer Vision and Pattern Recognition Conference (CVPR), 2022.

概

将模型投射到更低维的子空间中, 以节省计算量.

大概流程

原本的扩散模型开始和结束都是基于原始的图像空间, 所以如果想要生成特别高清的图像的话所需的计算开销是不菲的.
于是作者希望先训练 Encoder, Decoder, 然后首先:
1. 将原本的图像 \(x \in \mathbb{R}^{C \times H \times W}\) 映射到一个低维的隐空间中.
2. 然后整个前向扩散和反向恢复的过程都在这个隐空间进行.
3. 在实际推断的时候, 假设我们得到了一个隐空间中的一个采样 \(\hat{z}\), 再通过 decoder 映射回来即可.
注意, 本文还提出了一种一种 cross-attention 的方式来建模条件分布:

\[\text{Attention}(Q, K, V) = \text{softmax}(\frac{QK^T}{\sqrt{d}}) \cdot V, \\ Q = W_Q^{(i)} \cdot \varphi_i (z_t), K = W_K^{(i)}, \tau_{\theta}(y), V = W_V^{(i)} \cdot \tau_{\theta}(y). \]

代码

official

标签：Diffusion,Latent,Models,text,High,cdot,空间,Synthesis
From： https://www.cnblogs.com/MTandHJ/p/17224002.html

ChatGPT 辅助 stable-diffusion 生成图片描述 tag 话术
将如下话术发给ChatGPT:请用尽量多的英文单词描述一幅画，描述词尽量丰富，每个单词之间用逗号分隔:一个XXX 如果回复的tag数量不够，则追加四个字：不够丰富之后Chat......
使用ControlNet 控制 Stable Diffusion
本文将要介绍整合HuggingFace的diffusers包和ControlNet调节生成文本到图像，可以更好地控制文本到图像的生成ControlNet是一种通过添加额外条件来控制扩散模型的神经网络......
安装stabile diffusion的问题及解决方法
1、先要在你的那块下载git2、然后下载python，python选择3.10版本，然后安装的时候左下角ADDpath一定要点3、安装安装https://www.freedidi.com/6727.htmlgitcloneht......
Denoising Diffusion Implicit Models
DenoisingDiffusionImplicitModels目录DenoisingDiffusionImplicitModels概Motivation代码SongJ.,MengC.andErmonS.Denoisingdiffusionimplicitmodels.......
Perception Prioritized Training of Diffusion Models
目录概Motivation本文的方法代码ChoiJ.,LeeJ.,ShinC.,KimS.,KimH.andYoonS.Perceptionprioritizedtrainingofdiffusionmodels.InIEEEComputerVisi......
模型类序列化器、ModelSerializer用法、exclude排除某个字段用法、extra_kwargs中用wr
1.序列表表所有字段返回结果： 2.url： 3.序列化表和表里的所有字段： 4.views代码： 5.序列化表中的指定字段： 6.exclude......
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
目录概符号说明流程代码GongS.,LiM.,FengJ.,WuZ.andKongL.DiffuSeq:Sequencetosequencetextgenerationwithdiffusionmodels.InInternationalConfe......
Stable Diffusion
StableDiffusion ......
Diffusion-LM Improves Controllable Text Generation
目录概符号说明流程代码LiX.L.,ThickstunJ.,GulrajaniI.,LiangP.andHashimotoT.B.Diffusion-lmimprovescontrollabletextgeneration.arXivpreprinta......
搭建stable-diffusion-webui环境，使用ai生成图片
首先python版本要求：Python3.10 第一步下载框架代码：https://github.com/AUTOMATIC1111/stable-diffusion-webui.gitgithub慢的话把域名换成https://kgithub.com/ ......

High-Resolution Image Synthesis with Latent Diffusion Models

概

大概流程

代码

相关文章

赞助商

阅读排行