Stable Diffusion | NotionNext BLOG

type

status

date

slug

summary

笔记已过时，仅存档

💡

这篇笔记只为给完全不了解 SatbleDiffusion 的同学打开学习入口，不至于面对网上大量的视频和攻略而无从下手，希望对大家有所帮助！

分享网络上的优质教程，😇从入门到放(bu)弃(shi) ，持续更新。如有内容过时和错误纰漏请联系我或在对应位置评论，有发现更好的教程也请联系我更新，😘感谢！

安装：

两种方式（二选一）

秋葉aaaki的整合包和启动器（推荐，开箱即用，免折腾）

【AI绘画】Stable Diffusion整合包v4发布！全新加速解压即用防爆显存三分钟入门AI绘画 ☆可更新 ☆训练 ☆汉化_哔哩哔哩_bilibili

百度盘：https://pan.baidu.com/s/1_ibEk2OpKHxmEg4AnFOpSA什么？关注了也没有？那你看完简介再说======================================================本整合包基于开源软件Stable Diffusion Webui制作，仅供学习AIGC相关技术。任何使用整合包以及启动器，生成相关图片造成的后果以及责任都, 视频播放量 1220617、弹幕量 1554、点赞数 134991、投硬币枚数 178110、收藏人数 142583、转发人数 21726, 视频作者秋葉aaaki, 作者简介动态区喂饭区up主 / 爱发电 akibanzu / 纯个人，相关视频：【AI绘画】Stable Diffusion 最终版无需额外下载安装！可更新✓ 训练✓ 汉化✓ 提供7G模型 NovelAI，【AI绘画】这辈子值了，B站第一套系统的AI绘画课！零基础学会Stable Diffusion，这绝对是你看过的最容易上手的AI绘画教程 | SD WebUI 保姆级攻略，【AI绘画】AI又进化了，革命性突破，20分钟搞懂Prompt与参数设置，你的AI绘画“咒语”学明白了吗？ | 零基础入门Stable Diffusion·保姆级新手教程 | Prompt关键词教学，AI绘画模型新手包！“画风”自由切换，有哪些你不知道的模型使用技巧？ | 零基础入门Stable Diffusion的保姆级新手教程 | SD模型下载方式与推荐，4090逆天的ai画图速度，提示词补全翻译反推，“终极”放大脚本与细节优化插件，这些AI绘画扩展真的泰厉害辣！| StableDiffusion教程·拓展的安装与应用·初学者必备8款拓展，AI绘画出现后的画画课，【AI绘画】全面升级！ControlNet1.1 使用教程更加强大的控制功能！线稿上色/风格迁移/局部修改

https://www.bilibili.com/video/BV1iM4y1y7oA/?spm_id_from=333.999.0.0&vd_source=9ee7ccc193b00146b2d136b33b851724

自行部署（有Github使用经验的，仍可以使用秋葉aaaki的启动器）

GitHub - AUTOMATIC1111/stable-diffusion-webui: Stable Diffusion web UI

Stable Diffusion web UI. Contribute to AUTOMATIC1111/stable-diffusion-webui development by creating an account on GitHub.

https://github.com/AUTOMATIC1111/stable-diffusion-webui

教程：

基本用法（！！！入门必看！！！）

B站第一套系统的AI绘画课！零基础学会Stable Diffusion，这绝对是你看过的最容易上手的AI绘画教程 | SD WebUI 保姆级攻略_哔哩哔哩_bilibili

B站首门系统的AI绘画入门教程来了！零基础玩转Stable Diffusion，就看这个系列的视频了！提示词宝典：BV12X4y1r7QB模型新手包：BV1Us4y117Rg课程学习资料链接：https://www.mubucm.com/doc/_2As4DSE4m（请复制在浏览器或幕布中打开，内附资料网盘下载地址，实时更新中）——————————————作为一个新兴的技能门类，目前市面上关于AI, 视频播放量 867221、弹幕量 4492、点赞数 45165、投硬币枚数 46275、收藏人数 90382、转发人数 10984, 视频作者 Nenly同学, 作者简介一个不会画画的设计师，AIGC创意人，相关视频：4090逆天的ai画图速度，【AI绘画】Stable Diffusion整合包v4发布！全新加速解压即用防爆显存三分钟入门AI绘画 ☆可更新 ☆训练 ☆汉化，AI绘画出现后的画画课，AI美女正在入侵小红书，真人网红博主们彻底慌了，第1集热血重燃！16名顶尖AI程序员助力科技反诈，【最新】Stable Diffusion复刻蓝衣战神，我的画真的会被AI取代吗？，弱智吧为何成为AI头号公敌，【Novelai】超简单！手机轻松跑图！，［AI生成动画］这可是耶路撒冷啊！

https://www.bilibili.com/video/BV1As4y127HW/?spm_id_from=333.999.0.0&vd_source=9ee7ccc193b00146b2d136b33b851724

这是个视频合集，视频做的都特别好 @B站 Nenly同学

浅谈stable diffusion (三)

本系列的第三篇，如果你看过前俩篇，想必已经对stable diffuison原理和优势与劣势甚至未来的发展方向有了一定了解。 stable diffuison发展至今，商业（实用）属性已经相当高了，所以我们对stable diffusion剩下的…

https://zhuanlan.zhihu.com/p/617026822

图文攻略（全面了解SD，选择性观看）

模型：

最大最全的模型网：

Civitai | Stable Diffusion models, embeddings, LoRAs and more

Civitai is a platform for Stable Diffusion AI Art models. Browse a collection of thousands of models from a growing number of creators. Join an engaged community in reviewing models and sharing images with prompts to get you started.

https://civitai.com/

科学上网，NSFW预警

国人开设的模型网：

LiblibAI_中国最大的原创AI模型分享社区

https://www.liblibai.com/#/

常用的模型应该基本都有吧

DESAI

http://www.i-desai.com/#/

免费的MJ和SD，总之就是NB

↘都有些什么模型？【5分钟SD简明原理——prompt文字如何进入画面】

模型	需要	作用	使用
Checkpoint / Dreambooth 底模型/大模型 `.ckpt` `.safetensors`	必备	Checkpoint：在训练过程中保存的模型参数，用于模型恢复/迁移（现在出于安全考虑格式常用safetensors）	请看作者写的简介！基本都会给出适宜参数和配套模型信息 .\stable-diffusion-webui\models\Stable-diffusion
VAE 变分自编码器 `.pt` `.safetensors`	必备	Latent variable解码成图像。表现为强化画面色彩或者稳定画面结构（是否需要具体看底模的简介）	① 有的底模作者会提供VAE； ② 有的就直接用最常用的几个VAE模型； ③ 有的不需要VAE，因为底模中包含了VAE； .\stable-diffusion-webui\models\VAE
Embeddings / Textual Inversion 嵌入模型 `.pt` `.bin`	可选	用几个反映该概念的图像来教授基本模型有关特定概念的新词汇（姿势、风格、纹理等，实际上是用向量表示的）	文本反转，触发词写入正/负向提示词，具体看模型简介 .\stable-diffusion-webui\embeddings
LoRA / LyCORIS `.pt` `.safetensors`	可选	一种微调CLIP和Unet权重的方法（加载LyCORIS需要下载插件）	写入正向提示词（不能填入负向），有的会有触发词 .\stable-diffusion-webui\models\Lora
Hypernetworks `.pt`	可选	其工作方式与LoRA相同	使用方式与LoRA相同，这种模型现在比较少用了 .\stable-diffusion-webui\models\hypernetworks

插件：

.\stable-diffusion-webui\extensions 安装的插件都在这里！先看有没有再安装

基础插件	必装的和可能会有用的插件	如何使用？
C站模型管理【必装】		【教你如何解决太多太乱的模型】B站@插画师小光sir
控图插件【必装】		ControlNet（CN）：这部分内容很多，也是mj目前无法做到的重要绘图功能
预测图的提词		【stable diffusion咒语学习指北：Tag反推】B站@自带马赛克属性的阿尼
lora模型混合		不融合LoRA，方便调各自权重而已
读取LyCORIS		装了就是支持了，没教程
tag自动补全		原作者文档解释的很清楚
允许拉更高的CFG Scale		缓解了CFG过高时产生的画面颜色崩坏
低显存绘图 or 分区绘图		【精准分区绘画——multidiffusion插件分区功能详解】B站@筱旒
界面美化		【Kitchen，优化Stable Diffusion绘图界面】B站@来真的
翻译插件	慎重选择！这类东西容易出问题	如何使用？
翻译提词		【自动翻译√ 支持多种语言√支持a1111-sd-webui-tagcomple】Youtube@番茄没有酱
翻译提词【整合包自带】		【不会英文也能玩转AI 汉化✓ 中英对照翻译✓中文识别tag自动补全✓tag预设✓】Youtube@番茄没有酱
翻译界面		↑↑↑如上，组合使用。该插件不知道是不是整合包自带
进阶插件	需要多学习一下才能掌握的插件	如何使用？
增加题词语法		【一键生成N风格图片提高绘图效率】Youtube@番茄没有酱
LoRA分层绘制		【LoRA进阶用法：分层控制】B站@大江户战士
LoRA融合		【利用LoRA分层控制高效且有目的地融合画风】B站@大江户战士
图片分区绘制		【画出自己想要的CP图/多人图】B站@大江户战士
对不同的题词使用不同的LoRA		↑↑↑如上，组合使用
识别手和脸进行局部重绘		（脸部和手部修复）Youtube@番茄没有酱
StableSR超分		【StableSR 超清无损放大图片】Youtube@番茄没有酱

咒语：

人像的快速题词网：

https://tag.redsex.cc/

Danbooru 标签超市

用于构建 Danbooru 标签组合的网站。

https://tags.novelai.dev/

预设部分比较好用（有一些Textual Inversion）

PromptHero - Search prompts for Stable Diffusion, ChatGPT & Midjourney

The #1 website for Artificial Intelligence and Prompt Engineering. Search the world's best AI prompts for models like Stable Diffusion, ChatGPT, Midjourney...

https://prompthero.com/

PromptHero - Search prompts for Stable Diffusion, ChatGPT & Midjourney

mj和sd的提示词都有

相较于Midjourney，专注于SD的题词网站很少，SD作品基本都是发布在上述的模型网站。如果你想跑人像图，上述这种网址能满足你，如果你想跑其他的建议去找mj的提词网站（是真的多）。

其他

GPT辅助生成提示词

You are a master artist, well-versed and artistic terminology with a vast vocabulary for being able to describe visually things that you see. I'm going to give you a series of prompts that I use in a program called stable diffusion to do image generation. I would like you to analyze the styles of these prompts, the sentence structures, how they're laid out and the common pattern between all of them. The sample prompts may have special formatting in the form of parentheses with a ":" and a number between 0 and 2 with a decimal point, for example (symmetrical Easter theme:1.3) or (cinematic:1.4) or (symetrical:1.2). This format should be used to emphasize or subdue parts of the description that would be important to the presentation of the image. Above 1 emphasizes whatever is in the parentheses, and anything below 1, de-emphasizes whatever is in the parentheses. do not use the word "emphasis" when utilizing this format. You will be asked to generate prompts based on your analysis. I want you to take note that the image generation software pays more attention to what's at the beginning of the prompt and that attention declines the closet to the end of the positive Prompt that you get. The formatting may look something like this: [main character or focus of the image], [lighting style, shot style, and color], [other aesthetics like mood, emotion, peripheral subject matter], [possible artists or art styles] Prompting: Order matters - words near the front of your prompt are weighted more heavily than the things at the end of your prompt. Most prompts should have the following at the beginning: "((best quality)), ((masterpiece)), (detailed), " When using an artist name or artist reference always use the format " in the style of [ artist name]", for example " in the style of Greg Rutkowski" or " in the styles of Michelangelo and Leonardo da Vinci" [Example low, medium and high quality prompts]: [low detail prompt]:portrait of a futuristic beautiful woman and a futuristic city, deep vivid colors [medium detail prompt]:modern alchemists lab drawn with watercolor pencil, magic potions, remedies, vials and concoctions, vivid colors, bubbling, bursting, full of life, organic, oily, bones, dark, scary [High detailed, high quality prompt]: (best quality:1.4), (masterpiece:1.4), (detailed:1.3), 8K, portrait, elven queen in a lush forest, shimmering golden gown, jeweled tiara, enigmatic smile, surrounded by ethereal glowing fauna, (magic-infused:1.4), vivid colors, intricate foliage patterns, chiaroscuro lighting, (Pre-Raphaelite art style:1.2) [Sample positive prompts to analyze]: put your sample prompts here You will be asked to generate prompts based on the instructions above. You will provide a prompt based on the provided information every time a user instructs you, and in those prompts I would also like you to include a description of the camera angle used to get the shot.

You do not need to report on your analysis, please just respond with, "Tell me what you want to see.".