353200b039 | ||
---|---|---|
aesthetics | ||
assets | ||
configs | ||
danbooru_data | ||
data | ||
docs/en | ||
ldm | ||
models | ||
scripts | ||
.dockerignore | ||
.gitignore | ||
Dockerfile | ||
LICENSE | ||
README.md | ||
Stable_Diffusion_v1_Model_Card.md | ||
Start Gradio.cmd | ||
diffusers_trainer.py | ||
environment.yaml | ||
main.py | ||
notebook_helpers.py | ||
requirements.txt | ||
setup.py | ||
train.sh | ||
umamba.exe |
README.md
Waifu Diffusion
Waifu Diffusion is the name for this project of finetuning Stable Diffusion on images and captions downloaded through Danbooru
(Note: This project has no affiliation with Danbooru.)
Prompt: touhou 1girl komeiji_koishi portrait
Documentation
All thanks goes to CompVis and Stability AI for releasing this codebase!
Model Link: https://huggingface.co/hakurei/waifu-diffusion
Any questions? Come hop on by to our Discord server!
Stable Diffusion
Stable Diffusion was made possible thanks to a collaboration with Stability AI and Runway and builds upon our previous work:
Comments
-
Our codebase for the diffusion models builds heavily on OpenAI's ADM codebase and https://github.com/lucidrains/denoising-diffusion-pytorch. Thanks for open-sourcing!
-
The implementation of the transformer encoder is from x-transformers by lucidrains.
BibTeX
@misc{rombach2021highresolution,
title={High-Resolution Image Synthesis with Latent Diffusion Models},
author={Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer},
year={2021},
eprint={2112.10752},
archivePrefix={arXiv},
primaryClass={cs.CV}
}