General fine tuning for Stable Diffusion
Go to file
Xavier 3f909b58d5
Update README.md
2022-09-06 00:13:19 -07:00
configs initial commit 2022-09-06 00:00:21 -07:00
evaluation initial commit 2022-09-06 00:00:21 -07:00
img initial commit 2022-09-06 00:00:21 -07:00
ldm initial commit 2022-09-06 00:00:21 -07:00
models initial commit 2022-09-06 00:00:21 -07:00
scripts initial commit 2022-09-06 00:00:21 -07:00
LICENSE initial commit 2022-09-06 00:00:21 -07:00
README.md Update README.md 2022-09-06 00:13:19 -07:00
environment.yaml initial commit 2022-09-06 00:00:21 -07:00
main.py initial commit 2022-09-06 00:00:21 -07:00
merge_embeddings.py initial commit 2022-09-06 00:00:21 -07:00
setup.py initial commit 2022-09-06 00:00:21 -07:00

README.md

Dreambooth on Stable Diffusion

This is an implementtaion of Google's Dreambooth with Stable Diffusion. The original Dreambooth is based on Imagen text-to-image model. However, neither the model nor the pre-trained weights of Imagen is available. To enable people to fine-tune a text-to-image model with a few examples, I implemented the idea of Dreambooth on Stable diffusion.

This code repository is based on that of Textual Inversion. Note that Textual Inversion only optimizes word ebedding, while dreambooth fine-tunes the whole diffusion model.

The implementation makes minimum changes over the official codebase of Textual Inversion, and in fact some components in Textual Inversion, such as the embedding manager, are not deleted, although they will never be used here.

Usage