EveryDream2trainer/doc/BASEMODELS.md

# Prepare base models

In order to train, you need a base model on which to train.  

Make sure the trainer is installed properly first. See [SETUP.md](SETUP.md) for more details.

You can either [download one manually](#manual-download), or alternatively EveryDream2 can [automatically download](#automatic-download) a model from the Hugging Face hub for you.

It's strongly suggested to simple use automatic download.  The quick start here is you can simply use `resume_ckpt` of `stabilityai/stable-diffusion-2-1` to make a SD2.1 768 model, or `panopstor/EveryDream` for SD1.5 models.  If you are ok with the base SD1.5 and SD2.1 models, you can use those values and skip reading this document.

See below for more details.

## Manual download

First you have to download a `.ckpt` or `.safetensors` file for the base model, then you need to convert it to a "diffusers format" folder.  When you finish you should see something like this, come back to reference this picture as you go through the steps below:

![models](ckptcache.png) *(this picture is just an EXAMPLE)*

### Converting to 🧨diffusers format

Run these commands *one time* to prepare them. **It's very important to use the correct YAML!**

For SD1.x models, use this (note it will spill a lot of warnings to the console, but its fine):

    python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^
    --original_config_file v1-inference.yaml ^
    --image_size 512 ^
    --checkpoint_path my_sd15_model.ckpt ^
    --to_safetensors ^
    --prediction_type epsilon ^
    --dump_path "ckpt_cache/my_sd15_model"

...where `my_sd15_model.ckpt` is the filename you want to convert to prepare for training and `my_sd15_model` is all you need to set `resume_ckpt` to in your config file.

Almost the same exact thing for safetensors except one extra `from_safetensors` argument and make sure the file extension is `.safetensors`:

    python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^
    --original_config_file v1-inference.yaml ^
    --image_size 512 ^
    --checkpoint_path my_sd15_model.safetensors ^
    --from_safetensors ^
    --to_safetensors ^
    --prediction_type epsilon ^
    --dump_path "ckpt_cache/my_sd15_model"

And for any SD2.1 768 models (uses v2-v yaml and "v_prediction" prediction type):

    python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^
    --original_config_file v2-inference-v.yaml ^
    --image_size 768 ^
    --checkpoint_path my_sd21_model.ckpt ^
    --prediction_type v_prediction ^
    --upcast_attention True ^
    --dump_path "ckpt_cache/my_sd21_model"

Note the `v2-inference-v.yaml` and `v_prediction`.  This is because the SD2.1 768 model uses a different yaml and prediction type than the SD1.X models. 

And finally the SD2.0 512 base model (generally not recommended base model, no one tunes this either):

    python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^
    --original_config_file v2-inference.yaml ^
    --image_size 512 ^
    --checkpoint_path 512-base-ema.ckpt ^
    --prediction_type epsilon ^
    --upcast_attn True ^
    --dump_path "ckpt_cache/512-base-ema"

If you have other models, you need to know the base model that was used for them, **in particular use the correct yaml (original_config_file) or it will not properly convert.** Make sure to put some sort of name in the dump_path after "ckpt_cache/" so you can reference it later.

All of the above is one time.  After running, you will use `resume_ckpt` and just name the file without "ckpt_cache/"

ex.
```
train.json
{
    ...
    "resume_ckpt": "my_sd15_model",
    ...
}
```

or using the CLI arg:

    python train.py --resume_ckpt "my_sd21_model" ...


## Automatic download

If you don't want the hassle of downloading and converting ckpt files, you can pass a Hugging Face "repo id" for `resume-ckpt` and the model will be automatically downloaded from Huggingface if it exists.

For example, to use [Stable Diffusion 2.1](https://huggingface.co/stabilityai/stable-diffusion-2-1), you can pass the repo id `stabilityai/stable-diffusion-2-1` for `resume_ckpt`:

```
train.json
{
    ...
    "resume_ckpt": "stabilityai/stable-diffusion-2-1",
    ...
}
```

or with the CLI arg:

    python train.py --resume_ckpt stabilityai/stable-diffusion-2-1 ...

You can use any model on Huggingface which is saved in 🧨diffusers format, which is the vast majority of models on [this list](https://huggingface.co/models?pipeline_tag=text-to-image&sort=downloads). For example, to resume training using [nitrosocke's Modern Disney model](https://huggingface.co/nitrosocke/mo-di-diffusion) as a starting point, use `nitrosocke/mo-di-diffusion` as the repo id:

    python train.py --resume_ckpt nitrosocke/mo-di-diffusion ...

You can check if a 🧨diffusers format model is available by checking [the "Files" tab](https://huggingface.co/nitrosocke/mo-di-diffusion/tree/main) for a bunch of folders with names like `feature_extractor`, `unet`, and `vae`.

### Hugging Face login

If the model requires you to sign a license agreement (rare), you may need to login to the Hugging Face hub before downloads will work. You can do this by running the following command in the terminal window before you start training:
   
    huggingface-cli login

When prompted, paste a [Hugging Face User Access Token](https://huggingface.co/settings/tokens) into the terminal window (you may not see anything appear to show that you've pasted something), and then press Enter.  

> To get an Access Token you'll need to [create a Hugging Face account](https://huggingface.co/join) if you don't have one already. Login to your account and click `New token` on [your User Access Tokens page](https://huggingface.co/settings/tokens) to create an Access Token that you can then copy and paste into the terminal window.

> Note that on Windows you may have to right-click the terminal window -> Paste, rather than just using ctrl-V. You also may not see anything appear in the terminal to indicate that you've pasted something - just press Enter anyway. If downloading doesn't work after setting a token, double-check you have agreed to the license agreement and try running `huggingface-cli login` again. 

**Alternatively**, you can set the environment variable `HF_API_TOKEN` to your Access Token. On Windows:

    set HF_API_TOKEN=<token>

On Linux:

    export HF_API_TOKEN=<token>

Replace `<token>` with the Access Token you got from [your Hugging Face User Access Tokens page](https://huggingface.co/settings/tokens)

### Where are the files?

By default the downloaded Hugging Face files are stored in the Hugging Face cache folder. On Windows this is at `C:\Users\username\.cache\huggingface\hub`. On Linux it is at `~/.cache/huggingface/hub`. 

You can set the environment variable `HUGGINGFACE_HUB_CACHE` to change this. Eg, to put the cache on `Z:\stable-diffusion-big-files\huggingface-hub-cache` (on Windows):

    set HUGGINGFACE_HUB_CACHE=Z:\stable-diffusion-big-files\huggingface-hub-cache\

Make sure to do this before running `train.py`.

### Downloading from a subfolder

If the model you want to download is not stored in the root folder under the huggingface repo id (rare), you can pass `--hf_repo_subfolder` to set the subfolder where it should be downloaded from.
update conv docs 2023-07-06 21:47:07 -06:00			`# Prepare base models`
hey look ed2 2022-12-17 20:32:48 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			`In order to train, you need a base model on which to train.`
hey look ed2 2022-12-17 20:32:48 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			`Make sure the trainer is installed properly first. See [SETUP.md](SETUP.md) for more details.`
hey look ed2 2022-12-17 20:32:48 -07:00
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00			`You can either [download one manually](#manual-download), or alternatively EveryDream2 can [automatically download](#automatic-download) a model from the Hugging Face hub for you.`

update conv docs 2023-07-06 21:47:07 -06:00			It's strongly suggested to simple use automatic download. The quick start here is you can simply use `resume_ckpt` of `stabilityai/stable-diffusion-2-1` to make a SD2.1 768 model, or `panopstor/EveryDream` for SD1.5 models. If you are ok with the base SD1.5 and SD2.1 models, you can use those values and skip reading this document.
hey look ed2 2022-12-17 20:32:48 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			`See below for more details.`
hey look ed2 2022-12-17 20:32:48 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			`## Manual download`
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			First you have to download a `.ckpt` or `.safetensors` file for the base model, then you need to convert it to a "diffusers format" folder. When you finish you should see something like this, come back to reference this picture as you go through the steps below:
hey look ed2 2022-12-17 20:32:48 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			`![models](ckptcache.png) (this picture is just an EXAMPLE)`
hey look ed2 2022-12-17 20:32:48 -07:00
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00			`### Converting to 🧨diffusers format`

hey look ed2 2022-12-17 20:32:48 -07:00			`Run these commands one time to prepare them. It's very important to use the correct YAML!`

			`For SD1.x models, use this (note it will spill a lot of warnings to the console, but its fine):`

			`python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^`
			`--original_config_file v1-inference.yaml ^`
			`--image_size 512 ^`
update conv docs 2023-07-06 21:47:07 -06:00			`--checkpoint_path my_sd15_model.ckpt ^`
			`--to_safetensors ^`
			`--prediction_type epsilon ^`
			`--dump_path "ckpt_cache/my_sd15_model"`

			...where `my_sd15_model.ckpt` is the filename you want to convert to prepare for training and `my_sd15_model` is all you need to set `resume_ckpt` to in your config file.

			Almost the same exact thing for safetensors except one extra `from_safetensors` argument and make sure the file extension is `.safetensors`:

			`python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^`
			`--original_config_file v1-inference.yaml ^`
			`--image_size 512 ^`
			`--checkpoint_path my_sd15_model.safetensors ^`
			`--from_safetensors ^`
			`--to_safetensors ^`
hey look ed2 2022-12-17 20:32:48 -07:00			`--prediction_type epsilon ^`
update conv docs 2023-07-06 21:47:07 -06:00			`--dump_path "ckpt_cache/my_sd15_model"`
hey look ed2 2022-12-17 20:32:48 -07:00
update conv docs 2023-07-06 21:47:07 -06:00			`And for any SD2.1 768 models (uses v2-v yaml and "v_prediction" prediction type):`
hey look ed2 2022-12-17 20:32:48 -07:00
			`python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^`
			`--original_config_file v2-inference-v.yaml ^`
			`--image_size 768 ^`
update conv docs 2023-07-06 21:47:07 -06:00			`--checkpoint_path my_sd21_model.ckpt ^`
hey look ed2 2022-12-17 20:32:48 -07:00			`--prediction_type v_prediction ^`
update conv docs 2023-07-06 21:47:07 -06:00			`--upcast_attention True ^`
			`--dump_path "ckpt_cache/my_sd21_model"`

			Note the `v2-inference-v.yaml` and `v_prediction`. This is because the SD2.1 768 model uses a different yaml and prediction type than the SD1.X models.
hey look ed2 2022-12-17 20:32:48 -07:00
clarify convert doc 2023-07-06 23:11:59 -06:00			`And finally the SD2.0 512 base model (generally not recommended base model, no one tunes this either):`
hey look ed2 2022-12-17 20:32:48 -07:00
			`python utils/convert_original_stable_diffusion_to_diffusers.py --scheduler_type ddim ^`
			`--original_config_file v2-inference.yaml ^`
			`--image_size 512 ^`
			`--checkpoint_path 512-base-ema.ckpt ^`
			`--prediction_type epsilon ^`
Update BASEMODELS.md upcast attn for sd2 conv, not critical but technically correct 2023-02-14 10:41:50 -07:00			`--upcast_attn True ^`
hey look ed2 2022-12-17 20:32:48 -07:00			`--dump_path "ckpt_cache/512-base-ema"`

			`If you have other models, you need to know the base model that was used for them, in particular use the correct yaml (original_config_file) or it will not properly convert. Make sure to put some sort of name in the dump_path after "ckpt_cache/" so you can reference it later.`

clarify convert doc 2023-07-06 23:11:59 -06:00			All of the above is one time. After running, you will use `resume_ckpt` and just name the file without "ckpt_cache/"
hey look ed2 2022-12-17 20:32:48 -07:00
			`ex.`
clarify convert doc 2023-07-06 23:11:59 -06:00			```
			`train.json`
			`{`
			`...`
			`"resume_ckpt": "my_sd15_model",`
			`...`
			`}`
			```

			`or using the CLI arg:`

			`python train.py --resume_ckpt "my_sd21_model" ...`
hey look ed2 2022-12-17 20:32:48 -07:00

document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00			`## Automatic download`

moar docs 2023-07-06 23:16:02 -06:00			If you don't want the hassle of downloading and converting ckpt files, you can pass a Hugging Face "repo id" for `resume-ckpt` and the model will be automatically downloaded from Huggingface if it exists.
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00
moar docs 2023-07-06 23:16:02 -06:00			For example, to use [Stable Diffusion 2.1](https://huggingface.co/stabilityai/stable-diffusion-2-1), you can pass the repo id `stabilityai/stable-diffusion-2-1` for `resume_ckpt`:

			```
			`train.json`
			`{`
			`...`
			`"resume_ckpt": "stabilityai/stable-diffusion-2-1",`
			`...`
			`}`
			```

			`or with the CLI arg:`
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00
			`python train.py --resume_ckpt stabilityai/stable-diffusion-2-1 ...`

			You can use any model on Huggingface which is saved in 🧨diffusers format, which is the vast majority of models on [this list](https://huggingface.co/models?pipeline_tag=text-to-image&sort=downloads). For example, to resume training using [nitrosocke's Modern Disney model](https://huggingface.co/nitrosocke/mo-di-diffusion) as a starting point, use `nitrosocke/mo-di-diffusion` as the repo id:

			`python train.py --resume_ckpt nitrosocke/mo-di-diffusion ...`

			You can check if a 🧨diffusers format model is available by checking [the "Files" tab](https://huggingface.co/nitrosocke/mo-di-diffusion/tree/main) for a bunch of folders with names like `feature_extractor`, `unet`, and `vae`.

			`### Hugging Face login`

moar docs 2023-07-06 23:16:02 -06:00			`If the model requires you to sign a license agreement (rare), you may need to login to the Hugging Face hub before downloads will work. You can do this by running the following command in the terminal window before you start training:`
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00
			`huggingface-cli login`

			`When prompted, paste a [Hugging Face User Access Token](https://huggingface.co/settings/tokens) into the terminal window (you may not see anything appear to show that you've pasted something), and then press Enter.`

			> To get an Access Token you'll need to [create a Hugging Face account](https://huggingface.co/join) if you don't have one already. Login to your account and click `New token` on [your User Access Tokens page](https://huggingface.co/settings/tokens) to create an Access Token that you can then copy and paste into the terminal window.

			> Note that on Windows you may have to right-click the terminal window -> Paste, rather than just using ctrl-V. You also may not see anything appear in the terminal to indicate that you've pasted something - just press Enter anyway. If downloading doesn't work after setting a token, double-check you have agreed to the license agreement and try running `huggingface-cli login` again.

			Alternatively, you can set the environment variable `HF_API_TOKEN` to your Access Token. On Windows:

			`set HF_API_TOKEN=<token>`

			`On Linux:`

			`export HF_API_TOKEN=<token>`

			Replace `<token>` with the Access Token you got from [your Hugging Face User Access Tokens page](https://huggingface.co/settings/tokens)

			`### Where are the files?`

moar docs 2023-07-06 23:16:02 -06:00			By default the downloaded Hugging Face files are stored in the Hugging Face cache folder. On Windows this is at `C:\Users\username\.cache\huggingface\hub`. On Linux it is at `~/.cache/huggingface/hub`.
document the huggingface automatic downloader 2023-01-24 02:22:52 -07:00
			You can set the environment variable `HUGGINGFACE_HUB_CACHE` to change this. Eg, to put the cache on `Z:\stable-diffusion-big-files\huggingface-hub-cache` (on Windows):

			`set HUGGINGFACE_HUB_CACHE=Z:\stable-diffusion-big-files\huggingface-hub-cache\`

			Make sure to do this before running `train.py`.

			`### Downloading from a subfolder`

moar docs 2023-07-06 23:16:02 -06:00			If the model you want to download is not stored in the root folder under the huggingface repo id (rare), you can pass `--hf_repo_subfolder` to set the subfolder where it should be downloaded from.