diffusers/docs/source/training/unconditional_training.mdx

<!--Copyright 2022 The HuggingFace Team. All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
specific language governing permissions and limitations under the License.
-->

# Unconditional Image-Generation

In this section, we explain how one can train an unconditional image generation diffusion 
model. "Unconditional" because the model is not conditioned on any context to generate an image - once trained the model will simply generate images that resemble its training data 
distribution.

## Installing the dependencies

Before running the scripts, make sure to install the library's training dependencies:

```bash
pip install diffusers[training] accelerate datasets
```

And initialize an [🤗Accelerate](https://github.com/huggingface/accelerate/) environment with:

```bash
accelerate config
```

## Unconditional Flowers  

The command to train a DDPM UNet model on the Oxford Flowers dataset:

```bash
accelerate launch train_unconditional.py \
  --dataset_name="huggan/flowers-102-categories" \
  --resolution=64 \
  --output_dir="ddpm-ema-flowers-64" \
  --train_batch_size=16 \
  --num_epochs=100 \
  --gradient_accumulation_steps=1 \
  --learning_rate=1e-4 \
  --lr_warmup_steps=500 \
  --mixed_precision=no \
  --push_to_hub
```
An example trained model: https://huggingface.co/anton-l/ddpm-ema-flowers-64

A full training run takes 2 hours on 4xV100 GPUs.

<img src="https://user-images.githubusercontent.com/26864830/180248660-a0b143d0-b89a-42c5-8656-2ebf6ece7e52.png" width="700" />

## Unconditional Pokemon 

The command to train a DDPM UNet model on the Pokemon dataset:

```bash
accelerate launch train_unconditional.py \
  --dataset_name="huggan/pokemon" \
  --resolution=64 \
  --output_dir="ddpm-ema-pokemon-64" \
  --train_batch_size=16 \
  --num_epochs=100 \
  --gradient_accumulation_steps=1 \
  --learning_rate=1e-4 \
  --lr_warmup_steps=500 \
  --mixed_precision=no \
  --push_to_hub
```
An example trained model: https://huggingface.co/anton-l/ddpm-ema-pokemon-64

A full training run takes 2 hours on 4xV100 GPUs.

<img src="https://user-images.githubusercontent.com/26864830/180248200-928953b4-db38-48db-b0c6-8b740fe6786f.png" width="700" />


## Using your own data

To use your own dataset, there are 2 ways:
- you can either provide your own folder as `--train_data_dir`
- or you can upload your dataset to the hub (possibly as a private repo, if you prefer so), and simply pass the `--dataset_name` argument.

**Note**: If you want to create your own training dataset please have a look at [this document](https://huggingface.co/docs/datasets/image_process#image-datasets).

Below, we explain both in more detail.

### Provide the dataset as a folder

If you provide your own folders with images, the script expects the following directory structure:

```bash
data_dir/xxx.png
data_dir/xxy.png
data_dir/[...]/xxz.png
```

In other words, the script will take care of gathering all images inside the folder. You can then run the script like this:

```bash
accelerate launch train_unconditional.py \
    --train_data_dir <path-to-train-directory> \
    <other-arguments>
```

Internally, the script will use the [`ImageFolder`](https://huggingface.co/docs/datasets/v2.0.0/en/image_process#imagefolder) feature which will automatically turn the folders into 🤗 Dataset objects.

### Upload your data to the hub, as a (possibly private) repo

It's very easy (and convenient) to upload your image dataset to the hub using the [`ImageFolder`](https://huggingface.co/docs/datasets/v2.0.0/en/image_process#imagefolder) feature available in 🤗 Datasets. Simply do the following:

```python
from datasets import load_dataset

# example 1: local folder
dataset = load_dataset("imagefolder", data_dir="path_to_your_folder")

# example 2: local files (supported formats are tar, gzip, zip, xz, rar, zstd)
dataset = load_dataset("imagefolder", data_files="path_to_zip_file")

# example 3: remote files (supported formats are tar, gzip, zip, xz, rar, zstd)
dataset = load_dataset(
    "imagefolder",
    data_files="https://download.microsoft.com/download/3/E/1/3E1C3F21-ECDB-4869-8368-6DEBA77B919F/kagglecatsanddogs_3367a.zip",
)

# example 4: providing several splits
dataset = load_dataset(
    "imagefolder", data_files={"train": ["path/to/file1", "path/to/file2"], "test": ["path/to/file3", "path/to/file4"]}
)
```

`ImageFolder` will create an `image` column containing the PIL-encoded images.

Next, push it to the hub!

```python
# assuming you have ran the huggingface-cli login command in a terminal
dataset.push_to_hub("name_of_your_dataset")

# if you want to push to a private repo, simply pass private=True:
dataset.push_to_hub("name_of_your_dataset", private=True)
```

and that's it! You can now train your model by simply setting the `--dataset_name` argument to the name of your dataset on the hub.

More on this can also be found in [this blog post](https://huggingface.co/blog/image-search-datasets).
Docs (#45) * first pass at docs structure * minor reformatting, add github actions for docs * populate docs (primarily from README, some writing) 2022-07-13 09:42:05 -06:00			`<!--Copyright 2022 The HuggingFace Team. All rights reserved.`

			`Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with`
			`the License. You may obtain a copy of the License at`

			`http://www.apache.org/licenses/LICENSE-2.0`

			`Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on`
			`an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the`
			`specific language governing permissions and limitations under the License.`
			`-->`

[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`# Unconditional Image-Generation`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`In this section, we explain how one can train an unconditional image generation diffusion`
			`model. "Unconditional" because the model is not conditioned on any context to generate an image - once trained the model will simply generate images that resemble its training data`
			`distribution.`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`## Installing the dependencies`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
Fix typos and add Typo check GitHub Action (#483) * Fix typos * Add a typo check action * Fix a bug * Changed to manual typo check currently Ref: https://github.com/huggingface/diffusers/pull/483#pullrequestreview-1104468010 Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Removed a confusing message * Renamed "nin_shortcut" to "in_shortcut" * Add memo about NIN Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> 2022-09-16 07:36:51 -06:00			`Before running the scripts, make sure to install the library's training dependencies:`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			```bash
			`pip install diffusers[training] accelerate datasets`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00			```
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00
			`And initialize an [🤗Accelerate](https://github.com/huggingface/accelerate/) environment with:`

			```bash
			`accelerate config`
			```

			`## Unconditional Flowers`

			`The command to train a DDPM UNet model on the Oxford Flowers dataset:`

			```bash
			`accelerate launch train_unconditional.py \`
			`--dataset_name="huggan/flowers-102-categories" \`
			`--resolution=64 \`
			`--output_dir="ddpm-ema-flowers-64" \`
			`--train_batch_size=16 \`
			`--num_epochs=100 \`
			`--gradient_accumulation_steps=1 \`
			`--learning_rate=1e-4 \`
			`--lr_warmup_steps=500 \`
			`--mixed_precision=no \`
			`--push_to_hub`
			```
			`An example trained model: https://huggingface.co/anton-l/ddpm-ema-flowers-64`

			`A full training run takes 2 hours on 4xV100 GPUs.`

			`<img src="https://user-images.githubusercontent.com/26864830/180248660-a0b143d0-b89a-42c5-8656-2ebf6ece7e52.png" width="700" />`

			`## Unconditional Pokemon`

			`The command to train a DDPM UNet model on the Pokemon dataset:`

			```bash
			`accelerate launch train_unconditional.py \`
			`--dataset_name="huggan/pokemon" \`
			`--resolution=64 \`
			`--output_dir="ddpm-ema-pokemon-64" \`
			`--train_batch_size=16 \`
			`--num_epochs=100 \`
			`--gradient_accumulation_steps=1 \`
			`--learning_rate=1e-4 \`
			`--lr_warmup_steps=500 \`
			`--mixed_precision=no \`
			`--push_to_hub`
			```
			`An example trained model: https://huggingface.co/anton-l/ddpm-ema-pokemon-64`

			`A full training run takes 2 hours on 4xV100 GPUs.`

			`<img src="https://user-images.githubusercontent.com/26864830/180248200-928953b4-db38-48db-b0c6-8b740fe6786f.png" width="700" />`


			`## Using your own data`

			`To use your own dataset, there are 2 ways:`
			- you can either provide your own folder as `--train_data_dir`
			- or you can upload your dataset to the hub (possibly as a private repo, if you prefer so), and simply pass the `--dataset_name` argument.

			`Note: If you want to create your own training dataset please have a look at [this document](https://huggingface.co/docs/datasets/image_process#image-datasets).`

			`Below, we explain both in more detail.`

			`### Provide the dataset as a folder`

			`If you provide your own folders with images, the script expects the following directory structure:`

			```bash
			`data_dir/xxx.png`
			`data_dir/xxy.png`
			`data_dir/[...]/xxz.png`
			```

			`In other words, the script will take care of gathering all images inside the folder. You can then run the script like this:`

			```bash
			`accelerate launch train_unconditional.py \`
			`--train_data_dir <path-to-train-directory> \`
			`<other-arguments>`
			```

			Internally, the script will use the [`ImageFolder`](https://huggingface.co/docs/datasets/v2.0.0/en/image_process#imagefolder) feature which will automatically turn the folders into 🤗 Dataset objects.

			`### Upload your data to the hub, as a (possibly private) repo`

			It's very easy (and convenient) to upload your image dataset to the hub using the [`ImageFolder`](https://huggingface.co/docs/datasets/v2.0.0/en/image_process#imagefolder) feature available in 🤗 Datasets. Simply do the following:

			```python
			`from datasets import load_dataset`

			`# example 1: local folder`
			`dataset = load_dataset("imagefolder", data_dir="path_to_your_folder")`

Fix typos and add Typo check GitHub Action (#483) * Fix typos * Add a typo check action * Fix a bug * Changed to manual typo check currently Ref: https://github.com/huggingface/diffusers/pull/483#pullrequestreview-1104468010 Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Removed a confusing message * Renamed "nin_shortcut" to "in_shortcut" * Add memo about NIN Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> 2022-09-16 07:36:51 -06:00			`# example 2: local files (supported formats are tar, gzip, zip, xz, rar, zstd)`
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`dataset = load_dataset("imagefolder", data_files="path_to_zip_file")`

			`# example 3: remote files (supported formats are tar, gzip, zip, xz, rar, zstd)`
			`dataset = load_dataset(`
			`"imagefolder",`
			`data_files="https://download.microsoft.com/download/3/E/1/3E1C3F21-ECDB-4869-8368-6DEBA77B919F/kagglecatsanddogs_3367a.zip",`
			`)`

			`# example 4: providing several splits`
			`dataset = load_dataset(`
			`"imagefolder", data_files={"train": ["path/to/file1", "path/to/file2"], "test": ["path/to/file3", "path/to/file4"]}`
			`)`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00			```

[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`ImageFolder` will create an `image` column containing the PIL-encoded images.
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`Next, push it to the hub!`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			```python
			`# assuming you have ran the huggingface-cli login command in a terminal`
			`dataset.push_to_hub("name_of_your_dataset")`
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`# if you want to push to a private repo, simply pass private=True:`
			`dataset.push_to_hub("name_of_your_dataset", private=True)`
			```
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			and that's it! You can now train your model by simply setting the `--dataset_name` argument to the name of your dataset on the hub.
[Docs] Let's go (#385) 2022-09-07 03:31:13 -06:00
[Docs] Training docs (#415) finish training docs 2022-09-08 02:17:37 -06:00			`More on this can also be found in [this blog post](https://huggingface.co/blog/image-search-datasets).`