diffusers/examples/rl
Nathan Lambert bea7eb4314
Update RL docs for better sharing / adding models (#1563)
* init docs update

* style

* fix bad colab formatting, add pipeline comment

* update todo
2022-12-07 09:08:12 -08:00
..
README.md Update RL docs for better sharing / adding models (#1563) 2022-12-07 09:08:12 -08:00
run_diffuser_locomotion.py Update RL docs for better sharing / adding models (#1563) 2022-12-07 09:08:12 -08:00

README.md

Overview

These examples show how to run Diffuser in Diffusers. There are two ways to use the script, run_diffuser_locomotion.py.

The key option is a change of the variable n_guide_steps. When n_guide_steps=0, the trajectories are sampled from the diffusion model, but not fine-tuned to maximize reward in the environment. By default, n_guide_steps=2 to match the original implementation.

You will need some RL specific requirements to run the examples:

pip install -f https://download.pytorch.org/whl/torch_stable.html \
                free-mujoco-py \
                einops \
                gym==0.24.1 \
                protobuf==3.20.1 \
                git+https://github.com/rail-berkeley/d4rl.git \
                mediapy \
                Pillow==9.0.0