diffusers

Commit Graph

Author	SHA1	Message	Date
fboulnois	52eb0348e5	Standardize on using `image` argument in all pipelines (#1361 ) * feat: switch core pipelines to use image arg * test: update tests for core pipelines * feat: switch examples to use image arg * docs: update docs to use image arg * style: format code using black and doc-builder * fix: deprecate use of init_image in all pipelines	2022-12-01 16:55:22 +01:00
regisss	2579d42158	Add doc for Stable Diffusion on Habana Gaudi (#1496 ) * Add doc for Stable Diffusion on Habana Gaudi * Make style * Add benchmark * Center-align columns in the benchmark table	2022-12-01 15:43:48 +01:00
Patrick von Platen	c05356497a	Add better docs xformers (#1487 ) * Add better docs xformers * update * Apply suggestions from code review * fix	2022-11-30 13:57:45 +01:00
Ilmari Heikkinen	c28d3c82ce	StableDiffusion: Decode latents separately to run larger batches (#1150 ) * StableDiffusion: Decode latents separately to run larger batches * Move VAE sliced decode under enable_vae_sliced_decode and vae.enable_sliced_decode * Rename sliced_decode to slicing * fix whitespace * fix quality check and repository consistency * VAE slicing tests and documentation * API doc hooks for VAE slicing * reformat vae slicing tests * Skip VAE slicing for one-image batches * Documentation tweaks for VAE slicing Co-authored-by: Ilmari Heikkinen <ilmari@fhtr.org>	2022-11-29 13:28:14 +01:00
Kashif Rasul	462a79d39a	[Docs] fixed some typos (#1425 ) fixed typos	2022-11-25 17:44:07 +01:00
Patrick von Platen	6883294d44	SD2 docs (#1424 ) * up * up * up * up	2022-11-25 17:23:21 +01:00
Suraj Patil	9ec5084a9c	StableDiffusionUpscalePipeline (#1396 ) * StableDiffusionUpscalePipeline * fix a few things * make it better * fix image batching * run vae in fp32 * fix docstr * resize to mul of 64 * doc * remove safety_checker * add max_noise_level * fix Copied * begin tests * slow tests * default max_noise_level * remove kwargs * doc * fix * fix fast tests * fix fast tests * no sf * don't offload vae Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-11-25 16:13:16 +01:00
Patrick von Platen	2625fb59dc	[Versatile Diffusion] Add versatile diffusion model (#1283 ) * up * convert dual unet * revert dual attn * adapt for vd-official * test the full pipeline * mixed inference * mixed inference for text2img * add image prompting * fix clip norm * split text2img and img2img * fix format * refactor text2img * mega pipeline * add optimus * refactor image var * wip text_unet * text unet end to end * update tests * reshape * fix image to text * add some first docs * dual guided pipeline * fix token ratio * propose change * dual transformer as a native module * DualTransformer(nn.Module) * DualTransformer(nn.Module) * correct unconditional image * save-load with mega pipeline * remove image to text * up * uP * fix * up * final fix * remove_unused_weights * test updates * save progress * uP * fix dual prompts * some fixes * finish * style * finish renaming * up * fix * fix * fix * finish Co-authored-by: anton-l <anton@huggingface.co>	2022-11-23 19:03:45 +01:00
Suraj Patil	0eb507f2af	StableDiffusionImageVariationPipeline (#1365 ) * add StableDiffusionImageVariationPipeline * add ini init * use CLIPVisionModelWithProjection * fix _encode_image * add copied from * fix copies * add doc * handle tensor in _encode_image * add tests * correct model_id * remove copied from in enable_sequential_cpu_offload * fix tests * make slow tests pass * update slow tests * use temp model for now * fix test_stable_diffusion_img_variation_intermediate_state * fix test_stable_diffusion_img_variation_intermediate_state * check for torch.Tensor * quality * fix name * fix slow tests * install transformers from source * fix install * fix install * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * input_image -> image * remove deprication warnings * fix test_stable_diffusion_img_variation_multiple_images * make flake happy Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-23 14:36:39 +01:00
Manuel Brack	e50c25d808	Add Safe Stable Diffusion Pipeline (#1244 ) * Add pipeline_stable_diffusion_safe.py to pipelines * Fix repository consistency Ran make fix-copies after adding new pipline * Add Paper/Equation reference for parameters to doc string * Ensure code style and quality * Perform code refactoring * Fix copies inherited from merge with huggingface/main * Add docs * Fix code style * Fix errors in documentation * Fix refactoring error * remove debugging print statement * added Safe Latent Diffusion tests * Fix style * Fix style * Add pre-defined safety configurations * Fix line-break * fix some tests * finish * Change safety checker * Add missing safety_checker.py file * Remove unused imports Co-authored-by: PatrickSchrML <patrick_schramowski@hotmail.de> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-11-22 11:51:30 +01:00
shunxing1234	94b27fb8da	change the sample model (#1352 ) * Update alt_diffusion.mdx * Update alt_diffusion.mdx	2022-11-21 11:28:25 +01:00
Nathan Lambert	0cfbb51b0c	add docs for multi-modal examples (#1227 ) * add docs for multi-modal * many changes * fix docs build * fix links * Update docs/source/using-diffusers/other-modalities.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-17 10:25:49 -08:00
Patrick von Platen	b9b7039f0e	img2text Typo (#1329 ) * make fix copies again * Fix typo	2022-11-17 16:48:15 +01:00
Patrick von Platen	65d136e067	Add improved handling of pil (#1309 ) * Better error message for transformers dummy * [PIL] Better deprecation functionality * up	2022-11-16 15:58:22 +01:00
Patrick von Platen	8a73064576	Add AltDiffusion (#1299 ) * add conversion script for vae * up * up * some fixes * add text model * use the correct config * add docs * move model in it's own file * move model in its own file * pass attenion mask to text encoder * pass attn mask to uncond inputs * quality * fix image2image * add imag2image in init * fix import * fix one more import * fix import, dummy objetcs * fix copied from * up * finish Co-authored-by: patil-suraj <surajp815@gmail.com>	2022-11-15 21:32:26 +01:00
Patrick von Platen	a0520193e1	Add Scheduler.from_pretrained and better scheduler changing (#1286 ) * add conversion script for vae * uP * uP * more changes * push * up * finish again * up * up * up * up * finish * up * uP * up * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Suraj Patil <surajp815@gmail.com> * up * up Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-11-15 18:15:13 +01:00
Nathan Lambert	7c5fef81e0	Add UNet 1d for RL model for planning + colab (#105 ) * re-add RL model code * match model forward api * add register_to_config, pass training tests * fix tests, update forward outputs * remove unused code, some comments * add to docs * remove extra embedding code * unify time embedding * remove conv1d output sequential * remove sequential from conv1dblock * style and deleting duplicated code * clean files * remove unused variables * clean variables * add 1d resnet block structure for downsample * rename as unet1d * fix renaming * rename files * add get_block(...) api * unify args for model1d like model2d * minor cleaning * fix docs * improve 1d resnet blocks * fix tests, remove permuts * fix style * add output activation * rename flax blocks file * Add Value Function and corresponding example script to Diffuser implementation (#884) * valuefunction code * start example scripts * missing imports * bug fixes and placeholder example script * add value function scheduler * load value function from hub and get best actions in example * very close to working example * larger batch size for planning * more tests * merge unet1d changes * wandb for debugging, use newer models * success! * turns out we just need more diffusion steps * run on modal * merge and code cleanup * use same api for rl model * fix variance type * wrong normalization function * add tests * style * style and quality * edits based on comments * style and quality * remove unused var * hack unet1d into a value function * add pipeline * fix arg order * add pipeline to core library * community pipeline * fix couple shape bugs * style * Apply suggestions from code review Co-authored-by: Nathan Lambert <nathan@huggingface.co> * update post merge of scripts * add mdiblock / outblock architecture * Pipeline cleanup (#947) * valuefunction code * start example scripts * missing imports * bug fixes and placeholder example script * add value function scheduler * load value function from hub and get best actions in example * very close to working example * larger batch size for planning * more tests * merge unet1d changes * wandb for debugging, use newer models * success! * turns out we just need more diffusion steps * run on modal * merge and code cleanup * use same api for rl model * fix variance type * wrong normalization function * add tests * style * style and quality * edits based on comments * style and quality * remove unused var * hack unet1d into a value function * add pipeline * fix arg order * add pipeline to core library * community pipeline * fix couple shape bugs * style * Apply suggestions from code review * clean up comments * convert older script to using pipeline and add readme * rename scripts * style, update tests * delete unet rl model file * remove imports in src Co-authored-by: Nathan Lambert <nathan@huggingface.co> * Update src/diffusers/models/unet_1d_blocks.py * Update tests/test_models_unet.py * RL Cleanup v2 (#965) * valuefunction code * start example scripts * missing imports * bug fixes and placeholder example script * add value function scheduler * load value function from hub and get best actions in example * very close to working example * larger batch size for planning * more tests * merge unet1d changes * wandb for debugging, use newer models * success! * turns out we just need more diffusion steps * run on modal * merge and code cleanup * use same api for rl model * fix variance type * wrong normalization function * add tests * style * style and quality * edits based on comments * style and quality * remove unused var * hack unet1d into a value function * add pipeline * fix arg order * add pipeline to core library * community pipeline * fix couple shape bugs * style * Apply suggestions from code review * clean up comments * convert older script to using pipeline and add readme * rename scripts * style, update tests * delete unet rl model file * remove imports in src * add specific vf block and update tests * style * Update tests/test_models_unet.py Co-authored-by: Nathan Lambert <nathan@huggingface.co> * fix quality in tests * fix quality style, split test file * fix checks / tests * make timesteps closer to main * unify block API * unify forward api * delete lines in examples * style * examples style * all tests pass * make style * make dance_diff test pass * Refactoring RL PR (#1200) * init file changes * add import utils * finish cleaning files, imports * remove import flags * clean examples * fix imports, tests for merge * update readmes * hotfix for tests * quality * fix some tests * change defaults * more mps test fixes * unet1d defaults * do not default import experimental * defaults for tests * fix tests * fix-copies * fix * changes per Patrik's comments (#1285) * changes per Patrik's comments * update conversion script * fix renaming * skip more mps tests * last test fix * Update examples/rl/README.md Co-authored-by: Ben Glickenhaus <benglickenhaus@gmail.com>	2022-11-14 13:48:48 -08:00
Partho	c9b3463703	Fix wrong link in text2img fine-tuning documentation (#1282 ) fix link typo	2022-11-14 20:42:14 +01:00
ruanrz	8171566163	[Docs] improve img2img example (#1193 ) update img2img example	2022-11-11 12:28:20 +01:00
apolinario	a09d47532d	Add a reference to the name 'Sampler' (#1172 ) * Add a reference to the name 'Sampler' - Facilitate people that are familiar with the name samplers to understand that we call that schedulers - Better SEO if people are googling for samplers to find our library as well * Update README.md with a reference to 'Sampler'	2022-11-10 14:37:42 +01:00
Patrick von Platen	6c0335c7f9	DDIM docs (#1219 )	2022-11-09 16:02:11 +01:00
Duong A. Nguyen	5a59f9b717	Add LDM Super Resolution pipeline (#1116 ) * Add ldm super resolution pipeline * style * fix copies * style * fix doc * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * add doc * address comments * address comments * fix doc * minor * add tests * add tests * load text encoder from subfolder * fix test * fix test * style * style * handle mps latents * unfix typo * unfix typo * Update tests/pipelines/latent_diffusion/test_latent_diffusion_superresolution.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * fix set_timesteps mps * fix set_timesteps mps * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/diffusers/pipelines/latent_diffusion/pipeline_latent_diffusion_superresolution.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * style * test 64x64 instead of 256x256 Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-09 13:42:16 +01:00
Pedro Cuenca	fa6e5209a8	Link to Dreambooth blog post instead of W&B report (#1180 ) Link to Dreambooth blog post instead of W&B report.	2022-11-07 21:59:36 +01:00
Patrick von Platen	de7536281a	fix image docs	2022-11-07 17:25:13 +01:00
Patrick von Platen	b500df1155	[Docs] Add loading script (#1174 ) * add loading script * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Suraj Patil <surajp815@gmail.com> * correct * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * uP Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-07 17:15:41 +01:00
Pedro Cuenca	0dd8c6b4db	Fix community pipeline links (#1162 ) * Change title to match the sidebar in _toctree. * Fix custom pipe link, add link to contribute. * Fix community pipeline links.	2022-11-07 14:32:51 +01:00
Cheng Lu	b4a1ed8544	Add multistep DPM-Solver discrete scheduler (#1132 ) * add dpmsolver discrete pytorch scheduler * fix some typos in dpm-solver pytorch * add dpm-solver pytorch in stable-diffusion pipeline * add jax/flax version dpm-solver * change code style * change code style * add docs * add `add_noise` method for dpmsolver * add pytorch unit test for dpmsolver * add dummy object for pytorch dpmsolver * Update src/diffusers/schedulers/scheduling_dpmsolver_discrete.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update tests/test_config.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update tests/test_config.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * resolve the code comments * rename the file * change class name * fix code style * add auto docs for dpmsolver multistep * add more explanations for the stabilizing trick (for steps < 15) * delete the dummy file * change the API name of predict_epsilon, algorithm_type and solver_type * add compatible lists Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-11-06 22:49:55 +01:00
Chen Wu (吴尘)	9d8943b7e7	Add CycleDiffusion pipeline using Stable Diffusion (#888 ) * Add CycleDiffusion pipeline for Stable Diffusion * Add the option of passing noise to DDIMScheduler Add the option of providing the noise itself to DDIMScheduler, instead of the random seed generator. * Update README.md * Update README.md * Update pipeline_stable_diffusion_cycle_diffusion.py * Update pipeline_stable_diffusion_cycle_diffusion.py * Update pipeline_stable_diffusion_cycle_diffusion.py * Update pipeline_stable_diffusion_cycle_diffusion.py * Update scheduling_ddim.py * Update import format * Update pipeline_stable_diffusion_cycle_diffusion.py * Update scheduling_ddim.py * Update src/diffusers/schedulers/scheduling_ddim.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_ddim.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_ddim.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_ddim.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_ddim.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update scheduling_ddim.py * Update scheduling_ddim.py * Update scheduling_ddim.py * add two tests * Update pipeline_stable_diffusion_cycle_diffusion.py * Update pipeline_stable_diffusion_cycle_diffusion.py * Update README.md * Rename pipeline name as suggested in the latest reviewer comment * Update test_pipelines.py * Update test_pipelines.py * Update test_pipelines.py * Update pipeline_stable_diffusion_cycle_diffusion.py * Remove the generator This generator does not control all randomness during sampling, which can be misleading. * Update optimal hyperparameters * Update src/diffusers/pipelines/stable_diffusion/README.md Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/diffusers/pipelines/stable_diffusion/README.md Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/diffusers/pipelines/stable_diffusion/README.md Co-authored-by: Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review * uP * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_cycle_diffusion.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * up * up * Replace assert with ValueError * finish docs Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-11-04 20:51:06 +01:00
Pedro Cuenca	118c5be94a	Docs: Do not require PyTorch nightlies (#1123 ) Do not require PyTorch nightlies.	2022-11-03 18:17:23 +01:00
Suraj Patil	7482178162	default fast model loading 🔥 (#1115 ) * make accelerate hard dep * default fast init * move params to cpu when device map is None * handle device_map=None * handle torch < 1.9 * remove device_map="auto" * style * add accelerate in torch extra * remove accelerate from extras["test"] * raise an error if torch is available but not accelerate * update installation docs * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * improve defautl loading speed even further, allow disabling fats loading * address review comments * adapt the tests * fix test_stable_diffusion_fast_load * fix test_read_init * temp fix for dummy checks * Trigger Build * Apply suggestions from code review Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-11-03 17:25:57 +01:00
Will Berman	ef2ea33c3b	VQ-diffusion (#658 ) * Changes for VQ-diffusion VQVAE Add specify dimension of embeddings to VQModel: `VQModel` will by default set the dimension of embeddings to the number of latent channels. The VQ-diffusion VQVAE has a smaller embedding dimension, 128, than number of latent channels, 256. Add AttnDownEncoderBlock2D and AttnUpDecoderBlock2D to the up and down unet block helpers. VQ-diffusion's VQVAE uses those two block types. * Changes for VQ-diffusion transformer Modify attention.py so SpatialTransformer can be used for VQ-diffusion's transformer. SpatialTransformer: - Can now operate over discrete inputs (classes of vector embeddings) as well as continuous. - `in_channels` was made optional in the constructor so two locations where it was passed as a positional arg were moved to kwargs - modified forward pass to take optional timestep embeddings ImagePositionalEmbeddings: - added to provide positional embeddings to discrete inputs for latent pixels BasicTransformerBlock: - norm layers were made configurable so that the VQ-diffusion could use AdaLayerNorm with timestep embeddings - modified forward pass to take optional timestep embeddings CrossAttention: - now may optionally take a bias parameter for its query, key, and value linear layers FeedForward: - Internal layers are now configurable ApproximateGELU: - Activation function in VQ-diffusion's feedforward layer AdaLayerNorm: - Norm layer modified to incorporate timestep embeddings * Add VQ-diffusion scheduler * Add VQ-diffusion pipeline * Add VQ-diffusion convert script to diffusers * Add VQ-diffusion dummy objects * Add VQ-diffusion markdown docs * Add VQ-diffusion tests * some renaming * some fixes * more renaming * correct * fix typo * correct weights * finalize * fix tests * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finish * finish * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-03 16:10:28 +01:00
Revist	d38c804320	feat: add repaint (#974 ) * feat: add repaint * fix: fix quality check with `make fix-copies` * fix: remove old unnecessary arg * chore: change default to DDPM (looks better in experiments) * ".to(device)" changed to "device=" Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * make generator device-specific Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * make generator device-specific and change shape Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * fix: add preprocessing for image and mask Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * fix: update test Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update src/diffusers/pipelines/repaint/pipeline_repaint.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add docs and examples * Fix toctree Co-authored-by: fja <fja@zurich.ibm.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-11-03 15:42:46 +01:00
Patrick von Platen	d53ffbbdf4	Rename latent (#1102 ) * Rename latent * uP	2022-11-02 11:59:00 +01:00
Suraj Patil	8608795711	[docs] add euler scheduler in docs, how to use differnet schedulers (#1089 ) * add euler scheduler in docs * add a section for how to use different scheds * address patrck's comments	2022-11-02 11:32:46 +01:00
MatthieuTPHR	98c42134a5	Up to 2x speedup on GPUs using memory efficient attention (#532 ) * 2x speedup using memory efficient attention * remove einops dependency * Swap K, M in op instantiation * Simplify code, remove unnecessary maybe_init call and function, remove unused self.scale parameter * make xformers a soft dependency * remove one-liner functions * change one letter variable to appropriate names * Remove Env variable dependency, remove MemoryEfficientCrossAttention class and use enable_xformers_memory_efficient_attention method * Add memory efficient attention toggle to img2img and inpaint pipelines * Clearer management of xformers' availability * update optimizations markdown to add info about memory efficient attention * add benchmarks for TITAN RTX * More detailed explanation of how the mem eff benchmark were ran * Removing autocast from optimization markdown * import_utils: import torch only if is available Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>	2022-11-02 10:29:06 +01:00
Patrick von Platen	010bc4ea19	incorrect model id	2022-10-31 16:35:59 +00:00
Patrick von Platen	c18941b01a	[Better scheduler docs] Improve usage examples of schedulers (#890 ) * [Better scheduler docs] Improve usage examples of schedulers * finish * fix warnings and add test * finish * more replacements * adapt fast tests hf token * correct more * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Integrate compatibility with euler Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-10-31 17:26:30 +01:00
Nathan Lambert	12fd0736dc	clean incomplete pages (#1008 )	2022-10-29 09:28:26 +02:00
Minwoo Byeon	fc0ca47456	Fix speedup ratio in fp16.mdx (#837 )	2022-10-29 09:26:23 +02:00
Pedro Cuenca	6b185b6acd	Update training and fine-tuning docs (#1020 ) * Update training and fine-tuning docs. * Update examples README. * Update README. * Add Flax fine-tuning section. * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-10-28 21:02:08 +02:00
Pi Esposito	de00c63217	Document sequential CPU offload method on Stable Diffusion pipeline (#1024 ) * document cpu offloading method * address review comments Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-10-27 16:52:21 +02:00
Ella Charlaix	e2243de5f2	Fix typo in documentation title (#975 )	2022-10-25 20:20:16 +02:00
Patrick von Platen	88fa6b7d68	[Dance Diffusion] Add dance diffusion (#803 ) * start * add more logic * Update src/diffusers/models/unet_2d_condition_flax.py * match weights * up * make model work * making class more general, fixing missed file rename * small fix * make new conversion work * up * finalize conversion * up * first batch of variable renamings * remove c and c_prev var names * add mid and out block structure * add pipeline * up * finish conversion * finish * upload * more fixes * Apply suggestions from code review * add attr * up * uP * up * finish tests * finish * uP * finish * fix test * up * naming consistency in tests * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Nathan Lambert <nathan@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co> * remove hardcoded 16 * Remove bogus * fix some stuff * finish * improve logging * docs * upload Co-authored-by: Nathan Lambert <nol@berkeley.edu> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Nathan Lambert <nathan@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-10-25 18:39:25 +02:00
Pedro Cuenca	3d02c92187	mps changes for PyTorch 1.13 (#926 ) * Docs: refer to pre-RC version of PyTorch 1.13.0. * Remove temporary workaround for unavailable op. * Update comment to make it less ambiguous. * Remove use of contiguous in mps. It appears to not longer be necessary. * Special case: use einsum for much better performance in mps * Update mps docs. * Minor doc update. * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-10-25 16:41:51 +02:00
Nathan Lambert	2fb8fafa4b	add community pipeline docs; add minimal text to some empty doc pages (#930 ) * add community pipeline docs * fix style in code snippets (lol) * clean up loading docs * add license to doc files * fix some weird links	2022-10-24 14:20:08 -07:00
apolinario	8aac1f99d7	v1-5 docs updates (#921 ) * Update README.md Additionally add FLAX so the model card can be slimmer and point to this page * Find and replace all * v-1-5 -> v1-5 * revert test changes * Update README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/quicktour.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/quicktour.mdx Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update README.md Co-authored-by: Suraj Patil <surajp815@gmail.com> * Revert certain references to v1-5 * Docs changes * Apply suggestions from code review Co-authored-by: apolinario <joaopaulo.passos+multimodal@gmail.com> Co-authored-by: anton-l <anton@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-10-24 22:50:23 +02:00
Patrick von Platen	83f8a5ff70	[Stable Diffusion] Add components function (#889 ) * [Stable Diffusion] Add components function * uP	2022-10-20 13:28:11 +02:00
Pedro Cuenca	8124863d1f	Initial docs update for new in-painting pipeline (#910 ) Docs update for new in-painting pipeline. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-10-19 17:31:23 +02:00
Patrick von Platen	c1b6ea3dce	Update img2img.mdx	2022-10-12 00:52:30 +02:00
Pedro Cuenca	24b8b5cf5e	`mps`: Alternative implementation for `repeat_interleave` (#766 ) * mps: alt. implementation for repeat_interleave * style * Bump mps version of PyTorch in the documentation. * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Simplify: do not check for device. * style * Fix repeat dimensions: - The unconditional embeddings are always created from a single prompt. - I was shadowing the batch_size var. * Split long lines as suggested by Suraj. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-10-11 20:30:09 +02:00
Omar Sanseviero	757babfcad	Fix indentation in the code example (#802 ) Update custom_pipelines.mdx	2022-10-11 20:26:52 +02:00
anton-l	970e30606c	Revert "[v0.4.0] Temporarily remove Flax modules from the public API (#755 )" This reverts commit `2e209c30cf`.	2022-10-06 18:35:40 +02:00
Anton Lozhkov	2e209c30cf	[v0.4.0] Temporarily remove Flax modules from the public API (#755 ) Temporarily remove Flax modules from the public API	2022-10-06 18:10:36 +02:00
Patrick von Platen	d9c449ea30	Custome Pipelines (#744 ) * [Custom Pipelines] * uP * make style * finish * finish * remove ipdb * upload * fix * finish docs * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: apolinario <joaopaulo.passos@gmail.com> * finish * final uploads * remove unnecessary test Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: apolinario <joaopaulo.passos@gmail.com>	2022-10-06 16:54:02 +02:00
Patrick von Platen	4deb16e830	[Docs] Advertise fp16 instead of autocast (#740 ) up	2022-10-05 22:20:53 +02:00
Suraj Patil	19e559d5e9	remove use_auth_token from remaining places (#737 ) remove use_auth_token	2022-10-05 17:40:49 +02:00
Patrick von Platen	78744b6a8f	No more use_auth_token=True (#733 ) * up * uP * uP * make style * Apply suggestions from code review * up * finish	2022-10-05 17:16:15 +02:00
Kashif Rasul	726aba089d	[Pytorch] pytorch only timesteps (#724 ) * pytorch timesteps * style * get rid of if-else * fix test Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-10-05 12:55:51 +02:00
Yuta Hayashibe	7e92c5bc73	Fix typos (#718 ) * Fix typos * Update examples/dreambooth/train_dreambooth.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-10-04 15:22:14 +02:00
Nouamane Tazi	daa22050c7	[docs] fix table in fp16.mdx (#683 )	2022-09-30 15:15:22 +02:00
Nouamane Tazi	9ebaea545f	Optimize Stable Diffusion (#371 ) * initial commit * make UNet stream capturable * try to fix noise_pred value * remove cuda graph and keep NB * non blocking unet with PNDMScheduler * make timesteps np arrays for pndm scheduler because lists don't get formatted to tensors in `self.set_format` * make max async in pndm * use channel last format in unet * avoid moving timesteps device in each unet call * avoid memcpy op in `get_timestep_embedding` * add `channels_last` kwarg to `DiffusionPipeline.from_pretrained` * update TODO * replace `channels_last` kwarg with `memory_format` for more generality * revert the channels_last changes to leave it for another PR * remove non_blocking when moving input ids to device * remove blocking from all .to() operations at beginning of pipeline * fix merging * fix merging * model can run in other precisions without autocast * attn refactoring * Revert "attn refactoring" This reverts commit 0c70c0e189cd2c4d8768274c9fcf5b940ee310fb. * remove restriction to run conv_norm in fp32 * use `baddbmm` instead of `matmul`for better in attention for better perf * removing all reshapes to test perf * Revert "removing all reshapes to test perf" This reverts commit 006ccb8a8c6bc7eb7e512392e692a29d9b1553cd. * add shapes comments * hardcore whats needed for jitting * Revert "hardcore whats needed for jitting" This reverts commit 2fa9c698eae2890ac5f8e367ca80532ecf94df9a. * Revert "remove restriction to run conv_norm in fp32" This reverts commit cec592890c32da3d1b78d38b49e4307aedf459b9. * revert using baddmm in attention's forward * cleanup comment * remove restriction to run conv_norm in fp32. no quality loss was noticed This reverts commit cc9bc1339c998ebe9e7d733f910c6d72d9792213. * add more optimizations techniques to docs * Revert "add shapes comments" This reverts commit 31c58eadb8892f95478cdf05229adf678678c5f4. * apply suggestions * make quality * apply suggestions * styling * `scheduler.timesteps` are now arrays so we dont need .to() * remove useless .type() * use mean instead of max in `test_stable_diffusion_inpaint_pipeline_k_lms` * move scheduler timestamps to correct device if tensors * add device to `set_timesteps` in LMSD scheduler * `self.scheduler.set_timesteps` now uses device arg for schedulers that accept it * quick fix * styling * remove kwargs from schedulers `set_timesteps` * revert to using max in K-LMS inpaint pipeline test * Revert "`self.scheduler.set_timesteps` now uses device arg for schedulers that accept it" This reverts commit 00d5a51e5c20d8d445c8664407ef29608106d899. * move timesteps to correct device before loop in SD pipeline * apply previous fix to other SD pipelines * UNet now accepts tensor timesteps even on wrong device, to avoid errors - it shouldnt affect performance if timesteps are alrdy on correct device - it does slow down performance if they're on the wrong device * fix pipeline when timesteps are arrays with strides	2022-09-30 09:49:13 +02:00
Tanishq Abraham	f5b9bc8b49	Update index.mdx (#670 )	2022-09-29 09:17:52 +02:00
Kashif Rasul	bd8df2da89	[Pytorch] Pytorch only schedulers (#534 ) * pytorch only schedulers * fix style * remove match_shape * pytorch only ddpm * remove SchedulerMixin * remove numpy from karras_ve * fix types * remove numpy from lms_discrete * remove numpy from pndm * fix typo * remove mixin and numpy from sde_vp and ve * remove remaining tensor_format * fix style * sigmas has to be torch tensor * removed set_format in readme * remove set format from docs * remove set_format from pipelines * update tests * fix typo * continue to use mixin * fix imports * removed unsed imports * match shape instead of assuming image shapes * remove import typo * update call to add_noise * use math instead of numpy * fix t_index * removed commented out numpy tests * timesteps needs to be discrete * cast timesteps to int in flax scheduler too * fix device mismatch issue * small fix * Update src/diffusers/schedulers/scheduling_pndm.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-09-27 15:27:34 +02:00
Younes Belkada	8b0be93596	Flax documentation (#589 ) * documenting `attention_flax.py` file * documenting `embeddings_flax.py` * documenting `unet_blocks_flax.py` * Add new objs to doc page * document `vae_flax.py` * Apply suggestions from code review * modify `unet_2d_condition_flax.py` * make style * Apply suggestions from code review * make style * Apply suggestions from code review * fix indent * fix typo * fix indent unet * Update src/diffusers/models/vae_flax.py * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Mishig Davaadorj <dmishig@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-09-23 13:24:16 +02:00
Ryan Russell	df80ccf7de	docs: `.md` readability fixups (#619 ) Signed-off-by: Ryan Russell <git@ryanrussell.org>	2022-09-23 12:02:27 +02:00
Ryan Russell	f149d037de	docs: fix `stochastic_karras_ve` ref (#618 ) Signed-off-by: Ryan Russell <git@ryanrussell.org> Signed-off-by: Ryan Russell <git@ryanrussell.org>	2022-09-22 18:36:29 +02:00
Yuta Hayashibe	76d492ea49	Fix typos and add Typo check GitHub Action (#483 ) * Fix typos * Add a typo check action * Fix a bug * Changed to manual typo check currently Ref: https://github.com/huggingface/diffusers/pull/483#pullrequestreview-1104468010 Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Removed a confusing message * Renamed "nin_shortcut" to "in_shortcut" * Add memo about NIN Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2022-09-16 15:36:51 +02:00
Jithin James	ab7a78e8f1	docs: bocken doc links for relative links (#504 ) fix: bocken doc links for relative links	2022-09-14 00:50:02 +02:00
Nathan Lambert	25a51b63ca	fix table formatting for stable diffusion pipeline doc (add blank line) (#471 ) fix table formatting (add blank line)	2022-09-12 10:28:27 +02:00
Partho	8eaaa546d8	Docs: fix installation typo (#453 ) installation doc typo fix	2022-09-09 15:17:17 -06:00
Patrick von Platen	44968e4204	[Docs] Correct links (#432 )	2022-09-08 21:29:24 +02:00
Patrick von Platen	1e98723e12	finish	2022-09-08 17:47:54 +02:00
Patrick von Platen	4e2c1f3a4d	Add config docs (#429 ) * advance * finish * finish	2022-09-08 17:46:03 +02:00
Kashif Rasul	5e6417e988	[Docs] Models (#416 ) * docs for attention * types for embeddings * unet2d docstrings * UNet2DConditionModel docstrings * fix typos * style and vq-vae docstrings * docstrings for VAE * Update src/diffusers/models/unet_2d.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make style * added inherits from sentence * docstring to forward * make style * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finish model docs * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-09-08 17:28:11 +02:00
Patrick von Platen	234e90cca7	[Docs] Using diffusers (#428 ) * [Docs] Using diffusers * up	2022-09-08 17:27:36 +02:00
Patrick von Platen	f6fb3282b1	[Outputs] Improve syntax (#423 ) * [Outputs] Improve syntax * improve more * fix docstring return * correct all * uP Co-authored-by: Mishig Davaadorj <dmishig@gmail.com>	2022-09-08 16:46:38 +02:00
Pedro Cuenca	1a79969d23	Initial ONNX doc (TODO: Installation) (#426 )	2022-09-08 16:46:24 +02:00
Patrick von Platen	43c585111d	[Docs] Outputs.mdx (#422 ) * up * remove bogus file	2022-09-08 14:47:14 +02:00
Patrick von Platen	46013e8e3f	[Docs] Fix scheduler docs (#421 ) * [Docs] Fix scheduler docs * up * Apply suggestions from code review	2022-09-08 14:04:09 +02:00
Patrick von Platen	e7457b377d	[Docs] DiffusionPipeline (#418 ) * Start * up * up * finish	2022-09-08 13:50:06 +02:00
Patrick von Platen	98f346835a	[Docs] Minor fixes in optimization section (#420 ) * uP * more	2022-09-08 13:13:46 +02:00
Satpal Singh Rathore	6b9906f6c2	[Docs] Pipelines for inference (#417 ) * Update conditional_image_generation.mdx * Update unconditional_image_generation.mdx	2022-09-08 12:42:13 +02:00
Patrick von Platen	a353c46ec0	[Docs] Training docs (#415 ) finish training docs	2022-09-08 10:17:37 +02:00
Pedro Cuenca	c29d81c3e3	Docs: fp16 page (#404 ) * Initial version of `fp16` page. * Fix typo in README. * Change titles of fp16 section in toctree. * PR suggestion Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * PR suggestion Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Clarify attention slicing is useful even for batches of 1 Explained by @patrickvonplaten after a suggestion by @keturn. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Do not talk about `batches` in `enable_attention_slicing`. * Use Tip (just for fun), add link to method. * Comment about fp16 results looking the same as float32 in practice. * Style: docstring line wrapping. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-09-08 09:17:51 +02:00
Nathan Lambert	b8894f181d	Docs fix some typos (#408 ) * fix small typos * capitalize Diffusers	2022-09-08 09:08:35 +02:00
Nathan Lambert	e6110f6856	[docs sprint] schedulers docs, will update (#376 ) * init schedulers docs * add some docstrings, fix sidebar formatting * add docstrings * [Type hint] PNDM schedulers (#335) * [Type hint] PNDM Schedulers * ran make style * updated timesteps type hint * apply suggestions from code review * ran make style * removed unused import * [Type hint] scheduling ddim (#343) * [Type hint] scheduling ddim * apply suggestions from code review apply suggestions to also return the return type Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make style * update class docstrings * add docstrings * missed merge edit * add general docs page * modify headings for right sidebar Co-authored-by: Partho <parthodas6176@gmail.com> Co-authored-by: Santiago Víquez <santi.viquez@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-09-08 09:07:44 +02:00
Nathan Lambert	cee3aa0dd4	Docs: fix undefined in toctree (#406 ) fix undefined in toctree	2022-09-07 23:02:36 +02:00
Pedro Cuenca	8d14edf27f	Docs: Stable Diffusion pipeline (#386 ) * Initial description of Stable Diffusion pipeline. * Placeholder docstrings to test preview. * Add docstrings to Stable Diffusion pipeline. * Style * Docs for all the SD pipelines + attention slicing. * Style: wrap long lines.	2022-09-07 18:48:49 +02:00
Pedro Cuenca	58d627aed6	Small changes to Philosophy (#403 ) Small changes to Philosophy.	2022-09-07 18:47:38 +02:00
Kashif Rasul	65ed5d2845	karras-ve docs (#401 ) * karras-ve docs for issue #293 * make style	2022-09-07 18:34:54 +02:00
Kashif Rasul	44091d8b2a	Score sde ve doc (#400 ) * initial score_sde_ve docs * fixed typo * fix VE term	2022-09-07 18:34:34 +02:00
Patrick von Platen	e0d836c813	[Docs] Finish Intro Section (#402 ) * up * up * finish	2022-09-07 18:00:49 +02:00
Patrick von Platen	8603ca6b09	[Docs] Quicktour (#397 ) * uP * better * up * finish * up	2022-09-07 16:29:34 +02:00
Kashif Rasul	fead3ba386	ddim docs (#396 ) * ddim docs for issue #293 * space	2022-09-07 16:29:06 +02:00
Pedro Cuenca	492f5c9a6c	Docs: optimization / special hardware (#390 ) Add mps documentation.	2022-09-07 16:27:14 +02:00
Kashif Rasul	71d737bfe2	added pndm docs (#391 ) for issue #293	2022-09-07 15:33:17 +02:00
Jonathan Whitaker	5b4f5951a9	Update text_inversion.mdx (#393 ) * Update text_inversion.mdx Getting in a bit of background info * fixed typo mode -> model * Link SD and re-write a few bits for clarity * Copied in info from the example script As suggested by surajpatil :) * removed an unnecessary heading	2022-09-07 18:48:34 +05:30
Patrick von Platen	3dcc5e9a5a	[Docs] Logging (#394 ) up	2022-09-07 14:58:21 +02:00
Kashif Rasul	9288fb1df8	[Pipeline Docs] ddpm docs for sprint (#382 ) * initial ddpm for issue #293 * initial ddpm pipeline doc * added docstrings * Update docs/source/api/pipelines/ddpm.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make style * fix docs * make style * fix doc strings Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-09-07 14:43:29 +02:00
Satpal Singh Rathore	a0592a13ee	[Pipeline Docs] Unconditional Latent Diffusion (#388 ) * initial description * add doc strings	2022-09-07 14:42:24 +02:00
Pedro Cuenca	cdb371f07b	Docs: Conceptual section (#392 ) Add contribution.mdx by copy/pasting and adapting.	2022-09-07 14:41:17 +02:00
Patrick von Platen	8ef1ee812d	[Pipeline Docs] Latent Diffusion (#377 ) * up * up * up * up * up * up * up	2022-09-07 12:53:03 +02:00
Patrick von Platen	5a38033de4	[Docs] Let's go (#385 )	2022-09-07 11:31:13 +02:00
Lysandre Debut	d87d5edf66	README improvements: credits and roadmap (#116 ) * Typos * Credits and roadmap * Second version	2022-07-21 10:06:16 +02:00
Patrick von Platen	e7fe901e5e	save intermediate (#87 ) * save intermediate * up * up	2022-07-14 12:29:06 +02:00
Nathan Lambert	c3d78cd306	Docs (#45 ) * first pass at docs structure * minor reformatting, add github actions for docs * populate docs (primarily from README, some writing)	2022-07-13 08:42:05 -07:00
Patrick von Platen	32c556731f	improve readme	2022-06-15 11:50:41 +02:00

... 2 3 4 5 6

257 Commits