diffusers

Commit Graph

Author	SHA1	Message	Date
Sayak Paul	5883d8d4d1	[Docs] update docs (Stable unCLIP) to reflect the updated ckpts. (#2815 ) * update docs to reflect the updated ckpts. * update: point about prompt. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * emove image resizing. * Apply suggestions from code review * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-24 17:24:19 +01:00
Bahjat Kawar	37a44bb283	Add ModelEditing pipeline (#2721 ) * TIME first commit * styling. * styling 2. * fixes; tests * apply styling and doc fix. * remove sups. * fixes * remove temp file * move augmentations to const * added doc entry * code quality * customize augmentations * quality * quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-03-24 13:01:39 +05:30
Sanchit Gandhi	b94880e536	Add AudioLDM (#2232 ) * Add AudioLDM * up * add vocoder * start unet * unconditional unet * clap, vocoder and vae * clean-up: conversion scripts * fix: conversion script token_type_ids * clean-up: pipeline docstring * tests: from SD * clean-up: cpu offload vocoder instead of safety checker * feat: adapt tests to audioldm * feat: add docs * clean-up: amend pipeline docstrings * clean-up: make style * clean-up: make fix-copies * fix: add doc path to toctree * clean-up: args for conversion script * clean-up: paths to checkpoints * fix: use conditional unet * clean-up: make style * fix: type hints for UNet * clean-up: docstring for UNet * clean-up: make style * clean-up: remove duplicate in docstring * clean-up: make style * clean-up: make fix-copies * clean-up: move imports to start in code snippet * fix: pass cross_attention_dim as a list/tuple to unet * clean-up: make fix-copies * fix: update checkpoint path * fix: unet cross_attention_dim in tests * film embeddings -> class embeddings * Apply suggestions from code review Co-authored-by: Will Berman <wlbberman@gmail.com> * fix: unet film embed to use existing args * fix: unet tests to use existing args * fix: make style * fix: transformers import and version in init * clean-up: make style * Revert "clean-up: make style" This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66. * clean-up: make style * clean-up: use pipeline tester mixin tests where poss * clean-up: skip attn slicing test * fix: add torch dtype to docs * fix: remove conversion script out of src * fix: remove .detach from 1d waveform * fix: reduce default num inf steps * fix: swap height/width -> audio_length_in_s * clean-up: make style * fix: remove nightly tests * fix: imports in conversion script * clean-up: slim-down to two slow tests * clean-up: slim-down fast tests * fix: batch consistent tests * clean-up: make style * clean-up: remove vae slicing fast test * clean-up: propagate changes to doc * fix: increase test tol to 1e-2 * clean-up: finish docs * clean-up: make style * feat: vocoder / VAE compatibility check * feat: possibly expand / cut audio waveform * fix: pipeline call signature test * fix: slow tests output len * clean-up: make style * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com>	2023-03-23 19:00:21 +01:00
Steven Liu	1870fb05a9	[docs] Add Colab notebooks and Spaces (#2713 ) * add colab notebook and spaces * fix image link	2023-03-23 09:48:58 -07:00
YiYi Xu	df91c44712	Flax controlnet (#2727 ) * add contronet flax --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-03-23 05:46:23 -10:00
Sayak Paul	0d7aac3e8d	[Docs] small fixes to the text to video doc. (#2787 ) * small fixes to the text to video doc. * add: Spaces link. * add: warning on research-only model.	2023-03-23 18:57:02 +05:30
Kashif Rasul	2ef9bdd76f	Music Spectrogram diffusion pipeline (#1044 ) * initial TokenEncoder and ContinuousEncoder * initial modules * added ContinuousContextTransformer * fix copy paste error * use numpy for get_sequence_length * initial terminal relative positional encodings * fix weights keys * fix assert * cross attend style: concat encodings * make style * concat once * fix formatting * Initial SpectrogramPipeline * fix input_tokens * make style * added mel output * ignore weights for config * move mel to numpy * import pipeline * fix class names and import * moved models to models folder * import ContinuousContextTransformer and SpectrogramDiffusionPipeline * initial spec diffusion converstion script * renamed config to t5config * added weight loading * use arguments instead of t5config * broadcast noise time to batch dim * fix call * added scale_to_features * fix weights * transpose laynorm weight * scale is a vector * scale the query outputs * added comment * undo scaling * undo depth_scaling * inital get_extended_attention_mask * attention_mask is none in self-attention * cleanup * manually invert attention * nn.linear need bias=False * added T5LayerFFCond * remove to fix conflict * make style and dummy * remove unsed variables * remove predict_epsilon * Move accelerate to a soft-dependency (#1134) * finish * finish * Update src/diffusers/modeling_utils.py * Update src/diffusers/pipeline_utils.py Co-authored-by: Anton Lozhkov <anton@huggingface.co> * more fixes * fix Co-authored-by: Anton Lozhkov <anton@huggingface.co> * fix order * added initial midi to note token data pipeline * added int to int tokenizer * remove duplicate * added logic for segments * add melgan to pipeline * move autoregressive gen into pipeline * added note_representation_processor_chain * fix dtypes * remove immutabledict req * initial doc * use np.where * require note_seq * fix typo * update dependency * added note-seq to test * added is_note_seq_available * fix import * added toc * added example usage * undo for now * moved docs * fix merge * fix imports * predict first segment * avoid un-needed copy to and from cpu * make style * Copyright * fix style * add test and fix inference steps * remove bogus files * reorder models * up * remove transformers dependency * make work with diffusers cross attention * clean more * remove @ * improve further * up * uP * Apply suggestions from code review * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py * loop over all tokens * make style * Added a section on the model * fix formatting * grammer * formatting * make fix-copies * Update src/diffusers/pipelines/__init__.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * added callback ad optional ionnx * do not squeeze batch dim * clean up more * upload * convert jax to nnumpy * make style * fix warning * make fix-copies * fix warning * add initial fast tests * add initial pipeline_params * eval mode due to dropout * skip batch tests as pipeline runs on a single file * make style * fix relative path * fix doc tests * Update src/diffusers/models/t5_film_transformer.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/t5_film_transformer.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add MidiProcessor * format * fix org * Apply suggestions from code review * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py * make style * pin protobuf to <4 * fix formatting * white space * tensorboard needs protobuf --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2023-03-23 14:06:17 +01:00
Naoki Ainoya	14e3a28c12	Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732 ) The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.	2023-03-23 13:49:22 +01:00
Mishig	8e35ef0142	[doc wip] literalinclude (#2718 )	2023-03-23 13:42:54 +01:00
Sayak Paul	0d633a42f4	deduplicate training section in the docs. (#2788 )	2023-03-23 11:21:53 +05:30
Sayak Paul	9dc84448ac	[Examples] InstructPix2Pix instruct training script (#2478 ) * add: initial implementation of the pix2pix instruct training script. * shorten cli arg. * fix: main process check. * fix: dataset column names. * simplify tokenization. * proper placement of null conditions. * apply styling. * remove debugging message for conditioning do. * complete license. * add: requirements.tzt * wandb column name order. * fix: augmentation. * change: dataset_id. * fix: convert_to_np() call. * fix: reshaping. * fix: final ema copy. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * address PR comments. * add: readme details. * config fix. * downgrade version. * reduce image width in the readme. * note on hyperparameters during generation. * add: output images. * update readme. * minor edits to readme. * debugging statement. * explicitly placement of the pipeline. * bump minimum diffusers version. * fix: device attribute error. * weight dtype. * debugging. * add dtype inform. * add seoarate te and vae. * add: explicit casting/ * remove casting. * up. * up 2. * up 3. * autocast. * disable mixed-precision in the final inference. * debugging information. * autocasting. * add: instructpix2pix training section to the docs. * Empty-Commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-23 10:15:01 +05:30
Sayak Paul	c681ad1af2	add: section on multiple controlnets. (#2762 ) * add: section on multiple controlnets. Co-authored-by: William Berman <WLBberman@gmail.com> * fix: docs. * fix: docs. --------- Co-authored-by: William Berman <WLBberman@gmail.com>	2023-03-23 09:55:25 +05:30
Pedro Cuenca	92e1164e2e	`mps`: remove warmup passes (#2771 ) * Remove warmup passes in mps tests. * Update mps docs: no warmup pass in PyTorch 2 * Update imports.	2023-03-22 19:29:27 +01:00
Patrick von Platen	ca1a22296d	[MS Text To Video] Add first text to video (#2738 ) * [MS Text To Video} Add first text to video * upload * make first model example * match unet3d params * make sure weights are correcctly converted * improve * forward pass works, but diff result * make forward work * fix more * finish * refactor video output class. * feat: add support for a video export utility. * fix: opencv availability check. * run make fix-copies. * add: docs for the model components. * add: standalone pipeline doc. * edit docstring of the pipeline. * add: right path to TransformerTempModel * add: first set of tests. * complete fast tests for text to video. * fix bug * up * three fast tests failing. * add: note on slow tests * make work with all schedulers * apply styling. * add slow tests * change file name * update * more correction * more fixes * finish * up * Apply suggestions from code review * up * finish * make copies * fix pipeline tests * fix more tests * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * apply suggestions * up * revert --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-03-22 18:39:33 +01:00
Steven Liu	7fe88613fa	[docs] Clarify purpose of reproducibility docs (#2756 ) * clarify purpose of repro docs * apply feedback	2023-03-21 17:35:21 -07:00
Pedro Cuenca	a39d42b91d	[docs] update torch 2 benchmark (#2764 ) * Update benchmark for A100, 3090, 3090 Ti, 4090. * Link to PyTorch blog. * Update install instructions.	2023-03-21 17:41:13 +00:00
Patrick von Platen	2120b4eee3	Improve Contribution Doc (#2043 ) * first refactor * more text * improve * finish * up * up * up * up * finish * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> * up * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finished * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finished --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-03-21 13:41:29 +01:00
regisss	c10d6854c0	Update numbers for Habana Gaudi in documentation (#2734 ) Update numbers for Habana Gaudi in doc	2023-03-21 11:59:28 +01:00
Sayak Paul	73bdad08a1	add: controlnet entry to training section in the docs. (#2677 ) * add: controlnet entry to training section in the docs. * formatting. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * wrap in a tip block. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-03-21 07:23:24 +05:30
M. Tolga Cangöz	ba87c1607c	Update text_inversion.mdx (#2751 ) Fix typos	2023-03-20 13:20:50 -07:00
M. Tolga Cangöz	afe59a920e	Update philosophy.mdx (#2752 ) Fix typos	2023-03-20 13:19:43 -07:00
M. Tolga Cangöz	25ed7cb08b	Update dreambooth.mdx (#2742 ) Fix typos	2023-03-20 17:40:56 +00:00
M. Tolga Cangöz	af86b0ccac	Update fp16.mdx (#2746 ) Fix typos	2023-03-20 17:39:55 +00:00
M. Tolga Cangöz	a9f28b687c	Update torch2.0.mdx (#2748 ) Fix typos	2023-03-20 17:39:04 +00:00
M. Tolga Cangöz	d91dc57d8a	Update mps.mdx (#2749 ) Fix typos	2023-03-20 17:33:23 +00:00
Sayak Paul	a16957159e	[docs] Update ONNX doc to use `optimum` (#2702 ) * minor edits to onnx and openvino docs. * Apply suggestions from code review Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> --------- Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>	2023-03-17 18:17:42 +01:00
YiYi Xu	f4bbcb29c0	fix image link in inpaint doc (#2693 ) fix link Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-03-16 19:35:27 -10:00
Steven Liu	77e0ea8048	[docs] Add safety checker to ethical guidelines (#2699 ) add safety checker	2023-03-16 09:39:39 -07:00
Patrick von Platen	e828232780	Rename attention (#2691 ) * rename file * rename attention * fix more * rename more * up * more deprecation imports * fixes	2023-03-16 00:35:54 +01:00
Steven Liu	588e50bc57	[docs] Reorganize table of contents (#2671 ) * reorg toc * reorg toc some more * remove duplicate config	2023-03-15 16:28:18 -07:00
Steven Liu	a72d14fc8d	[docs] Create better navigation on index (#2658 ) * create updated nav for index * fix header * apply feedback	2023-03-15 11:58:04 -07:00
Steven Liu	1c2c594e3d	[docs] Add overviews to each section (#2657 ) * add overviews to each section * fix typo in toctree * apply feedbacks	2023-03-15 11:57:32 -07:00
M. Tolga Cangöz	c0b4d72095	Update unconditional_image_generation.mdx (#2686 ) Fix typos	2023-03-15 18:19:57 +01:00
M. Tolga Cangöz	78afb84436	Update controlling_generation.mdx (#2690 ) Fix typos	2023-03-15 18:18:41 +01:00
M. Tolga Cangöz	91570b2fda	Update conditional_image_generation.mdx (#2687 ) Fix typos	2023-03-15 18:16:32 +01:00
M. Tolga Cangöz	3584f6b345	Update img2img.mdx (#2688 ) Fix typos	2023-03-15 18:15:59 +01:00
M. Tolga Cangöz	b4bb5345cd	Update kerascv.mdx (#2685 ) Fix typos	2023-03-15 18:15:51 +01:00
M. Tolga Cangöz	e71f73d8df	Update custom_pipeline_overview.mdx (#2684 ) Fix typos	2023-03-15 18:14:37 +01:00
Sayak Paul	c9477bf8a8	[Docs] Adds a documentation page for evaluating diffusion models (#2516 ) * add a documentation page for evaluating diffuion models. * fix: checkpoint link. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> * formatting fixes. * formatting fixes. * link to partiprompts dataset on hub. * reflect on Pedro's comments. Co-authored-by: Pedro <pedro@huggingface.co> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * reflect on Pedro's comments. Co-authored-by: Pedro <pedro@huggingface.co> * update mention of FID. * Apply suggestions from code review Co-authored-by: Will Berman <wlbberman@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> * minor nit. * finish edges and add colab notebook. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * run formatting. * additional feedback. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: Pedro <pedro@huggingface.co> Co-authored-by: Will Berman <wlbberman@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-03-15 17:05:01 +05:30
M. Tolga Cangöz	fa7a576191	Update schedulers.mdx (#2647 ) Fix typos	2023-03-13 16:41:28 +01:00
M. Tolga Cangöz	bbab855322	Update loading.mdx (#2642 ) Fix typos	2023-03-11 16:49:05 +01:00
Steven Liu	d5ce55293c	[docs] Build Jax notebooks for real (#2641 ) build jax notebooks for real	2023-03-11 01:21:14 +01:00
M. Tolga Cangöz	f1ab955f64	Update basic_training.mdx (#2639 ) Add 'import os'	2023-03-10 14:19:12 +01:00
M. Tolga Cangöz	9360bb94c3	Update quicktour.mdx (#2637 ) Fix typo	2023-03-10 14:17:10 +01:00
Sian	4aa68291a9	add translated docs (#2587 ) * add translated docs * improve translated content * improve translated content * Modify the translation content	2023-03-10 13:55:12 +01:00
Patrick von Platen	0650d641a3	Revert "[docs] Build notebooks from Markdown" (#2625 ) Revert "[docs] Build notebooks from Markdown (#2570)" This reverts commit `78507bda24`.	2023-03-09 15:45:24 +01:00
Steven Liu	251a34add8	Migrate blog content to docs (#2477 ) * first draft * ✨ minor edits * 💄 make style * oops add to toc * 🖍 reframe around understanding components * 🖍 apply feedback * 🖍 apply feedback	2023-03-09 13:20:49 +01:00
YiYi Xu	a062e47ec3	add flax pipelines to api doc + doc string examples (#2600 ) * add api doc for flax pipeline + doc string examples * make style --------- Co-authored-by: yiyixuxu <yixu@yis-macbook-pro.lan> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-09 13:00:29 +01:00
Ella Charlaix	a5d2ee9d47	Add OpenVINO documentation (#2569 ) * Add OpenVINO documentation * Update docs/source/en/optimization/open_vino.mdx Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-03-08 20:07:44 +01:00
Steven Liu	68545a15d9	[docs] Update unconditional image generation docs (#2592 ) * 📝 update and minor refactor * ✨ minor edits	2023-03-08 09:47:49 -08:00

1 2 3 4 5 ...

257 Commits