Shivam Shrirao
2c29830075
Merge pull request #240 from NatalieHanzhina/main
...
Accelerator issue is fixed
2023-06-08 13:24:20 +05:30
natkaha
b320b59ff6
Update train_dreambooth.py
2023-06-08 08:27:06 +04:00
Shivam Shrirao
2933b6b20b
Merge branch 'main' of github.com:ShivamShrirao/diffusers
2023-03-27 02:38:35 +05:30
Shivam Shrirao
17ff5f59d2
temporarily use python3 instead of accelerate lauch for now
2023-03-27 02:37:50 +05:30
Shivam Shrirao
52e0f5de8d
Merge remote-tracking branch 'upstream/main'
2023-03-27 02:27:04 +05:30
Bahjat Kawar
9fb0217548
StableDiffusionModelEditingPipeline documentation ( #2810 )
...
* comment update
* comment update
2023-03-24 22:41:31 +05:30
Sayak Paul
5883d8d4d1
[Docs] update docs (Stable unCLIP) to reflect the updated ckpts. ( #2815 )
...
* update docs to reflect the updated ckpts.
* update: point about prompt.
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* emove image resizing.
* Apply suggestions from code review
* Apply suggestions from code review
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2023-03-24 17:24:19 +01:00
Patrick von Platen
dbcb15c25f
[Stable UnCLIP] Finish Stable UnCLIP ( #2814 )
...
* up
* fix more 7
* up
* finish
2023-03-24 17:04:41 +01:00
PeixuanZuo
c4892f1855
Update onnxruntime package candidates ( #2666 )
...
* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training
* add ort_nightly_gpu
2023-03-24 12:23:05 +01:00
Kashif Rasul
f6feb69991
Relax DiT test ( #2808 )
...
* Relax DiT test
* relax 2 more tests
* fix style
* skip test on mac due to older protobuf
2023-03-24 11:28:55 +01:00
Bahjat Kawar
37a44bb283
Add ModelEditing pipeline ( #2721 )
...
* TIME first commit
* styling.
* styling 2.
* fixes; tests
* apply styling and doc fix.
* remove sups.
* fixes
* remove temp file
* move augmentations to const
* added doc entry
* code quality
* customize augmentations
* quality
* quality
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
2023-03-24 13:01:39 +05:30
Haofan Wang
4a98d6e097
Update train_text_to_image_lora.py ( #2795 )
2023-03-24 11:45:35 +05:30
Sanchit Gandhi
b94880e536
Add AudioLDM ( #2232 )
...
* Add AudioLDM
* up
* add vocoder
* start unet
* unconditional unet
* clap, vocoder and vae
* clean-up: conversion scripts
* fix: conversion script token_type_ids
* clean-up: pipeline docstring
* tests: from SD
* clean-up: cpu offload vocoder instead of safety checker
* feat: adapt tests to audioldm
* feat: add docs
* clean-up: amend pipeline docstrings
* clean-up: make style
* clean-up: make fix-copies
* fix: add doc path to toctree
* clean-up: args for conversion script
* clean-up: paths to checkpoints
* fix: use conditional unet
* clean-up: make style
* fix: type hints for UNet
* clean-up: docstring for UNet
* clean-up: make style
* clean-up: remove duplicate in docstring
* clean-up: make style
* clean-up: make fix-copies
* clean-up: move imports to start in code snippet
* fix: pass cross_attention_dim as a list/tuple to unet
* clean-up: make fix-copies
* fix: update checkpoint path
* fix: unet cross_attention_dim in tests
* film embeddings -> class embeddings
* Apply suggestions from code review
Co-authored-by: Will Berman <wlbberman@gmail.com>
* fix: unet film embed to use existing args
* fix: unet tests to use existing args
* fix: make style
* fix: transformers import and version in init
* clean-up: make style
* Revert "clean-up: make style"
This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66.
* clean-up: make style
* clean-up: use pipeline tester mixin tests where poss
* clean-up: skip attn slicing test
* fix: add torch dtype to docs
* fix: remove conversion script out of src
* fix: remove .detach from 1d waveform
* fix: reduce default num inf steps
* fix: swap height/width -> audio_length_in_s
* clean-up: make style
* fix: remove nightly tests
* fix: imports in conversion script
* clean-up: slim-down to two slow tests
* clean-up: slim-down fast tests
* fix: batch consistent tests
* clean-up: make style
* clean-up: remove vae slicing fast test
* clean-up: propagate changes to doc
* fix: increase test tol to 1e-2
* clean-up: finish docs
* clean-up: make style
* feat: vocoder / VAE compatibility check
* feat: possibly expand / cut audio waveform
* fix: pipeline call signature test
* fix: slow tests output len
* clean-up: make style
* make style
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: William Berman <WLBberman@gmail.com>
2023-03-23 19:00:21 +01:00
Steven Liu
1870fb05a9
[docs] Add Colab notebooks and Spaces ( #2713 )
...
* add colab notebook and spaces
* fix image link
2023-03-23 09:48:58 -07:00
YiYi Xu
df91c44712
Flax controlnet ( #2727 )
...
* add contronet flax
---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
2023-03-23 05:46:23 -10:00
Pedro Cuenca
aa0531fa8d
Skip `mps` in text-to-video tests ( #2792 )
...
* Skip mps in text-to-video tests.
* style
* Skip UNet3D mps tests.
2023-03-23 14:39:03 +01:00
Haofan Wang
dc5b4e2342
Update train_text_to_image_lora.py ( #2767 )
...
* Update train_text_to_image_lora.py
* Update train_text_to_image_lora.py
* Update train_text_to_image_lora.py
* Update train_text_to_image_lora.py
* format
2023-03-23 14:28:47 +01:00
Sayak Paul
0d7aac3e8d
[Docs] small fixes to the text to video doc. ( #2787 )
...
* small fixes to the text to video doc.
* add: Spaces link.
* add: warning on research-only model.
2023-03-23 18:57:02 +05:30
Nipun Jindal
055c90f589
[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline ( #2779 )
...
[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines
Co-authored-by: njindal <njindal@adobe.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2023-03-23 14:20:24 +01:00
Kashif Rasul
2ef9bdd76f
Music Spectrogram diffusion pipeline ( #1044 )
...
* initial TokenEncoder and ContinuousEncoder
* initial modules
* added ContinuousContextTransformer
* fix copy paste error
* use numpy for get_sequence_length
* initial terminal relative positional encodings
* fix weights keys
* fix assert
* cross attend style: concat encodings
* make style
* concat once
* fix formatting
* Initial SpectrogramPipeline
* fix input_tokens
* make style
* added mel output
* ignore weights for config
* move mel to numpy
* import pipeline
* fix class names and import
* moved models to models folder
* import ContinuousContextTransformer and SpectrogramDiffusionPipeline
* initial spec diffusion converstion script
* renamed config to t5config
* added weight loading
* use arguments instead of t5config
* broadcast noise time to batch dim
* fix call
* added scale_to_features
* fix weights
* transpose laynorm weight
* scale is a vector
* scale the query outputs
* added comment
* undo scaling
* undo depth_scaling
* inital get_extended_attention_mask
* attention_mask is none in self-attention
* cleanup
* manually invert attention
* nn.linear need bias=False
* added T5LayerFFCond
* remove to fix conflict
* make style and dummy
* remove unsed variables
* remove predict_epsilon
* Move accelerate to a soft-dependency (#1134 )
* finish
* finish
* Update src/diffusers/modeling_utils.py
* Update src/diffusers/pipeline_utils.py
Co-authored-by: Anton Lozhkov <anton@huggingface.co>
* more fixes
* fix
Co-authored-by: Anton Lozhkov <anton@huggingface.co>
* fix order
* added initial midi to note token data pipeline
* added int to int tokenizer
* remove duplicate
* added logic for segments
* add melgan to pipeline
* move autoregressive gen into pipeline
* added note_representation_processor_chain
* fix dtypes
* remove immutabledict req
* initial doc
* use np.where
* require note_seq
* fix typo
* update dependency
* added note-seq to test
* added is_note_seq_available
* fix import
* added toc
* added example usage
* undo for now
* moved docs
* fix merge
* fix imports
* predict first segment
* avoid un-needed copy to and from cpu
* make style
* Copyright
* fix style
* add test and fix inference steps
* remove bogus files
* reorder models
* up
* remove transformers dependency
* make work with diffusers cross attention
* clean more
* remove @
* improve further
* up
* uP
* Apply suggestions from code review
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
* loop over all tokens
* make style
* Added a section on the model
* fix formatting
* grammer
* formatting
* make fix-copies
* Update src/diffusers/pipelines/__init__.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* added callback ad optional ionnx
* do not squeeze batch dim
* clean up more
* upload
* convert jax to nnumpy
* make style
* fix warning
* make fix-copies
* fix warning
* add initial fast tests
* add initial pipeline_params
* eval mode due to dropout
* skip batch tests as pipeline runs on a single file
* make style
* fix relative path
* fix doc tests
* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* add MidiProcessor
* format
* fix org
* Apply suggestions from code review
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
* make style
* pin protobuf to <4
* fix formatting
* white space
* tensorboard needs protobuf
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>
2023-03-23 14:06:17 +01:00
Naoki Ainoya
14e3a28c12
Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' ( #2732 )
...
The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.
2023-03-23 13:49:22 +01:00
Mishig
8e35ef0142
[doc wip] literalinclude ( #2718 )
2023-03-23 13:42:54 +01:00
Patrick von Platen
a8315ce1a9
[UNet3DModel] Fix with attn processor ( #2790 )
...
* [UNet3DModel] Fix attn processor
* make style
2023-03-23 09:56:02 +01:00
Sayak Paul
0d633a42f4
deduplicate training section in the docs. ( #2788 )
2023-03-23 11:21:53 +05:30
Sayak Paul
9dc84448ac
[Examples] InstructPix2Pix instruct training script ( #2478 )
...
* add: initial implementation of the pix2pix instruct training script.
* shorten cli arg.
* fix: main process check.
* fix: dataset column names.
* simplify tokenization.
* proper placement of null conditions.
* apply styling.
* remove debugging message for conditioning do.
* complete license.
* add: requirements.tzt
* wandb column name order.
* fix: augmentation.
* change: dataset_id.
* fix: convert_to_np() call.
* fix: reshaping.
* fix: final ema copy.
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* address PR comments.
* add: readme details.
* config fix.
* downgrade version.
* reduce image width in the readme.
* note on hyperparameters during generation.
* add: output images.
* update readme.
* minor edits to readme.
* debugging statement.
* explicitly placement of the pipeline.
* bump minimum diffusers version.
* fix: device attribute error.
* weight dtype.
* debugging.
* add dtype inform.
* add seoarate te and vae.
* add: explicit casting/
* remove casting.
* up.
* up 2.
* up 3.
* autocast.
* disable mixed-precision in the final inference.
* debugging information.
* autocasting.
* add: instructpix2pix training section to the docs.
* Empty-Commit
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2023-03-23 10:15:01 +05:30
Sayak Paul
c681ad1af2
add: section on multiple controlnets. ( #2762 )
...
* add: section on multiple controlnets.
Co-authored-by: William Berman <WLBberman@gmail.com>
* fix: docs.
* fix: docs.
---------
Co-authored-by: William Berman <WLBberman@gmail.com>
2023-03-23 09:55:25 +05:30
Haofan Wang
e0d8c9ef83
Support for Offset Noise in examples ( #2753 )
...
* add noise offset
* make style
2023-03-23 09:36:17 +05:30
Pedro Cuenca
92e1164e2e
`mps`: remove warmup passes ( #2771 )
...
* Remove warmup passes in mps tests.
* Update mps docs: no warmup pass in PyTorch 2
* Update imports.
2023-03-22 19:29:27 +01:00
Patrick von Platen
ca1a22296d
[MS Text To Video] Add first text to video ( #2738 )
...
* [MS Text To Video} Add first text to video
* upload
* make first model example
* match unet3d params
* make sure weights are correcctly converted
* improve
* forward pass works, but diff result
* make forward work
* fix more
* finish
* refactor video output class.
* feat: add support for a video export utility.
* fix: opencv availability check.
* run make fix-copies.
* add: docs for the model components.
* add: standalone pipeline doc.
* edit docstring of the pipeline.
* add: right path to TransformerTempModel
* add: first set of tests.
* complete fast tests for text to video.
* fix bug
* up
* three fast tests failing.
* add: note on slow tests
* make work with all schedulers
* apply styling.
* add slow tests
* change file name
* update
* more correction
* more fixes
* finish
* up
* Apply suggestions from code review
* up
* finish
* make copies
* fix pipeline tests
* fix more tests
* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* apply suggestions
* up
* revert
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
2023-03-22 18:39:33 +01:00
Steven Liu
7fe88613fa
[docs] Clarify purpose of reproducibility docs ( #2756 )
...
* clarify purpose of repro docs
* apply feedback
2023-03-21 17:35:21 -07:00
Pedro Cuenca
a39d42b91d
[docs] update torch 2 benchmark ( #2764 )
...
* Update benchmark for A100, 3090, 3090 Ti, 4090.
* Link to PyTorch blog.
* Update install instructions.
2023-03-21 17:41:13 +00:00
Will Berman
ca1e40726e
stable diffusion depth batching fix ( #2757 )
2023-03-21 10:18:44 -07:00
1lint
b33bd91fae
Add option to set dtype in pipeline.to() method ( #2317 )
...
add test_to_dtype to check pipe.to(fp16)
2023-03-21 15:21:23 +01:00
Pedro Cuenca
1fcf279d74
Fix mps tests on torch 2.0 ( #2766 )
2023-03-21 15:19:31 +01:00
Hyowon Ha
58bcf46a8f
Add guidance start/end parameters to StableDiffusionControlNetImg2ImgPipeline ( #2731 )
...
* Add guidance start/end parameters to community controlnet img2img pipeline
* Fix formats
2023-03-21 14:38:43 +01:00
Nipun Jindal
0042efd015
[1929]: Add CLIP guidance for Img2Img stable diffusion pipeline ( #2723 )
...
* [Img2Img]: Copyover img2img pipeline
* [Img2Img]: img2img pipeline
* [Img2Img]: img2img pipeline
* [Img2Img]: img2img pipeline
---------
Co-authored-by: njindal <njindal@adobe.com>
2023-03-21 13:53:00 +01:00
Alexander Pivovarov
f024e00398
Fix typos ( #2715 )
...
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2023-03-21 13:45:04 +01:00
Patrick von Platen
2120b4eee3
Improve Contribution Doc ( #2043 )
...
* first refactor
* more text
* improve
* finish
* up
* up
* up
* up
* finish
* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
* up
* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* finished
* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* finished
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
2023-03-21 13:41:29 +01:00
regisss
c10d6854c0
Update numbers for Habana Gaudi in documentation ( #2734 )
...
Update numbers for Habana Gaudi in doc
2023-03-21 11:59:28 +01:00
Sayak Paul
73bdad08a1
add: controlnet entry to training section in the docs. ( #2677 )
...
* add: controlnet entry to training section in the docs.
* formatting.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* wrap in a tip block.
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-03-21 07:23:24 +05:30
M. Tolga Cangöz
ba87c1607c
Update text_inversion.mdx ( #2751 )
...
Fix typos
2023-03-20 13:20:50 -07:00
M. Tolga Cangöz
afe59a920e
Update philosophy.mdx ( #2752 )
...
Fix typos
2023-03-20 13:19:43 -07:00
M. Tolga Cangöz
25ed7cb08b
Update dreambooth.mdx ( #2742 )
...
Fix typos
2023-03-20 17:40:56 +00:00
M. Tolga Cangöz
af86b0ccac
Update fp16.mdx ( #2746 )
...
Fix typos
2023-03-20 17:39:55 +00:00
M. Tolga Cangöz
a9f28b687c
Update torch2.0.mdx ( #2748 )
...
Fix typos
2023-03-20 17:39:04 +00:00
M. Tolga Cangöz
d91dc57d8a
Update mps.mdx ( #2749 )
...
Fix typos
2023-03-20 17:33:23 +00:00
Patrick von Platen
fdcff560d0
Fix more slow tests
2023-03-18 19:41:38 +00:00
Patrick von Platen
ec2c1bc95f
Update README.md
2023-03-18 19:39:24 +01:00
Patrick von Platen
9ecd924859
[Tests] Correct PT2 ( #2724 )
...
* [Tests] Correct PT2
* correct more
* move versatile to nightly
* up
* up
* again
* Apply suggestions from code review
2023-03-18 18:38:04 +01:00
Andy
116f70cbf8
Enabling gradient checkpointing for VAE ( #2536 )
...
* updated black format
* update black format
* make style format
* updated line endings
* update code formatting
* Update examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update src/diffusers/models/vae.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update src/diffusers/models/vae.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* added vae gradient checkpointing test
* make style
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Will Berman <wlbberman@gmail.com>
2023-03-17 14:59:38 -07:00