diffusers

Commit Graph

Author	SHA1	Message	Date
Patrick von Platen	29b2c93c90	Make repo structure consistent (#1862 ) * move files a bit * more refactors * fix more * more fixes * fix more onnx * make style * upload * fix * up * fix more * up again * up * small fix * Update src/diffusers/__init__.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * correct Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-30 11:51:08 +01:00
Will Berman	53c8147afe	unCLIP image variation (#1781 ) * unCLIP image variation * remove prior comment re: @pcuenca * stable diffusion -> unCLIP re: @pcuenca * add copy froms re: @patil-suraj	2022-12-28 14:17:09 +01:00
camenduru	1f1b6c6544	Device to use (e.g. cpu, cuda:0, cuda:1, etc.) (#1844 ) * Device to use (e.g. cpu, cuda:0, cuda:1, etc.) * "cuda" if torch.cuda.is_available() else "cpu"	2022-12-27 14:42:56 +01:00
Mikołaj Siedlarek	8890758823	Correct help text for scheduler_type flag in scripts. (#1749 )	2022-12-19 11:27:23 +01:00
Will Berman	2dcf64b72a	kakaobrain unCLIP (#1428 ) * [wip] attention block updates * [wip] unCLIP unet decoder and super res * [wip] unCLIP prior transformer * [wip] scheduler changes * [wip] text proj utility class * [wip] UnCLIPPipeline * [wip] kakaobrain unCLIP convert script * [unCLIP pipeline] fixes re: @patrickvonplaten remove callbacks move denoising loops into call function * UNCLIPScheduler re: @patrickvonplaten Revert changes to DDPMScheduler. Make UNCLIPScheduler, a modified DDPM scheduler with changes to support karlo * mask -> attention_mask re: @patrickvonplaten * [DDPMScheduler] remove leftover change * [docs] PriorTransformer * [docs] UNet2DConditionModel and UNet2DModel * [nit] UNCLIPScheduler -> UnCLIPScheduler matches existing unclip naming better * [docs] SchedulingUnCLIP * [docs] UnCLIPTextProjModel * refactor * finish licenses * rename all to attention_mask and prep in models * more renaming * don't expose unused configs * final renaming fixes * remove x attn mask when not necessary * configure kakao script to use new class embedding config * fix copies * [tests] UnCLIPScheduler * finish x attn * finish * remove more * rename condition blocks * clean more * Apply suggestions from code review * up * fix * [tests] UnCLIPPipelineFastTests * remove unused imports * [tests] UnCLIPPipelineIntegrationTests * correct * make style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-12-18 15:15:30 -08:00
apolinario	b417042291	Fix wrong type checking in `convert_diffusers_to_original_stable_diffusion.py` (#1681 ) * Fix type checking remainders * Remove IS_V20_MODEL flag always being True Co-authored-by: apolinario <joaopaulo.passos+multimodal@gmail.com>	2022-12-13 12:44:20 +01:00
Patrick von Platen	3ce6380d3a	[SD] Make sure scheduler is correct when converting (#1667 )	2022-12-12 16:57:48 +01:00
Cyberes	d2dc4de303	Handle missing global_step key in scripts/convert_original_stable_diffusion_to_diffusers.py (#1612 ) handle missing global_step key and don't download config if it already exists	2022-12-12 16:10:52 +01:00
lawfordp2017	31444f5790	Add text encoder conversion (#1559 ) * Initial code for attempt at improving SD <--> diffusers conversions for v2.0 * Updates to support round-trip between orig. SD 2.0 and diffusers models * Corrected formatting to Black standard * Correcting import formatting * Fixed imports (properly this time) * add some corrections * remove inference files Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-12-12 10:07:42 +01:00
Patrick von Platen	896c98a2ae	Add paint by example (#1533 ) * add paint by example * mkae loading possibel * up * Update src/diffusers/models/attention.py * up * finalize weight structure * make example work * make it work * up * up * fix * del * add * update * Apply suggestions from code review * correct transformer 2d * finish * up * up * up * up * fix * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Apply suggestions from code review * up * finish Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-07 11:06:30 +01:00
Patrick von Platen	922d56a19c	Correct type from int to str in conversion script sd	2022-12-05 18:51:29 +00:00
Patrick von Platen	f21415d1d9	Update conversion script to correctly handle SD 2 (#1511 ) * Conversion SD 2 * finish	2022-12-02 12:28:01 +01:00
Anton Lozhkov	e65b71aba4	Add an explicit `--image_size` to the conversion script (#1509 ) * Add an explicit `--image_size` to the conversion script * style	2022-12-01 19:22:48 +01:00
Anton Lozhkov	86aa747da9	Fix ONNX conversion and inference (#1416 )	2022-11-25 14:51:17 +01:00
Patrick von Platen	9f10c545cb	Fix sample size conversion script (#1408 ) up	2022-11-25 11:26:27 +01:00
Patrick von Platen	2625fb59dc	[Versatile Diffusion] Add versatile diffusion model (#1283 ) * up * convert dual unet * revert dual attn * adapt for vd-official * test the full pipeline * mixed inference * mixed inference for text2img * add image prompting * fix clip norm * split text2img and img2img * fix format * refactor text2img * mega pipeline * add optimus * refactor image var * wip text_unet * text unet end to end * update tests * reshape * fix image to text * add some first docs * dual guided pipeline * fix token ratio * propose change * dual transformer as a native module * DualTransformer(nn.Module) * DualTransformer(nn.Module) * correct unconditional image * save-load with mega pipeline * remove image to text * up * uP * fix * up * final fix * remove_unused_weights * test updates * save progress * uP * fix dual prompts * some fixes * finish * style * finish renaming * up * fix * fix * fix * finish Co-authored-by: anton-l <anton@huggingface.co>	2022-11-23 19:03:45 +01:00
Will Berman	f1fcfdeec5	vq diffusion classifier free sampling (#1294 ) * vq diffusion classifier free sampling * correct * uP Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-11-16 17:51:43 +01:00
Patrick von Platen	4625f04bc0	remove bogus files	2022-11-15 17:34:00 +00:00
Patrick von Platen	554b374d20	Merge branch 'main' of https://github.com/huggingface/diffusers into main	2022-11-15 17:17:47 +00:00
Nathan Lambert	7c5fef81e0	Add UNet 1d for RL model for planning + colab (#105 ) * re-add RL model code * match model forward api * add register_to_config, pass training tests * fix tests, update forward outputs * remove unused code, some comments * add to docs * remove extra embedding code * unify time embedding * remove conv1d output sequential * remove sequential from conv1dblock * style and deleting duplicated code * clean files * remove unused variables * clean variables * add 1d resnet block structure for downsample * rename as unet1d * fix renaming * rename files * add get_block(...) api * unify args for model1d like model2d * minor cleaning * fix docs * improve 1d resnet blocks * fix tests, remove permuts * fix style * add output activation * rename flax blocks file * Add Value Function and corresponding example script to Diffuser implementation (#884) * valuefunction code * start example scripts * missing imports * bug fixes and placeholder example script * add value function scheduler * load value function from hub and get best actions in example * very close to working example * larger batch size for planning * more tests * merge unet1d changes * wandb for debugging, use newer models * success! * turns out we just need more diffusion steps * run on modal * merge and code cleanup * use same api for rl model * fix variance type * wrong normalization function * add tests * style * style and quality * edits based on comments * style and quality * remove unused var * hack unet1d into a value function * add pipeline * fix arg order * add pipeline to core library * community pipeline * fix couple shape bugs * style * Apply suggestions from code review Co-authored-by: Nathan Lambert <nathan@huggingface.co> * update post merge of scripts * add mdiblock / outblock architecture * Pipeline cleanup (#947) * valuefunction code * start example scripts * missing imports * bug fixes and placeholder example script * add value function scheduler * load value function from hub and get best actions in example * very close to working example * larger batch size for planning * more tests * merge unet1d changes * wandb for debugging, use newer models * success! * turns out we just need more diffusion steps * run on modal * merge and code cleanup * use same api for rl model * fix variance type * wrong normalization function * add tests * style * style and quality * edits based on comments * style and quality * remove unused var * hack unet1d into a value function * add pipeline * fix arg order * add pipeline to core library * community pipeline * fix couple shape bugs * style * Apply suggestions from code review * clean up comments * convert older script to using pipeline and add readme * rename scripts * style, update tests * delete unet rl model file * remove imports in src Co-authored-by: Nathan Lambert <nathan@huggingface.co> * Update src/diffusers/models/unet_1d_blocks.py * Update tests/test_models_unet.py * RL Cleanup v2 (#965) * valuefunction code * start example scripts * missing imports * bug fixes and placeholder example script * add value function scheduler * load value function from hub and get best actions in example * very close to working example * larger batch size for planning * more tests * merge unet1d changes * wandb for debugging, use newer models * success! * turns out we just need more diffusion steps * run on modal * merge and code cleanup * use same api for rl model * fix variance type * wrong normalization function * add tests * style * style and quality * edits based on comments * style and quality * remove unused var * hack unet1d into a value function * add pipeline * fix arg order * add pipeline to core library * community pipeline * fix couple shape bugs * style * Apply suggestions from code review * clean up comments * convert older script to using pipeline and add readme * rename scripts * style, update tests * delete unet rl model file * remove imports in src * add specific vf block and update tests * style * Update tests/test_models_unet.py Co-authored-by: Nathan Lambert <nathan@huggingface.co> * fix quality in tests * fix quality style, split test file * fix checks / tests * make timesteps closer to main * unify block API * unify forward api * delete lines in examples * style * examples style * all tests pass * make style * make dance_diff test pass * Refactoring RL PR (#1200) * init file changes * add import utils * finish cleaning files, imports * remove import flags * clean examples * fix imports, tests for merge * update readmes * hotfix for tests * quality * fix some tests * change defaults * more mps test fixes * unet1d defaults * do not default import experimental * defaults for tests * fix tests * fix-copies * fix * changes per Patrik's comments (#1285) * changes per Patrik's comments * update conversion script * fix renaming * skip more mps tests * last test fix * Update examples/rl/README.md Co-authored-by: Ben Glickenhaus <benglickenhaus@gmail.com>	2022-11-14 13:48:48 -08:00
Patrick von Platen	ec7c8d32b0	add conversion script for vae	2022-11-14 19:43:17 +00:00
Patrick von Platen	0248541dea	[Conversion] Improve conversion script (#1218 ) up	2022-11-09 15:46:08 +01:00
Duong A. Nguyen	3f7edc5f72	Fix layer names convert LDM script (#1206 ) fix script convert LDM	2022-11-09 12:08:30 +01:00
Anton Lozhkov	11f7d6f3cc	[ONNX] Improve ONNXPipeline scheduler compatibility, fix safety_checker (#1173 ) * [ONNX] Improve ONNX scheduler compatibility, fix safety_checker * typo	2022-11-08 14:39:11 +01:00
Will Berman	ef2ea33c3b	VQ-diffusion (#658 ) * Changes for VQ-diffusion VQVAE Add specify dimension of embeddings to VQModel: `VQModel` will by default set the dimension of embeddings to the number of latent channels. The VQ-diffusion VQVAE has a smaller embedding dimension, 128, than number of latent channels, 256. Add AttnDownEncoderBlock2D and AttnUpDecoderBlock2D to the up and down unet block helpers. VQ-diffusion's VQVAE uses those two block types. * Changes for VQ-diffusion transformer Modify attention.py so SpatialTransformer can be used for VQ-diffusion's transformer. SpatialTransformer: - Can now operate over discrete inputs (classes of vector embeddings) as well as continuous. - `in_channels` was made optional in the constructor so two locations where it was passed as a positional arg were moved to kwargs - modified forward pass to take optional timestep embeddings ImagePositionalEmbeddings: - added to provide positional embeddings to discrete inputs for latent pixels BasicTransformerBlock: - norm layers were made configurable so that the VQ-diffusion could use AdaLayerNorm with timestep embeddings - modified forward pass to take optional timestep embeddings CrossAttention: - now may optionally take a bias parameter for its query, key, and value linear layers FeedForward: - Internal layers are now configurable ApproximateGELU: - Activation function in VQ-diffusion's feedforward layer AdaLayerNorm: - Norm layer modified to incorporate timestep embeddings * Add VQ-diffusion scheduler * Add VQ-diffusion pipeline * Add VQ-diffusion convert script to diffusers * Add VQ-diffusion dummy objects * Add VQ-diffusion markdown docs * Add VQ-diffusion tests * some renaming * some fixes * more renaming * correct * fix typo * correct weights * finalize * fix tests * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finish * finish * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-03 16:10:28 +01:00
Patrick von Platen	d9cfe325a5	CompVis -> diffusers script - allow converting from merged checkpoint to either EMA or non-EMA (#991 ) * improve script * up	2022-10-26 12:32:07 +02:00
Patrick von Platen	88fa6b7d68	[Dance Diffusion] Add dance diffusion (#803 ) * start * add more logic * Update src/diffusers/models/unet_2d_condition_flax.py * match weights * up * make model work * making class more general, fixing missed file rename * small fix * make new conversion work * up * finalize conversion * up * first batch of variable renamings * remove c and c_prev var names * add mid and out block structure * add pipeline * up * finish conversion * finish * upload * more fixes * Apply suggestions from code review * add attr * up * uP * up * finish tests * finish * uP * finish * fix test * up * naming consistency in tests * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Nathan Lambert <nathan@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co> * remove hardcoded 16 * Remove bogus * fix some stuff * finish * improve logging * docs * upload Co-authored-by: Nathan Lambert <nol@berkeley.edu> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Nathan Lambert <nathan@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-10-25 18:39:25 +02:00
SkyTNT	0b42b074b4	[Onnx] support half-precision and fix bugs for onnx pipelines (#932 ) * [Onnx] support half-precision and fix bugs for onnx pipelines * Update convert_stable_diffusion_checkpoint_to_onnx.py * style * fix has_nsfw_concept * Update convert_stable_diffusion_checkpoint_to_onnx.py * fix style	2022-10-25 16:48:53 +02:00
Anton Lozhkov	89d124945a	ONNX supervised inpainting (#906 ) * ONNX supervised inpainting * sync with the torch pipeline * fix concat * update ref values * back to 8 steps * type fix * make fix-copies	2022-10-19 17:03:31 +02:00
Anton Lozhkov	8eb9d9703d	Improve ONNX img2img numpy handling, temporarily fix the tests (#899 ) * [WIP] Onnx img2img determinism * more numpy + seed * numpy inpainting, tolerance * revert test workflow	2022-10-19 11:26:32 +02:00
Žilvinas Ledas	a9908ecfc1	Stable Diffusion image-to-image and inpaint using onnx. (#552 ) * * Stabe Diffusion img2img using onnx. * * Stabe Diffusion inpaint using onnx. * Export vae_encoder, upgrade img2img, add test * updated inpainting pipeline + test * style Co-authored-by: anton-l <anton@huggingface.co>	2022-10-18 17:44:01 +02:00
Justin Chu	75bb6d2d46	Fix ONNX conversion script opset argument type (#739 ) The opset argument should be an `int` but was set as a `str`.	2022-10-07 15:47:43 +02:00
Patrick von Platen	78744b6a8f	No more use_auth_token=True (#733 ) * up * uP * uP * make style * Apply suggestions from code review * up * finish	2022-10-05 17:16:15 +02:00
Kane Wallmann	b9eea06e9f	Include CLIPTextModel parameters in conversion (#695 )	2022-10-05 12:22:07 +02:00
Josh Achiam	4ff4d4db12	Checkpoint conversion script from Diffusers => Stable Diffusion (CompVis) (#701 ) * Conversion script * ran black * ran isort * remove unused import * map location so everything gets loaded onto CPU before conversion * ran black again * Update setup.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-10-04 13:33:38 +02:00
Anton Lozhkov	6bd005ebbe	[ONNX] Collate the external weights, speed up loading from the hub (#610 )	2022-09-21 22:26:30 +02:00
Yuta Hayashibe	76d492ea49	Fix typos and add Typo check GitHub Action (#483 ) * Fix typos * Add a typo check action * Fix a bug * Changed to manual typo check currently Ref: https://github.com/huggingface/diffusers/pull/483#pullrequestreview-1104468010 Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Removed a confusing message * Renamed "nin_shortcut" to "in_shortcut" * Add memo about NIN Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2022-09-16 15:36:51 +02:00
Suraj Patil	039958eae5	Stable diffusion text2img conversion script. (#154 ) * begin text2img conversion script * add fn to convert config * create config if not provided * update imports and use UNet2DConditionModel * fix imports, layer names * fix unet coversion * add function to convert VAE * fix vae conversion * update main * create text model * update config creating logic for unet * fix config creation * update script to create and save pipeline * remove unused imports * fix checkpoint loading * better name * save progress * finish * up * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-09-16 00:07:32 +02:00
Anton Lozhkov	8d9c4a531b	[ONNX] Stable Diffusion exporter and pipeline (#399 ) * initial export and design * update imports * custom prover, import fixes * Update src/diffusers/onnx_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/onnx_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * remove push_to_hub * Update src/diffusers/onnx_utils.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * remove torch_device * numpify the rest of the pipeline * torchify the safety checker * revert tensor * Code review suggestions + quality * fix tests * fix provider, add an end-to-end test * style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-09-08 15:17:28 +02:00
Patrick von Platen	cc59b05635	[ModelOutputs] Replace dict outputs with Dict/Dataclass and allow to return tuples (#334 ) * add outputs for models * add for pipelines * finish schedulers * better naming * adapt tests as well * replace dict access with . access * make schedulers works * finish * correct readme * make bcp compatible * up * small fix * finish * more fixes * more fixes * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/diffusers/models/vae.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Adapt model outputs * Apply more suggestions * finish examples * correct Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-09-05 14:49:26 +02:00
Anton Lozhkov	89793a97e2	Style the `scripts` directory (#250 ) Style scripts	2022-08-25 15:46:09 +02:00
Patrick von Platen	3100bc9670	[Vae and AutoencoderKL] Final clean of LDM checkpoints (#137 ) * [Vae and AutoencoderKL clean] * save intermediate finished work * more progress * more progress * finish modeling code * save intermediate * finish * Correct tests	2022-07-28 10:14:34 +02:00
Patrick von Platen	b1b99b59ac	some more cleaning	2022-07-21 02:11:28 +00:00
Patrick von Platen	836f3f35c2	Rename pipelines (#115 ) up	2022-07-21 01:39:46 +02:00
Patrick von Platen	9c3820d05a	Big Model Renaming (#109 ) * up * change model name * renaming * more changes * up * up * up * save checkpoint * finish api / naming * finish config renaming * rename all weights * finish really	2022-07-21 01:30:45 +02:00
Patrick von Platen	c3a15437f8	automatic logits verification >> visual logits verification	2022-07-19 16:14:17 +00:00
Patrick von Platen	8c31925b3b	Get diffusers ready 🚀🚀🚀 (#101 ) * big purge * more fixes * finish for now	2022-07-19 18:02:12 +02:00
Arthur	33344ed916	logits for google and compvis models (#100 ) * initial commit * quick fix	2022-07-19 18:02:04 +02:00
Patrick von Platen	37fe8e00b2	upload	2022-07-19 15:05:40 +00:00
Patrick von Platen	3f0b44b322	improve ddpm conversion script	2022-07-19 11:24:13 +00:00

1 2

59 Commits