diffusers

History

Sanchit Gandhi b94880e536 Add AudioLDM (#2232 ) * Add AudioLDM * up * add vocoder * start unet * unconditional unet * clap, vocoder and vae * clean-up: conversion scripts * fix: conversion script token_type_ids * clean-up: pipeline docstring * tests: from SD * clean-up: cpu offload vocoder instead of safety checker * feat: adapt tests to audioldm * feat: add docs * clean-up: amend pipeline docstrings * clean-up: make style * clean-up: make fix-copies * fix: add doc path to toctree * clean-up: args for conversion script * clean-up: paths to checkpoints * fix: use conditional unet * clean-up: make style * fix: type hints for UNet * clean-up: docstring for UNet * clean-up: make style * clean-up: remove duplicate in docstring * clean-up: make style * clean-up: make fix-copies * clean-up: move imports to start in code snippet * fix: pass cross_attention_dim as a list/tuple to unet * clean-up: make fix-copies * fix: update checkpoint path * fix: unet cross_attention_dim in tests * film embeddings -> class embeddings * Apply suggestions from code review Co-authored-by: Will Berman <wlbberman@gmail.com> * fix: unet film embed to use existing args * fix: unet tests to use existing args * fix: make style * fix: transformers import and version in init * clean-up: make style * Revert "clean-up: make style" This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66. * clean-up: make style * clean-up: use pipeline tester mixin tests where poss * clean-up: skip attn slicing test * fix: add torch dtype to docs * fix: remove conversion script out of src * fix: remove .detach from 1d waveform * fix: reduce default num inf steps * fix: swap height/width -> audio_length_in_s * clean-up: make style * fix: remove nightly tests * fix: imports in conversion script * clean-up: slim-down to two slow tests * clean-up: slim-down fast tests * fix: batch consistent tests * clean-up: make style * clean-up: remove vae slicing fast test * clean-up: propagate changes to doc * fix: increase test tol to 1e-2 * clean-up: finish docs * clean-up: make style * feat: vocoder / VAE compatibility check * feat: possibly expand / cut audio waveform * fix: pipeline call signature test * fix: slow tests output len * clean-up: make style * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com>		2023-03-23 19:00:21 +01:00
..
__init__.py	[Tests] Improve unet / vae tests (#1018 )	2022-10-28 13:43:26 +02:00
test_models_unet_1d.py	[Copyright] 2023 (#2524 )	2023-03-01 10:31:00 +01:00
test_models_unet_2d.py	[Tests] Correct PT2 (#2724 )	2023-03-18 18:38:04 +01:00
test_models_unet_2d_condition.py	Add AudioLDM (#2232 )	2023-03-23 19:00:21 +01:00
test_models_unet_2d_flax.py	Replace flake8 with ruff and update black (#2279 )	2023-02-07 23:46:23 +01:00
test_models_unet_3d_condition.py	Skip `mps` in text-to-video tests (#2792 )	2023-03-23 14:39:03 +01:00
test_models_vae.py	`mps`: remove warmup passes (#2771 )	2023-03-22 19:29:27 +01:00
test_models_vae_flax.py	[Tests] Improve unet / vae tests (#1018 )	2022-10-28 13:43:26 +02:00
test_models_vq.py	`mps`: remove warmup passes (#2771 )	2023-03-22 19:29:27 +01:00