diffusers

Commit Graph

Author	SHA1	Message	Date
Will Berman	ef2ea33c3b	VQ-diffusion (#658 ) * Changes for VQ-diffusion VQVAE Add specify dimension of embeddings to VQModel: `VQModel` will by default set the dimension of embeddings to the number of latent channels. The VQ-diffusion VQVAE has a smaller embedding dimension, 128, than number of latent channels, 256. Add AttnDownEncoderBlock2D and AttnUpDecoderBlock2D to the up and down unet block helpers. VQ-diffusion's VQVAE uses those two block types. * Changes for VQ-diffusion transformer Modify attention.py so SpatialTransformer can be used for VQ-diffusion's transformer. SpatialTransformer: - Can now operate over discrete inputs (classes of vector embeddings) as well as continuous. - `in_channels` was made optional in the constructor so two locations where it was passed as a positional arg were moved to kwargs - modified forward pass to take optional timestep embeddings ImagePositionalEmbeddings: - added to provide positional embeddings to discrete inputs for latent pixels BasicTransformerBlock: - norm layers were made configurable so that the VQ-diffusion could use AdaLayerNorm with timestep embeddings - modified forward pass to take optional timestep embeddings CrossAttention: - now may optionally take a bias parameter for its query, key, and value linear layers FeedForward: - Internal layers are now configurable ApproximateGELU: - Activation function in VQ-diffusion's feedforward layer AdaLayerNorm: - Norm layer modified to incorporate timestep embeddings * Add VQ-diffusion scheduler * Add VQ-diffusion pipeline * Add VQ-diffusion convert script to diffusers * Add VQ-diffusion dummy objects * Add VQ-diffusion markdown docs * Add VQ-diffusion tests * some renaming * some fixes * more renaming * correct * fix typo * correct weights * finalize * fix tests * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finish * finish * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-03 16:10:28 +01:00
Anton Lozhkov	cca59ce3a2	Add Apple M1 tests (#796 ) * [CI] Add Apple M1 tests * setup-python * python build * conda install * remove branch * only 3.8 is built for osx-arm * try fetching prebuilt tokenizers * use user cache * update shells * Reports and cleanup * -> MPS * Disable parallel tests * Better naming * investigate worker crash * return xdist * restart * num_workers=2 * still crashing? * faulthandler for segfaults * faulthandler for segfaults * remove restarts, stop on segfault * torch version * change installation order * Use pre-RC version of PyTorch. To be updated when it is released. * Skip crashing test on MPS, add new one that works. * Skip cuda tests in mps device. * Actually use generator in test. I think this was a typo. * make style Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-10-17 20:27:30 +02:00
Patrick von Platen	f1484b81b0	[Utils] Add deprecate function and move testing_utils under utils (#659 ) * [Utils] Add deprecate function * up * up * uP * up * up * up * up * uP * up * fix * up * move to deprecation utils file * fix * fix * fix more	2022-10-03 23:44:24 +02:00
Anton Lozhkov	761f0297b0	[Tests] Fix spatial transformer tests on GPU (#531 )	2022-09-16 16:04:37 +02:00
Sid Sahai	f73ca908e5	[Tests] Test attention.py (#368 ) * add test for AttentionBlock, SpatialTransformer * add context_dim, handle device * removed dropout test * fixes, add dropout test	2022-09-16 12:59:42 +02:00
Anton Lozhkov	ed22b4fd07	Revive `make quality` (#203 ) * Revive Make utils * Add datasets for training too	2022-08-17 15:22:04 +02:00
Suraj Patil	4e2674934f	add tests for 1D Up/Downsample blocks (#72 )	2022-07-04 11:41:04 +02:00
Patrick von Platen	321f9791d6	Downsample / Upsample - clean to 1D and 2D (#68 ) * make unet rl work * uploaad files / code * upload files * make style correct * finish	2022-07-03 22:26:33 +02:00
patil-suraj	7b9b946cb2	add tests for downsample block	2022-06-27 18:03:51 +02:00
patil-suraj	dc7c49e4e4	add tests for upsample blocks	2022-06-27 15:50:54 +02:00
Patrick von Platen	c7a39d38ad	refactor all sinus embeddings	2022-06-27 11:37:37 +00:00
Patrick von Platen	02a76c2c81	consolidate timestep embeds	2022-06-27 10:14:54 +00:00
Patrick von Platen	45a09bebf3	add first files	2022-06-27 10:46:39 +02:00

13 Commits