hf_text-generation-inference/launcher
Nicolas Patry 57b3495823
Fixing exl2 and other quanize tests again. (#2419)
* Fixing exl2 and other quanize tests again.

* Mark exl2 as non release (so CI tests them, needs to be removed latet).

* Fixing exl2 (by disabling cuda graphs)

* Fix quantization defaults without cuda graphs on exl2 (linked to new
issues with it).

* Removing serde override.

* Go back to released exl2 and remove log.

* Adding warnings for deprecated bitsandbytes + upgrade info to warn.
2024-08-15 11:12:51 +02:00
..
src Fixing exl2 and other quanize tests again. (#2419) 2024-08-15 11:12:51 +02:00
Cargo.toml Upgrading to rust 1.78. (#1851) 2024-05-06 13:48:11 +02:00
build.rs chore(github): add templates (#264) 2023-05-02 15:43:19 +02:00