OlivierDehaene
|
53ec0b790b
|
feat(fp8): use fbgemm kernels and load fp8 weights directly (#2248)
* feat(fp8): add support for fbgemm
* allow loading fp8 weights directly
* update outlines
* fix makefile
* build fbgemm
* avoid circular import and fix dockerfile
* add default dtype
* refactored weights loader
* fix auto conversion
* fix quantization config parsing
* force new nccl on install
* missing get_weights implementation
* increase timeout
|
2024-07-20 19:02:04 +02:00 |