OlivierDehaene
|
0d794af6a5
|
feat: experimental support for cuda graphs (#1428)
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
|
2024-02-12 10:09:29 +01:00 |
OlivierDehaene
|
44b267ab22
|
fix: fix gpt-q params loading
|
2023-12-14 11:02:16 +01:00 |
OlivierDehaene
|
82670d9786
|
feat: add quant to mixtral (#1337)
|
2023-12-12 17:55:03 +01:00 |
OlivierDehaene
|
72ee382ded
|
chore: formatting
|
2023-12-11 14:49:52 +01:00 |
OlivierDehaene
|
3a521c92b3
|
feat: mixtral (#1328)
|
2023-12-11 14:43:40 +01:00 |
Nicolas Patry
|
9ecfa16b12
|
Speculative (#1308)
|
2023-12-11 12:46:30 +01:00 |
OlivierDehaene
|
3b56d7669b
|
feat: add mistral model (#1071)
|
2023-09-28 09:55:47 +02:00 |