chunks.py
|
server: use chunked inputs
|
2024-06-07 08:09:04 +02:00 |
dist.py
|
add intel xpu support for TGI (#1475)
|
2024-04-26 15:48:58 +02:00 |
hub.py
|
Fixing the download strategy for ibm-fms (#1917)
|
2024-05-18 13:31:24 +02:00 |
log.py
|
v1.3.4
|
2023-12-22 15:46:04 +01:00 |
logits_process.py
|
Fixing frequency penalty (#1811)
|
2024-04-30 12:13:23 +02:00 |
peft.py
|
fix: fix local loading for .bin models (#1419)
|
2024-01-09 15:21:00 +01:00 |
speculate.py
|
chore: formatting
|
2023-12-11 14:49:52 +01:00 |
tokens.py
|
Use the generation config. (#1808)
|
2024-04-25 19:41:50 +02:00 |
watermark.py
|
Fixing watermark. (#851)
|
2023-08-16 07:17:26 +02:00 |
weights.py
|
fix gptq tests, LLMM1 matrix bound
|
2024-06-24 18:49:45 +02:00 |