Default Branch

3c54488638 · nix: downgrade to outlines 0.1.3 (#2768) · Updated 2024-11-21 05:00:26 -07:00

Branches

69dd51069f · unique hash for each image token · Updated 2024-09-03 06:56:02 -06:00

157
5

a258e8f66a · fix: Fix PR comments · Updated 2024-09-02 01:36:23 -06:00

158
12

5838f2139f · Tied embeddings in MLP speculator. · Updated 2024-08-29 04:30:26 -06:00

164
47

e152cb022b · fix: also show total memory after full warmup · Updated 2024-08-22 11:57:51 -06:00

169
2

d33fb9ed2c · extracting traceparent from header to span · Updated 2024-08-21 03:28:50 -06:00

170
1

2652e209e7 · Updated flake lock · Updated 2024-08-21 01:15:10 -06:00

170
15

b378fb4702 · Fixing exl2 (by disabling cuda graphs) · Updated 2024-08-14 11:44:54 -06:00

203
2

89707adbbb · Fixing exl2 (by disabling cuda graphs) · Updated 2024-08-14 11:41:29 -06:00

186
4

4b10c8c30b · fix: improve scales change and revert conditional · Updated 2024-08-14 10:38:15 -06:00

187
2

b84bb19ece · fix: prefer recent gptq changes · Updated 2024-08-12 09:51:19 -06:00

194
9

7bc16deb48 · wip: debug gemma and flash · Updated 2024-08-09 17:08:54 -06:00

204
1

7735b385dc · Prefix caching WIP · Updated 2024-08-09 08:52:59 -06:00

206
1

9f039ad4b3 · flake: use rust-overlay · Updated 2024-08-09 07:02:57 -06:00

210
2

e219397ee1 · fix: adjust syntax typo again · Updated 2024-08-07 18:31:24 -06:00

222
4

f230da8d63 · Keeping the benchmark somewhere · Updated 2024-08-06 06:36:15 -06:00

231
17

4379f0650a · feat: add release and sha tagged images · Updated 2024-08-05 11:13:52 -06:00

228
1

ab2ab2a0aa · pre-commit · Updated 2024-08-05 05:01:19 -06:00

233
16

060b2db0df · add 'mamba' as model config · Updated 2024-08-01 10:16:32 -06:00

230
1

31ebfd0dd7 · (launcher) default new server::run parameters to false for now · Updated 2024-07-31 03:06:52 -06:00

238
24

8fad7ae5a2 · add some more basic info in README.md · Updated 2024-07-30 02:45:29 -06:00

344
82