Default Branch

b57f370386 · Saving some VRAM. (#2790) · Updated 2024-12-02 20:04:21 -07:00

Branches

4a9615e8ff · Add to ToC · Updated 2023-08-11 07:05:10 -06:00

789
2

43ed6c217a · Dummy commit · Updated 2023-08-10 02:33:52 -06:00

794
1

4ddb6681ac · Add workflow to upload documentation · Updated 2023-08-07 23:49:45 -06:00

797
1

e994ad1172 · Added InferenceClient · Updated 2023-08-02 08:57:01 -06:00

811
11

7766fee9b1 · fix typo for dynamic rotary (#745) · Updated 2023-07-31 10:58:46 -06:00

805
0
Included

f555dabca8 · Putting back header inclusion (seems unused but still) · Updated 2023-07-20 09:46:51 -06:00

832
21

bfa3920aec · BNB 4bits. · Updated 2023-07-12 06:42:43 -06:00

858
7

db4efbf4bc · fix(server): T5 weights names. (#582) · Updated 2023-07-12 02:01:42 -06:00

854
0
Included

a4fd6905d8 · fmt · Updated 2023-06-23 07:01:05 -06:00

881
2

dca0fe2585 · Adding GPTQ integration tests. · Updated 2023-06-19 06:14:17 -06:00

884
19

17837b1e51 · Adding docs about GPTQ usage. · Updated 2023-06-15 11:41:04 -06:00

884
19

fb0840944c · Reducing number of reps while autotuning. · Updated 2023-06-06 05:56:10 -06:00

935
9

7ccb8eefdc · TMP. · Updated 2023-05-15 08:43:32 -06:00

929
4

a963495315 · add logic to queue · Updated 2023-04-26 05:40:20 -06:00

968
2

7caea42573 · feat(launcher): parse all shard logs · Updated 2023-04-15 13:25:02 -06:00

998
2

47ac334a21 · 0.4.0 · Updated 2023-03-12 03:06:15 -06:00

1091
9

60ed7b535c · first tests · Updated 2023-02-23 01:52:17 -07:00

1075
1

f11965c11d · support deepspeed · Updated 2022-10-13 03:05:44 -06:00

1157
1

Deleted by Ghost 2024-12-03 03:33:34 -07:00

Deleted by Ghost 2024-12-03 03:33:34 -07:00