Commit Graph

  • feb7806ca4 fix(readme): Typo OlivierDehaene 2022-11-14 16:22:10 +0100
  • 91f5f86280 fix(router): Fix HTTP status codes OlivierDehaene 2022-11-14 14:34:15 +0100
  • 6c781025ae feat(rust): Update to 1.65 OlivierDehaene 2022-11-14 13:59:56 +0100
  • dccd5c2b1a feat(server): Clarify CausalLMBatch concatenate method OlivierDehaene 2022-11-09 18:24:07 +0100
  • fa43fb71be fix(server): Fix Transformers fork version OlivierDehaene 2022-11-08 17:42:38 +0100
  • 4236e41b0d feat(server): Improved doc OlivierDehaene 2022-11-07 12:53:56 +0100
  • cea6051eff feat(launcher): Pass CUDA_VISIBLE_DEVICES to the shard OlivierDehaene 2022-11-04 18:31:08 +0100
  • 427d7cc444 feat(server): Support AutoModelForSeq2SeqLM OlivierDehaene 2022-11-04 18:03:04 +0100
  • c5665f5c8b feat(server): Support generic AutoModelForCausalLM OlivierDehaene 2022-11-04 14:22:47 +0100
  • 755fc0e403 fix(models): Revert buggy support for AutoModel OlivierDehaene 2022-11-03 16:07:54 +0100
  • b3b7ea0d74 feat: Use json formatter by default in docker image OlivierDehaene 2022-11-02 17:29:56 +0100
  • 3cf6368c77 feat(server): Support all AutoModelForCausalLM on a best effort basis OlivierDehaene 2022-10-28 19:24:00 +0200
  • 09674e6df9 feat(server): Support bitsandbytes OlivierDehaene 2022-10-27 14:25:29 +0200
  • beb552127a feat(client): Simplify sharded logic OlivierDehaene 2022-10-22 23:40:05 +0200
  • c8ce9b2515
    feat(server): Use safetensors Nicolas Patry 2022-10-22 20:00:15 +0200
  • 75adbb3441 feat(weights): Support safetensors #1 OlivierDehaene 2022-10-22 19:46:05 +0200
  • be8827fe41
    Create LICENSE (#2) Thomas Wang 2022-10-22 10:44:52 +0200
  • 3398211873
    Create LICENSE #2 Thomas Wang 2022-10-21 23:15:02 +0200
  • 604b18bec2
    Reworked follwoing https://github.com/huggingface/transformers_bloom_parallel/pull/7 Nicolas Patry 2022-10-21 20:47:57 +0200
  • 457c9038ff
    Making bloom loadable with `safetensors`. Nicolas Patry 2022-10-21 18:02:04 +0200
  • c837893370 feat(router): Add max_waiting_tokens OlivierDehaene 2022-10-21 16:40:05 +0200
  • 895a341d06 fix(validation): Fix error messages OlivierDehaene 2022-10-21 10:59:15 +0200
  • f16f2f5ae1 v0.1.0 Olivier Dehaene 2022-10-18 15:19:03 +0200
  • 92c1ecd008 feat: Add arguments to CLI Olivier Dehaene 2022-10-17 18:27:33 +0200
  • 5e5d8766a2 feat: Improve error handling Olivier Dehaene 2022-10-17 14:59:00 +0200
  • 00e6ce44b1 Update aml deployment Olivier Dehaene 2022-10-17 10:39:59 +0200
  • bcb53903b8 feat: Add AML deployment Olivier Dehaene 2022-10-15 20:21:50 +0200
  • bf99afe916 feat: Docker image Olivier Dehaene 2022-10-14 15:56:21 +0200
  • f11965c11d support deepspeed feat/support_deepspeed Olivier Dehaene 2022-10-13 11:05:44 +0200
  • 39df4d9975 Use axum Olivier Dehaene 2022-10-11 18:14:39 +0200
  • e86ecbac63 ValidationError was not correctly handled Olivier Dehaene 2022-10-11 16:53:40 +0200
  • 4c693e6524 Refactored gRPC interface Added validation logic Olivier Dehaene 2022-10-11 16:50:54 +0200
  • fa9a088467 Add load testing Olivier Dehaene 2022-10-11 10:36:51 +0200
  • 1d986983d5 fix: cleanup Olivier Dehaene 2022-10-08 12:34:25 +0200
  • 295831a481 Init Olivier Dehaene 2022-10-08 12:30:12 +0200