docs(readme): provide link Logits Warper README (#154)
This commit is contained in:
parent
c0aeb32583
commit
9122e7bd9c
|
@ -46,7 +46,7 @@ to power LLMs api-inference widgets.
|
|||
- Quantization with [bitsandbytes](https://github.com/TimDettmers/bitsandbytes)
|
||||
- [Safetensors](https://github.com/huggingface/safetensors) weight loading
|
||||
- Watermarking with [A Watermark for Large Language Models](https://arxiv.org/abs/2301.10226)
|
||||
- Logits warpers (temperature scaling, topk, repetition penalty ...)
|
||||
- Logits warper (temperature scaling, top-p, top-k, repetition penalty, more details see [transformers.LogitsProcessor](https://huggingface.co/docs/transformers/internal/generation_utils#transformers.LogitsProcessor))
|
||||
- Stop sequences
|
||||
- Log probabilities
|
||||
- Production ready (distributed tracing with Open Telemetry, Prometheus metrics)
|
||||
|
|
Loading…
Reference in New Issue