af1ed38f39
IDK what else to add in this guide, I looked for relevant code in TGI codebase and saw that it's used in quantization as well (maybe I could add that?) |
||
---|---|---|
.. | ||
flash_attention.md | ||
safetensors.md | ||
streaming.md |