6073ece4fc
This PR improves the guidance docs and adds a section that explains how grammars are applied on a technical level |
||
---|---|---|
.. | ||
flash_attention.md | ||
guidance.md | ||
paged_attention.md | ||
quantization.md | ||
safetensors.md | ||
speculation.md | ||
streaming.md | ||
tensor_parallelism.md |