a103e3e9e2
This PR adds a tutorial to self distill and train medusa heads for a specific model --------- Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> |
||
---|---|---|
.. | ||
consuming_tgi.md | ||
gated_model_access.md | ||
launcher.md | ||
monitoring.md | ||
non_core_models.md | ||
preparing_model.md | ||
safety.md | ||
train_medusa.md | ||
using_cli.md | ||
using_guidance.md | ||
visual_language_models.md |