From 960cc95a0eadfa1fad7275a9e927f904e6f0bf36 Mon Sep 17 00:00:00 2001 From: Nicolas Patry Date: Tue, 27 Feb 2024 15:55:37 +0100 Subject: [PATCH] Update speculation.md --- docs/source/conceptual/speculation.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/conceptual/speculation.md b/docs/source/conceptual/speculation.md index 071b7b68..6f4e0db6 100644 --- a/docs/source/conceptual/speculation.md +++ b/docs/source/conceptual/speculation.md @@ -26,7 +26,7 @@ You can check a few existing fine-tunes for popular models: - [text-generation-inference/Mistral-7B-Instruct-v0.2-medusa](https://huggingface.co/text-generation-inference/Mistral-7B-Instruct-v0.2-medusa) -In order to create your own medusa heads for your own finetune, you should check own the original medusa repo. [https://github.com/FasterDecoding/Medusa](https://github.com/FasterDecoding/Medusa) +In order to create your own medusa heads for your own finetune, you should check own the original medusa repo. [https://github.com/FasterDecoding/Medusa](https://github.com/FasterDecoding/Medusa/pull/83). In order to use medusa models in TGI, simply point to a medusa enabled model, and everything will load automatically.