Загрузка видео...

Не удалось загрузить видео

На главную

PSA: DeepSeek R1 Distill Llama 70B speculative decoding version is now live on Groq Inc for Dev Tier. We just made fast even faster for instant reasoning. 🏁

44,675 просмотров • 1 год назад •via X (Twitter)

Комментарии: 11

Фото профиля Hatice Ozen
Hatice Ozen1 год назад

1/5 What is speculative decoding? It's a technique that uses a smaller, faster model to predict a sequence of tokens, which are then verified by the main, more powerful model in parallel. The main model evaluates these predictions and determines which tokens to keep or reject.

Фото профиля Hatice Ozen
Hatice Ozen1 год назад

2/5 Speculative decoding achieves faster inference because the main model can verify multiple tokens in parallel rather than generating them one-by-one. This parallel verification is significantly faster than traditional sequential token generation.

Фото профиля Hatice Ozen
Hatice Ozen1 год назад

3/5 Think of it like pair programming where your junior dev (small model) writes the first draft of code, and the senior dev (large model) reviews and corrects it. When the junior gets it right and the draft aligns, you save a lot of time.

Фото профиля Hatice Ozen
Hatice Ozen1 год назад

4/5 The efficiency comes from parallel verification - while the main model still verifies each token, it can do this simultaneously for many tokens. When wrong? No problem, the main model corrects course. This means much faster inference without having to compromise on quality.

Фото профиля Hatice Ozen
Hatice Ozen1 год назад

5/5 Really excited for you all to try it. Will get around to doc updates, but you can just use the `deepseek-r1-distill-llama-70b-specdec` model ID to try. Let us know what else you'd like to see below and have fun building with instant reasoning! 💪

Фото профиля Ben Everman
Ben Everman1 год назад

@GroqInc Any plans for 670B?

Фото профиля Jasper
Jasper1 год назад

@GroqInc The speed is insane! Would love to have your machines and models on our platform

Фото профиля Mike Sulka
Mike Sulka1 год назад

@GroqInc Great stuff!

Фото профиля Charlie Greenman
Charlie Greenman1 год назад

@GroqInc cool

Фото профиля Rish e/acc
Rish e/acc1 год назад

@GroqInc This is fkin awesome, I hadn’t heard of speculative deckding. Can you recommend any literature etc on the subject?

Фото профиля Hatice Ozen
Hatice Ozen1 год назад

@GroqInc 100% agree and recommend this white paper to learn more:

Похожие видео