Загрузка видео...

Не удалось загрузить видео

На главную

Check out mistral.​rs, our #Rust-based open source inference engine allowing for fast #LLM serving for a variety of architectures including X-LoRA mixture-of-expert (MoE) models, Llama-3, Mistral/Mixtral, Gemma & many others. Built on the Hugging Face #Candle framework for #Rust w/ custom CUDA kernels in the backend (as well as...

73,575 просмотров • 2 лет назад •via X (Twitter)

Комментарии: 10

Фото профиля nicoarq
nicoarq2 лет назад

@huggingface This is really rly cool! I thought it was only limited to Mistral related LLMs though maybe it's not too late to change the name to something generic to not confuse people?

Фото профиля Markus J. Buehler
Markus J. Buehler2 лет назад

@huggingface Yes, the tool with with many architectures, including Llama-3, Phi-3 (up to 128k context), Mistral/Mixtral, Gemma, X-LoRA mixture-of-expert (MoE) models, & many others. Also features in-situ quantization to any level.

Фото профиля Ingo Villnow (DM5DK) 🇺🇦🌻
Ingo Villnow (DM5DK) 🇺🇦🌻2 лет назад

@huggingface Awesome! More apps like this written in Rust 😍

Фото профиля Jonathan Eisenzopf
Jonathan Eisenzopf2 лет назад

@huggingface This looks really cool. I’ve been trying to figure out optimal branching in rust candle. Nice job.

Фото профиля Thomas Wolf
Thomas Wolf2 лет назад

@huggingface wow!

Фото профиля PΔBLØ ᄃΞ
PΔBLØ ᄃΞ2 лет назад

@huggingface 🦀

Фото профиля Alexander Ocsa
Alexander Ocsa2 лет назад

@huggingface Does it support GPU hardware acceleration ?

Фото профиля Nicolas Patry
Nicolas Patry2 лет назад

@huggingface Impressive work !

Фото профиля Markus J. Buehler
Markus J. Buehler2 лет назад

@huggingface Thank you!

Фото профиля iandanforth 🦋 @iandanforth.bsky.social
iandanforth 🦋 @iandanforth.bsky.social2 лет назад

@_philschmid @huggingface Is this affiliated with Mistral AI? The name certainly implies it.

Похожие видео