正在加载视频...

视频加载失败

Check out mistral.​rs, our #Rust-based open source inference engine allowing for fast #LLM serving for a variety of architectures including X-LoRA mixture-of-expert (MoE) models, Llama-3, Mistral/Mixtral, Gemma & many others. Built on the Hugging Face #Candle framework for #Rust w/ custom CUDA kernels in the backend (as well as...

73,575 次观看 • 2 年前 •via X (Twitter)

10 条评论

nicoarq 的头像
nicoarq2 年前

@huggingface This is really rly cool! I thought it was only limited to Mistral related LLMs though maybe it's not too late to change the name to something generic to not confuse people?

Markus J. Buehler 的头像
Markus J. Buehler2 年前

@huggingface Yes, the tool with with many architectures, including Llama-3, Phi-3 (up to 128k context), Mistral/Mixtral, Gemma, X-LoRA mixture-of-expert (MoE) models, & many others. Also features in-situ quantization to any level.

Ingo Villnow (DM5DK) 🇺🇦🌻 的头像
Ingo Villnow (DM5DK) 🇺🇦🌻2 年前

@huggingface Awesome! More apps like this written in Rust 😍

Jonathan Eisenzopf 的头像
Jonathan Eisenzopf2 年前

@huggingface This looks really cool. I’ve been trying to figure out optimal branching in rust candle. Nice job.

Thomas Wolf 的头像
Thomas Wolf2 年前

@huggingface wow!

PΔBLØ ᄃΞ 的头像
PΔBLØ ᄃΞ2 年前

@huggingface 🦀

Alexander Ocsa 的头像
Alexander Ocsa2 年前

@huggingface Does it support GPU hardware acceleration ?

Nicolas Patry 的头像
Nicolas Patry2 年前

@huggingface Impressive work !

Markus J. Buehler 的头像
Markus J. Buehler2 年前

@huggingface Thank you!

iandanforth 🦋 @iandanforth.bsky.social 的头像
iandanforth 🦋 @iandanforth.bsky.social2 年前

@_philschmid @huggingface Is this affiliated with Mistral AI? The name certainly implies it.

相关视频