Loading video...

Video Failed to Load

Go Home

Here is how to deploy and serve any LLM on HF with a single command in less than 3 minutes with llama.cpp $ bash -c "$(curl -s

143,123 views • 2 years ago •via X (Twitter)

8 Comments

Georgi Gerganov's profile picture
Georgi Gerganov2 years ago

More info

Fernando Vidal's profile picture
Fernando Vidal2 years ago

Would be cool to have some kind of auth system, where you can have it check against a list of auth tokens before serving the request.

Donneker's profile picture
Donneker2 years ago

thanks for the demo, nice done

bornjre's profile picture
bornjre2 years ago

Does server binary supports LLaVA/BakLLaVA models ?

wwwwg's profile picture
wwwwg2 years ago

@memdotai mem it

Mem's profile picture
Mem2 years ago

@ggerganov Saved! Here's the compiled thread: 🪄 AI-generated summary: "This thread provides instructions on how to deploy and serve any LLM on HF with a single command in less than 3 minutes using llama.cpp. More information can be found at the...

Tim Wu's profile picture
Tim Wu2 years ago

Time to fill up my runpod credits. 😁 It would be lovely if supports llava.

Filippo Broggini's profile picture
Filippo Broggini2 years ago

They should have made you CEO of OpenAI 😅😇💪

Related Videos