Loading video...
Video Failed to Load
Here is how to deploy and serve any LLM on HF with a single command in less than 3 minutes with llama.cpp $ bash -c "$(curl -s
143,123 views • 2 years ago •via X (Twitter)
8 Comments

Georgi Gerganov2 years ago
More info

Fernando Vidal2 years ago
Would be cool to have some kind of auth system, where you can have it check against a list of auth tokens before serving the request.

Donneker2 years ago
thanks for the demo, nice done

bornjre2 years ago
Does server binary supports LLaVA/BakLLaVA models ?

wwwwg2 years ago
@memdotai mem it

Mem2 years ago
@ggerganov Saved! Here's the compiled thread: 🪄 AI-generated summary: "This thread provides instructions on how to deploy and serve any LLM on HF with a single command in less than 3 minutes using llama.cpp. More information can be found at the...

Tim Wu2 years ago
Time to fill up my runpod credits. 😁 It would be lovely if supports llava.

Filippo Broggini2 years ago
They should have made you CEO of OpenAI 😅😇💪

