Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

Here is how to deploy and serve any LLM on HF with a single command in less than 3 minutes with llama.cpp $ bash -c "$(curl -s

Georgi Gerganov

59,901 subscribers

143,123 views • 2 years ago •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

8 Comments

Georgi Gerganov2 years ago

More info

Fernando Vidal2 years ago

Would be cool to have some kind of auth system, where you can have it check against a list of auth tokens before serving the request.

Donneker2 years ago

thanks for the demo, nice done

bornjre2 years ago

Does server binary supports LLaVA/BakLLaVA models ?

wwwwg2 years ago

@memdotai mem it

Mem2 years ago

@ggerganov Saved! Here's the compiled thread: 🪄 AI-generated summary: "This thread provides instructions on how to deploy and serve any LLM on HF with a single command in less than 3 minutes using llama.cpp. More information can be found at the...

Tim Wu2 years ago

Time to fill up my runpod credits. 😁 It would be lovely if supports llava.

Filippo Broggini2 years ago

They should have made you CEO of OpenAI 😅😇💪

Related Videos

Deploy Custom Oracle Feeds In 97 Seconds On SEDA. Watch CTO jasper: - Configure a feed. - Deploy a feed. - Execute request with results In less than 2 minutes....

Deploy Custom Oracle Feeds In 97 Seconds On SEDA. Watch CTO jasper: - Configure a feed. - Deploy a feed. - Execute request with results In less than 2 minutes....

SEDA

54,178 views • 1 year ago

How to instantly get more FPS in ANY GAME with the TunedPC App in less than 4 minutes:

How to instantly get more FPS in ANY GAME with the TunedPC App in less than 4 minutes:

KIRNEILL

73,115 views • 4 months ago

🦞 Ready to deploy OpenClaw🦞? Our just released NVIDIA NemoClaw simplifies running OpenClaw always-on assistants more safely with a single command. ✅ Deploy claws more safely ✅ Run any coding agent ✅ Deploy anywhere Try with a free NVIDIA Brev Launchable: 🔗

🦞 Ready to deploy OpenClaw🦞? Our just released NVIDIA NemoClaw simplifies running OpenClaw always-on assistants more safely with a single command. ✅ Deploy claws more safely ✅ Run any coding agent ✅ Deploy anywhere Try with a free NVIDIA Brev Launchable: 🔗

NVIDIA AI Developer

189,442 views • 3 months ago

Analyzing your LinkedIn data with ChatGPT's Code Interpreter is a secret weapon. 🤫 Use it to identify your next customer or job opportunity in less than 10 minutes. Here is a quick tutorial on how.

Analyzing your LinkedIn data with ChatGPT's Code Interpreter is a secret weapon. 🤫 Use it to identify your next customer or job opportunity in less than 10 minutes. Here is a quick tutorial on how.

Sebo ⚛︎

73,754 views • 2 years ago

BIG MAN TD ALERT!! Saint Francis Football S&C Sophomore Jared Almeida with the focus to tie it up with less than 2 minutes in regulation. #SCTop10 ESPNAssignmentDesk NAIAFBALL

BIG MAN TD ALERT!! Saint Francis Football S&C Sophomore Jared Almeida with the focus to tie it up with less than 2 minutes in regulation. #SCTop10 ESPNAssignmentDesk NAIAFBALL

ISC Sports Network

24,160 views • 1 year ago

Creating & deploying a dapp on Fuel is easy, and fast. Watch me start from scratch, add a feature, and deploy that dapp to testnet in less than 2 minutes ⚡️👇

Creating & deploying a dapp on Fuel is easy, and fast. Watch me start from scratch, add a feature, and deploy that dapp to testnet in less than 2 minutes ⚡️👇

Dhai

21,085 views • 1 year ago

Agents & MCP are great... but how can you DEPLOY an agent with MCP tools? Here's the fastest way I've found to build and deploy with the OpenAI Agents SDK in <3 minutes.

Agents & MCP are great... but how can you DEPLOY an agent with MCP tools? Here's the fastest way I've found to build and deploy with the OpenAI Agents SDK in <3 minutes.

matt palmer

64,690 views • 1 year ago

anyways here's USC vs Notre Dame on 4th and 9 with USC down 3 with less than 2 minutes left in the 4th quarter ✌🏻

anyways here's USC vs Notre Dame on 4th and 9 with USC down 3 with less than 2 minutes left in the 4th quarter ✌🏻

Arrogant Nation✌🏻

13,565 views • 1 month ago

The truck is able to repair potholes in less than five minutes each and is capable of fixing up to 50 potholes in a single day.

The truck is able to repair potholes in less than five minutes each and is capable of fixing up to 50 potholes in a single day.

Tech Burrito

573,601 views • 3 years ago

Someone built a free and better alternative to Claude that runs 100% locally. → works with any LLM (Claude, GPT, Gemini, vLLM) → beats it on deep research → has Cowork-like capabilities → 50+ connectors out of the box → deploy in literally one command 100% open source. MIT license. 28k stars.

Someone built a free and better alternative to Claude that runs 100% locally. → works with any LLM (Claude, GPT, Gemini, vLLM) → beats it on deep research → has Cowork-like capabilities → 50+ connectors out of the box → deploy in literally one command 100% open source. MIT license. 28k stars.

How To Prompt

39,043 views • 1 month ago

I'm super excited to launch ⌘ 🥳 ⌘ Langbase – Composable AI developer platform to ship AI features in minutes, not months. Deploy AI Pipes: Hook any LLM to any data, hyper-personalized API AI Memory: Managed search engine API with RAG tools

I'm super excited to launch ⌘ 🥳 ⌘ Langbase – Composable AI developer platform to ship AI features in minutes, not months. Deploy AI Pipes: Hook any LLM to any data, hyper-personalized API AI Memory: Managed search engine API with RAG tools

Ahmad Awais

57,456 views • 1 year ago

Integrate Venice API with Brave for a private, uncensored AI assistant directly in your browser This video shows you how in less than 3 minutes

Integrate Venice API with Brave for a private, uncensored AI assistant directly in your browser This video shows you how in less than 3 minutes

Venice

20,047 views • 1 year ago

GPT-4o's native image gen enables amazing workflows You can generate the perfect landing page with 4o and then build and deploy it with Replit. Literally from imagination to production in less than 5 minutes. Prompts below

GPT-4o's native image gen enables amazing workflows You can generate the perfect landing page with 4o and then build and deploy it with Replit. Literally from imagination to production in less than 5 minutes. Prompts below

Paul Couvert

290,638 views • 1 year ago

MARVEL SNAP Beach Bash is here, and ⁨Ken is here to show you how to get tubular with Sub-Mariner!

MARVEL SNAP Beach Bash is here, and ⁨Ken is here to show you how to get tubular with Sub-Mariner!

MARVEL SNAP

14,147 views • 11 days ago

he literally taught me how to bersyukur dan ikhlas in less than 3 minutes

he literally taught me how to bersyukur dan ikhlas in less than 3 minutes

tubina

343,091 views • 2 years ago

Most "AI coding tools" need 5 files, a build config, and a prayer. Gradio 6's gr.HTML: one Python file. Frontend, backend, state—done 😎 Generate with Claude or any frontier LLM in one shot. Deploy in seconds. This is what vibe coding is supposed to be! Full breakdown👇️

Most "AI coding tools" need 5 files, a build config, and a prayer. Gradio 6's gr.HTML: one Python file. Frontend, backend, state—done 😎 Generate with Claude or any frontier LLM in one shot. Deploy in seconds. This is what vibe coding is supposed to be! Full breakdown👇️

Gradio

22,392 views • 4 months ago

Homemade marshmallow! Made with just a few ingredients and in less than ten minutes.

Homemade marshmallow! Made with just a few ingredients and in less than ten minutes.

Mindful DIY

57,951 views • 1 month ago

Build a multiplayer game from your terminal! Billy Jacobson shows how to vibe code a holiday village and deploy it with real-time syncing using the /𝚏𝚒𝚛𝚎𝚋𝚊𝚜𝚎 𝚒𝚗𝚒𝚝 command in Gemini CLI. From local script to live URL in minutes → #DEVcember

Build a multiplayer game from your terminal! Billy Jacobson shows how to vibe code a holiday village and deploy it with real-time syncing using the /𝚏𝚒𝚛𝚎𝚋𝚊𝚜𝚎 𝚒𝚗𝚒𝚝 command in Gemini CLI. From local script to live URL in minutes → #DEVcember

Google Cloud Tech

11,807 views • 6 months ago

A career with the #FBI is unlike any other. A career with purpose starts here. Are you ready to serve something bigger than yourself? Learn more at

A career with the #FBI is unlike any other. A career with purpose starts here. Are you ready to serve something bigger than yourself? Learn more at

FBI Mobile

94,691 views • 2 months ago