Loading video...

Video Failed to Load

Go Home

llama.vscode (powered by Qwen Coder)

77,488 views • 1 year ago •via X (Twitter)

9 Comments

Georgi Gerganov's profile picture
Georgi Gerganov1 year ago

This is a lightweight and very efficient VS Code extension using llama.cpp directly to provide local LLM-assisted code and text completions:

Georgi Gerganov's profile picture
Georgi Gerganov1 year ago

The llama.cpp server provides unique context reuse techniques that allow you to efficiently use large contexts to enhance the completions based on the contents of your codebase. The setup is simple, no RAG is necessary and the performance is good even on low-end hardware. Enjoy!

malico.'s profile picture
malico.1 year ago

nvim?? 🥹🥹

Georgi Gerganov's profile picture
Georgi Gerganov1 year ago

i got you pal

Daniel Nguyen ⚡'s profile picture
Daniel Nguyen ⚡1 year ago

Great work. Thanks

Neil Chudleigh's profile picture
Neil Chudleigh1 year ago

Lets go! Nice work.

Nikita 🤙's profile picture
Nikita 🤙1 year ago

Great start!

Raymond Weitekamp's profile picture
Raymond Weitekamp1 year ago

wow - thank you so much for giving the different suggestions for different hardware (RAM). two follow up questions: - can i do this on PC? (my PC happens to have way more GPU/RAM) - from a purely personal perspective - what is the minimum "useful" RAM for this?

Georgi Gerganov's profile picture
Georgi Gerganov1 year ago

Yes, this runs on Mac, Linux and Windows - see the setup instructions in the readme. The 7B models are pretty good, so if you can run this (i.e. ~8GB of VRAM) then go for this. Otherwise - use the 3B model (~4GB)

Related Videos