正在加载视频...
视频加载失败
llama.vscode (powered by Qwen Coder)
77,488 次观看 • 1 年前 •via X (Twitter)
9 条评论

This is a lightweight and very efficient VS Code extension using llama.cpp directly to provide local LLM-assisted code and text completions:

The llama.cpp server provides unique context reuse techniques that allow you to efficiently use large contexts to enhance the completions based on the contents of your codebase. The setup is simple, no RAG is necessary and the performance is good even on low-end hardware. Enjoy!

nvim?? 🥹🥹

i got you pal

Great work. Thanks

Lets go! Nice work.

Great start!

wow - thank you so much for giving the different suggestions for different hardware (RAM). two follow up questions: - can i do this on PC? (my PC happens to have way more GPU/RAM) - from a purely personal perspective - what is the minimum "useful" RAM for this?

Yes, this runs on Mac, Linux and Windows - see the setup instructions in the readme. The 7B models are pretty good, so if you can run this (i.e. ~8GB of VRAM) then go for this. Otherwise - use the 3B model (~4GB)
