正在加载视频...

视频加载失败

llama.vscode (powered by Qwen Coder)

77,488 次观看 • 1 年前 •via X (Twitter)

9 条评论

Georgi Gerganov 的头像
Georgi Gerganov1 年前

This is a lightweight and very efficient VS Code extension using llama.cpp directly to provide local LLM-assisted code and text completions:

Georgi Gerganov 的头像
Georgi Gerganov1 年前

The llama.cpp server provides unique context reuse techniques that allow you to efficiently use large contexts to enhance the completions based on the contents of your codebase. The setup is simple, no RAG is necessary and the performance is good even on low-end hardware. Enjoy!

malico. 的头像
malico.1 年前

nvim?? 🥹🥹

Georgi Gerganov 的头像
Georgi Gerganov1 年前

i got you pal

Daniel Nguyen ⚡ 的头像
Daniel Nguyen ⚡1 年前

Great work. Thanks

Neil Chudleigh 的头像
Neil Chudleigh1 年前

Lets go! Nice work.

Nikita 🤙 的头像
Nikita 🤙1 年前

Great start!

Raymond Weitekamp 的头像
Raymond Weitekamp1 年前

wow - thank you so much for giving the different suggestions for different hardware (RAM). two follow up questions: - can i do this on PC? (my PC happens to have way more GPU/RAM) - from a purely personal perspective - what is the minimum "useful" RAM for this?

Georgi Gerganov 的头像
Georgi Gerganov1 年前

Yes, this runs on Mac, Linux and Windows - see the setup instructions in the readme. The 7B models are pretty good, so if you can run this (i.e. ~8GB of VRAM) then go for this. Otherwise - use the 3B model (~4GB)

相关视频