Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

LET'S GO! Cursor using local 🤗 transformers models! You can now test ANY transformers-compatible LLM against your codebase. From hacking to production, it takes only a few minutes: anything `transformers` does, you can serve into your app 🔥 Here's a demo with Qwen3 4B:

33,081 Aufrufe • vor 11 Monaten •via X (Twitter)

11 Kommentare

Profilbild von João Gante
João Gantevor 11 Monaten

We've iterated on our local `transformers serve`, a server with `transformers` backend, and it now supports more advanced requests -- including the requests from Cursor. Testing new models, quantization methods, KV caches, decoding methods, (...) should be much easier now 🫶

Profilbild von João Gante
João Gantevor 11 Monaten

👉5-minute instructions to replicate this demo: (this link will die at some point, and the following will work: 👉The PR where it happened:

Profilbild von Andres Franco
Andres Francovor 11 Monaten

If this works as well as it sounds, it’s really going to make so many things possible.

Profilbild von MAGA1776_PATRIOT
MAGA1776_PATRIOTvor 11 Monaten

Spent much of the day working with LM Studio and Ollama, Mistral 7b. They are getting a lot better. I'm doing a serious build for local AI next month.

Profilbild von Kevin Rossi
Kevin Rossivor 11 Monaten

This has been possible for a while but Cursor still makes calls out to their servers. What happens if you turn off your internet connection?

Profilbild von João Gante
João Gantevor 11 Monaten

To go fully offline, a different IDE has to be used :(

Profilbild von Zach Mueller
Zach Muellervor 11 Monaten

How well does this fully work? IIRC last I checked @cursor_ai strongly advised against doing self-hosted models? @srush_nlp has that changed?

Profilbild von David Siroky
David Sirokyvor 11 Monaten

@ClementDelangue Does that mean I can run cursor fully offline, and point at a local endpoint on my network?

Profilbild von João Gante
João Gantevor 11 Monaten

@ClementDelangue Sadly no -- Cursor makes requests through their server (i.e. your request + codebase -> cursor server -> llm -> cursor server -> your cursor app) The best would be to use a different IDE.

Profilbild von 🇺🇦🇮🇱dmitriy samsonov
🇺🇦🇮🇱dmitriy samsonovvor 11 Monaten

It’s either any open-ai compatible endpoint and token+model settings and no data being sent to cursor’s servers or nothing

Profilbild von João Gante
João Gantevor 11 Monaten

sadly data still goes to cursor :( see my comment here

Ähnliche Videos