Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

first step was getting llama cpp to play nice with electron js so that we can run the model I fine tuned on the client, a couple errors but eventually got it wired up with node-llama-cpp bindings. this way the model + app can be shipped to the user...

16,499 Aufrufe • vor 1 Jahr •via X (Twitter)

10 Kommentare

Profilbild von anton
antonvor 1 Jahr

the nice thing about llama cpp is the user will be able to run inference on CPU or GPU (cuda + metal for mac) in case they have either

Profilbild von anton
antonvor 1 Jahr

stack is electron-vite, react, llama-cpp using the node-llama-cpp bindings and model is still tbd but currently working with a fine tuned qwen2 500M

Profilbild von Stocko 👊🤖
Stocko 👊🤖vor 1 Jahr

wow, that’s amazingly fast

Profilbild von Alloy🐍🍀
Alloy🐍🍀vor 1 Jahr

Isn't this going to be a massive download or are you downloading the model within the client app and then working "offline"?

Profilbild von anton
antonvor 1 Jahr

the app will ship without the model, which will be downloaded after you install it. how big is the app (w/o the model file)? it is 227mb (will work on bundle size later honestly)

Profilbild von Yam Peleg
Yam Pelegvor 1 Jahr

very nice work! an integration like this done well has amazing potential

Profilbild von nigh8w0lf
nigh8w0lfvor 1 Jahr

Looks nice! Lamafile but with a cleaner JS interface.

Profilbild von Caleb
Calebvor 1 Jahr

I’m really intrigued at using transformers js to do code autocomplete or something in the browser. Excited to follow along on this

Profilbild von el
elvor 1 Jahr

what machine is this on?

Profilbild von Ravi Chandra Veeramachaneni
Ravi Chandra Veeramachanenivor 1 Jahr

@abacaj Have you tried or considered swift for the purpose. Lately been seeing lots of apps coming out of the swift native and its cross platform bindings.

Ähnliche Videos