Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

first step was getting llama cpp to play nice with electron js so that we can run the model I fine tuned on the client, a couple errors but eventually got it wired up with node-llama-cpp bindings. this way the model + app can be shipped to the user...

16,499 görüntüleme • 2 yıl önce •via X (Twitter)

10 Yorum

anton profil fotoğrafı
anton2 yıl önce

the nice thing about llama cpp is the user will be able to run inference on CPU or GPU (cuda + metal for mac) in case they have either

anton profil fotoğrafı
anton2 yıl önce

stack is electron-vite, react, llama-cpp using the node-llama-cpp bindings and model is still tbd but currently working with a fine tuned qwen2 500M

Stocko 👊🤖 profil fotoğrafı
Stocko 👊🤖2 yıl önce

wow, that’s amazingly fast

Alloy🐍🍀 profil fotoğrafı
Alloy🐍🍀2 yıl önce

Isn't this going to be a massive download or are you downloading the model within the client app and then working "offline"?

anton profil fotoğrafı
anton2 yıl önce

the app will ship without the model, which will be downloaded after you install it. how big is the app (w/o the model file)? it is 227mb (will work on bundle size later honestly)

Yam Peleg profil fotoğrafı
Yam Peleg2 yıl önce

very nice work! an integration like this done well has amazing potential

nigh8w0lf profil fotoğrafı
nigh8w0lf2 yıl önce

Looks nice! Lamafile but with a cleaner JS interface.

Caleb profil fotoğrafı
Caleb2 yıl önce

I’m really intrigued at using transformers js to do code autocomplete or something in the browser. Excited to follow along on this

el profil fotoğrafı
el2 yıl önce

what machine is this on?

Ravi Chandra Veeramachaneni profil fotoğrafı
Ravi Chandra Veeramachaneni2 yıl önce

@abacaj Have you tried or considered swift for the purpose. Lately been seeing lots of apps coming out of the swift native and its cross platform bindings.

Benzer Videolar