Загрузка видео...

Не удалось загрузить видео

На главную

first step was getting llama cpp to play nice with electron js so that we can run the model I fine tuned on the client, a couple errors but eventually got it wired up with node-llama-cpp bindings. this way the model + app can be shipped to the user...

16,499 просмотров • 2 лет назад •via X (Twitter)

Комментарии: 10

Фото профиля anton
anton2 лет назад

the nice thing about llama cpp is the user will be able to run inference on CPU or GPU (cuda + metal for mac) in case they have either

Фото профиля anton
anton2 лет назад

stack is electron-vite, react, llama-cpp using the node-llama-cpp bindings and model is still tbd but currently working with a fine tuned qwen2 500M

Фото профиля Stocko 👊🤖
Stocko 👊🤖2 лет назад

wow, that’s amazingly fast

Фото профиля Alloy🐍🍀
Alloy🐍🍀2 лет назад

Isn't this going to be a massive download or are you downloading the model within the client app and then working "offline"?

Фото профиля anton
anton2 лет назад

the app will ship without the model, which will be downloaded after you install it. how big is the app (w/o the model file)? it is 227mb (will work on bundle size later honestly)

Фото профиля Yam Peleg
Yam Peleg2 лет назад

very nice work! an integration like this done well has amazing potential

Фото профиля nigh8w0lf
nigh8w0lf2 лет назад

Looks nice! Lamafile but with a cleaner JS interface.

Фото профиля Caleb
Caleb2 лет назад

I’m really intrigued at using transformers js to do code autocomplete or something in the browser. Excited to follow along on this

Фото профиля el
el2 лет назад

what machine is this on?

Фото профиля Ravi Chandra Veeramachaneni
Ravi Chandra Veeramachaneni2 лет назад

@abacaj Have you tried or considered swift for the purpose. Lately been seeing lots of apps coming out of the swift native and its cross platform bindings.

Похожие видео