Загрузка видео...
Не удалось загрузить видео
Quantized Gemma 2B runs pretty fast on my iPhone 15 pro in MLX Swift. code & docs: Comparable to GPT 3.5 turbo and Mixtral 8x7B in LMSYS Org benchmarks but runs efficiently on an iPhone. Pretty wild.
79,702 просмотров • 1 год назад •via X (Twitter)
Комментарии: 10

Logan Kilpatrick1 год назад
@lmsysorg Cost of intelligence takes another hit today : )

Christian Schoppe1 год назад
@lmsysorg I have the 6 bit quantized version running on my Pixel. Not quite as fast as yours but still quite usable. After a few initial tests, I still prefer Phi-3-mini.

Eric Hartford1 год назад
@lmsysorg that's awesome!

Kirito (e/acc) 🏴☠️1 год назад
@lmsysorg Great work we all saw it coming - privacy and intelligence at the palm of your hand

Rami El-Masri1 год назад
@lmsysorg Running advanced models like Gemma 2B efficiently on mobile devices is a game-changing milestone.

Tris Warkentin1 год назад
@lmsysorg What an incredible demo -- speed and quality are very impressive. Now to work on accessibility =)

NFTPerks 🇵🇹1 год назад
@lmsysorg awesome

Stavros Kassinos1 год назад
@lmsysorg 🚀🚀

Mani1 год назад
@lmsysorg Is it 4bit quantized?

Awni Hannun1 год назад
@lmsysorg Yes
Похожие видео
Gemma 4 E2B on iPhone 17 Pro Max in AI Edge Gallery!
Max Weinbach
177,412 просмотров • 2 месяцев назад
