Video yükleniyor...
Video Yüklenemedi
Quantized Gemma 2B runs pretty fast on my iPhone 15 pro in MLX Swift. code & docs: Comparable to GPT 3.5 turbo and Mixtral 8x7B in LMSYS Org benchmarks but runs efficiently on an iPhone. Pretty wild.
79,702 görüntüleme • 1 yıl önce •via X (Twitter)
10 Yorum

Logan Kilpatrick1 yıl önce
@lmsysorg Cost of intelligence takes another hit today : )

Christian Schoppe1 yıl önce
@lmsysorg I have the 6 bit quantized version running on my Pixel. Not quite as fast as yours but still quite usable. After a few initial tests, I still prefer Phi-3-mini.

Eric Hartford1 yıl önce
@lmsysorg that's awesome!

Kirito (e/acc) 🏴☠️1 yıl önce
@lmsysorg Great work we all saw it coming - privacy and intelligence at the palm of your hand

Rami El-Masri1 yıl önce
@lmsysorg Running advanced models like Gemma 2B efficiently on mobile devices is a game-changing milestone.

Tris Warkentin1 yıl önce
@lmsysorg What an incredible demo -- speed and quality are very impressive. Now to work on accessibility =)

NFTPerks 🇵🇹1 yıl önce
@lmsysorg awesome

Stavros Kassinos1 yıl önce
@lmsysorg 🚀🚀

Mani1 yıl önce
@lmsysorg Is it 4bit quantized?

Awni Hannun1 yıl önce
@lmsysorg Yes
