Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Quantized Gemma 2B runs pretty fast on my iPhone 15 pro in MLX Swift. code & docs: Comparable to GPT 3.5 turbo and Mixtral 8x7B in LMSYS Org benchmarks but runs efficiently on an iPhone. Pretty wild.

79,702 Aufrufe • vor 1 Jahr •via X (Twitter)

10 Kommentare

Profilbild von Logan Kilpatrick
Logan Kilpatrickvor 1 Jahr

@lmsysorg Cost of intelligence takes another hit today : )

Profilbild von Christian Schoppe
Christian Schoppevor 1 Jahr

@lmsysorg I have the 6 bit quantized version running on my Pixel. Not quite as fast as yours but still quite usable. After a few initial tests, I still prefer Phi-3-mini.

Profilbild von Eric Hartford
Eric Hartfordvor 1 Jahr

@lmsysorg that's awesome!

Profilbild von Kirito (e/acc) 🏴‍☠️
Kirito (e/acc) 🏴‍☠️vor 1 Jahr

@lmsysorg Great work we all saw it coming - privacy and intelligence at the palm of your hand

Profilbild von Rami El-Masri
Rami El-Masrivor 1 Jahr

@lmsysorg Running advanced models like Gemma 2B efficiently on mobile devices is a game-changing milestone.

Profilbild von Tris Warkentin
Tris Warkentinvor 1 Jahr

@lmsysorg What an incredible demo -- speed and quality are very impressive. Now to work on accessibility =)

Profilbild von NFTPerks 🇵🇹
NFTPerks 🇵🇹vor 1 Jahr

@lmsysorg awesome

Profilbild von Stavros Kassinos
Stavros Kassinosvor 1 Jahr

@lmsysorg 🚀🚀

Profilbild von Mani
Manivor 1 Jahr

@lmsysorg Is it 4bit quantized?

Profilbild von Awni Hannun
Awni Hannunvor 1 Jahr

@lmsysorg Yes

Ähnliche Videos