Got continuous batching working with SSMs in mlx-lm. Here's...

Awni Hannun's profile picture

Awni Hannun

35,078 views • 4 months ago

Running DeepSeek-V3 on M4 Mac Mini AI Cluster 671B...

EXO Labs's profile picture

EXO Labs

719,005 views • 1 year ago

No-one: But can you do 16 generations on your...

Awni Hannun's profile picture

Awni Hannun

46,707 views • 8 months ago

Batching for vision models is now available in Beta...

LM Studio's profile picture

LM Studio

46,015 views • 22 days ago

Qwen QwQ 32B fp16 on M4 Max and M2...

Ivan Fioravanti ᯅ's profile picture

Ivan Fioravanti ᯅ

62,377 views • 1 year ago

How much faster is the new MacBook Pro for...

Alex Cheema - e/acc's profile picture

Alex Cheema - e/acc

527,894 views • 1 year ago

Another demo of the iPhone 17 Pro’s on-device LLM...

Adrien Grondin's profile picture

Adrien Grondin

46,205 views • 8 months ago

NVIDIA Nemotron 3 Nano Omni, a new multimodal reasoning...

NVIDIA Robotics's profile picture

NVIDIA Robotics

15,828 views • 1 month ago

DeepSeek-Prover (4-bit 7B) running at 114 toks/sec in MLX...

Awni Hannun's profile picture

Awni Hannun

16,077 views • 1 year ago

M4 Mac Mini AI Cluster Uses EXO Labs with...

Alex Cheema's profile picture

Alex Cheema

3,515,852 views • 1 year ago

Tested the new MacBook Pro M4 Pro vs. the...

01000010's profile picture

01000010

111,446 views • 1 year ago

Managed to get Ling Mini 16B (1.4B active) running...

Awni Hannun's profile picture

Awni Hannun

92,422 views • 8 months ago

Sam 3 by Facebook now on MLX 🚀 Here...

Prince Canuma's profile picture

Prince Canuma

180,242 views • 2 months ago

DeepSeek R1 Qwen 7B 4bit M2 Ultra vs M4...

Ivan Fioravanti ᯅ's profile picture

Ivan Fioravanti ᯅ

59,734 views • 1 year ago

Currently working on a retro inspired action horror game...

Kathy (Prii)'s profile picture

Kathy (Prii)

83,894 views • 2 years ago

Introducing MON Protocol Partner - Hybrid Hybrid is an...

MON Protocol 🐉 $MON's profile picture

MON Protocol 🐉 $MON

182,596 views • 2 years ago

Qwen 3.5 0.8B, Gated DeltaNet attention is running on...

Anemll's profile picture

Anemll

13,589 views • 3 months ago

Gotta love MoEs on Apple silicon with MLX. Kimi's...

Awni Hannun's profile picture

Awni Hannun

23,209 views • 1 year ago

Sparsely activated models like MOEs and Apple silicon +...

Awni Hannun's profile picture

Awni Hannun

27,452 views • 1 year ago

Mac owners don't miss this: MLX LM is now...

Victor M's profile picture

Victor M

204,554 views • 1 year ago

Unsloth Studio now installs in just one line of...

Unsloth AI's profile picture

Unsloth AI

113,048 views • 2 months ago