Got continuous batching working with SSMs in mlx-lm. Here's...

Awni Hannun's profile picture

Awni Hannun

35,078 Aufrufe • vor 5 Monaten

Running DeepSeek-V3 on M4 Mac Mini AI Cluster 671B...

EXO Labs's profile picture

EXO Labs

719,005 Aufrufe • vor 1 Jahr

No-one: But can you do 16 generations on your...

Awni Hannun's profile picture

Awni Hannun

46,713 Aufrufe • vor 8 Monaten

Batching for vision models is now available in Beta...

LM Studio's profile picture

LM Studio

46,015 Aufrufe • vor 27 Tagen

First steps for a specialized DeepSeek v4 Flash inference...

antirez's profile picture

antirez

14,155 Aufrufe • vor 1 Monat

Qwen QwQ 32B fp16 on M4 Max and M2...

Ivan Fioravanti ᯅ's profile picture

Ivan Fioravanti ᯅ

62,377 Aufrufe • vor 1 Jahr

How much faster is the new MacBook Pro for...

Alex Cheema - e/acc's profile picture

Alex Cheema - e/acc

527,894 Aufrufe • vor 1 Jahr

Another demo of the iPhone 17 Pro’s on-device LLM...

Adrien Grondin's profile picture

Adrien Grondin

46,205 Aufrufe • vor 8 Monaten

NVIDIA Nemotron 3 Nano Omni, a new multimodal reasoning...

NVIDIA Robotics's profile picture

NVIDIA Robotics

15,828 Aufrufe • vor 1 Monat

DeepSeek-Prover (4-bit 7B) running at 114 toks/sec in MLX...

Awni Hannun's profile picture

Awni Hannun

16,077 Aufrufe • vor 1 Jahr

M4 Mac Mini AI Cluster Uses EXO Labs with...

Alex Cheema's profile picture

Alex Cheema

3,515,929 Aufrufe • vor 1 Jahr

Tested the new MacBook Pro M4 Pro vs. the...

01000010's profile picture

01000010

111,446 Aufrufe • vor 1 Jahr

Managed to get Ling Mini 16B (1.4B active) running...

Awni Hannun's profile picture

Awni Hannun

92,422 Aufrufe • vor 8 Monaten

Sam 3 by Facebook now on MLX 🚀 Here...

Prince Canuma's profile picture

Prince Canuma

180,245 Aufrufe • vor 2 Monaten

DeepSeek R1 Qwen 7B 4bit M2 Ultra vs M4...

Ivan Fioravanti ᯅ's profile picture

Ivan Fioravanti ᯅ

59,734 Aufrufe • vor 1 Jahr

Currently working on a retro inspired action horror game...

Kathy (Prii)'s profile picture

Kathy (Prii)

83,894 Aufrufe • vor 2 Jahren

Introducing MON Protocol Partner - Hybrid Hybrid is an...

MON Protocol 🐉 $MON's profile picture

MON Protocol 🐉 $MON

182,628 Aufrufe • vor 2 Jahren

Qwen 3.5 0.8B, Gated DeltaNet attention is running on...

Anemll's profile picture

Anemll

13,589 Aufrufe • vor 3 Monaten

Gotta love MoEs on Apple silicon with MLX. Kimi's...

Awni Hannun's profile picture

Awni Hannun

23,209 Aufrufe • vor 1 Jahr

Sparsely activated models like MOEs and Apple silicon +...

Awni Hannun's profile picture

Awni Hannun

27,452 Aufrufe • vor 1 Jahr

Mac owners don't miss this: MLX LM is now...

Victor M's profile picture

Victor M

204,554 Aufrufe • vor 1 Jahr