First steps for a specialized DeepSeek v4 Flash inference...

antirez's profile picture

antirez

14,176 views • 1 month ago

DeepSeek DeepSeek is now live on Heurist. Open source...

Heurist's profile picture

Heurist

25,657 views • 1 year ago

AGI at home Running DeepSeek R1 across my 7...

Alex Cheema's profile picture

Alex Cheema

1,934,687 views • 1 year ago

Running DeepSeek R1 on my desk Uses EXO Labs...

Alex Cheema's profile picture

Alex Cheema

992,163 views • 1 year ago

Nemotron 3 Ultra is fast and genuinely good Compared...

GMI Cloud's profile picture

GMI Cloud

224,625 views • 21 days ago

Batching for vision models is now available in Beta...

LM Studio's profile picture

LM Studio

46,015 views • 1 month ago

Watching llama.cpp do 40 tok/s inference of the 7B...

Nat Friedman's profile picture

Nat Friedman

1,764,077 views • 3 years ago

How much faster is the new MacBook Pro for...

Alex Cheema's profile picture

Alex Cheema

529,673 views • 1 year ago

Happy Friday! We just put DeepSeek-V4-Pro up on It’s...

NVIDIA AI's profile picture

NVIDIA AI

202,637 views • 2 months ago

Laika AI x Inference Labs Excited to announce our...

Laika AI's profile picture

Laika AI

13,727 views • 1 year ago

Another demo of the iPhone 17 Pro’s on-device LLM...

Adrien Grondin's profile picture

Adrien Grondin

46,205 views • 9 months ago

Introducing a 100% free coding agent with DeepSeek v4...

James Grugett's profile picture

James Grugett

407,283 views • 1 month ago

Got continuous batching working with SSMs in mlx-lm. Here's...

Awni Hannun's profile picture

Awni Hannun

35,078 views • 5 months ago

The world’s fastest inference for Llama 4 Scout is...

Poe's profile picture

Poe

17,286 views • 1 year ago

Real-time Moondream inference using our new inference engine

vik's profile picture

vik

144,409 views • 3 months ago

Want to run Deepseek R1 ? Text-generation-inference v3.1.0 is...

Nicolas Patry's profile picture

Nicolas Patry

28,859 views • 1 year ago

Today, we're excited to announce a partnership with Manta...

Nesa's profile picture

Nesa

35,180 views • 1 year ago

My AI broke the world record on Tempest yesterday!...

Dave W Plummer's profile picture

Dave W Plummer

37,372 views • 4 months ago

Step 4 to achieve truly serverless GPUs for AI...

Charles 🎉 Frye's profile picture

Charles 🎉 Frye

17,452 views • 1 month ago

Deepseek V4 Flash is now free via Nous Portal...

Nous Research's profile picture

Nous Research

517,610 views • 1 month ago

NVIDIA Nemotron 3 Nano Omni, a new multimodal reasoning...

NVIDIA Robotics's profile picture

NVIDIA Robotics

15,828 views • 1 month ago