First steps for a specialized DeepSeek v4 Flash inference...

antirez's profile picture

antirez

14,176 Aufrufe • vor 1 Monat

DeepSeek DeepSeek is now live on Heurist. Open source...

Heurist's profile picture

Heurist

25,657 Aufrufe • vor 1 Jahr

AGI at home Running DeepSeek R1 across my 7...

Alex Cheema's profile picture

Alex Cheema

1,934,652 Aufrufe • vor 1 Jahr

Running DeepSeek R1 on my desk Uses EXO Labs...

Alex Cheema's profile picture

Alex Cheema

992,119 Aufrufe • vor 1 Jahr

Nemotron 3 Ultra is fast and genuinely good Compared...

GMI Cloud's profile picture

GMI Cloud

224,436 Aufrufe • vor 15 Tagen

Batching for vision models is now available in Beta...

LM Studio's profile picture

LM Studio

46,015 Aufrufe • vor 1 Monat

Watching llama.cpp do 40 tok/s inference of the 7B...

Nat Friedman's profile picture

Nat Friedman

1,764,052 Aufrufe • vor 3 Jahren

How much faster is the new MacBook Pro for...

Alex Cheema - e/acc's profile picture

Alex Cheema - e/acc

527,894 Aufrufe • vor 1 Jahr

Happy Friday! We just put DeepSeek-V4-Pro up on It’s...

NVIDIA AI's profile picture

NVIDIA AI

202,087 Aufrufe • vor 1 Monat

Laika AI x Inference Labs Excited to announce our...

Laika AI's profile picture

Laika AI

13,727 Aufrufe • vor 1 Jahr

Another demo of the iPhone 17 Pro’s on-device LLM...

Adrien Grondin's profile picture

Adrien Grondin

46,205 Aufrufe • vor 9 Monaten

Introducing a 100% free coding agent with DeepSeek v4...

James Grugett's profile picture

James Grugett

403,143 Aufrufe • vor 1 Monat

Got continuous batching working with SSMs in mlx-lm. Here's...

Awni Hannun's profile picture

Awni Hannun

35,078 Aufrufe • vor 5 Monaten

The world’s fastest inference for Llama 4 Scout is...

Poe's profile picture

Poe

17,286 Aufrufe • vor 1 Jahr

Real-time Moondream inference using our new inference engine

vik's profile picture

vik

144,409 Aufrufe • vor 2 Monaten

Want to run Deepseek R1 ? Text-generation-inference v3.1.0 is...

Nicolas Patry's profile picture

Nicolas Patry

28,859 Aufrufe • vor 1 Jahr

Today, we're excited to announce a partnership with Manta...

Nesa's profile picture

Nesa

35,180 Aufrufe • vor 1 Jahr

My AI broke the world record on Tempest yesterday!...

Dave W Plummer's profile picture

Dave W Plummer

37,372 Aufrufe • vor 3 Monaten

Step 4 to achieve truly serverless GPUs for AI...

Charles 🎉 Frye's profile picture

Charles 🎉 Frye

17,384 Aufrufe • vor 1 Monat

Deepseek V4 Flash is now free via Nous Portal...

Nous Research's profile picture

Nous Research

517,475 Aufrufe • vor 1 Monat

NVIDIA Nemotron 3 Nano Omni, a new multimodal reasoning...

NVIDIA Robotics's profile picture

NVIDIA Robotics

15,828 Aufrufe • vor 1 Monat