Real-time Moondream inference using our new inference engine

vik's profile picture

vik

144,409 Aufrufe • vor 2 Monaten

Laika AI x Inference Labs Excited to announce our...

Laika AI's profile picture

Laika AI

13,727 Aufrufe • vor 1 Jahr

Step 4 to achieve truly serverless GPUs for AI...

Charles 🎉 Frye's profile picture

Charles 🎉 Frye

17,384 Aufrufe • vor 1 Monat

First steps for a specialized DeepSeek v4 Flash inference...

antirez's profile picture

antirez

14,159 Aufrufe • vor 1 Monat

Introducing DeepThought-8B: Transparent reasoning model built on LLaMA-3.1 with...

Ruliad's profile picture

Ruliad

219,315 Aufrufe • vor 1 Jahr

Batching for vision models is now available in Beta...

LM Studio's profile picture

LM Studio

46,015 Aufrufe • vor 1 Monat

The team at Runway is pushing the frontier of...

Modal's profile picture

Modal

68,896 Aufrufe • vor 2 Monaten

Video generation is powerful but too slow for real-world...

Shuang Li's profile picture

Shuang Li

67,367 Aufrufe • vor 1 Jahr

Excited to share our NeurIPS 2024 Oral, Convolutional Differentiable...

Felix Petersen's profile picture

Felix Petersen

157,435 Aufrufe • vor 1 Jahr

Distributed Inference, Now in Hybrid. Try it now:

Gradient's profile picture

Gradient

34,773 Aufrufe • vor 10 Monaten

Fireworks blazing fast LLM inference is now available on...

Fireworks AI's profile picture

Fireworks AI

93,334 Aufrufe • vor 2 Jahren

For fellows. What is your inference from this LV angiogram?

Dr G Rajesh (Gopalan Nair Rajesh).'s profile picture

Dr G Rajesh (Gopalan Nair Rajesh).

33,810 Aufrufe • vor 1 Jahr

The Inference - Episode 7 👾 Our second “The...

Warden's profile picture

Warden

20,068 Aufrufe • vor 5 Monaten

Grok 2 mini is now 2x faster than it...

Igor Babuschkin's profile picture

Igor Babuschkin

1,803,113 Aufrufe • vor 1 Jahr

Move from experimentation to real AI outcomes with secure...

HPE's profile picture

HPE

1,743,946 Aufrufe • vor 3 Monaten

just walked into one of our ai inference teammates’...

Ramin's profile picture

Ramin

82,749 Aufrufe • vor 11 Tagen

Real-time distributed inference monitoring is live on exo intern...

Alex Cheema - e/acc's profile picture

Alex Cheema - e/acc

17,798 Aufrufe • vor 1 Jahr

We’re amazed: AMD now beats Nvidia in inference for...

Higgsfield AI 🧩's profile picture

Higgsfield AI 🧩

841,844 Aufrufe • vor 1 Jahr

🏎️ Incredible Speeds with Groq's LPU-Powered Inference 🕔 The...

LangChain's profile picture

LangChain

27,387 Aufrufe • vor 2 Jahren

[ Comparison Video ] Truth & Inference Series Optimization...

Identity V | News's profile picture

Identity V | News

167,443 Aufrufe • vor 2 Monaten

PRIMA.CPP Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday...

AK's profile picture

AK

48,241 Aufrufe • vor 1 Jahr