Introducing DeepThought-8B: Transparent reasoning model built on LLaMA-3.1 with...

Ruliad's profile picture

Ruliad

219,315 views • 1 year ago

Llama 2: Now on Hugging Chat 🤗🦙 Try out...

Hugging Face's profile picture

Hugging Face

403,558 views • 2 years ago

`transformers` + `torchao` quantization + `torch.compile` for faster inference...

Marc Sun's profile picture

Marc Sun

24,515 views • 1 year ago

starting the week with a true groundbreaking work 💥...

apolinario 🌐's profile picture

apolinario 🌐

11,833 views • 1 year ago

First came pre-training scaling; then came inference-time scaling. Now...

Leonard Tang's profile picture

Leonard Tang

111,298 views • 1 year ago

Introducing 𝗦𝘂𝗽𝗲𝗿 𝗝𝗦𝗢𝗡 𝗠𝗼𝗱𝗲, a framework for low latency...

Varun Shenoy's profile picture

Varun Shenoy

166,119 views • 2 years ago

Introducing ✨ Aya Vision ✨ - an open-weights model...

Cohere Labs's profile picture

Cohere Labs

206,502 views • 1 year ago

You can now run inference directly on the Llama...

Together AI's profile picture

Together AI

21,489 views • 1 year ago

Laika AI x Inference Labs Excited to announce our...

Laika AI's profile picture

Laika AI

13,727 views • 1 year ago

NVIDIA Nemotron 3 Nano Omni, a new multimodal reasoning...

NVIDIA Robotics's profile picture

NVIDIA Robotics

15,828 views • 1 month ago

Llama 3.1 Nemotron 70B is the latest model from...

Akash Network's profile picture

Akash Network

38,472 views • 1 year ago

The easiest way to use this new model is...

Paul Couvert's profile picture

Paul Couvert

81,620 views • 1 year ago

pip install spectralquant ✂️ Up to 6.62x KV cache...

ani's profile picture

ani

16,583 views • 23 days ago

Introducing Alpamayo 1.5. Based on community feedback, we’ve updated...

NVIDIA DRIVE's profile picture

NVIDIA DRIVE

50,693 views • 3 months ago

MolmoAct2 is landing in LeRobot! Ai2's open Action Reasoning...

LeRobot's profile picture

LeRobot

24,523 views • 27 days ago

PRIMA.CPP Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday...

AK's profile picture

AK

48,241 views • 1 year ago

🤖 ExoBrain is a compute device built for large-scale...

ExoBrain's profile picture

ExoBrain

21,529 views • 3 months ago

We just shipped support for tool use and JSON...

Hatice Ozen's profile picture

Hatice Ozen

43,225 views • 1 year ago

🔥🔥🔥We’ve been listening to your feedback! Our latest world...

Tencent Hy's profile picture

Tencent Hy

20,581 views • 5 months ago

Llama 3.3 70B is live on AkashChat. The latest...

Akash Network's profile picture

Akash Network

16,463 views • 1 year ago

Enterprise wants privacy. Builders want flexibility. Users want speed....

Nesa's profile picture

Nesa

52,336 views • 6 months ago