Microsoft just a 1-bit LLM with 2B parameters that...

Shubham Saboo's profile picture

Shubham Saboo

260,049 просмотров • 1 год назад

You are not prepared for this, 250+ tokens/sec, 1B...

anton's profile picture

anton

372,371 просмотров • 2 лет назад

LightVAE + ComfyUI node: High-performance video VAE; runs 2–3x...

Wildminder's profile picture

Wildminder

38,092 просмотров • 8 месяцев назад

LM Studio 0.3.4 ships with Apple MLX 🚢🍎 Run...

LM Studio's profile picture

LM Studio

171,777 просмотров • 1 год назад

Llama 3.2 1B in 4-bit runs at ~60 toks/sec...

Awni Hannun's profile picture

Awni Hannun

492,413 просмотров • 1 год назад

GPT-4o level multimodal LLM running on your phone. MiniCPM-V...

Shubham Saboo's profile picture

Shubham Saboo

18,112 просмотров • 10 месяцев назад

How much faster is the new MacBook Pro for...

Alex Cheema's profile picture

Alex Cheema

529,673 просмотров • 1 год назад

Latest mlx-lm has faster and lower memory prompt processing!...

Awni Hannun's profile picture

Awni Hannun

22,156 просмотров • 1 год назад

`transformers` + `torchao` quantization + `torch.compile` for faster inference...

Marc Sun's profile picture

Marc Sun

24,515 просмотров • 1 год назад

Today Meta released "Code Llama", a large language model...

Marcel Pociot 🧪's profile picture

Marcel Pociot 🧪

50,094 просмотров • 2 лет назад

RAG is not Memory for AI Agents. 5 AI...

Unwind AI's profile picture

Unwind AI

85,045 просмотров • 10 месяцев назад

Llama 3.2 is the latest open-source AI model from...

Akash Network's profile picture

Akash Network

37,087 просмотров • 1 год назад

Microsoft is testing a new feature in Windows 11...

Pirat_Nation 🔴's profile picture

Pirat_Nation 🔴

145,459 просмотров • 1 месяц назад

You can run and monitor Claude Code from literally...

Unwind AI's profile picture

Unwind AI

15,573 просмотров • 10 месяцев назад

Meta released LongVU: a new video LM that can...

merve's profile picture

merve

49,546 просмотров • 1 год назад

RAG engine that just works for complex real-world documents....

Shubham Saboo's profile picture

Shubham Saboo

45,212 просмотров • 1 год назад

Indian man finds romance with Fido while his H-1B processes

LXXPBUH*'s profile picture

LXXPBUH*

4,830,005 просмотров • 1 год назад

Perplexity's Sonar—built on Llama 3.3 70b—outperforms GPT-4o-mini and Claude...

Perplexity's profile picture

Perplexity

565,953 просмотров • 1 год назад

I just created my own LaTeX-OCR app using Llama...

Avi Chawla's profile picture

Avi Chawla

15,610 просмотров • 1 год назад

RAG is not Memory. AI agents need long-term memory...

Unwind AI's profile picture

Unwind AI

293,513 просмотров • 1 год назад