Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

Introducing Parallax, the first fully distributed inference and serving engine for large language models. Try it now: 🧵

Gradient

712,573 subscribers

161,065 görüntüleme • 1 yıl önce •via X (Twitter)

Bilim & Teknoloji

Anya Rossi• Live Now

Private livecam show

10 Yorum

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

AI is reaching a bottleneck. LLMs are reshaping how we think, build, and create, but their demand for tokens is outpacing what centralized infra can deliver. Chips saturated; Power grids strained; Intelligence remains locked behind high-cost silos. We need a new paradigm.

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

Parallax reimagines model inference as a global, collaborative process, one where models are no longer chained to centralized infrastructure, but are instead recomposed, executed, and verified across a global mesh of compute.

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

The engine introduces 3 foundational shifts: – Intelligence sovereignty: serve models from the hardware you trust – Composable inference: GPUs, Apple Silicon, desktops working in harmony – Latent compute: activate into the world’s untapped compute

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

The Parallax Runtime Layer is the core orchestration engine for high-throughput, server-side LLM serving across distributed, heterogeneous networks. It delivers server-grade optimizations—from continuous batching to paged KV-cache—and is the first MLX-based framework to enable professional-grade inference on Apple Silicon. By unifying NVIDIA GPUs and Apple devices into a single compute fabric, Parallax brings frictionless decentralized AI to everyone.

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

Parallax runs on a distributed architecture called the Swarm: a dynamic network of nodes that collaboratively serve LLMs. Each prompt is processed across heterogeneous nodes, with each handling a segment of the model. The result: real-time inference that is decentralized, fluid, and verifiable.

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

Compared to Petals (BitTorrent-style serving), Parallax running Qwen2.5-72B on 2× RTX 5090s achieved: – 3.1× lower end-to-end latency, 5.3× faster inter-token latency – 2.9× faster time-to-first-token, 3.1× higher I/O throughput Results were consistent and showed great scalability across different input configurations, and this is just the beginning.

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

Now live: a chatbot fully powered by Parallax. Every response is generated peer-to-peer with no centralized server involved. Experience decentralized LLM inference:

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

The swarm is growing. Apply to join the Edge Host Pilot Program to scale the world’s intelligence:

Gradient Network profil fotoğrafı

Gradient Network1 yıl önce

Parallax is a major step toward a future where intelligence is hosted, served, and owned by all. Together with Lattica, the decentralized AI base stack is taking shape. Parallax will be open-sourced soon. Read our full blog on Parallax 👇

rw./ 🌐 profil fotoğrafı

rw./ 🌐1 yıl önce

./ This is just the start… 👀🌐

Benzer Videolar

Today, we're introducing our document parser built specifically for RAG. The parser combines the best vision, OCR, and vision language models to deliver unmatched accuracy. Try it for free today—the first 500+ pages are on us! 🧵 1/

Today, we're introducing our document parser built specifically for RAG. The parser combines the best vision, OCR, and vision language models to deliver unmatched accuracy. Try it for free today—the first 500+ pages are on us! 🧵 1/

Douwe Kiela

1,308,593 görüntüleme • 1 yıl önce

🚀 Excited to announce the first release of a novel open source programming language and platform for language model interaction! Combining prompts, constraints & scripting, LMQL elevates the capabilities of large language models. 🧵1/6 A quick tour.

🚀 Excited to announce the first release of a novel open source programming language and platform for language model interaction! Combining prompts, constraints & scripting, LMQL elevates the capabilities of large language models. 🧵1/6 A quick tour.

LMQL (Language Model Query Language)

198,966 görüntüleme • 3 yıl önce

Gradient recently launched 2 game-changing technologies for decentralized AI: • Parallax - distributed inference engine • Lattica - peer-to-peer communication layer for AI Together, they break the centralized mold of AI. Here’s everything you need to know (in 90 seconds) 👇

Gradient recently launched 2 game-changing technologies for decentralized AI: • Parallax - distributed inference engine • Lattica - peer-to-peer communication layer for AI Together, they break the centralized mold of AI. Here’s everything you need to know (in 90 seconds) 👇

Cipher

34,599 görüntüleme • 1 yıl önce

Introducing the Luma✨Unreal Engine alpha! Fully volumetric Luma NeRFs running realtime on Windows in UE 5 for incredible cinematic shots and experiences, starting today! Try now:

Introducing the Luma✨Unreal Engine alpha! Fully volumetric Luma NeRFs running realtime on Windows in UE 5 for incredible cinematic shots and experiences, starting today! Try now:

Luma

628,364 görüntüleme • 3 yıl önce

Introducing Magenta RealTime 2 (MRT2): the live music model you can play as an instrument. MRT2 offers MIDI and prompt controls, and runs natively on a MacBook with <200ms latency. Open weights. Open source inference engine. Suite of apps and plugins. Hear what it can do and try it out for yourself below 🧵

Introducing Magenta RealTime 2 (MRT2): the live music model you can play as an instrument. MRT2 offers MIDI and prompt controls, and runs natively on a MacBook with <200ms latency. Open weights. Open source inference engine. Suite of apps and plugins. Hear what it can do and try it out for yourself below 🧵

Google Magenta Project

112,452 görüntüleme • 1 ay önce

Introducing Archer The global trading engine for every asset, fully onchain on Solana. Live now.

Introducing Archer The global trading engine for every asset, fully onchain on Solana. Live now.

ARCHER

71,922 görüntüleme • 2 ay önce

Diffusion language models are SO FAST!! A new startup, Inception Labs, has released Mercury Coder, "the first commercial-scale diffusion large language model" It's 5-10x faster than current gen LLMs, providing high-quality responses at low costs. And you can try it now!

Diffusion language models are SO FAST!! A new startup, Inception Labs, has released Mercury Coder, "the first commercial-scale diffusion large language model" It's 5-10x faster than current gen LLMs, providing high-quality responses at low costs. And you can try it now!

Tanishq, Ph.D. at ICML

354,254 görüntüleme • 1 yıl önce

Introducing GRID: the General Robot Intelligence Development platform, designed for prototyping smart and safe robots rapidly using foundation models, LLMs, and simulation. Paper: Try now: GitHub: 🧵👇(1/N)

Introducing GRID: the General Robot Intelligence Development platform, designed for prototyping smart and safe robots rapidly using foundation models, LLMs, and simulation. Paper: Try now: GitHub: 🧵👇(1/N)

Sai Vemprala

277,299 görüntüleme • 2 yıl önce

🏥 Introducing the new and upgraded Medical Sphere—our biggest step yet toward building a global community for evaluating AI models on medical tasks. 🌐Try it now: 📝 Blog post: 🧵Here’s what’s new:

🏥 Introducing the new and upgraded Medical Sphere—our biggest step yet toward building a global community for evaluating AI models on medical tasks. 🌐Try it now: 📝 Blog post: 🧵Here’s what’s new:

Lavita.AI

12,122 görüntüleme • 1 yıl önce

introducing trainers for Qwen-2512 and Z-Image. now you can train LoRAs for these two models and use them in Krea Image. try it now!

introducing trainers for Qwen-2512 and Z-Image. now you can train LoRAs for these two models and use them in Krea Image. try it now!

Krea

26,849 görüntüleme • 6 ay önce

New Threads. 🧵🧵 And introducing for the first time the 𝑺𝒉𝒂𝒅𝒐𝒘 uniform. #RELENTLESS

New Threads. 🧵🧵 And introducing for the first time the 𝑺𝒉𝒂𝒅𝒐𝒘 uniform. #RELENTLESS

Michigan State Football

153,399 görüntüleme • 3 yıl önce

introducing Qwen Edit. this new models offers incredible image editing capabilities from text prompts. try it now for free!

introducing Qwen Edit. this new models offers incredible image editing capabilities from text prompts. try it now for free!

Krea

76,683 görüntüleme • 11 ay önce

Introducing Lasso: The natural language search engine for blockchain data ✨ Search the chain, get data, and find alpha in your own words -- no SQL or complex smart contract understanding needed! Sign up for early access here -> A 🧵

Introducing Lasso: The natural language search engine for blockchain data ✨ Search the chain, get data, and find alpha in your own words -- no SQL or complex smart contract understanding needed! Sign up for early access here -> A 🧵

Lasso

130,136 görüntüleme • 3 yıl önce

Introducing DeepProve—Lagrange’s zkML Library—a breakthrough in verifiable AI inference. We can now verify AI decisions instead of blindly trusting black-box models. And we can do it up to 158x faster than ever before. The future of AI is ZK. The future of humanity is Lagrange: 🧵

Introducing DeepProve—Lagrange’s zkML Library—a breakthrough in verifiable AI inference. We can now verify AI decisions instead of blindly trusting black-box models. And we can do it up to 158x faster than ever before. The future of AI is ZK. The future of humanity is Lagrange: 🧵

LAGRANGE

1,740,772 görüntüleme • 1 yıl önce

Introducing The Largest Organized and Incentivized Human-Data Fleet for General-Purpose Humanoid-Robotics and large language model training Our users are creators that complete daily data quests with crypto incentives through our Mobile App 🧵

Introducing The Largest Organized and Incentivized Human-Data Fleet for General-Purpose Humanoid-Robotics and large language model training Our users are creators that complete daily data quests with crypto incentives through our Mobile App 🧵

Mecka

385,649 görüntüleme • 2 yıl önce

1/ Introducing RL Swarm’s new backend: GenRL. A modular reinforcement learning library built for distributed, fault-tolerant training - now powering RL Swarm from the ground up. 🧵

1/ Introducing RL Swarm’s new backend: GenRL. A modular reinforcement learning library built for distributed, fault-tolerant training - now powering RL Swarm from the ground up. 🧵

gensyn

82,174 görüntüleme • 1 yıl önce

I’m thrilled to introduce my very first full-stack app: LLM Explorer! ✨ ✨ Explore 190+ Large Language Models and their specifics, from well-known GPTs to hidden gems from across the world! Let’s dive in! 🧵↓

I’m thrilled to introduce my very first full-stack app: LLM Explorer! ✨ ✨ Explore 190+ Large Language Models and their specifics, from well-known GPTs to hidden gems from across the world! Let’s dive in! 🧵↓

Charly Wargnier

37,582 görüntüleme • 1 yıl önce

Proteins can now talk. Introducing BioReason-Pro, the first reasoning model for protein function. A thread🧵

Proteins can now talk. Introducing BioReason-Pro, the first reasoning model for protein function. A thread🧵

Adib

204,333 görüntüleme • 4 ay önce