Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

Introducing OpenDiLoCo, an open-source implementation and scaling of DeepMind’s Distributed Low-Communication (DiLoCo) method, enabling globally distributed AI model training.

Prime Intellect

60,558 subscribers

260,002 views • 2 years ago •via X (Twitter)

Education News & Politics Science & Technology

Anya Rossi• Live Now

Private livecam show

11 Comments

Prime Intellect2 years ago

Last week, we released the first step in our masterplan by launching the PI Compute Exchange. Today, we are thrilled to announce a major step forward on the second part by open-sourcing our framework to enable collaborative model development across globally distributed GPUs.

Prime Intellect2 years ago

We reproduced DeepMind's DiLoCo experiments in a scalable, decentralized training framework. We trained a model across 3 countries with 90-95% compute utilization and scaled it to 3x the size of the original work, proving its effectiveness for billion-parameter models.

Prime Intellect2 years ago

DiLoCo Recent work by @GoogleDeepMind introduced an approach that enables training of language models on devices that are poorly connected. The method allows for data parallel training, but requires synchronization of gradients only every 500 steps.

Prime Intellect2 years ago

OpenDiLoCo To foster collaboration in this promising research direction to democratize AI, we have released our code for OpenDiLoCo under an open-source license: Our implementation is built on top of the Hivemind library, enabling a real-world decentralized training setup for DiLoCo, including: - On/Off ramping of compute resources - Fault tolerance training - Peer-to-Peer: There is no master node.

Prime Intellect2 years ago

We replicate the main experiment results and show that DiLoCo with 8 replicas significantly outperforms the baseline without any replicas and matches the performance of a stronger baseline with the same compute budget, despite 500x lower communication.

Prime Intellect2 years ago

Scaling DiLoCo to Billion Parameter Models The original DiLoCo work only experimented with model sizes of up to 400 million parameters. We scale the method to a 1.1 billion parameter model. While we demonstrate that DiLoCo works at the billion-parameter scale, we believe further work is needed to make it effective with larger batch sizes and increased local steps.

Prime Intellect2 years ago

Globally Distributed Training Setting We train a billion parameter scale model across three countries. Due to DiLoCo’s reduction in communication time, the all-reduce bottleneck only took up 6.9% of the training time, minimally impacting the overall training speed.

Prime Intellect2 years ago

Blog: Code: Paper:

Prime Intellect2 years ago

We are excited about OpenDiLoCo's practical applications and look forward to building on it for the third part of our masterplan: To collaboratively train and contribute to open AI models in high-impact domains like language, agents, code, and science for collective ownership.

Prime Intellect2 years ago

Join us in building the open future of decentralized AI! - Apply for open roles: - Collaborate on AI initiatives: - Contribute compute & earn ownership

Prime Intellect2 years ago

We want to thank @m_ryabinin for his guidance and help with the Hivemind library, and @Ar_Douillard for his work on DiLoCo and helping us figure out the details of reproducing the original experiments!

Related Videos

Today, we're laying the foundation to accelerate open & decentralized AI Introducing our protocol & testnet: A peer-to-peer compute and intelligence network. Enabling collective creation, ownership, and access of sovereign open-source AI Towards an open superintelligence future

Today, we're laying the foundation to accelerate open & decentralized AI Introducing our protocol & testnet: A peer-to-peer compute and intelligence network. Enabling collective creation, ownership, and access of sovereign open-source AI Towards an open superintelligence future

Prime Intellect

266,737 views • 1 year ago

There's no point in doing decentralized training without efficient communication. >> DiLoCo (H=15) ships ~480mb/merge with 163 syncs. >> SparseLoCo (H=15) ships ~5.5–17mb/merge at 0.78–3.12% density with 163 syncs Top-K Compression + 2 bit comms ~28–89× smaller per sync than DiLoCo. Subnet 3 :: Luis el grande If you have the algorithm, you can train large language models across disparate compute, collectively. "In the space of eight months or nine months, we've been able to scale our model from 1.2B to 70B, which represents 58x improvement" Distributed State Research paper :: Full Episode059 + const :: The holy grail of distributed AI training SN3 :: Templar :: Luis el grande_ai SN39 :: Basilica :: basilica SN81 :: Grail :: grail #SN3 #SN39 #SN81 #Bittensor

There's no point in doing decentralized training without efficient communication. >> DiLoCo (H=15) ships ~480mb/merge with 163 syncs. >> SparseLoCo (H=15) ships ~5.5–17mb/merge at 0.78–3.12% density with 163 syncs Top-K Compression + 2 bit comms ~28–89× smaller per sync than DiLoCo. Subnet 3 :: Luis el grande If you have the algorithm, you can train large language models across disparate compute, collectively. "In the space of eight months or nine months, we've been able to scale our model from 1.2B to 70B, which represents 58x improvement" Distributed State Research paper :: Full Episode059 + const :: The holy grail of distributed AI training SN3 :: Templar :: Luis el grande_ai SN39 :: Basilica :: basilica SN81 :: Grail :: grail #SN3 #SN39 #SN81 #Bittensor

Openτensor Foundaτion

17,767 views • 10 months ago

Introducing Open Deep Research 🔭 An open source AI Agent that reasons large amounts of web data extracted with Open source. Powered by the AI SDK

Introducing Open Deep Research 🔭 An open source AI Agent that reasons large amounts of web data extracted with Open source. Powered by the AI SDK

Nicolas Camara

215,028 views • 1 year ago

mlx distributed uses mpi for machine to machine communication, and this is how you can handle uneven distribution of model layers. Thanks Awni Hannun for the help

mlx distributed uses mpi for machine to machine communication, and this is how you can handle uneven distribution of model layers. Thanks Awni Hannun for the help

Alex Ziskind

31,265 views • 1 year ago

Introducing Open Source Reve I built an open-source version of Reve using AI Studio and Nano Banana, making creative image generation and editing available for everyone.

Introducing Open Source Reve I built an open-source version of Reve using AI Studio and Nano Banana, making creative image generation and editing available for everyone.

CHOI

57,361 views • 10 months ago

Introducing Ψ₀ ( — an open foundation model for universal humanoid loco-manipulation. 🏆 Outperforms GR00T N1.6 by 40%+ overall success rate 📉 Uses only ~10% of the pre-training data 📦 Fully open-source: model, data, code, and deployment pipeline 1/10

Introducing Ψ₀ ( — an open foundation model for universal humanoid loco-manipulation. 🏆 Outperforms GR00T N1.6 by 40%+ overall success rate 📉 Uses only ~10% of the pre-training data 📦 Fully open-source: model, data, code, and deployment pipeline 1/10

Yue Wang

20,363 views • 4 months ago

Announcing INTELLECT-1: the first-ever decentralized training of a 10B model Scaling decentralized training 10x beyond prior efforts. Anyone can join us to build open-source AGI 🦋

Announcing INTELLECT-1: the first-ever decentralized training of a 10B model Scaling decentralized training 10x beyond prior efforts. Anyone can join us to build open-source AGI 🦋

Prime Intellect

790,586 views • 1 year ago

Introducing Bambot, an open source, low cost (~$300) humanoid robot. Inspired by LeRobot , so-100 arm and lekiwi

Introducing Bambot, an open source, low cost (~$300) humanoid robot. Inspired by LeRobot , so-100 arm and lekiwi

Tim Qian

193,417 views • 1 year ago

🪣 RustFS: high-performance, distributed object storage system built in Rust - offers full S3 compatibility, is completely open-source, and is optimized for data lakes, AI, and big data workloads

🪣 RustFS: high-performance, distributed object storage system built in Rust - offers full S3 compatibility, is completely open-source, and is optimized for data lakes, AI, and big data workloads

AstraKernel 💫

34,101 views • 7 months ago

📢 Introducing World Vibe Web: a distributed, open-source app store. Anyone can create an apps.json, register their store, and their apps appear on Fully static on GitHub Pages. 2 stores w 18 apps live now. Add yours with a PR.

📢 Introducing World Vibe Web: a distributed, open-source app store. Anyone can create an apps.json, register their store, and their apps appear on Fully static on GitHub Pages. 2 stores w 18 apps live now. Add yours with a PR.

fatih kadir akın

33,298 views • 4 months ago

Introducing TogetherLink! An open source CLI to run any open source model inside your favorite coding harness. Run GLM 5.2 directly in Codex and Claude Code.

Introducing TogetherLink! An open source CLI to run any open source model inside your favorite coding harness. Run GLM 5.2 directly in Codex and Claude Code.

Hassan

33,312 views • 15 days ago

Introducing Project Stera by FPV Labs, an open data infra for embodied AI research. Project Stera includes Stera-10M, with 10M+ frames of long-horizon data with persistent state tracking, and an open-source pipeline that converts raw data into training-ready formats.

Introducing Project Stera by FPV Labs, an open data infra for embodied AI research. Project Stera includes Stera-10M, with 10M+ frames of long-horizon data with persistent state tracking, and an open-source pipeline that converts raw data into training-ready formats.

FPV Labs

15,025 views • 2 months ago

Akash Homenode is currently in beta taking in 3090s, 4090s, & 5090s to form a network for inference and start off Akash's foothold in the future of distributed training. Greg Osuri 🇺🇸 talks about why he believes the future of AI model training is through home GPUs, everywhere.

Akash Homenode is currently in beta taking in 3090s, 4090s, & 5090s to form a network for inference and start off Akash's foothold in the future of distributed training. Greg Osuri 🇺🇸 talks about why he believes the future of AI model training is through home GPUs, everywhere.

Akash Network

30,778 views • 1 month ago

Introducing... Amica! 👩‍🦰🤖 Amica is an open source interface for interactive communication with 3D characters with voice synthesis, speech recognition, visual understanding, and an emotion system. 🔗

Introducing... Amica! 👩‍🦰🤖 Amica is an open source interface for interactive communication with 3D characters with voice synthesis, speech recognition, visual understanding, and an emotion system. 🔗

Arbius

41,769 views • 2 years ago

Introducing Whisper – an open source voice note taking app! Record voice notes and transcribe them into lists, blogs, & more with AI. 100% free & open source.

Introducing Whisper – an open source voice note taking app! Record voice notes and transcribe them into lists, blogs, & more with AI. 100% free & open source.

Hassan

280,769 views • 1 year ago

Introducing Passmark: an open-source AI agent purpose-built for regression testing at scale. Built on Playwright: natural language tests, multi-model assertions, smart caching, telemetry, AI gateway support and more! github: bug0inc/passmark

Introducing Passmark: an open-source AI agent purpose-built for regression testing at scale. Built on Playwright: natural language tests, multi-model assertions, smart caching, telemetry, AI gateway support and more! github: bug0inc/passmark

Sandeep

34,933,218 views • 3 months ago

Introducing LogoCreator! An open source logo generator that creates professional logos in seconds using Flux Pro 1.1 on Together AI. 100% free and open source. Demo + code:

Hassan

349,316 views • 1 year ago

AI that senses coding frustration, is this the future of learning? In our latest GitHub Podcast episode we talk to Angie Jones about Goose, Block’s open source AI agent and reference implementation of the Model Context Protocol (MCP).

AI that senses coding frustration, is this the future of learning? In our latest GitHub Podcast episode we talk to Angie Jones about Goose, Block’s open source AI agent and reference implementation of the Model Context Protocol (MCP).

GitHub

16,414 views • 8 months ago

BG2. Ep 17. Double $NVDA! System Level Comp Moat, “Insane Demand”, Inference Explosion 1 B x, Memphis Supercluster, OpenAI, & more. Brad Gerstner Clark Tang Bill Gurley (00:00) Intro (1:50) The Evolution of AGI and Personal Assistants (06:03) NVIDIA's Competitive Moat (15:51 ) The Future of Inference and Training in AI (19:01) Building the AI Infrastructure (31:35) Inventing a New Market in an AI Future (38:40) The Impact of OpenAI (43:25) The Future of AI Models (46.44) and Memphis Supercluster (51:21) Distributed Computing and Inference Scaling (55:54) Inference Time Reasoning and Its Importance (01:00:46) AI's Role in Growing Business and Improving Productivity (01:08:00) Ensuring Safe AI Development (01:12:31) The Balance of Open Source and Closed Source AI

BG2. Ep 17. Double $NVDA! System Level Comp Moat, “Insane Demand”, Inference Explosion 1 B x, Memphis Supercluster, OpenAI, & more. Brad Gerstner Clark Tang Bill Gurley (00:00) Intro (1:50) The Evolution of AGI and Personal Assistants (06:03) NVIDIA's Competitive Moat (15:51 ) The Future of Inference and Training in AI (19:01) Building the AI Infrastructure (31:35) Inventing a New Market in an AI Future (38:40) The Impact of OpenAI (43:25) The Future of AI Models (46.44) and Memphis Supercluster (51:21) Distributed Computing and Inference Scaling (55:54) Inference Time Reasoning and Its Importance (01:00:46) AI's Role in Growing Business and Improving Productivity (01:08:00) Ensuring Safe AI Development (01:12:31) The Balance of Open Source and Closed Source AI

Bg2 Pod

487,379 views • 1 year ago