Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

Running Minimax M2.1 (MiniMax (official)) with OpenCode (OpenCode) and mlx_lm.server. Works quite well on an M3 Ultra. Once the KV cache is warm the prompt processing is pretty quick. And token generation is very fast.

Awni Hannun

37,077 subscribers

32,329 views • 5 months ago •via X (Twitter)

Education News & Politics Science & Technology

Anya Rossi• Live Now

Private livecam show

0 Comments

No comments available

Comments from the original post will appear here

Related Videos

Running four simultaneous OpenCode agents works well with mlx_lm.server continuous batching and MiniMax M2.1 on an M3 Ultra:

Running four simultaneous OpenCode agents works well with mlx_lm.server continuous batching and MiniMax M2.1 on an M3 Ultra:

Awni Hannun

95,013 views • 5 months ago

Running four high-level OpenCode agents + subagents with mlx_lm.server continuous batching and MiniMax M2.5 (6-bit). Fits easily on a 512GB M3 Ultra. Generation is quite fast. But prefill is still slow compared to cloud servers.

Running four high-level OpenCode agents + subagents with mlx_lm.server continuous batching and MiniMax M2.5 (6-bit). Fits easily on a 512GB M3 Ultra. Generation is quite fast. But prefill is still slow compared to cloud servers.

Awni Hannun

25,535 views • 3 months ago

PSA: MiniMax (official) M2.5 is freely available on OpenCode Great opportunity to check out the power of open models for coding

PSA: MiniMax (official) M2.5 is freely available on OpenCode Great opportunity to check out the power of open models for coding

Niels Rogge

49,016 views • 3 months ago

You can use a 100% open source and MUCH cheaper/free alternative to Claude Code and Opus 4.5 OpenCode + MiniMax M2.1 can even build a 3D website using pure vibe coding. Steps are really simple: 1. Install OpenCode using the command 'npm i -g opencode-ai' 2. Get your MiniMax API key here: 3. Configure the MiniMax (official) mode Just type "opencode connect minimax" Coding plans start at $2… 10x cheaper than Claude Code (yes). You can also invite friends so they can get 10% off and you’ll get 10% API credits. (You can also use it locally if you have the config) And you're ready to build and iterate almost endlessly since the model is both way faster and cheaper than Opus in CC.

You can use a 100% open source and MUCH cheaper/free alternative to Claude Code and Opus 4.5 OpenCode + MiniMax M2.1 can even build a 3D website using pure vibe coding. Steps are really simple: 1. Install OpenCode using the command 'npm i -g opencode-ai' 2. Get your MiniMax API key here: 3. Configure the MiniMax (official) mode Just type "opencode connect minimax" Coding plans start at $2… 10x cheaper than Claude Code (yes). You can also invite friends so they can get 10% off and you’ll get 10% API credits. (You can also use it locally if you have the config) And you're ready to build and iterate almost endlessly since the model is both way faster and cheaper than Opus in CC.

Paul Couvert

42,443 views • 4 months ago

I did it! It works! Using GLM-4.7-4bit with mlx_lm.server and opencode to fix real code locally! 🔥 Here single M3 Ultra 512GB, nex step phase will be 2 using Tensor Parallelism and then apply same changes to exo. Prefill is slow on a single machine, but generation is good.

I did it! It works! Using GLM-4.7-4bit with mlx_lm.server and opencode to fix real code locally! 🔥 Here single M3 Ultra 512GB, nex step phase will be 2 using Tensor Parallelism and then apply same changes to exo. Prefill is slow on a single machine, but generation is good.

Ivan Fioravanti ᯅ

44,000 views • 5 months ago

This is MiniMax-M2.5 MLX running in LM Studio on an Apple Mac Studio M3 Ultra 512GB. Fast enough out of the box for hosting OpenClaw, n8n workflows, and Open WebUI for the team.

This is MiniMax-M2.5 MLX running in LM Studio on an Apple Mac Studio M3 Ultra 512GB. Fast enough out of the box for hosting OpenClaw, n8n workflows, and Open WebUI for the team.

Patrick J Kennedy

73,547 views • 3 months ago

Opencode + MiniMax M2.1 have created an amazing fashion style website 👀 No skills added, just plain combo: CLI + Model. Zero-shot! WOW 🔥 I can cancel my Lovable subscription 🤷🏻‍♂️ Prompt used, below.

Opencode + MiniMax M2.1 have created an amazing fashion style website 👀 No skills added, just plain combo: CLI + Model. Zero-shot! WOW 🔥 I can cancel my Lovable subscription 🤷🏻‍♂️ Prompt used, below.

Ivan Fioravanti ᯅ

29,986 views • 5 months ago

MiniMax M2.1 in 4-bit cruises on an M3 Ultra with mlx-lm. Generated a space invaders game using 5098 tokens at 47.2 tok/sec:

MiniMax M2.1 in 4-bit cruises on an M3 Ultra with mlx-lm. Generated a space invaders game using 5098 tokens at 47.2 tok/sec:

Awni Hannun

93,826 views • 5 months ago

i built an ipod inspired yt music desktop client using MiniMax (official) and opencode in 1 night 100% rust. check it out

i built an ipod inspired yt music desktop client using MiniMax (official) and opencode in 1 night 100% rust. check it out

shydev

15,971 views • 4 months ago

I wasn't expecting such an intelligence model on my local machine. Minimax 2.1 on MLX🔥 git clone cd mlx-lm && pip install -e . mlx_lm.server --model mlx-community/MiniMax-M2.1-4bit

I wasn't expecting such an intelligence model on my local machine. Minimax 2.1 on MLX🔥 git clone cd mlx-lm && pip install -e . mlx_lm.server --model mlx-community/MiniMax-M2.1-4bit

mzba

30,377 views • 5 months ago

We ran the same prompt and identical starting context on MiniMax-M2.1 and Claude Sonnet 4.5. MiniMax-M2.1 by MiniMax (official) reached a usable result faster, required fewer structural fixes, and produced a more consistent visual and interaction flow from the first pass. To test this properly, we asked both models to build a complex single-page web animation with real-world visual and physics constraints. Comparison video below 👇

We ran the same prompt and identical starting context on MiniMax-M2.1 and Claude Sonnet 4.5. MiniMax-M2.1 by MiniMax (official) reached a usable result faster, required fewer structural fixes, and produced a more consistent visual and interaction flow from the first pass. To test this properly, we asked both models to build a complex single-page web animation with real-world visual and physics constraints. Comparison video below 👇

GitHub Projects Community

29,252 views • 5 months ago

MLX + OpenCode + Qwen3.5-122B-A10B-4bit on M3 Ultra created a great snake game! Work zero-shot. Video clearly in super fast mode during generation. I generated the prompt using Grok 4.20, it's in the article.

MLX + OpenCode + Qwen3.5-122B-A10B-4bit on M3 Ultra created a great snake game! Work zero-shot. Video clearly in super fast mode during generation. I generated the prompt using Grok 4.20, it's in the article.

Ivan Fioravanti ᯅ

74,659 views • 3 months ago

The new MiniMax M2.1 model is now available in the Blackbox CLI. 3 games built with a similar prompt, here was the result. Get started here

The new MiniMax M2.1 model is now available in the Blackbox CLI. 3 games built with a similar prompt, here was the result. Get started here

BLACKBOX AI

15,642 views • 5 months ago

Minimax M3 is excellent at SVG generation, reaching close to Gemini 3.5 Flash levels and beating Opus 4.7 on SVG-Bench. With 1M context, native multimodality, strong agentic/coding ability and open weights coming soon, the closed-source moat is thinning fast. Full Video:

Minimax M3 is excellent at SVG generation, reaching close to Gemini 3.5 Flash levels and beating Opus 4.7 on SVG-Bench. With 1M context, native multimodality, strong agentic/coding ability and open weights coming soon, the closed-source moat is thinning fast. Full Video:

WorldofAI

16,499 views • 10 days ago

We built Openwork! This time with OpenCode + Composio This agent can now: - use opencode zen models like Big Pickle, GLM 4.7, MiniMax - access local files and your terminal via opencode - chain actions across local files and cloud apps in one flow completely free and open source:

We built Openwork! This time with OpenCode + Composio This agent can now: - use opencode zen models like Big Pickle, GLM 4.7, MiniMax - access local files and your terminal via opencode - chain actions across local files and cloud apps in one flow completely free and open source:

Karan Vaidya

62,689 views • 4 months ago

So I created a simple Expo app for remotely connecting to an OpenCode server (running on my mac) so I can remotely control & prompt from my iPad or phone. Not very polished but it works pretty well and its kinda interesting that you can see the feedback loop live.

So I created a simple Expo app for remotely connecting to an OpenCode server (running on my mac) so I can remotely control & prompt from my iPad or phone. Not very polished but it works pretty well and its kinda interesting that you can see the feedback loop live.

ryan vogel

50,706 views • 5 months ago

MiniMax (official) M2.1 cooked so hard like this is so good looking website with no external assets 🤯 i am sharing prompts as it was very demanded , for you guys made it in a template so just change first line of the prompt and make beautiful site and tag me ❤️

MiniMax (official) M2.1 cooked so hard like this is so good looking website with no external assets 🤯 i am sharing prompts as it was very demanded , for you guys made it in a template so just change first line of the prompt and make beautiful site and tag me ❤️

Chetaslua

18,115 views • 5 months ago

MiniMax is now official supported provider for OpenClaude! cc MiniMax (official)

MiniMax is now official supported provider for OpenClaude! cc MiniMax (official)

GitLawb

42,422 views • 1 month ago

Opus 4.6 vs. Minimax M2.5 Prompt: Build an interactive solar system from scratch. Opus 4.6 tried to create a beautiful UI, but the sun’s shadow ruined it. Minimax M2.5 built a simple version that works beautifully. The winner is: 🥇 Minimax M2.5 🥈 Opus 4.6

Opus 4.6 vs. Minimax M2.5 Prompt: Build an interactive solar system from scratch. Opus 4.6 tried to create a beautiful UI, but the sun’s shadow ruined it. Minimax M2.5 built a simple version that works beautifully. The winner is: 🥇 Minimax M2.5 🥈 Opus 4.6

Okara

80,546 views • 3 months ago

Qwen 3.5 397B prompt processing on M3 Ultra (with MLX distributed + JACCL) - 3.4× speedup on 4 chips - scaling improves as context increases Really fun to use with opencode; generated a playable Asteroids clone in ~4 minutes (real time, including me playing it a bit).

Qwen 3.5 397B prompt processing on M3 Ultra (with MLX distributed + JACCL) - 3.4× speedup on 4 chips - scaling improves as context increases Really fun to use with opencode; generated a playable Asteroids clone in ~4 minutes (real time, including me playing it a bit).

Angelos Katharopoulos

23,348 views • 3 months ago