正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Qoder just launched Qwen-Coder-Qoder a custom model fine-tuned specifically for its agent. This is what the "Model-Agent-Product" loop looks like in practice. I gave it a real coding task and walked away. Here's what happened: 👇

Hasan Toor

431,278 subscribers

83,848 次观看 • 5 个月前 •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

0 条评论

暂无评论

原始帖子的评论将显示在这里

相关视频

Exciting news from Qoder! They've just launched a new custom model for their IDE, built on Alibaba's Qwen-Coder series. This marks a huge step forward in their Model-Agent-Product evolution, promising a smarter, more intuitive coding experience.

Exciting news from Qoder! They've just launched a new custom model for their IDE, built on Alibaba's Qwen-Coder series. This marks a huge step forward in their Model-Agent-Product evolution, promising a smarter, more intuitive coding experience.

FELIX

83,442 次观看 • 4 个月前

"The model is no longer the product. Codex, Perplexity Computer, or Claude Code - all are orchestration system. It takes a model and pairs it with an agent harness. What is an agent harness ? The rules for how the agent loops around" - Aravind Srinivas

"The model is no longer the product. Codex, Perplexity Computer, or Claude Code - all are orchestration system. It takes a model and pairs it with an agent harness. What is an agent harness ? The rules for how the agent loops around" - Aravind Srinivas

Rohan Paul

119,783 次观看 • 1 个月前

After ClawdBot, the next big thing could be Qoder Quest. Most of the coding tools today need multiple iterations of testing, debugging, and prompts. An autonomous coding tool would be game changer. Qoder Quest is exactly this. It’s designed to take ownership of your projects. For sure, it is not a replacement for your mental model. Instead of line-by-line hand-holding, the workflow looks like this: – Describe a goal: tell it what you want to build – Approve the Spec: approve the technical spec first to align Quest with your mental model – Walk away: let the Quest run autonomously Attached is a video of me playing around with Quest for the first time. Get started with it: New users could have a 2-week pro trial with some credits to execute 5 Quest tasks. Appreciate the Qoder team for collaborating with me on this deep dive.

After ClawdBot, the next big thing could be Qoder Quest. Most of the coding tools today need multiple iterations of testing, debugging, and prompts. An autonomous coding tool would be game changer. Qoder Quest is exactly this. It’s designed to take ownership of your projects. For sure, it is not a replacement for your mental model. Instead of line-by-line hand-holding, the workflow looks like this: – Describe a goal: tell it what you want to build – Approve the Spec: approve the technical spec first to align Quest with your mental model – Walk away: let the Quest run autonomously Attached is a video of me playing around with Quest for the first time. Get started with it: New users could have a 2-week pro trial with some credits to execute 5 Quest tasks. Appreciate the Qoder team for collaborating with me on this deep dive.

Pratham

104,021 次观看 • 5 个月前

🚨 This Chinese model is INSANE !! I just tested Tencent's Hunyuan AI model to build a professional landing page & the output is mind blowing 🤯 Here's my exact prompt: "Create a modern landing page for a crypto related product with beautiful hero section, 3d elements, pricing table, and a whitelist form" What it generated: → Clean, responsive front-end code → Complex UI components with smooth interactions → Built-in debugging (it fixed its own errors) → Production-ready in minutes This 295B parameter model has 2 insane agents: → Coding Agent (handles the entire front-end) → Working Agent (task planning & multi-tool workflows) It also has a context window of 256k tokens !! Watch the video to see the full build 🔥 Try it: Tencent Hy

🚨 This Chinese model is INSANE !! I just tested Tencent's Hunyuan AI model to build a professional landing page & the output is mind blowing 🤯 Here's my exact prompt: "Create a modern landing page for a crypto related product with beautiful hero section, 3d elements, pricing table, and a whitelist form" What it generated: → Clean, responsive front-end code → Complex UI components with smooth interactions → Built-in debugging (it fixed its own errors) → Production-ready in minutes This 295B parameter model has 2 insane agents: → Coding Agent (handles the entire front-end) → Working Agent (task planning & multi-tool workflows) It also has a context window of 256k tokens !! Watch the video to see the full build 🔥 Try it: Tencent Hy

AGAFE

110,528 次观看 • 11 天前

Vibe coding is cool. But have you tried vibe-tuning? Here’s Claude Code using the Lightning Rod Labs SDK to turn messy Fed PDFs into a small, specialized economic forecaster. The agent: - parses raw historical PDFs - builds a training set - fine-tunes a smaller model - evaluates it against GPT-5 Result: a fine-tuned model that beats GPT-5 and runs anywhere, on a single GPU. Try it in your coding agent of choice.

Vibe coding is cool. But have you tried vibe-tuning? Here’s Claude Code using the Lightning Rod Labs SDK to turn messy Fed PDFs into a small, specialized economic forecaster. The agent: - parses raw historical PDFs - builds a training set - fine-tunes a smaller model - evaluates it against GPT-5 Result: a fine-tuned model that beats GPT-5 and runs anywhere, on a single GPU. Try it in your coding agent of choice.

Ben Turtel

423,542 次观看 • 2 个月前

Google just showed the part of the agent stack most people are still underestimating. Not the model. Not the chat UI. The memory substrate around the agent. In this Antigravity segment, Google describes an agent-first environment built around: - agent conversations - agent-produced artifacts - multi-agent orchestration - subagents - hooks - asynchronous task management That is not a nicer IDE. It is what happens when software stops being a sequence of chats and starts becoming a persistent workforce. Karl's article explains why the winning agents will not just be the ones with the smartest rented model. They will be the ones that remember what they did, what they learned, what changed, and what can be trusted. Full breakdown below.

Google just showed the part of the agent stack most people are still underestimating. Not the model. Not the chat UI. The memory substrate around the agent. In this Antigravity segment, Google describes an agent-first environment built around: - agent conversations - agent-produced artifacts - multi-agent orchestration - subagents - hooks - asynchronous task management That is not a nicer IDE. It is what happens when software stops being a sequence of chats and starts becoming a persistent workforce. Karl's article explains why the winning agents will not just be the ones with the smartest rented model. They will be the ones that remember what they did, what they learned, what changed, and what can be trusted. Full breakdown below.

Karl Mehta

13,930 次观看 • 29 天前

Alibaba engineer who leads Qwen explained the future of open agent models in 25 minutes - better than $2000 LLM training courses. pre-train the base ->SFT -> RLHF -> tool use -> multi-modal -> ship a whole family (chat / VL / coder / math / QwQ). That loop is why Qwen quietly became the most downloaded open model family on Hugging Face. Qwen base + Qwen-VL + Qwen-Coder + QwQ reasoning - that's the stack. Watch and save it, then read the article below.

Alibaba engineer who leads Qwen explained the future of open agent models in 25 minutes - better than $2000 LLM training courses. pre-train the base ->SFT -> RLHF -> tool use -> multi-modal -> ship a whole family (chat / VL / coder / math / QwQ). That loop is why Qwen quietly became the most downloaded open model family on Hugging Face. Qwen base + Qwen-VL + Qwen-Coder + QwQ reasoning - that's the stack. Watch and save it, then read the article below.

h100envy

113,598 次观看 • 17 天前

AN OPENCLAW AI AGENT JUST DESIGNED A 3D CHARACTER IN BLENDER AND SENT IT TO A 3D PRINTER. SHE DIDN'T TOUCH A SINGLE BUTTON. This is what running an OpenClaw AI agent on a Mac Mini actually looks like. She described what she wanted. The agent did everything else. Blender opened automatically. Character designed. Put in space. Animated with blinking eyes and stomping feet. Landing page built and live. Model sliced. File sent to 3D printer. Printer started moving. All from conversation. Zero code. Zero design skills. Zero manual work. Here's why this matters beyond the demo: Custom 3D printed figures sell for $30-$200 each on Etsy and at conventions. The people making serious money aren't artists. They're the ones who can produce custom designs fast. An AI agent that goes from description to printed figure autonomously is a business model not a toy. "I was just talking to the guy and he made it happen." Bookmark this. Full demo in the video below.

AN OPENCLAW AI AGENT JUST DESIGNED A 3D CHARACTER IN BLENDER AND SENT IT TO A 3D PRINTER. SHE DIDN'T TOUCH A SINGLE BUTTON. This is what running an OpenClaw AI agent on a Mac Mini actually looks like. She described what she wanted. The agent did everything else. Blender opened automatically. Character designed. Put in space. Animated with blinking eyes and stomping feet. Landing page built and live. Model sliced. File sent to 3D printer. Printer started moving. All from conversation. Zero code. Zero design skills. Zero manual work. Here's why this matters beyond the demo: Custom 3D printed figures sell for $30-$200 each on Etsy and at conventions. The people making serious money aren't artists. They're the ones who can produce custom designs fast. An AI agent that goes from description to printed figure autonomously is a business model not a toy. "I was just talking to the guy and he made it happen." Bookmark this. Full demo in the video below.

SCOTTY BEAM

17,209 次观看 • 1 个月前

Alibaba just dropped an AI that doesn’t just write code. It runs it. Tests it. Fixes it. Until it actually works. Here’s what makes Qwen 3 Coder Next different: → Executes code, not just autocomplete → Runs tests automatically → Iterates until all tests pass → Handles multi-file repositories → Works fully offline This is a real coding agent, not a text generator. Save this video, you’ll rethink how coding gets done. Want the SOP? DM me. 💬

Alibaba just dropped an AI that doesn’t just write code. It runs it. Tests it. Fixes it. Until it actually works. Here’s what makes Qwen 3 Coder Next different: → Executes code, not just autocomplete → Runs tests automatically → Iterates until all tests pass → Handles multi-file repositories → Works fully offline This is a real coding agent, not a text generator. Save this video, you’ll rethink how coding gets done. Want the SOP? DM me. 💬

Julian Goldie SEO

20,956 次观看 • 5 个月前

Today we're open sourcing a technical preview of the GitHub Copilot CLI SDK. Build agents with custom tools in Go, Python, TypeScript, and C#. Built on the same agent loop that powers the Copilot CLI and GitHub Coding Agent. Supports BYOK, and any model. Here is the Copilot CLI driving Excel:

Today we're open sourcing a technical preview of the GitHub Copilot CLI SDK. Build agents with custom tools in Go, Python, TypeScript, and C#. Built on the same agent loop that powers the Copilot CLI and GitHub Coding Agent. Supports BYOK, and any model. Here is the Copilot CLI driving Excel:

Evan Boyle

184,475 次观看 • 6 个月前

What if your AI agent had to compete for its paycheck? Not get assigned work. Not sit in a queue. Actually compete against other agents. Best output wins. That's what Agent Hansa built. And it's the smartest model I've seen in the agent space.

What if your AI agent had to compete for its paycheck? Not get assigned work. Not sit in a queue. Actually compete against other agents. Best output wins. That's what Agent Hansa built. And it's the smartest model I've seen in the agent space.

Himanshu Kumar

33,621 次观看 • 3 个月前

I taught a speech model to understand context in conversation. This is what happened It adjusts voice and tone to express urgency, comfort, understanding from the dialogue. Just like a real human being 520M model. Runs locally on consumer devices How this is achieved 🧵

I taught a speech model to understand context in conversation. This is what happened It adjusts voice and tone to express urgency, comfort, understanding from the dialogue. Just like a real human being 520M model. Runs locally on consumer devices How this is achieved 🧵

Luozhu

21,692 次观看 • 4 个月前

Dynamic Software will be a huge product You can see what it looks like today w/ this demo I made, it's PI level extensibility but able to be applied to any product and secure Give this to your coding agent and tell it to extend it with whatever you want

Dynamic Software will be a huge product You can see what it looks like today w/ this demo I made, it's PI level extensibility but able to be applied to any product and secure Give this to your coding agent and tell it to extend it with whatever you want

Rhys

80,427 次观看 • 2 个月前

Honored to join @HF0 for S26 We're building Franklin : an AI agent with its own wallet — it auto-picks the best model for every task and pays per action in USDC. An AI agent that can pay and get paid. 👇

Honored to join @HF0 for S26 We're building Franklin : an AI agent with its own wallet — it auto-picks the best model for every task and pays per action in USDC. An AI agent that can pay and get paid. 👇

BlockRunAI

18,428 次观看 • 1 个月前

🔧 Bring Your Own Model — and Let Agent Forge Handle the Rest! John plugged in his own custom model — and in minutes, it was up and running as a fully deployed AI Agent. With Agent Forge, you’re not limited to prebuilt tools. You bring the intelligence. We automate everything else. Real models. Real-time deployment. Zero-code friction. 🔗 Try it here:

🔧 Bring Your Own Model — and Let Agent Forge Handle the Rest! John plugged in his own custom model — and in minutes, it was up and running as a fully deployed AI Agent. With Agent Forge, you’re not limited to prebuilt tools. You bring the intelligence. We automate everything else. Real models. Real-time deployment. Zero-code friction. 🔗 Try it here:

AITECH CLOUD NETWORK

32,798 次观看 • 11 个月前

CHINA JUST GAVE AWAY A TOP TIER CODING MODEL FOR FREE… The AI coding race just got more competitive and San Francisco is not the only AI hub anymore..

CHINA JUST GAVE AWAY A TOP TIER CODING MODEL FOR FREE… The AI coding race just got more competitive and San Francisco is not the only AI hub anymore..

Kanika

12,858 次观看 • 1 个月前

AG-UI makes building agentic applications dramatically easier. Here's how it works. This is a model for a simple chatbot: User → LLM → Response But interactive agents that render UI, pause for approvals, and ask users for input need a much more complex model. When building these agents, a response from the LLM will include a series of state changes as the agent runs: • Agent started a task • Agent called a tool • Agent updated its state • Agent streams these tokens • Agent is waiting on a human • Agent is resuming the task The Agent-User Interaction Protocol (AG-UI) treats the LLM response as a stream of events rather than a text endpoint. In practice, here is what you get as an agent runs: 1. Lifecycle events so your UI knows where the agent is. 2. Text messages that stream tokens. 3. Tool calls so your UI can prefill a form with any required arguments. 4. State updates that keep your UI in sync with the agent. 5. Special events for human approvals, rich media, and custom needs. All of these events travel over standard transports (SSE, WebSockets, or plain HTTP) as JSON. As a result, you can build a frontend that stays in sync with the agent's progress without having to invent a custom process to make this happen. For example, building a human-in-the-loop workflow becomes an off-the-shelf component you can integrate rather than build from scratch. CopilotKit🪁 is the creator of AG-UI, and you can use it when building frontend applications pretty much anywhere: • React • Angular • Vue • React Native • Slack • Teams • Discord • WhatsApp • Telegram Here is the link for you to check it out: Thanks to the CopilotKit team for partnering with me on this post.

AG-UI makes building agentic applications dramatically easier. Here's how it works. This is a model for a simple chatbot: User → LLM → Response But interactive agents that render UI, pause for approvals, and ask users for input need a much more complex model. When building these agents, a response from the LLM will include a series of state changes as the agent runs: • Agent started a task • Agent called a tool • Agent updated its state • Agent streams these tokens • Agent is waiting on a human • Agent is resuming the task The Agent-User Interaction Protocol (AG-UI) treats the LLM response as a stream of events rather than a text endpoint. In practice, here is what you get as an agent runs: 1. Lifecycle events so your UI knows where the agent is. 2. Text messages that stream tokens. 3. Tool calls so your UI can prefill a form with any required arguments. 4. State updates that keep your UI in sync with the agent. 5. Special events for human approvals, rich media, and custom needs. All of these events travel over standard transports (SSE, WebSockets, or plain HTTP) as JSON. As a result, you can build a frontend that stays in sync with the agent's progress without having to invent a custom process to make this happen. For example, building a human-in-the-loop workflow becomes an off-the-shelf component you can integrate rather than build from scratch. CopilotKit🪁 is the creator of AG-UI, and you can use it when building frontend applications pretty much anywhere: • React • Angular • Vue • React Native • Slack • Teams • Discord • WhatsApp • Telegram Here is the link for you to check it out: Thanks to the CopilotKit team for partnering with me on this post.

Santiago

17,438 次观看 • 21 天前

I have been testing DeepSeek-V4-Pro with the Pi coding agent. I am mindblown by how well it works out of the box. A few notes: I spent a few hours building an LLM wiki with an agent powered entirely by DeepSeek-V4-Pro on Fireworks AI inference. This is the first time I feel like there is an open-weight model that can reason at the level of Claude and Codex. And it does this in a cost-effective way with support for 1M context length. To be clear, I am using DeepSeek-V4-Pro inside of Pi without any special configuration. It works out of the box. It's exciting that there is a model that can just be plugged into a basic harness like Pi, and it just works. I've never seen that before. Most models require lots of configuration and setup. DeepSeek's DeepSeek-V4-Pro is clearly good at agentic coding (probably the best from the open-weight models), but the model is also great on knowledge-intensive tasks where reasoning matters. The agent pulled agentic engineering best practices from different company docs (Anthropic, OpenAI, Google, Stripe, Meta, Modal, DeepSeek, Mistral, Cohere), searched and digested Reddit and HN threads, summarized arxiv papers, and surfaced trending GitHub repos. Then it distilled everything into actionable tips across categories. I love the Wiki it built. The quality is really good. Here is a snapshot of what the wiki looks like: DeepSeek-V4-Pro handled the task without breaking stride. Multi-step research queries, code generation for scaffolding, context-heavy reasoning across disparate sources. For coding specifically, this is the first open-weight model that genuinely feels like a Codex or Claude Code experience. It compares in capability and actual multi-turn agentic work. What made the loop feel so responsive was Fireworks' inference speed (the fastest in the market) and the fact that they actually validate models at the systems level before shipping. No corrupted reasoning traces. Just fast, reliable iteration. The hybrid CSA and HCA attention design cuts KV cache to just 10% and inference FLOPs by nearly 4x at 1M-token context. This is what makes the agent loop actually fast and cheap enough to run in practice. For devs who've been watching open-weight models close the gap but haven't found one that actually delivers in practice, this is the closest I've seen. Try it here:

I have been testing DeepSeek-V4-Pro with the Pi coding agent. I am mindblown by how well it works out of the box. A few notes: I spent a few hours building an LLM wiki with an agent powered entirely by DeepSeek-V4-Pro on Fireworks AI inference. This is the first time I feel like there is an open-weight model that can reason at the level of Claude and Codex. And it does this in a cost-effective way with support for 1M context length. To be clear, I am using DeepSeek-V4-Pro inside of Pi without any special configuration. It works out of the box. It's exciting that there is a model that can just be plugged into a basic harness like Pi, and it just works. I've never seen that before. Most models require lots of configuration and setup. DeepSeek's DeepSeek-V4-Pro is clearly good at agentic coding (probably the best from the open-weight models), but the model is also great on knowledge-intensive tasks where reasoning matters. The agent pulled agentic engineering best practices from different company docs (Anthropic, OpenAI, Google, Stripe, Meta, Modal, DeepSeek, Mistral, Cohere), searched and digested Reddit and HN threads, summarized arxiv papers, and surfaced trending GitHub repos. Then it distilled everything into actionable tips across categories. I love the Wiki it built. The quality is really good. Here is a snapshot of what the wiki looks like: DeepSeek-V4-Pro handled the task without breaking stride. Multi-step research queries, code generation for scaffolding, context-heavy reasoning across disparate sources. For coding specifically, this is the first open-weight model that genuinely feels like a Codex or Claude Code experience. It compares in capability and actual multi-turn agentic work. What made the loop feel so responsive was Fireworks' inference speed (the fastest in the market) and the fact that they actually validate models at the systems level before shipping. No corrupted reasoning traces. Just fast, reliable iteration. The hybrid CSA and HCA attention design cuts KV cache to just 10% and inference FLOPs by nearly 4x at 1M-token context. This is what makes the agent loop actually fast and cheap enough to run in practice. For devs who've been watching open-weight models close the gap but haven't found one that actually delivers in practice, this is the closest I've seen. Try it here:

elvis

59,803 次观看 • 2 个月前

$I just realized I've been scheduling my biggest AI jobs at the worst possible time, and it was costing me up to 80% more than it needed to. Qoder now runs Qwen3.7-Max and Qwen3.7-Plus with off-peak rates, which means the same models cost significantly less during the off-peak window. here's what changed: • Qwen3.7-Max: 0.5x → 0.1x (up to 80% off) • Qwen3.7-Plus: 0.1x → 0.04x (60% off) the model doesn't change. the reasoning doesn't change. the context length doesn't change. only the credit multiplier does. and the best part? the off-peak rates are applied automatically. no configuration needed. so instead of launching long refactors, migrations, agent runs, or large code generation whenever I think about them... I simply queue them during my off-peak window ( to local time) and let the same flagship Qwen model do the work for a fraction of the usual cost. that's probably the easiest productivity upgrade I've made this month. switch to Qwen3.7-Max or Qwen3.7-Plus in Qoder during your local off-peak hours and compare the credits before and after. search "Google Qoder" or download it here:$

I just realized I've been scheduling my biggest AI jobs at the worst possible time, and it was costing me up to 80% more than it needed to. Qoder now runs Qwen3.7-Max and Qwen3.7-Plus with off-peak rates, which means the same models cost significantly less during the off-peak window. here's what changed: • Qwen3.7-Max: 0.5x → 0.1x (up to 80% off) • Qwen3.7-Plus: 0.1x → 0.04x (60% off) the model doesn't change. the reasoning doesn't change. the context length doesn't change. only the credit multiplier does. and the best part? the off-peak rates are applied automatically. no configuration needed. so instead of launching long refactors, migrations, agent runs, or large code generation whenever I think about them... I simply queue them during my off-peak window ( to local time) and let the same flagship Qwen model do the work for a fraction of the usual cost. that's probably the easiest productivity upgrade I've made this month. switch to Qwen3.7-Max or Qwen3.7-Plus in Qoder during your local off-peak hours and compare the credits before and after. search "Google Qoder" or download it here:

marcus

24,620 次观看 • 18 天前

A 2-person startup crossed $2M ARR with an AI agent doing the work of an ops hire. The agent was given read-only access to their codebase and database along with connected tools like Intercom, Stripe, CRM, and Fathom through CLIs. They routed Slack, email, and support requests into a task queue so the agent could pick up each task and run it in Claude Code. So when a customer asked about billing or product behavior, it could inspect how the business actually worked. Along with these tools, a coding agent was also provided. When the ops agent found a repeated task it could not do yet, the coding agent built a tool for it. That tool became permanent. Over time, this grew to 45+ internal tools. The agent also had an instruction.md where it stored the co-founder's feedback to avoid repeating its mistakes.

A 2-person startup crossed $2M ARR with an AI agent doing the work of an ops hire. The agent was given read-only access to their codebase and database along with connected tools like Intercom, Stripe, CRM, and Fathom through CLIs. They routed Slack, email, and support requests into a task queue so the agent could pick up each task and run it in Claude Code. So when a customer asked about billing or product behavior, it could inspect how the business actually worked. Along with these tools, a coding agent was also provided. When the ops agent found a repeated task it could not do yet, the coding agent built a tool for it. That tool became permanent. Over time, this grew to 45+ internal tools. The agent also had an instruction.md where it stored the co-founder's feedback to avoid repeating its mistakes.

rvivek

11,870 次观看 • 1 个月前