Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

Kimi K2 is so good at tool calling and agentic loops, can call multiple tools in parallel and reliably, and knows "when to stop", which is another important property. It's the first model I feel comfortable using in production since Claude 3.5 Sonnet.

Pietro Schirano

102,467 subscribers

174,490 просмотров • 11 месяцев назад •via X (Twitter)

Наука и технологии Образование Искусство

Anya Rossi• Live Now

Private livecam show

Комментарии: 10

Фото профиля Karim Chaanine

Karim Chaanine11 месяцев назад

been testing kimi k2 for mario (my sales agent) and honestly? the tool calling is chef's kissswitched from claude for one specific workflow where we needed 3 parallel api calls conditional logic. kimi just... handles it. no weird loops, knows when to stopstill claude for most things but kimi's nailing the complex agentic stuff

Фото профиля javi

javi11 месяцев назад

you don’t feel comfortable using Opus 4 in prod? 🤯 hype!!!

Фото профиля Pietro Schirano

Pietro Schirano11 месяцев назад

More so the first non-Anthropic model

Фото профиля Jesse Merrigan

Jesse Merrigan11 месяцев назад

Agreed - given it a proper go today and am astounded by the quality. General vibe checks have it providing better answers than o3 pro and Grok 4 on a bunch of tasks

Фото профиля Sturgis Steele

Sturgis Steele11 месяцев назад

When you say the last model you were comfortable using in production was Claude 3.5 sonnet, does that mean you did not use Claude 4 for production?

Фото профиля Pietro Schirano

Pietro Schirano11 месяцев назад

I didn’t word it properly, I meant it's the first non-Anthropic model I feel I can use for agentic loops.

Фото профиля Oscar Le

Oscar Le11 месяцев назад

The "call multiple tools in parallel" that is done on the server side of Kimi, not your own implementation, right? This means that when 3rd parties host Kimi K2, we will not have that tool calling capability?

Фото профиля Pietro Schirano

Pietro Schirano11 месяцев назад

You just build that

Фото профиля 🐧 lalo adrian morales 𝕏

🐧 lalo adrian morales 𝕏11 месяцев назад

what are you using to run it? is it local?

Фото профиля Pietro Schirano

Pietro Schirano11 месяцев назад

No @OpenRouterAI. You can't really run this locally unless you have insane specs.

Похожие видео

Wild. Kimi K2 Thinking just released and it's insane. It's an AI model that can run by itself for hours on end and make HUNDREDS of tool calls It's the 1st model I think that can replace humans In this video I show why it's so special and how to use it to build your first app

Wild. Kimi K2 Thinking just released and it's insane. It's an AI model that can run by itself for hours on end and make HUNDREDS of tool calls It's the 1st model I think that can replace humans In this video I show why it's so special and how to use it to build your first app

Alex Finn

70,004 просмотров • 7 месяцев назад

PSA: Kimi.ai just launched a huge update to Kimi-K2-0905 and it's now live on Groq Inc for instant inference. Highlights? 256K context window + improvements to coding/tool calling capabilities that outperform Claude Sonnet 4. 🚀

PSA: Kimi.ai just launched a huge update to Kimi-K2-0905 and it's now live on Groq Inc for instant inference. Highlights? 256K context window + improvements to coding/tool calling capabilities that outperform Claude Sonnet 4. 🚀

Hatice Ozen

21,842 просмотров • 9 месяцев назад

Building an Agentic Search System Building an agentic system is not too hard. Loops, function calling, tool execution, and the model. That's it! I show in this video how to build a search agent from scratch. ~350 lines of code!

Building an Agentic Search System Building an agentic system is not too hard. Loops, function calling, tool execution, and the model. That's it! I show in this video how to build a search agent from scratch. ~350 lines of code!

elvis

57,290 просмотров • 1 год назад

Introducing Codespace A virtual computer that runs custom Claude Code on server. It is using 2 models: Kimi K2 + Claude Sonnet 4 in a sync. I achieved same code quality with 2x better reasoning and 53% less model cost. Access it today CodeGuide (500 credits for everyone)

Introducing Codespace A virtual computer that runs custom Claude Code on server. It is using 2 models: Kimi K2 + Claude Sonnet 4 in a sync. I achieved same code quality with 2x better reasoning and 53% less model cost. Access it today CodeGuide (500 credits for everyone)

CJ Zafir

31,070 просмотров • 10 месяцев назад

The MiniMax M2 model is mind-blowing! It's open-source. It outperforms Gemini 2.5, Claude 4.1, and Qwen3 across coding and tool-use benchmarks. Right now, it's one of the world's top 5 models in intelligence! And here is the best part: Claude is one of the best models you can use today, and MiniMax M2 costs only 8% of that! It's smaller, faster, and cheaper. Extremely efficient at using tokens. Minimax M2's biggest strength: High agentic capabilities. The model can plan and execute complex multi-tool workflows. It's reliable and very robust at executing long-horizon tool chains. In summary: • Low latency • Very cheap • Excels at agentic tasks • Open-source The model currently powers the MiniMax Agent and is available for a free global trial. You can access MiniMax M2's API here: To access the agent: And here is the MiniMax website: Thanks to the MiniMax team for showing me the ropes and partnering with me on this post.

The MiniMax M2 model is mind-blowing! It's open-source. It outperforms Gemini 2.5, Claude 4.1, and Qwen3 across coding and tool-use benchmarks. Right now, it's one of the world's top 5 models in intelligence! And here is the best part: Claude is one of the best models you can use today, and MiniMax M2 costs only 8% of that! It's smaller, faster, and cheaper. Extremely efficient at using tokens. Minimax M2's biggest strength: High agentic capabilities. The model can plan and execute complex multi-tool workflows. It's reliable and very robust at executing long-horizon tool chains. In summary: • Low latency • Very cheap • Excels at agentic tasks • Open-source The model currently powers the MiniMax Agent and is available for a free global trial. You can access MiniMax M2's API here: To access the agent: And here is the MiniMax website: Thanks to the MiniMax team for showing me the ropes and partnering with me on this post.

Santiago

91,142 просмотров • 7 месяцев назад

Kimi K2 Thinking is here! Scale up reasoning with more thinking tokens and tool-call steps. Now live on the Kimi app, and API.

Kimi K2 Thinking is here! Scale up reasoning with more thinking tokens and tool-call steps. Now live on the Kimi app, and API.

Kimi.ai

138,394 просмотров • 7 месяцев назад

Kimi K2.6 is godly in terms of webdev and still SOTA chinese model 😼 > best in deep SWE bench even after launches of multiple models from different chinese labs Look at the watery feel one shotted it < using custom harness - 4 kimi in parallel just like grok >

Kimi K2.6 is godly in terms of webdev and still SOTA chinese model 😼 > best in deep SWE bench even after launches of multiple models from different chinese labs Look at the watery feel one shotted it < using custom harness - 4 kimi in parallel just like grok >

Chetaslua

82,515 просмотров • 12 дней назад

Kimi K2: I can finally unveil that I was testing it in the last days using: Claude Code and Open WebUI 🚀 Video standard speed beginning and end, ultra speed up in the middle.

Kimi K2: I can finally unveil that I was testing it in the last days using: Claude Code and Open WebUI 🚀 Video standard speed beginning and end, ultra speed up in the middle.

Ivan Fioravanti ᯅ

41,317 просмотров • 11 месяцев назад

Anthropic’s Claude 3.7 Sonnet is now available to all customers on GitHub Copilot Pro, Business, and Enterprise. You can start using the new model in standard and thinking mode via the model selector in VS Code and on GitHub. VS and JetBrains IDE coming soon.

Anthropic’s Claude 3.7 Sonnet is now available to all customers on GitHub Copilot Pro, Business, and Enterprise. You can start using the new model in standard and thinking mode via the model selector in VS Code and on GitHub. VS and JetBrains IDE coming soon.

Thomas Dohmke

187,708 просмотров • 1 год назад

Mr Scamster Dhruv Rathee Is using GPT 4 and saying its GPT 5 Using Claude 3.5 and saying its Sonnet 4 watch this video

Mr Scamster Dhruv Rathee Is using GPT 4 and saying its GPT 5 Using Claude 3.5 and saying its Sonnet 4 watch this video

Humi

97,835 просмотров • 10 месяцев назад

Wow. The fastest, most powerful agentic AI model just dropped And oh yeah: it's completely free Kimi K2 has taken the AI world by storm. Here's how you can use it to build apps, even if you've never coded a day in your life before: 00:00 Intro 00:29 Using Kimi K2 00:50 Benchmarks 02:08 OpenAI Panic 03:09 Installing Kimi K2 05:19 Building the app 06:50 Final results

Wow. The fastest, most powerful agentic AI model just dropped And oh yeah: it's completely free Kimi K2 has taken the AI world by storm. Here's how you can use it to build apps, even if you've never coded a day in your life before: 00:00 Intro 00:29 Using Kimi K2 00:50 Benchmarks 02:08 OpenAI Panic 03:09 Installing Kimi K2 05:19 Building the app 06:50 Final results

Alex Finn

32,132 просмотров • 11 месяцев назад

We are excited to be official launch partner of Kimi.ai K2 on Thinking! - 7-day 50% off: For the next 7 days, using K2 in Verdent is 50% off. - 2x credits on Verdent subs: With our limited-time double credits bonus, K2 costs ONLY 25% of the official price. - Native tool calling: Our integration leverages Kimi's native tool calling capabilities for optimal performance. - Performance & Speed: In our internal benchmarks, K2 Thinking matches frontier model in performance, but K2 is ~2x faster in tokens/s! Try Kimi K2 now on Verdent.

We are excited to be official launch partner of Kimi.ai K2 on Thinking! - 7-day 50% off: For the next 7 days, using K2 in Verdent is 50% off. - 2x credits on Verdent subs: With our limited-time double credits bonus, K2 costs ONLY 25% of the official price. - Native tool calling: Our integration leverages Kimi's native tool calling capabilities for optimal performance. - Performance & Speed: In our internal benchmarks, K2 Thinking matches frontier model in performance, but K2 is ~2x faster in tokens/s! Try Kimi K2 now on Verdent.

Verdent

923,875 просмотров • 7 месяцев назад

Devin is like Claude Code except it lives in the cloud and runs against all of your repos vs your local filesystem. So it never turns off, can be run from anywhere including your phone + Slack, and runs as many tasks as you can send it in parallel. It's complementary to all agentic IDEs and CLIs, and for the first time ever it's free to get started.

Devin is like Claude Code except it lives in the cloud and runs against all of your repos vs your local filesystem. So it never turns off, can be run from anywhere including your phone + Slack, and runs as many tasks as you can send it in parallel. It's complementary to all agentic IDEs and CLIs, and for the first time ever it's free to get started.

nader dabit

49,534 просмотров • 3 месяцев назад

here's something kind of weird and neat 🤯 since o3 can call tools, and you can call an API as a tool i got o3 to call itself as a tool now THAT is self-recursive AI. 🤖 🤝 🤖

here's something kind of weird and neat 🤯 since o3 can call tools, and you can call an API as a tool i got o3 to call itself as a tool now THAT is self-recursive AI. 🤖 🤝 🤖

Dan Mac

35,084 просмотров • 1 год назад

OK Qwen3 Coder from Alibaba Group is really good You can basically replace Claude Sonnet 4 with it to build anything using its CLI tool. This is also 7x cheaper and 100% open source. How to use it and examples below

OK Qwen3 Coder from Alibaba Group is really good You can basically replace Claude Sonnet 4 with it to build anything using its CLI tool. This is also 7x cheaper and 100% open source. How to use it and examples below

Paul Couvert

193,480 просмотров • 11 месяцев назад

The new Claude 3.5 Sonnet is the first frontier AI model to offer computer use in public beta. While groundbreaking, computer use is still experimental—at times error-prone. We're releasing it early for feedback from developers.

The new Claude 3.5 Sonnet is the first frontier AI model to offer computer use in public beta. While groundbreaking, computer use is still experimental—at times error-prone. We're releasing it early for feedback from developers.

Anthropic

384,679 просмотров • 1 год назад

Vibe coded this football game with threejs, @windsurf and Claude Sonnet 3.5 in 2hrs over the weekend Using for the realtime dynamic commentary

Vibe coded this football game with threejs, @windsurf and Claude Sonnet 3.5 in 2hrs over the weekend Using for the realtime dynamic commentary

Anurag Bhagsain

758,685 просмотров • 1 год назад

In-line doc editor by Bay Gross and Matthew Slotkin Claude 3.5 Sonnet powered tool that reads your doc and drops in comments and suggestions right where you need them.

In-line doc editor by Bay Gross and Matthew Slotkin Claude 3.5 Sonnet powered tool that reads your doc and drops in comments and suggestions right where you need them.

Alex Albert

21,579 просмотров • 1 год назад

I just updated Claude Engineer to support multiple file/folder creation and edits, simultaneously. It's the closest I've felt to having real-life superpowers. Watch 3.5 Sonnet create a fully functional web app, 6 files, and 5 folders in one shot. Available now in the repo 🔥

I just updated Claude Engineer to support multiple file/folder creation and edits, simultaneously. It's the closest I've felt to having real-life superpowers. Watch 3.5 Sonnet create a fully functional web app, 6 files, and 5 folders in one shot. Available now in the repo 🔥

Pietro Schirano

297,607 просмотров • 1 год назад

i built an ai agent that does marketing for me on autopilot! 🤯 it searches reddit for relevant posts, provides a valuable response to the users & promotes my product in a subtle and natural way. i'm using claude 3.5 sonnet, it's so good i can't actually believe it haha

i built an ai agent that does marketing for me on autopilot! 🤯 it searches reddit for relevant posts, provides a valuable response to the users & promotes my product in a subtle and natural way. i'm using claude 3.5 sonnet, it's so good i can't actually believe it haha

Fekri

779,699 просмотров • 2 лет назад