Making OpenCode as lean as Pi agent? Just trimmed 25k out of...

Uploaded: 2026-05-28T05:50:40.000Z
Duration: PT11.349S
Channel: raymel 👋

0:16

HTML Artifacts are a big part of how I work with agents now. Artifacts can be more than just static files. When combined with agents, they can take action or help you take action. This unlocks all kinds of interesting ways to work with agents. This is clearly the future. Check out this writing and scheduler artifact I built in a few minutes. It uses a bit of HTML and JS. All the data is in markdown (Obsidian vaults), so the agent can access and modify it at any time. No DB needed. No sophisticated functionalities. The agent decides all that for me based on the skills, context, and memory it has access to. The best part about this simple stack is that all the important information stays with me. This has allowed me to build a recursive self-improving system and automations that can better tap into coding agents like Codex or Claude Code. I could have paid or built an entire app for scheduling posts, and there are so many of them out there. But I don't need to. I've realized a simple artifact does the job. And the simplicity of it is actually an advantage. Very little maintenance for very high returns on personalization, time, and efficiency. The other benefit of this is that I can add features as I please. That level of personalization feels magical, and we should all be pursuing more of it. All of this just keeps compounding. Of course, this example is just about writing. But I have similar artifacts for research, design, experimentation, evaluation, and so much more. And no, I didn't actually publish the post example I shared in the clip. It was just for demonstration purposes. I actually spend more time than this when writing together with agents. Lastly, having built my own agent orchestrator tool has made me realize that simplifying the tool stack is a superpower. If you are curious about how all this works, I will do a live session next week:

elvis

18,374 views • 2 months ago

0:19

Updated the subagents system in my custom Pi extension to include: - Pi subagents - Codex subagents - Claude Code subagents The Pi thread can call and spin up any of them, and instead of just using the "codex exec" or whatever, it'll do a tool call that wraps it up in a first class way. Have a decent UI for them already added in as well. Pi being the main thread then having Codex/Claude Code as tools when needed feels so good. A flow I'm already using a ton is: "Implement feature ___, have a cc fable subagent research and plan out the api design + feature, then have a codex subagent implement it, then have a cc fable subagent do a review/fixes focused on simplicity and correctness" I also have a dynamic workflows extension setup and working, but it's not nearly as polished yet. Probably gonna make the subagent system more robust, then built it around that so u could have a workflow that flows between codex/cc/pi...

Ben Davis

24,140 views • 15 days ago

0:18

Building a personal knowledge base for my agents is increasingly where I spend my time these days. Like Andrej Karpathy, I also use Obsidian for my MD vaults. What's different in my approach is that I curate research papers on a daily basis and have actually tuned a Skill for months to find high-signal, relevant papers. I was reviewing and curating papers manually for some time, but now it's all automated as it has gotten so good at capturing what I consider the best of the best. There are so many papers these days, so this is a big deal. You all get to benefit from that with the papers I feature in my timeline and on DAIR.AI. The papers are indexed using tobi lutke qmd cli tool (all of it in markdown files along with useful metadata). So good for semantic search and surfacing insights, unlike anything out there. I am a visual person, so I then started to experiment with how to leverage this personal knowledge base of research papers inside my new interactive artifact generator (mcp tools inside my agent orchestrator system). The result is what you see in the clip. 100s of papers with all sorts of insights visualized. I keep track of research papers daily, so believe me when I tell you that this system is absolutely insane at surfacing insights. This is the result of months of tinkering on how to index research and leverage agent automations for wikification and robust documentation. But this is just the beginning. The visual artifact (which is interactive too) can be changed dynamically as I please. I can prompt my agent to throw any data at it. I can add different views to the data. Different interactions. I feel like this is the most personalized research system I have ever built and used, and it's not even close. The knowledge that the agents are able to surface from this basic setup is already extremely useful as I experiment with new agentic engineering concepts. I feel like this knowledge layer and the higher-level ones I am working on will allow me to maximize other automation tools like autoresearch. The research is only as good as the research questions. And the research questions are only as good as the insights the agents have access to. Where I am spending time now is on how to make this more actionable. I am obsessed about the search problem here. The automations, autoresearch, ralph research loop (I built one months ago) are easier to build but are only as good as what you feed them. Work in progress. More updates soon. Back to building.

elvis

464,892 views • 3 months ago

0:14

Excited to launch a new way to upskill with AI agents. This is how we are making it possible for anyone to learn to build with coding agents. To start, we are launching 4 new hands-on labs on the following topics: - Agent Skills - Agentic Image Generation - 30 Days of Hermes Agents - Prompt Engineering with Agents I am confident that with our new DAIR.AI platform, anyone can learn to become a top AI builder by building and acquiring highly-demanded AI skills. And there is a lot more landing in the coming weeks.

elvis

19,058 views • 1 month ago

0:10

Simplicity is at the heart of great software. This is one of the reasons why Claude Code has been sticky for me. As a builder, I love planning and brainstorming, and this is now a key focus of Claude Code. I use Shift + Tab a lot to cycle between brainstorming, planning, and execution. This functionality provides the appropriate interface for me to either be very involved or less involved as I please. This works particularly well when building out new and complex features or entire new projects. This saves a huge amount of time. It allows me to tune Claude Code to execute and build more effectively. It also builds a loop of trust, and I often (surprisingly) find Claude Code asking for clarifications when it's confused. Coding agents don't normally do that. I have shared before on the power of brainstorming with AI for longer times. Try it and you will not be disappointed. Vibe coding is fun, but pair it with intentional development cycles, and you watch how far you can take a project with coding agents today.

elvis

81,765 views • 9 months ago

0:13

this is the worst local ai will ever be. it only gets better from here. if you are not expanding your mind with these small models you are missing what's happening right now 99 percent tool call success rate. when steered well with the right skills and a framework like hermes agent the node becomes a cognition layer. not a chatbot. not a toy. an extension of how you think. i was cranking this node at 35 to 50 tok/s all day on personal experiments and now after all the work is done qwen 3.5 9B is iterating on its own code. the game it created. fixing its own bugs autonomously. and the part you should probably not miss is that all of this is happening on a RTX 3060. not an H100. not an A100. the card most of you have sitting in a drawer right now. if you just open that drawer and put that intelligence to work every tensor core on that card should be running for you. your work. your experiments. your thinking. you all have it but because nobody told you what this hardware can actually do in 2026 you never tried. the day it unlocks is the day you test your workload, understand the tradeoffs, debug the loops, and then decide if you need to scale the hardware. there is no point buying 3 mac studios when things done well you can squeeze a similar level of intelligence from 9B compared to 70B. but only when you create the right environment for your model through the right harness. and let me tell you i have tried claude code as a local harness. i have tried opencode. i have tried various others. somehow i landed on hermes agent and never left. there is something magical going on at Nous Research. the tool call parsers, the skills system, the way it handles small models natively. nothing else comes close for local inference. own your cognition. your AI. your agent. your prompts. your experiments. why give them away for free. those are who you are and they don't belong on someone else's servers being monitored. just give it a shot with your existing hardware. you run into a problem the community will help you. and if you are migrating from openclaw to hermes i will personally help you make the switch.

Sudo su

58,717 views • 4 months ago

0:12

Flipping houses, or rehab and resell, is a game that seems to be the craze these days. People with little (myself) or tons of money think they can buy a dump of a house, shine it up, and make $100,000 without a problem. So, I gave it a try, putting my proverbial chips in the middle and going all in. Renovated it in 63 days. New roof, HVAC system, electrical panel, kitchen, bathroom, doors, trim, flooring and paint. I listed it, and then it just sat—hours, days, weeks, months — costing me $100 per day in holding costs. We adjusted the price, staged furniture, and reshot photos, but it sat, but why? The basement was wet, which is the kiss of death in real estate, I have learned. As a GC, I have access to a network of subcontractors and skill sets that many people don't, though that doesn't mean repairs are free. After spending $60,000 in renovations, $14,000 in closing costs, and $32,000 in holding costs I was taking a loss on selling this house. Then, I had to put another $15,000 into it to stop water from entering the basement. It is what it is. The house is now under contract, and we feel good about things, which is a distraction from the other house I bought in the middle of all of this until my guy sent me this video yesterday as he was leaving….

Nathan Quinlan GC

47,865 views • 1 year ago

0:15

New feature in Claude Code 2.1.14 just dropped! You can now search and install plugins from the marketplaces installed in your current Claude Code session. This is huge if you’re building plugins on top of Claude Code’s marketplace layer (Skills, Agents, Hooks, etc). How it works: - Run /plugin - The official Claude marketplace is installed by default - Use the search bar to find the plugin you want - Select one or multiple plugins with space, then press i to install - Go to the Installed tab to browse and enable them With the exponential growth of Skills and Agent-based components running in the CLI, improving plugin discoverability is a big win. Pretty sure more marketplace-related features are coming

Daniel San

40,994 views • 6 months ago

0:14

There’s two kinds of traders. The one that is able to walk away and the one that revenge trades. Overtrading kills more accounts than anything else. I used to be this way, getting angry that price wasn’t going my way and pushing the button again and again until I blew up all my accounts. It takes a lot of inner work, but believe me it’s worth it to work on not just your strategy and executions, but your mental game as well. Being able to walk away from the screen before it’s too late is a real skill set to work on. Don’t be the guy that revenge trades. Strategies that worked for me: 1. Writing down my game plan at the start of the morning. As long as I followed my plan, it’s a win. 2. Having pictures of family on my trading desk. Trading for others and not just yourself is important, it keeps you grounded. 3. Having alarms go off on my phone at the end of the session to bring me back to reality and remind me to walk away.

Tanja Trades

64,142 views • 4 months ago

0:12

I learned this the hard way: do NOT use SwiftUI if you want your app to look and feel amazing. At least when coding with AI. (sorry, Apple colleagues reading this 😅) I'm sharing my process vibe coding this calorie tracker. I get a lot of questions about the fluid transition in the video. Here's the whole story. Initially, Claude built the grid with SwiftUI. It was quick and easy, and looked good! But the transition to the day view was a boring navigation push/pop. No fun. I wanted something custom. I asked Claude to make it a fluid transition that remaps the food tiles from their source to destination positions. All hell broke loose. Claude tried a bunch of horrible things. Initially it used matched geometry effects, which worked OK but didn't lend themselves well to gesture-driven animations. So it resorted to SwiftUI preference keys + geometry readers to figure out the source and destination positions and calculate the interpolated position based on gesture progress, coordinating across grid and day views. But this meant it had to write a custom layout because it couldn't reposition tiles inside the native SwiftUI grid. And it had to do an awkward handoff between views, which always created ugly pops or jumps. And don't get me started on trying to put it on a bouncy spring, that only made the math 10x buggier. Fortunately, Claude Fable was smart enough to see that this was becoming a disaster (and discover most of the issues itself, in the simulator), so it pivoted away from SwiftUI. Opus might not be so wise, so you'll have to pay attention and intervene. Ultimately, it rewrote it in plain UIKit and everything turned out great. After that, we moved from 2D images to 3D assets, which introduced a new set of performance challenges and yet another rewrite to a single Metal layer, which is what you see below. I can write more about the 2D-to-3D saga if anyone's interested. If I were to do it again, I'd just say "Don't use SwiftUI" from the very first prompt, and save a few hours of headaches. SwiftUI can be amazing for a human iterating directly in code. But agents don't benefit from any of its advantages. Plus, agents have seen decades of UIKit training data, so they're great at writing it, and it's far more flexible. Here's hoping we see more agent-friendly iterations of SwiftUI in the future. Till then, I'm probably going to avoid it.

Anshu

111,617 views • 1 month ago

0:18

LLM Artifacts Connected to Andrej Karpathy's LLM Knowledge base idea, I've been building out a fun way to generate dynamic artifacts from these knowledge bases with the goal of discovering and revealing meaningful and deeper insights. LLM KBs are hard to consume for humans, as I think they are more built for agents. So the question is, what form would be useful for humans to take actions and make important decisions? That's what I am trying to figure out with these artifacts. The artifact example shows a pulse on HN discussions around AI-related stories. The insights can go deeper, of course, but this is already super fun and thought-provoking, like some of my favorite podcasts. The format and depth matter a lot. The aggregation skills of agents are outstanding if you tune the prompts and skill carefully. I built this artifact generator in a few minutes through an agent skill, but I feel like there are so many ways that LLM-generated information can be used and consumed. Like generating deeper insights and analysis, and things that are just not feasible for humans today. The generated artifact (including its data and design) serves as reusable templates or can be updated in real-time via auomations, which is something I am also working on. It is truly an insane way to monitor and track information. Better than a newsletter. Better than newspapers. There is something about this that gets me really excited about the future of AI agents for knowledge generation and discovery. Lots of hidden gems everywhere just waiting to be discovered and acted on if the information is presented correctly. This is not perfect. The format, style/prose can be improved, but this is easy to customize via skill. You can personalize it to your liking. I feel like these dynamic artifacts are going to emerge as a strong new medium to stay on the cutting edge of things, both for agents and humans. My target is research, of course. This was just a basic example. Besides animation, I am also targeting other components like voice, videos, images, slides, etc. This space is full of opportunities to explore. Skill for this coming soon.

elvis

31,213 views • 3 months ago

0:10

Learning is something you and your baby do together. You can think of the process as happening in three distinct stages, during which skills are transferred gradually from you to your little one: During the first stage, your baby is observing the behavior and skills of others. During the second, they begin to emulate these behaviors - and can find success with the support of a helpful adult (you) or more expert peer (often a sibling). And gradually they internalize these skills and perform them all by themselves. This video is a great example of the shared second phase. Infants explore the world with their mouths. But an important lesson of toddlerhood is that some things are for putting in our mouths, while others are not. This little one knows that we don’t eat the Play-Doh. But it sure is tempting! Watch as he breaks off a piece and brings it to his mouth. As he does his eyes lift and he realizes that Mom is watching - which alone prompts some introspection. He grins broadly, shakes his head and exclaims “No, no, no” - using Mom’s past words to affirm his decision to place the Play-Doh back on the table. Left to his own devices, who knows? But together, without exchanging a word, he managed to make the right choice. As a parent it’s important to remember the key role you play in the learning process. And that extends to your child’s behavior. Self-regulation begins as co-regulation. So be there. This sweet little guy was shared to IG by parentosa.

Dan Wuori

75,169 views • 2 years ago

0:18

ANTHROPIC JUST TURNED AI AGENTS INTO GIT REPOS Anthropic shipped "ant" - a CLI that runs every Claude API endpoint straight from your terminal. The headline isn't the terminal access. It's that you can now version-control an AI agent as YAML in Git and have CI sync it to the Claude Platform, the same way you ship code. - Every API resource is a subcommand: messages, models, files, agents, sessions - Define an agent in a YAML file, check it into your repo, and keep it in sync with one update command - Spin up a session, send it an event, then pull every event and tool call back from the same CLI - Claude Code knows how to drive ant out of the box - it shells out and reads the results with no glue code Agents just stopped being prompts you babysit and became infrastructure you deploy.

BuBBliK

200,319 views • 1 month ago

0:19

NEW HTML VIDEO SKILL: Claude can now make photo-grid promo video ads like these. These are very popular offer-style ads that work well for bottom-of-funnel conversions. Brands will often run these when they're doing seasonal sales, for example. It's pure HTML, so it's very cheap to make because there's no video generation cost. You can create dozens of variants very fast and then test them in Meta. All you have to do is install the skills and give Claude your brand website. This skill is insane - it'll make the entire Ad and give you a link where you can download the mp4 file. > If you want more variants, just mention in the prompt > If you have a specific concept in mind, just mention in the prompt > If you want any edits on the generated ad, just mention in the prompt Comment Goose and I'll DM you the skill (must be following so I can DM you)

Shiv

91,654 views • 14 days ago

0:10

Here's how to get 100% consistent product ads from one seedance 2.0 generation. I did it all in a single chat using the Comfy MCP. The real control here comes from calling my existing workflows (shared below) instead of the agent improvising a pipeline. I directed the agent to call my ComfyUI workflow for cinematic product ads. I specified the close-up shot of the sprite animating, the bezel turn flipping the screen, the display changing to the time 10:04. Now for the consistent variations. The driving video does the heavy lifting but you need to get it right → depthanything v3 pass blended with canny edge lines to show the fine detail... it's why the tiny debossed logo is there → the initial sprite outline lived in those edge lines too, and every gen kept inheriting it. claude suggested a sam3 mask over the screen to hide it (s/o the agent) → with the screen masked, the new star sprite is just prompting: one gpt-image-2 still to generate a reference, one extra line in the seedance 2.0 prompt, and it animates oh and the whole process is a claude skill now.

rob - comfyui

19,940 views • 20 days ago

0:15

Karpathy's Agentic Engineering finally has proper tooling! (built by Google) Karpathy defined agentic engineering as the discipline that separates production agent work from vibe coding. The core skills he listed were spec design, eval loops, and security oversight. The problem has been that practicing this still requires a different tool for every phase: - editor for code - a terminal for scaffolding - a browser for testing - a cloud console for deployment - and a separate framework for evals. Every transition is a context switch. The solution to production-grade Agentic Engineering is now actually implemented in Google’s Agents CLI. It covers the entire workflow in one place for scaffolding, evaluating, and deploying ADK agents. One setup command injects 7 ADK-specific skills into a coding agent's context, which lets it handle scaffolding, evals, deployment, and enterprise registration through natural language. I tested this end-to-end by building a RAG agent from scratch using Claude Code. It scaffolded the full project from the ADK agentic_rag template, generated 20 eval scenarios with LLM-as-judge scoring, and returned a quantitative scorecard. Finally, it also deployed everything to Agent Runtime and registered the agent to Gemini Enterprise, so the entire org can discover and use it. The video below shows this in action, and I worked with the Google Cloud team to put this together. Agents CLI GitHub repo → (don't forget to star it ⭐ ) I wrote up the full build covering all six steps from install to enterprise registration. It includes the eval scorecard, the instruction loophole the eval caught before deployment, and what the deployment process actually looks like end-to-end. Read it below.

Akshay 🚀

256,388 views • 1 month ago

0:12

⚔️ Kingdom Come Deliverance first impressions ⚔️ Loving it so far, basically a medieval detective simulator that really doesn't care that you are the main character. And I'm all here for it. ▪️The WORLD is the real star of the show here and even though it's got plenty of jank and lots of copy and paste NPC faces, it just feels so IMMERSIVE. Even the UI just transports me to the times with a bright colourful medieval art style. ▪️The MUSIC I love, absolutely sells the world and basically ASMR as you trot around on your horse through the world. ▪️THE Combat is a real interesting one, it's got quite the learning curve which I actually LIKE, it definitely has some jank to it as well but I really appreciate the attempt at an original and nuanced combat system. (Having to stop your bleeding with bandages is really cool) ▪️The Story has gripped me (19 hours in so far) And while it seems a simple revenge story on the face of it, I think the story is more about Henry making his way through the world after the horrors of Skallitz. The writing quality is top notch as well as the quest design also. ▪️The CHARACTERS are amazing and the humour is top notch. I'm not sure the last time I laughed so much at a game. Henry is great and so well voiced by Tom McKay This really feels like Warhorse Studios have put a lot of love and work into making an authentic medieval world and as a bit of a medieval nerd I can't get enough of this game. A True RPG as well by all accounts, the game really makes me think hard about how to approach situations. Also I can't wait to get to KCD2.

KJPlays

63,735 views • 6 months ago

0:16

one of my favorite ways to use claude code skills right now - combining remotion with claude-in-chrome for motion video creation. the workflow is addictive. the clip you see here was produced with minimal prompting effort let me know if you would like me to write a full break down of this process. you describe to claude code what you want, claude code writes the remotion components, opens the remotion studio (via browser) with claude-in-chrome, sees the actual rendered output, and iterates on it in real time. need the arrows pointing to the center of the bubbles instead of the edge? just say it. need the layout shifted to the center? say it. claude sees the preview, adjusts the code, re-renders. then when you're happy you tell claude code to render the final video. but skills are what make this possible. remotion knowledge + browser automation + the taste to iterate visually. no copy pasting screenshots back and forth. no "can you try moving it 10px to the right" over chat. it just looks and fixes. this is the kind of workflow that makes you realize how much further claude code can go beyond just writing code in a terminal. i didn't touch any code while working on the clip you see if you haven't tried combining skills together like this - start experimenting. the skills combos is where the magic is at.

elvis

46,464 views • 5 months ago

0:13

While I have premium I definitely want to do a long post breaking down the main scene from this show that I obsess with. There's so much that I feel could be learned from it. Not just for expansion animators either. There's honestly a LOT that can be learned from how this show directed its expansion scenes that can be applied to even safe for work animations. Obviously yes, the expansion alone is really good. What makes it so good though is just the fact that there's things you pick up on that you likely don't even realize until you re-watch. I'm using this scene as an example as it's not AS good I would say, but it still has a lot going for it. One thing I've always loved from it that I want to use in my own animations someday is that the expansion sort of comes in waves as opposed to one consistent growth. In a way, it makes it feel more natural while also selling how tight the top is getting. It could have been accompanied by showing the knot getting tighter or smaller with each growth. It's a small part of the animation but just that one thing already adds another layer to appreciate. It's not like most expansion scenes where there's very little outside of just "growing" on its own. There's a bunch of little things that subtly improve it without being obnoxious. The first expansion scene has WAY more that I want to talk about honestly. Far more than this one has. It will probably be a really long post now that I think about it...

FancyPlanks 🐀

16,420 views • 7 months ago

Live Cam

Video Failed to Load

raymel 👋

Anya Rossi• Live Now

0 Comments

Related Videos