
am.will
@LLMJunky • 26,308 subscribers
StarSwap // Life in Color Director of n number of agents. Thoughts are my own. Also not a car.
Shorts
Videos

Look ma new Codex Updates! 0.119.0 and 0.120.0 are here. And with it, a HUGE number of quality of life updates and bug fixes! > Hooks now render in a dedicated live area above the composer. They only persist when they have output, so your terminal stays clean. If you're running PreToolUse or PostToolUse hooks, this is a huge readability win. > Hooks are now available again on Windows > CTRL+O copies the last agent output. Small but clutch when you're pulling a code block into another file or chat. > New statusline option: context usage as a graphical bar instead of a percentage. Easier to glance at mid-session when you're trying to gauge how much runway you have left. > Zellij support is here with no scrollback bugs. If you've been stuck on tmux just because Codex was broken in Zellij, you're free now (shout out Felipe Coury 🦀) > Memory extensions just landed. The consolidation agent can now discover plugin folders under memories_extensions/ and read their instructions.md to learn how to interpret new memory sources. Drop a folder in, give it guidance, and the agent picks it up automatically during summarization. No core code changes needed. This is the first real extension point for Codex's memory system, and it opens the door for third-party memory plugins. > Did you know, you can /rename a thread? But what's really cool about that is, after you rename it, you can resume it with the same name, no more UUIDs. codex resume mynewapp or directly from the TUI: /resume mynewapp > Multi agents v2 got an update to tool descriptions More reliable multi agent environments and inter agent communication > You can now enable TUI notifications whether Codex is in focus or not. Modify this in your config: [tui] notification_condition = "always" > MAJOR overhaul to Codex MCP functionality: 1. Codex Tool Search now works with custom MCP servers, so tools can be searched and deferred instead of all being exposed up front. 2. Custom MCP servers can now trigger elicitations, meaning they can stop and ask for user approval or input mid-flow. 3. MCP tool results now preserve richer metadata, which improves app/UI handoff behavior. 4. Codex can now read MCP resources directly, letting apps return resource URIs that the client can actually open. 5. File params for Codex Apps are smoother: local file paths can be uploaded and remapped automatically. 6. Plugin cache refresh and fallback sync behavior are more reliable, especially for custom and curated plugins. > Composer and chat behavior smoother overall, resize bugs remain though. > Realtime v2 got several significant improvements as well. > You're still reading? What a legend. 🫶 npm i -g @openai/codex to update
am.will738,700 Aufrufe • vor 1 Monat

Introducing Lynk, a brand new way to interact with your favorite harnesses on the go. Compatible with OpenClaw, Hermes, Codex, and local edge models. Fully featured client allowing you to easily and quickly switch between your favorite agents. Lynk has been my absolute favorite way to kick off tasks on the go, completely replacing Telegram. Only available as a beta on Android at present, but iOS version is in development. Works via local network or Tailscale. Features: - Create and continue threads with your favorite harness - OpenClaw, Hermes, Codex, and local models - Android Phone control - Realtime voice agent - Speech-to-text transcription - Codex Pets + live notifications - Draw over screen quick access chat overlay - Open Source software Join the beta in the comments.
am.will35,971 Aufrufe • vor 5 Tagen

The Codex app server was such a brilliant stroke of foresight that really doesn't get enough love Not only are you allowed to use your chatgpt account with any harness, but you can build your own apps directly on top of theirs. They just make building on and with codex such a great experience To demonstrate this utility, I want to highlight the kitty litter app, made by SIGKITTEN. Instead of having to build the entire harness, and all the infrastructure, he's plugged into the app server for a unified experience between mobile and dev machine. When I create a session on my computer, it's automatically available on my phone. All of the chats you see in this video automatically populated when we connected to the app server. All my skills. My agents. My sessions. My folders. My prompts. They're all ready to use - automatically. Because they're exposed by the app server, along with many other endpoints. It's a great ux/dx that really deserves some love. It's almost like they want you to build on top of their products ;) Btw Litter is great 👍
am.will265,184 Aufrufe • vor 2 Monaten

Nice. Cursor just dropped their new "Glass" alpha, and they're leaning heavily into the simplified coding GUI trend that's been blowing up lately. First impressions are really positive. And just look at how insanely fast Composer 2 is. First impressions? Drop yours 👇
am.will217,069 Aufrufe • vor 2 Monaten

This is so cool. In the next Codex update, multi agents will get a massive flexibility upgrade. "Hey Codex, when you implement this plan, I want you to delegate all the lower complexity tasks to GPT 5.3 Spark subagents" Instead of needing to create 100 different custom agent roles for different situations, you can just prompt your agent to spawn whatever model or reasoning level you want. With only natural language. No config files. No pre-defined roles. Just tell the orchestrator what to use and it listens.
am.will127,465 Aufrufe • vor 2 Monaten

I got so tired of everyone raving about how great cmux is. Panes this. Browser that. EXHAUSTING. And that's because I'm on Linux, where we get none of the coolest toys. So...I built it myself. And my God. You were right. It's amazing. Introducing Limux, a a GPU-accelerated terminal workspace manager for Linux, powered by Ghostty's rendering engine, with split panes, tabbed workspaces, and a built-in browser. Think cmux, but native Linux. If you're interested in something like this, be sure to leave a comment and I'll release it. Special thanks to Manaflow and Mitchell Hashimoto for making this possible.
am.will108,626 Aufrufe • vor 2 Monaten

WOW! I'm so excited about this. OpenAI Developers said Codex was good at Computer Use, but I wasn't prepared for this. For the last two weeks I've been working on a Computer Use skill to work with Linux. And while I had some success, it was a pretty frustrating experience. That is...until the breakthrough. Using accessibility tools, Codex can now control my entire computer, not just the browser. There are limits to this, of course, but what a time to be alive. This Computer Use skill will unlock and entirely new set of automations, all powered by Codex. Demonstration below 👇
am.will75,650 Aufrufe • vor 2 Monaten

I have fallen in love with Ghostty Terminal. Look at this subtle, but incredibly useful opacity trick you can use to show "focus" when working in multiple panes. Shout out Daniel San who showed me this. What a legend. Ghostty config file: unfocused-split-opacity = 0.55
am.will86,085 Aufrufe • vor 3 Monaten

Calling on Codex Fans! I need your help 🫵 I'm introducing Codex Marketplace, a community collection of plugins, skills, and hooks curated by YOU. Yes, you! Help me make this the best resource for everyone who wants to build incredible software with Codex. Getting started is simple, submit your artifacts via your Github repository. If you own the repo, it'll be auto approved, else they'll be reviewed. Upvote your favorite artifacts. Add/remove what you like with a simple npx command. Best part? Everything lives on Github, so you always get the latest version. 👉
am.will40,580 Aufrufe • vor 1 Monat

Codex team is back in the kitchen with a really nice quality of life upgrade for subagents. With the advent of custom roles, they have also upgraded the TUI experience in two really meaningful ways. > All agents now get a name for better readability > Additionally, the agent role is declared. > Subagent name, role, and status are now color coded. > Subagent rendering was also optimized for readability > /agents slash command shows all agents, even 2+ layers deep. And here's the biggest and most important change. Subagent injection. Before, sometimes the orchestration agent would continue work and lose track of the work of a subagent. Now, when a subagent is blocked or completed, it injects a message back up the chain to ensure that the parent sees the message. This is a really big improvement overall, and leads to much more reliable inter-agent communication, reliability, and DX. In this example, I used the parent agent to spin up a worker agent, which then spawned two more "Spark" agents a second layer deep. I was able to easily tell them apart, switch between the threads, and see exactly what they were prompted. All of this will be available in update 0.105.0 I don't know who JIF is at OpenAI, but they are truly a legend.
am.will78,267 Aufrufe • vor 3 Monaten

This is what 30 hours of work, a handful of AI bux, and and a sore back looks like: 2x RTX 6000 Pro 320GB RAM (192V+128GB) 48,128 CUDA Cores 1.8TB/s Memory Bandwidth 9950X (16c/32t) Corsair DDR5 6400 ASUS ProArt X870E 4TB Crucial T705 5x4 SSD 4TB Crucial P3 4x4 SSD 4TB Crucial T500 4x4 SSD Asrock Platinum 1600W Corsair XD5 Reservoir EKWB Waterblock 2x 360mm EKWB Radiators Way too much debt I'm so stupid But it's glorious I bought everything used or repurposed old equipment except for the water block, 1 GPU, and 3 fans. Even though I'm not thrilled about spending the cash, I only have about $10.5K into it (not counting parts I paid for years ago). Safe to say I did pretty damn good all things considered. I'd like to thank Central Computers for helping me get this project off the ground. Now that I have this together, my aim is to support the OSS community. If you're interested in running local models, consider following me on this journey. Next chapter loading....
am.will48,372 Aufrufe • vor 2 Monaten

Your Plan SUCKS! 👀 Inspired by the legend Andrej Karpathy himself, I created a new skill: LLM Council. Create better plans, by committee. Supports: > Codex CLI > Gemini CLI > Claude Code > OpenCode Call the skill with your feature that you'd like to build. It will ask you some clarifying question, and then launch up to four parallel planning agents to create a detailed plan. Once the plans are in, all plans are anonymized and the "The Judge" will critique and choose the best plan *OR* the best parts from all of them. Finally, it will output a final-plan, which you can review and refine. I even created a nice UI for you to review and refine your plans. This Skill has been tested on Linux only, but should work on other platforms. Please report any bugs! Links in the comments.
am.will67,101 Aufrufe • vor 4 Monaten

With all the buzz around the Codex App, OpenAI Developers quietly snuck out a new CLI update (0.94.0) as well. And boy is it an important update! Codex Plan mode is now officially released to the general audience! I am very excited about this one as it has a really strong prompt that is unlike any other plan mode I've personally used. Codex Plan mode doesn't necessarily just ask you 3 questions up front. It goes, collects context, asks questions, collects more context, asks more questions (sometimes), and then writes an incredibly high quality plan. It is my favorite implementation of plan mode thus far. It also comes with Codex's own version of "AskUserQuestion!" Although, it only works in Plan mode for now. They really need to allow people to use it in Code mode as well, but one win at a time. npm i -g @openai/codex Below is a demo of how it works. Let me know what you think!
am.will57,121 Aufrufe • vor 4 Monaten

Codex 0.107.0 is here, and with it: FORKS🍴 This minor update includes the ability to fork your current chat history directly with custom multi agents with just a quick prompt. "Fork this session using n subagent(s)" This unlocks some interesting use-cases where you can launch multiple agents that retain all of your chat history up to that point, each in their own independent session. For example, you've been discussing architecture with Codex for 20 minutes, talked through constraints, reviewed existing code, settled on a model. Now you want to see two different implementation strategies side by side. Fork lets both children start from that full shared understanding. With a normal spawn, you'd have to re-explain the entire context to each child, and they'd inevitably miss nuance from the earlier conversation - which is basically a form of compaction/compression. With something like this, you could also just tell it to fork itself 5 times, and implement the same UI/frontend 5 different times in their own worktree to see which one you like the most. Other potential uses: - branching after reviews - try and rollback patterns using worktrees - parallel debugging strategies Otherwise, this is a minor update with improvements to the app server, realtime transcription, and memory. Enjoy!
am.will42,504 Aufrufe • vor 3 Monaten

No internet? No problem. I ran a research preview of Gemma 4 E4B on my Galaxy 26 Ultra entirely locally, and so far it's been impressive. It's a 4GB model with two generation speeds. Not perfect but still awesome! I am excited for the future of Edge AI. Google AI cooked.
am.will26,502 Aufrufe • vor 2 Monaten

Codex built me a full Memory Manager overnight using my new "Teams" strategy. Flawless first test. One prompt spun up 70 domain-expert subagents, organized into specialized teams, all working in parallel to one-shot the entire app while I slept. I've built over 200 custom subagent roles for this. Playbook dropping soon. This is a massive unlock for parallelization. I'll be improving this and testing it on much larger projects.
am.will31,458 Aufrufe • vor 3 Monaten

Codex 0.93 is here, and with it, shiny new features! App Connectors have arrived. Connect to an array of cloud apps directly from your terminal. No config files. No setting up MCP servers, or hunting down docs. Just two clicks, and you're off! Github, Notion, Google Apps, Microsoft Apps, Vercel, Adobe, Canva, Dropbox, Expedia, Figma, Coursera, Hubspot, Linear, Monday, Instacart, SendGrid, Resent, Stripe, Target, and Peleton? Plus more. OpenAI is going for a unified experience from cloud to terminal, and they unlock a bunch of capabilities for your terminal agent. I believe with this direction they're going, they are truly leaning into a complete AI assistant, not just for coding. While these capabilities are not exactly new, they are much easier to install and use, greatly improving the UX for everyone. I for one, appreciate it. To get started, update Codex, and type /experimental. Turn on "Apps" and restart your session. If you want more videos like this, please drop a comment!
am.will34,023 Aufrufe • vor 4 Monaten

I just built an entire app in 7 minutes using Codex Spark. EveryMCP: a universal MCP server installer that configures 15+ coding agents from a single CLI command. 12 Codex Spark subagents. All running in parallel. At the same time. The project itself isn't complicated, but just imagine what this looks like when you scale the complexity. My setup: • Codex 5.3 High to plan • Codex 5.3 Medium to orchestrate • 12 Codex Spark agents (xhigh) to implement in one shot Spark isn't competing with the big models. It's a different tool entirely. You orchestrate it. You run it in swarms. And it moves at a speed that wasn't possible before, even for the top 1% of AI coders. More complex workflows coming soon. Comment below if you want me to tell you how I did it.
am.will16,916 Aufrufe • vor 3 Monaten
Keine weiteren Inhalte verfügbar