am.will's banner

am.will

@LLMJunky • 27,446 subscribers

StarSwap // Lynk // DevX Director of n number of agents Also not a car. https://t.co/FAnFG220xi https://t.co/9mauIYOCx8

Shorts

The human brain is truly a marvel of nature. If you horribly reductive, and boiled it down to a language model, you'd be looking at roughly 100 trillon parameters running as a sparse MoE architecture Only about 1-5% of neurons fire at any given moment, meaning the brain "activates" maybe 1-5 trillion parameters per inference step. For context, the largest AI models we've built probably top out around 5 trillion parameters. The brain is roughly 100x larger. Even its active params at any given moment are larger than almost every model in existence today. Here's what melts my brain (pun intnended) though Your brain does all of this on about 20 watts of power, less than a dim light bulb. Training a frontier AI model consumes enough electricity to power small cities for months. Running inference across data centers pulls megawatts. Your brain runs 24/7 for 80+ years on the equivalent of a phone charger. We haven't come close to matching the brain's scale. And we're not even in the same universe when it comes to efficiency. Evolution spent 500 million yrs optimizing the most energy-efficient intelligence architecture ever known. we're trying to brute force our way there with compute and electricity. Nature is still the best engineer in the room.

The human brain is truly a marvel of nature. If you horribly reductive, and boiled it down to a language model, you'd be looking at roughly 100 trillon parameters running as a sparse MoE architecture Only about 1-5% of neurons fire at any given moment, meaning the brain "activates" maybe 1-5 trillion parameters per inference step. For context, the largest AI models we've built probably top out around 5 trillion parameters. The brain is roughly 100x larger. Even its active params at any given moment are larger than almost every model in existence today. Here's what melts my brain (pun intnended) though Your brain does all of this on about 20 watts of power, less than a dim light bulb. Training a frontier AI model consumes enough electricity to power small cities for months. Running inference across data centers pulls megawatts. Your brain runs 24/7 for 80+ years on the equivalent of a phone charger. We haven't come close to matching the brain's scale. And we're not even in the same universe when it comes to efficiency. Evolution spent 500 million yrs optimizing the most energy-efficient intelligence architecture ever known. we're trying to brute force our way there with compute and electricity. Nature is still the best engineer in the room.

130,733 görüntüleme

Codex update 0.105.0 is out! Despite the fairly pedestrian changelog, this one's a doosie. It's a laundry list of quality of life improvements across the board. - Wispr Voice dictation (hold space to talk) - Theme picker - Codex can prevent sleep on Linux & Windows (I just know there's a joke in there) - Customize Plan Mode reasoning - Many other fixes/updates There's also a complete overhaul to subagents: - New names for readability - Visual display overhaul (way cleaner) - Allow for multi-layered subagent depth (max_depth) - Custom multi-agent role definitions (custom subagents) - /agents now shows both agent names, agent roles, and "dead agents" for auditibility This is the largest single update of Codex I've ever seen! Absolutely massive if you love to use multi-agents. To turn on Voice Transcription, enable: [features] voice_transcription = true Does not work on Linux yet. Well done OpenAI Developers 👏

Codex update 0.105.0 is out! Despite the fairly pedestrian changelog, this one's a doosie. It's a laundry list of quality of life improvements across the board. - Wispr Voice dictation (hold space to talk) - Theme picker - Codex can prevent sleep on Linux & Windows (I just know there's a joke in there) - Customize Plan Mode reasoning - Many other fixes/updates There's also a complete overhaul to subagents: - New names for readability - Visual display overhaul (way cleaner) - Allow for multi-layered subagent depth (max_depth) - Custom multi-agent role definitions (custom subagents) - /agents now shows both agent names, agent roles, and "dead agents" for auditibility This is the largest single update of Codex I've ever seen! Absolutely massive if you love to use multi-agents. To turn on Voice Transcription, enable: [features] voice_transcription = true Does not work on Linux yet. Well done OpenAI Developers 👏

115,392 görüntüleme

Codex CLI Update: Let there be Search Whatup nerds, back so soon looking or yet ANOTHER update?! I got you. Update 0.121.0 is here! > You can now search through previous user prompts with CTRL+R. Just trigger search and enter your search string, you can easily arrow through all matches. See video below! > 🥔 Support for Spud! Is not here yet. Sorry. Maybe tomorrow. 🫢 > v0.121.0 adds custom marketplace installs in Codex: you can run codex marketplace add to register marketplaces from GitHub shorthand, git URLs, or local directories. Codex validates the marketplace layout and stores it in your user config so it shows up consistently in plugin discovery. > Improved memory features, including a new /memories TUI menu with use/generate toggles, a reset-all-memories action, app-server support for setting thread memory mode and clearing memories, and cleanup of stale memory-extension resources. Note: Not working on Linux for me. /memories command unavailable. > Codex MCP got further upgrades with direct app tool calls, cleaner namespacing, and safe optional parallel execution for faster workflows > Codex realtime got better controls (text/audio + clear “done” signals), easier history syncing, and safer file handling. > Hardened devcontainer setup plus smarter macOS socket allowlists for safer local runtime access. > Dozens of other bug fixes, see repo below. Toodles! ✌️

Codex CLI Update: Let there be Search Whatup nerds, back so soon looking or yet ANOTHER update?! I got you. Update 0.121.0 is here! > You can now search through previous user prompts with CTRL+R. Just trigger search and enter your search string, you can easily arrow through all matches. See video below! > 🥔 Support for Spud! Is not here yet. Sorry. Maybe tomorrow. 🫢 > v0.121.0 adds custom marketplace installs in Codex: you can run codex marketplace add to register marketplaces from GitHub shorthand, git URLs, or local directories. Codex validates the marketplace layout and stores it in your user config so it shows up consistently in plugin discovery. > Improved memory features, including a new /memories TUI menu with use/generate toggles, a reset-all-memories action, app-server support for setting thread memory mode and clearing memories, and cleanup of stale memory-extension resources. Note: Not working on Linux for me. /memories command unavailable. > Codex MCP got further upgrades with direct app tool calls, cleaner namespacing, and safe optional parallel execution for faster workflows > Codex realtime got better controls (text/audio + clear “done” signals), easier history syncing, and safer file handling. > Hardened devcontainer setup plus smarter macOS socket allowlists for safer local runtime access. > Dozens of other bug fixes, see repo below. Toodles! ✌️

21,842 görüntüleme

Codex App for Linux v26.415.20818 (latest) Available on my Github Link in comments Computer Use Plugin is not available, sorry!

Codex App for Linux v26.415.20818 (latest) Available on my Github Link in comments Computer Use Plugin is not available, sorry!

19,558 görüntüleme

Rejoice. Just following up with another quick W in Codex You can now configure your reasoning level in plan mode separately directly from your config file. This is huge for Plus users who want to plan with high or xhigh reasoning levels, and then switch over to medium reasoning for implementation, without needing the slash command. This is a great way to save your usage limits, and now it happens automatically. Even if you're on Pro, this should make you very happy. Prior to this, it was switching you automatically to medium every time you planned, which was pretty annoying. Place this near the top of your config file: plan_mode_reasoning = "high" (or xhigh) 0.150.0 is a massive quality of life update. They're clearly listening. This time I am shouting out Charlie. 🙏

Rejoice. Just following up with another quick W in Codex You can now configure your reasoning level in plan mode separately directly from your config file. This is huge for Plus users who want to plan with high or xhigh reasoning levels, and then switch over to medium reasoning for implementation, without needing the slash command. This is a great way to save your usage limits, and now it happens automatically. Even if you're on Pro, this should make you very happy. Prior to this, it was switching you automatically to medium every time you planned, which was pretty annoying. Place this near the top of your config file: plan_mode_reasoning = "high" (or xhigh) 0.150.0 is a massive quality of life update. They're clearly listening. This time I am shouting out Charlie. 🙏

25,887 görüntüleme

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

YEAH.... Fable is a very good model. I'm kinda mind blown right now. Fable one shot this Rocket League clone and it is so good that I'm honestly in shock. Psychosis level 1000 rn We are entering a new era of AI coding.

YEAH.... Fable is a very good model. I'm kinda mind blown right now. Fable one shot this Rocket League clone and it is so good that I'm honestly in shock. Psychosis level 1000 rn We are entering a new era of AI coding.

489,198 görüntüleme • 17 gün önce

Can confirm Bonsai 1.7B, works great locally on my Galaxy S26U with Lynk! Very, very fast even on CPU.

Can confirm Bonsai 1.7B, works great locally on my Galaxy S26U with Lynk! Very, very fast even on CPU.

94,733 görüntüleme • 4 gün önce

I tried this so you don't have to. I know this is going to absolutely shock you but no this does not match the performance of Mythos. A few early thoughts: 1. The limits are pretty bad. I used 100% of my 5-hour usage in less than 1 prompt. 2. I specifically gave it a threejs task because it is an area that SOTA models have made big strides in, that other models just are not great at. I asked it to build a replica of Rocket League. I'll put the prompt in the comments. The game was pretty bad and notably worse than GPT 5.5. Even after multiple fixes, it took 7-8 back and forth with Codex just to get it an almost playable condition. Prior to these fixes, the game was not playable. Maybe it's really strong in other disciplines. I'd love to test that but I hit my limit in 1 prompt lol. GPT 5.5 by contrast did a pretty good job and required no follow ups. Fable would have absolutely nailed this as well. But yeah, early impressions...not great. But I hope I'm wrong. More testing tomorrow.

I tried this so you don't have to. I know this is going to absolutely shock you but no this does not match the performance of Mythos. A few early thoughts: 1. The limits are pretty bad. I used 100% of my 5-hour usage in less than 1 prompt. 2. I specifically gave it a threejs task because it is an area that SOTA models have made big strides in, that other models just are not great at. I asked it to build a replica of Rocket League. I'll put the prompt in the comments. The game was pretty bad and notably worse than GPT 5.5. Even after multiple fixes, it took 7-8 back and forth with Codex just to get it an almost playable condition. Prior to these fixes, the game was not playable. Maybe it's really strong in other disciplines. I'd love to test that but I hit my limit in 1 prompt lol. GPT 5.5 by contrast did a pretty good job and required no follow ups. Fable would have absolutely nailed this as well. But yeah, early impressions...not great. But I hope I'm wrong. More testing tomorrow.

585,915 görüntüleme • 27 gün önce

I'll bet you didnt know you could do this with Claude Code

I'll bet you didnt know you could do this with Claude Code

294,701 görüntüleme • 22 gün önce

Look ma new Codex Updates! 0.119.0 and 0.120.0 are here. And with it, a HUGE number of quality of life updates and bug fixes! > Hooks now render in a dedicated live area above the composer. They only persist when they have output, so your terminal stays clean. If you're running PreToolUse or PostToolUse hooks, this is a huge readability win. > Hooks are now available again on Windows > CTRL+O copies the last agent output. Small but clutch when you're pulling a code block into another file or chat. > New statusline option: context usage as a graphical bar instead of a percentage. Easier to glance at mid-session when you're trying to gauge how much runway you have left. > Zellij support is here with no scrollback bugs. If you've been stuck on tmux just because Codex was broken in Zellij, you're free now (shout out Felipe Coury 🦀) > Memory extensions just landed. The consolidation agent can now discover plugin folders under memories_extensions/ and read their instructions.md to learn how to interpret new memory sources. Drop a folder in, give it guidance, and the agent picks it up automatically during summarization. No core code changes needed. This is the first real extension point for Codex's memory system, and it opens the door for third-party memory plugins. > Did you know, you can /rename a thread? But what's really cool about that is, after you rename it, you can resume it with the same name, no more UUIDs. codex resume mynewapp or directly from the TUI: /resume mynewapp > Multi agents v2 got an update to tool descriptions More reliable multi agent environments and inter agent communication > You can now enable TUI notifications whether Codex is in focus or not. Modify this in your config: [tui] notification_condition = "always" > MAJOR overhaul to Codex MCP functionality: 1. Codex Tool Search now works with custom MCP servers, so tools can be searched and deferred instead of all being exposed up front. 2. Custom MCP servers can now trigger elicitations, meaning they can stop and ask for user approval or input mid-flow. 3. MCP tool results now preserve richer metadata, which improves app/UI handoff behavior. 4. Codex can now read MCP resources directly, letting apps return resource URIs that the client can actually open. 5. File params for Codex Apps are smoother: local file paths can be uploaded and remapped automatically. 6. Plugin cache refresh and fallback sync behavior are more reliable, especially for custom and curated plugins. > Composer and chat behavior smoother overall, resize bugs remain though. > Realtime v2 got several significant improvements as well. > You're still reading? What a legend. 🫶 npm i -g @openai/codex to update

Look ma new Codex Updates! 0.119.0 and 0.120.0 are here. And with it, a HUGE number of quality of life updates and bug fixes! > Hooks now render in a dedicated live area above the composer. They only persist when they have output, so your terminal stays clean. If you're running PreToolUse or PostToolUse hooks, this is a huge readability win. > Hooks are now available again on Windows > CTRL+O copies the last agent output. Small but clutch when you're pulling a code block into another file or chat. > New statusline option: context usage as a graphical bar instead of a percentage. Easier to glance at mid-session when you're trying to gauge how much runway you have left. > Zellij support is here with no scrollback bugs. If you've been stuck on tmux just because Codex was broken in Zellij, you're free now (shout out Felipe Coury 🦀) > Memory extensions just landed. The consolidation agent can now discover plugin folders under memories_extensions/ and read their instructions.md to learn how to interpret new memory sources. Drop a folder in, give it guidance, and the agent picks it up automatically during summarization. No core code changes needed. This is the first real extension point for Codex's memory system, and it opens the door for third-party memory plugins. > Did you know, you can /rename a thread? But what's really cool about that is, after you rename it, you can resume it with the same name, no more UUIDs. codex resume mynewapp or directly from the TUI: /resume mynewapp > Multi agents v2 got an update to tool descriptions More reliable multi agent environments and inter agent communication > You can now enable TUI notifications whether Codex is in focus or not. Modify this in your config: [tui] notification_condition = "always" > MAJOR overhaul to Codex MCP functionality: 1. Codex Tool Search now works with custom MCP servers, so tools can be searched and deferred instead of all being exposed up front. 2. Custom MCP servers can now trigger elicitations, meaning they can stop and ask for user approval or input mid-flow. 3. MCP tool results now preserve richer metadata, which improves app/UI handoff behavior. 4. Codex can now read MCP resources directly, letting apps return resource URIs that the client can actually open. 5. File params for Codex Apps are smoother: local file paths can be uploaded and remapped automatically. 6. Plugin cache refresh and fallback sync behavior are more reliable, especially for custom and curated plugins. > Composer and chat behavior smoother overall, resize bugs remain though. > Realtime v2 got several significant improvements as well. > You're still reading? What a legend. 🫶 npm i -g @openai/codex to update

742,134 görüntüleme • 3 ay önce

This new Siri update is absolutely maddening. It's a tale as old as time, Siri being a complete joke. Google has historically just been far better at basically every stage of their lifecycle in terms of voice to text, and phone control. Google Assistant was completely dominant. So you would think considering that Google makes their own frontier models, and the fact they are one of the industry leaders in small models capable of running on Edge devices, that they would easily dominate the mobile space, right? I mean surely that would be true considering Apple has more or less been on the sidelines in the AI race, right? I seriously don't understand how this happened. On device Gemini, even with "Personal Intelligence" really has very little additional utility over the previous deterministic STT "Google Assistant" we've had for the last decade. What are we doing?

This new Siri update is absolutely maddening. It's a tale as old as time, Siri being a complete joke. Google has historically just been far better at basically every stage of their lifecycle in terms of voice to text, and phone control. Google Assistant was completely dominant. So you would think considering that Google makes their own frontier models, and the fact they are one of the industry leaders in small models capable of running on Edge devices, that they would easily dominate the mobile space, right? I mean surely that would be true considering Apple has more or less been on the sidelines in the AI race, right? I seriously don't understand how this happened. On device Gemini, even with "Personal Intelligence" really has very little additional utility over the previous deterministic STT "Google Assistant" we've had for the last decade. What are we doing?

205,878 görüntüleme • 1 ay önce

The Codex app server was such a brilliant stroke of foresight that really doesn't get enough love Not only are you allowed to use your chatgpt account with any harness, but you can build your own apps directly on top of theirs. They just make building on and with codex such a great experience To demonstrate this utility, I want to highlight the kitty litter app, made by SIGKITTEN. Instead of having to build the entire harness, and all the infrastructure, he's plugged into the app server for a unified experience between mobile and dev machine. When I create a session on my computer, it's automatically available on my phone. All of the chats you see in this video automatically populated when we connected to the app server. All my skills. My agents. My sessions. My folders. My prompts. They're all ready to use - automatically. Because they're exposed by the app server, along with many other endpoints. It's a great ux/dx that really deserves some love. It's almost like they want you to build on top of their products ;) Btw Litter is great 👍

The Codex app server was such a brilliant stroke of foresight that really doesn't get enough love Not only are you allowed to use your chatgpt account with any harness, but you can build your own apps directly on top of theirs. They just make building on and with codex such a great experience To demonstrate this utility, I want to highlight the kitty litter app, made by SIGKITTEN. Instead of having to build the entire harness, and all the infrastructure, he's plugged into the app server for a unified experience between mobile and dev machine. When I create a session on my computer, it's automatically available on my phone. All of the chats you see in this video automatically populated when we connected to the app server. All my skills. My agents. My sessions. My folders. My prompts. They're all ready to use - automatically. Because they're exposed by the app server, along with many other endpoints. It's a great ux/dx that really deserves some love. It's almost like they want you to build on top of their products ;) Btw Litter is great 👍

265,669 görüntüleme • 3 ay önce

Introducing Lynk, a brand new way to interact with your favorite harnesses on the go. Compatible with OpenClaw, Hermes, Codex, and local edge models. Fully featured client allowing you to easily and quickly switch between your favorite agents. Lynk has been my absolute favorite way to kick off tasks on the go, completely replacing Telegram. Only available as a beta on Android at present, but iOS version is in development. Works via local network or Tailscale. Features: - Create and continue threads with your favorite harness - OpenClaw, Hermes, Codex, and local models - Android Phone control - Realtime voice agent - Speech-to-text transcription - Codex Pets + live notifications - Draw over screen quick access chat overlay - Open Source software Join the beta in the comments.

97,949 görüntüleme • 1 ay önce

Nice. Cursor just dropped their new "Glass" alpha, and they're leaning heavily into the simplified coding GUI trend that's been blowing up lately. First impressions are really positive. And just look at how insanely fast Composer 2 is. First impressions? Drop yours 👇

Nice. Cursor just dropped their new "Glass" alpha, and they're leaning heavily into the simplified coding GUI trend that's been blowing up lately. First impressions are really positive. And just look at how insanely fast Composer 2 is. First impressions? Drop yours 👇

217,130 görüntüleme • 4 ay önce

This is so cool. In the next Codex update, multi agents will get a massive flexibility upgrade. "Hey Codex, when you implement this plan, I want you to delegate all the lower complexity tasks to GPT 5.3 Spark subagents" Instead of needing to create 100 different custom agent roles for different situations, you can just prompt your agent to spawn whatever model or reasoning level you want. With only natural language. No config files. No pre-defined roles. Just tell the orchestrator what to use and it listens.

This is so cool. In the next Codex update, multi agents will get a massive flexibility upgrade. "Hey Codex, when you implement this plan, I want you to delegate all the lower complexity tasks to GPT 5.3 Spark subagents" Instead of needing to create 100 different custom agent roles for different situations, you can just prompt your agent to spawn whatever model or reasoning level you want. With only natural language. No config files. No pre-defined roles. Just tell the orchestrator what to use and it listens.

128,125 görüntüleme • 4 ay önce

I got so tired of everyone raving about how great cmux is. Panes this. Browser that. EXHAUSTING. And that's because I'm on Linux, where we get none of the coolest toys. So...I built it myself. And my God. You were right. It's amazing. Introducing Limux, a a GPU-accelerated terminal workspace manager for Linux, powered by Ghostty's rendering engine, with split panes, tabbed workspaces, and a built-in browser. Think cmux, but native Linux. If you're interested in something like this, be sure to leave a comment and I'll release it. Special thanks to Manaflow and Mitchell Hashimoto for making this possible.

I got so tired of everyone raving about how great cmux is. Panes this. Browser that. EXHAUSTING. And that's because I'm on Linux, where we get none of the coolest toys. So...I built it myself. And my God. You were right. It's amazing. Introducing Limux, a a GPU-accelerated terminal workspace manager for Linux, powered by Ghostty's rendering engine, with split panes, tabbed workspaces, and a built-in browser. Think cmux, but native Linux. If you're interested in something like this, be sure to leave a comment and I'll release it. Special thanks to Manaflow and Mitchell Hashimoto for making this possible.

108,847 görüntüleme • 4 ay önce

He said Codex Computer Use on macOS isn't special. Omarchy can do the same thing. Challenge accepted. So I had Codex draw him a painting using only its mouse, no other tools. Holler at me when you can do this.

He said Codex Computer Use on macOS isn't special. Omarchy can do the same thing. Challenge accepted. So I had Codex draw him a painting using only its mouse, no other tools. Holler at me when you can do this.

64,077 görüntüleme • 2 ay önce

As it turns out, GPT 5.5 is also pretty good at building in CAD! This is insanely impressive compared to just a few months ago. Required a handful of extra prompts though, $45 in cost, roughly. I don't think we're that far away from models being able to build almost anything.

As it turns out, GPT 5.5 is also pretty good at building in CAD! This is insanely impressive compared to just a few months ago. Required a handful of extra prompts though, $45 in cost, roughly. I don't think we're that far away from models being able to build almost anything.

32,617 görüntüleme • 1 ay önce

WOW! I'm so excited about this. OpenAI Developers said Codex was good at Computer Use, but I wasn't prepared for this. For the last two weeks I've been working on a Computer Use skill to work with Linux. And while I had some success, it was a pretty frustrating experience. That is...until the breakthrough. Using accessibility tools, Codex can now control my entire computer, not just the browser. There are limits to this, of course, but what a time to be alive. This Computer Use skill will unlock and entirely new set of automations, all powered by Codex. Demonstration below 👇

WOW! I'm so excited about this. OpenAI Developers said Codex was good at Computer Use, but I wasn't prepared for this. For the last two weeks I've been working on a Computer Use skill to work with Linux. And while I had some success, it was a pretty frustrating experience. That is...until the breakthrough. Using accessibility tools, Codex can now control my entire computer, not just the browser. There are limits to this, of course, but what a time to be alive. This Computer Use skill will unlock and entirely new set of automations, all powered by Codex. Demonstration below 👇

79,647 görüntüleme • 3 ay önce

I have fallen in love with Ghostty Terminal. Look at this subtle, but incredibly useful opacity trick you can use to show "focus" when working in multiple panes. Shout out Daniel San who showed me this. What a legend. Ghostty config file: unfocused-split-opacity = 0.55

I have fallen in love with Ghostty Terminal. Look at this subtle, but incredibly useful opacity trick you can use to show "focus" when working in multiple panes. Shout out Daniel San who showed me this. What a legend. Ghostty config file: unfocused-split-opacity = 0.55

86,158 görüntüleme • 4 ay önce

Codex team is back in the kitchen with a really nice quality of life upgrade for subagents. With the advent of custom roles, they have also upgraded the TUI experience in two really meaningful ways. > All agents now get a name for better readability > Additionally, the agent role is declared. > Subagent name, role, and status are now color coded. > Subagent rendering was also optimized for readability > /agents slash command shows all agents, even 2+ layers deep. And here's the biggest and most important change. Subagent injection. Before, sometimes the orchestration agent would continue work and lose track of the work of a subagent. Now, when a subagent is blocked or completed, it injects a message back up the chain to ensure that the parent sees the message. This is a really big improvement overall, and leads to much more reliable inter-agent communication, reliability, and DX. In this example, I used the parent agent to spin up a worker agent, which then spawned two more "Spark" agents a second layer deep. I was able to easily tell them apart, switch between the threads, and see exactly what they were prompted. All of this will be available in update 0.105.0 I don't know who JIF is at OpenAI, but they are truly a legend.

Codex team is back in the kitchen with a really nice quality of life upgrade for subagents. With the advent of custom roles, they have also upgraded the TUI experience in two really meaningful ways. > All agents now get a name for better readability > Additionally, the agent role is declared. > Subagent name, role, and status are now color coded. > Subagent rendering was also optimized for readability > /agents slash command shows all agents, even 2+ layers deep. And here's the biggest and most important change. Subagent injection. Before, sometimes the orchestration agent would continue work and lose track of the work of a subagent. Now, when a subagent is blocked or completed, it injects a message back up the chain to ensure that the parent sees the message. This is a really big improvement overall, and leads to much more reliable inter-agent communication, reliability, and DX. In this example, I used the parent agent to spin up a worker agent, which then spawned two more "Spark" agents a second layer deep. I was able to easily tell them apart, switch between the threads, and see exactly what they were prompted. All of this will be available in update 0.105.0 I don't know who JIF is at OpenAI, but they are truly a legend.

78,267 görüntüleme • 4 ay önce

Calling on Codex Fans! I need your help 🫵 I'm introducing Codex Marketplace, a community collection of plugins, skills, and hooks curated by YOU. Yes, you! Help me make this the best resource for everyone who wants to build incredible software with Codex. Getting started is simple, submit your artifacts via your Github repository. If you own the repo, it'll be auto approved, else they'll be reviewed. Upvote your favorite artifacts. Add/remove what you like with a simple npx command. Best part? Everything lives on Github, so you always get the latest version. 👉

Calling on Codex Fans! I need your help 🫵 I'm introducing Codex Marketplace, a community collection of plugins, skills, and hooks curated by YOU. Yes, you! Help me make this the best resource for everyone who wants to build incredible software with Codex. Getting started is simple, submit your artifacts via your Github repository. If you own the repo, it'll be auto approved, else they'll be reviewed. Upvote your favorite artifacts. Add/remove what you like with a simple npx command. Best part? Everything lives on Github, so you always get the latest version. 👉

41,342 görüntüleme • 2 ay önce

This is what 30 hours of work, a handful of AI bux, and and a sore back looks like: 2x RTX 6000 Pro 320GB RAM (192V+128GB) 48,128 CUDA Cores 1.8TB/s Memory Bandwidth 9950X (16c/32t) Corsair DDR5 6400 ASUS ProArt X870E 4TB Crucial T705 5x4 SSD 4TB Crucial P3 4x4 SSD 4TB Crucial T500 4x4 SSD Asrock Platinum 1600W Corsair XD5 Reservoir EKWB Waterblock 2x 360mm EKWB Radiators Way too much debt I'm so stupid But it's glorious I bought everything used or repurposed old equipment except for the water block, 1 GPU, and 3 fans. Even though I'm not thrilled about spending the cash, I only have about $10.5K into it (not counting parts I paid for years ago). Safe to say I did pretty damn good all things considered. I'd like to thank Central Computers for helping me get this project off the ground. Now that I have this together, my aim is to support the OSS community. If you're interested in running local models, consider following me on this journey. Next chapter loading....

This is what 30 hours of work, a handful of AI bux, and and a sore back looks like: 2x RTX 6000 Pro 320GB RAM (192V+128GB) 48,128 CUDA Cores 1.8TB/s Memory Bandwidth 9950X (16c/32t) Corsair DDR5 6400 ASUS ProArt X870E 4TB Crucial T705 5x4 SSD 4TB Crucial P3 4x4 SSD 4TB Crucial T500 4x4 SSD Asrock Platinum 1600W Corsair XD5 Reservoir EKWB Waterblock 2x 360mm EKWB Radiators Way too much debt I'm so stupid But it's glorious I bought everything used or repurposed old equipment except for the water block, 1 GPU, and 3 fans. Even though I'm not thrilled about spending the cash, I only have about $10.5K into it (not counting parts I paid for years ago). Safe to say I did pretty damn good all things considered. I'd like to thank Central Computers for helping me get this project off the ground. Now that I have this together, my aim is to support the OSS community. If you're interested in running local models, consider following me on this journey. Next chapter loading....

48,479 görüntüleme • 3 ay önce

Your Plan SUCKS! 👀 Inspired by the legend Andrej Karpathy himself, I created a new skill: LLM Council. Create better plans, by committee. Supports: > Codex CLI > Gemini CLI > Claude Code > OpenCode Call the skill with your feature that you'd like to build. It will ask you some clarifying question, and then launch up to four parallel planning agents to create a detailed plan. Once the plans are in, all plans are anonymized and the "The Judge" will critique and choose the best plan *OR* the best parts from all of them. Finally, it will output a final-plan, which you can review and refine. I even created a nice UI for you to review and refine your plans. This Skill has been tested on Linux only, but should work on other platforms. Please report any bugs! Links in the comments.

Your Plan SUCKS! 👀 Inspired by the legend Andrej Karpathy himself, I created a new skill: LLM Council. Create better plans, by committee. Supports: > Codex CLI > Gemini CLI > Claude Code > OpenCode Call the skill with your feature that you'd like to build. It will ask you some clarifying question, and then launch up to four parallel planning agents to create a detailed plan. Once the plans are in, all plans are anonymized and the "The Judge" will critique and choose the best plan OR the best parts from all of them. Finally, it will output a final-plan, which you can review and refine. I even created a nice UI for you to review and refine your plans. This Skill has been tested on Linux only, but should work on other platforms. Please report any bugs! Links in the comments.

69,010 görüntüleme • 5 ay önce

With all the buzz around the Codex App, OpenAI Developers quietly snuck out a new CLI update (0.94.0) as well. And boy is it an important update! Codex Plan mode is now officially released to the general audience! I am very excited about this one as it has a really strong prompt that is unlike any other plan mode I've personally used. Codex Plan mode doesn't necessarily just ask you 3 questions up front. It goes, collects context, asks questions, collects more context, asks more questions (sometimes), and then writes an incredibly high quality plan. It is my favorite implementation of plan mode thus far. It also comes with Codex's own version of "AskUserQuestion!" Although, it only works in Plan mode for now. They really need to allow people to use it in Code mode as well, but one win at a time. npm i -g @openai/codex Below is a demo of how it works. Let me know what you think!

With all the buzz around the Codex App, OpenAI Developers quietly snuck out a new CLI update (0.94.0) as well. And boy is it an important update! Codex Plan mode is now officially released to the general audience! I am very excited about this one as it has a really strong prompt that is unlike any other plan mode I've personally used. Codex Plan mode doesn't necessarily just ask you 3 questions up front. It goes, collects context, asks questions, collects more context, asks more questions (sometimes), and then writes an incredibly high quality plan. It is my favorite implementation of plan mode thus far. It also comes with Codex's own version of "AskUserQuestion!" Although, it only works in Plan mode for now. They really need to allow people to use it in Code mode as well, but one win at a time. npm i -g @openai/codex Below is a demo of how it works. Let me know what you think!

57,121 görüntüleme • 5 ay önce