Loading video...

Video Failed to Load

Go Home

Codex, Claude Code, and Cursor are all great tools. They're also much more different than you think. I did a comparison of the three, but not in the usual way. I went deep on how they differ philosophically.

240,317 views • 10 days ago •via X (Twitter)

0 Comments

No comments available

Comments from the original post will appear here

Related Videos

I just compared Claude Code vs Codex vs Cursor CLI The task was to build a Next.js app with Tailwind 4 and shadcn components to collect customer feedback and showcase it with a widget. I gave all three the same prompt and let them go for 30 minutes to see what they came up with. Claude Code with Opus 4.1 Even though I told it to set up the app in the existing project folder, it tried to create a directory for it. After I interrupted and told it not to do that, it built a demo form and landing page with no errors. I had to ask it to make the demo interactive so users could submit a testimonial and preview it. The landing page looked like AI and was pretty basic, but it worked and it was done in a fraction of the time of the others. Total tokens used: 33k Codex with GPT-5 At the end of the 30 minutes I just could not get Codex to produce a working app. It got stuck in a loop of not being able to set up Tailwind 4 and despite many, MANY, attempts, I ended up with a "failed to compile" error. Total tokens used: 102k Cursor Agent with GPT-5 This was the slowest agent by far and a couple of times I actually thought it got stuck in a loop and was close to Ctrl+C'ing to cancel it. The TUI is really nice though, especially how it shows diffs and it did eventually build a working app (after one or two slight errors that needed fixing) The demo was interactive and it had a very minimal design that looked bare but also a lot less like an "AI generated" app than the Opus 4.1 design. It also wasn't too chatty and just did what it needed to do! Code quality was on a par with Opus 4.1, but it did use 5.5x as many tokens to get there. Still cheaper than Opus on a direct comparison but not when you factor in a Claude Code Max subscription. Total tokens: 188k I'll be able to do a proper comparison and record some videos when I'm back from holiday but for now, Opus is still the more capable model out of the box and Claude Code is the more complete CLI product. It will be interesting to see how Cursor evolve their CLI though with commands and subagents because I think with GPT-5 they have a real shot at providing competition for Claude Code if they can optimise output to get similar quality with less tokens. Jump to 0:40 in the video to see the two apps. Which do you think is which? ;)

Ian Nuttall

194,803 views • 10 months ago

Cursor vs Claude Code (day 3 of 30) today I re-tested Conductor and I don't want to overhype this but... what the actual fook 😮 this thing is amazing! closest things to the "perfect workflow" I have seen in a long time, and it's something I could see myself use full-time it's so minimal and the polar opposite of Cursor but it has all the essentials → Claude Code + Codex subscriptions (= unlimited usage) → great integration with Claude Code's plan mode → multiple tabs with multiple agents at the same time → can use multiple models, not just one → great file tree and diff viewer → best git worktree support I've ever seen there's honestly so much more, like a dedicated "review" button (which you can pair to your favorite review model, e.g. gpt-5.2-extra-high), buttons to create and merge pull requests, to launch your dev server, ... this solves literally EVERY fault I can think of when using Claude Code and/or Codex, combining both into one easy UX that not only does everything the tools can do, but actually adds useful stuff on top just 6 months ago I couldn't have dreamt about any this and the wildest part is that IT IS FREE (I'd literally pay for this UX lol) first time I'm genuinely enjoying myself since starting this challenge not saying it's a full Cursor replacement yet, it obviously isn't for a lot of people (no browser, no debug mode, no code indexing, etc) ... but this is the closest thing I've seen yet 👀 follow for day 4!

Robin Ebers | AI Coding for Founders

40,431 views • 5 months ago