Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

Matt Maher tested frontier models in Cursor v. other harnesses. Cursor boosted model performance by 11% on average: Gemini: 52% → 57% GPT-5.4: 82% → 88% Opus: 77% → 93% His benchmark measures how well models implement a 100-feature PRD. Cursor consistently outperformed.

edwin

30,628 subscribers

911,490 görüntüleme • 3 ay önce •via X (Twitter)

Bilim & Teknoloji

Anya Rossi• Live Now

Private livecam show

0 Yorum

Yorum bulunmuyor

Orijinal gönderinin yorumları burada görünecek

Benzer Videolar

GPT 4.5 in Cursor! We've found it surprisingly effective in cases where all other models fail.

GPT 4.5 in Cursor! We've found it surprisingly effective in cases where all other models fail.

Cursor

584,823 görüntüleme • 1 yıl önce

You can now run three frontier models at once and select your orchestrator model directly inside Perplexity Computer. Model Council automatically runs GPT-5.4, Claude Opus 4.6 and Gemini 3.1 Pro simultaneously. Three frontier models. One workflow. Best answer wins.

You can now run three frontier models at once and select your orchestrator model directly inside Perplexity Computer. Model Council automatically runs GPT-5.4, Claude Opus 4.6 and Gemini 3.1 Pro simultaneously. Three frontier models. One workflow. Best answer wins.

Computer

84,295 görüntüleme • 3 ay önce

I’ve been using GLM 5.2 in Cursor with MagicPath and really like it so far. Cursor Settings → Models: Add your Fireworks key under “OpenAI API Key” and enable it Base URL: Model: accounts/fireworks/models/glm-5p2 Restart Cursor. Done.

I’ve been using GLM 5.2 in Cursor with MagicPath and really like it so far. Cursor Settings → Models: Add your Fireworks key under “OpenAI API Key” and enable it Base URL: Model: accounts/fireworks/models/glm-5p2 Restart Cursor. Done.

Pietro Schirano

36,538 görüntüleme • 7 gün önce

You can now delegate tasks to Cursor directly from Notion. It's built on the Cursor SDK, so every cloud agent runs on the same models, harness, and runtime that power Cursor. @Cursor on any spec or assign it a task to open a PR your whole team can review.

You can now delegate tasks to Cursor directly from Notion. It's built on the Cursor SDK, so every cloud agent runs on the same models, harness, and runtime that power Cursor. @Cursor on any spec or assign it a task to open a PR your whole team can review.

Cursor

296,701 görüntüleme • 4 gün önce

Try out GPT-V in Cursor! It's pretty good for building/modifying components!

Try out GPT-V in Cursor! It's pretty good for building/modifying components!

Aman Sanger

195,264 görüntüleme • 2 yıl önce

Setup MCP on Cursor with Google Docs in less than 2 mins!! I used Cursor to to create PRDs in Google Docs Here's how you can do it too: - Go to the MCP directory - Search for Google Docs and grab your sse url - Paste the url and set up MCP in Cursor - Use Cursor Agent to authenticate and create PRD Check out the 100+ tools available at

Setup MCP on Cursor with Google Docs in less than 2 mins!! I used Cursor to to create PRDs in Google Docs Here's how you can do it too: - Go to the MCP directory - Search for Google Docs and grab your sse url - Paste the url and set up MCP in Cursor - Use Cursor Agent to authenticate and create PRD Check out the 100+ tools available at

Soham

151,139 görüntüleme • 1 yıl önce

AI has its PhD and now it’s on the job market. Introducing the AI Productivity Index (APEX), a benchmark that measures how well we’ve automated the most valuable industries in the world. Most benchmarks study abstract capabilities. APEX evaluates model performance on real deliverables across law, finance, consulting, and medicine. The models most capable of doing work today, according to APEX: 🥇 GPT 5 🥈 Grok 4 🥉 Gemini 2.5 Flash Other findings: - GPT 5 demonstrates the strongest performance across all 4 domains - Some cheaper models outperform more expensive models from the same provider (e.g. Gemini 2.5 Flash vs. Gemini 2.5 Pro) - The best open source model, Qwen (7th), performs only 2% behind Grok 4 overall

AI has its PhD and now it’s on the job market. Introducing the AI Productivity Index (APEX), a benchmark that measures how well we’ve automated the most valuable industries in the world. Most benchmarks study abstract capabilities. APEX evaluates model performance on real deliverables across law, finance, consulting, and medicine. The models most capable of doing work today, according to APEX: 🥇 GPT 5 🥈 Grok 4 🥉 Gemini 2.5 Flash Other findings: - GPT 5 demonstrates the strongest performance across all 4 domains - Some cheaper models outperform more expensive models from the same provider (e.g. Gemini 2.5 Flash vs. Gemini 2.5 Pro) - The best open source model, Qwen (7th), performs only 2% behind Grok 4 overall

Brendan (can/do)

451,298 görüntüleme • 8 ay önce

Conductor now supports Cursor with Composer 2.5 With native support for 3 harnesses, we now have the holy trinity (Claude, Codex, Cursor) and have fulfilled the prophecy of the 4C's Conductor, Claude, Codex, Cursor

Conductor now supports Cursor with Composer 2.5 With native support for 3 harnesses, we now have the holy trinity (Claude, Codex, Cursor) and have fulfilled the prophecy of the 4C's Conductor, Claude, Codex, Cursor

matt palmer

27,803 görüntüleme • 19 gün önce

The strongest models are gated and access is granted only to a select few. Hermes Agent now exposes MoA presets as virtual models, giving you capabilities beyond the publicly available frontier: 8% higher than Opus 4.8 and 11% higher than GPT 5.5 on our upcoming benchmark.

The strongest models are gated and access is granted only to a select few. Hermes Agent now exposes MoA presets as virtual models, giving you capabilities beyond the publicly available frontier: 8% higher than Opus 4.8 and 11% higher than GPT 5.5 on our upcoming benchmark.

Nous Research

1,630,605 görüntüleme • 2 gün önce

How fast is serv-swift? Runs roughly 9× faster than frontier models like GPT 5.4 Here's the proof 👇

How fast is serv-swift? Runs roughly 9× faster than frontier models like GPT 5.4 Here's the proof 👇

OpenServ

26,102 görüntüleme • 2 ay önce

Opus 4.5 in-a-loop builds Spiking Neural Net simulations in Cursor that is all :)

Opus 4.5 in-a-loop builds Spiking Neural Net simulations in Cursor that is all :)

echo.hive

42,700 görüntüleme • 5 ay önce

baby cursor already useful: came up with new ideas for stop + queuing made with Cursor + Gemini 2.5 Pro MAX

baby cursor already useful: came up with new ideas for stop + queuing made with Cursor + Gemini 2.5 Pro MAX

Ryo Lu

45,868 görüntüleme • 1 yıl önce

GPT-5.4 is special LisanBench: GPT-5.4 vs Opus 4.6 vs Gemini 3.1 Pro

GPT-5.4 is special LisanBench: GPT-5.4 vs Opus 4.6 vs Gemini 3.1 Pro

Lisan al Gaib

265,203 görüntüleme • 3 ay önce

I tested Gemini Pro 2.5 as my main coding model for 40+ hours. Here're 2 documents that are working brilliantly well with Gemini. "App flow document + App flowchart." This made my Cursor workflow 10x better. Here's why it is working: ↓

I tested Gemini Pro 2.5 as my main coding model for 40+ hours. Here're 2 documents that are working brilliantly well with Gemini. "App flow document + App flowchart." This made my Cursor workflow 10x better. Here's why it is working: ↓

CJ Zafir

85,260 görüntüleme • 1 yıl önce

Cursor AI MCP Integration: 🖱️ Cursor ID Reads Console Logs/Errors Auto 📸 Can see your Website 🔍 Analyse Selected Browser Elements 📝 Debug 3x Fast Cursor Here is a step by step Guide on how to do it 👇

Cursor AI MCP Integration: 🖱️ Cursor ID Reads Console Logs/Errors Auto 📸 Can see your Website 🔍 Analyse Selected Browser Elements 📝 Debug 3x Fast Cursor Here is a step by step Guide on how to do it 👇

Mervin Praison

41,565 görüntüleme • 1 yıl önce

I tested Github Copilot's latest "Cursor killer" features, and the results were... not as I expected Here's my in-depth review of Copilot vs Cursor:

I tested Github Copilot's latest "Cursor killer" features, and the results were... not as I expected Here's my in-depth review of Copilot vs Cursor:

Steve (Builder.io)

41,291 görüntüleme • 1 yıl önce

I was using Opus 4.6 in Cursor and am already out of credits

I was using Opus 4.6 in Cursor and am already out of credits

Karan

25,283 görüntüleme • 4 ay önce

Create datasets, run evals, and even train models directly in Cursor with the Hugging Face plugin. Here's Ben Burtenshaw to show you how:

Create datasets, run evals, and even train models directly in Cursor with the Hugging Face plugin. Here's Ben Burtenshaw to show you how:

edwin

17,119 görüntüleme • 3 ay önce

✦ | Cursor Keychain Commissions | ✦ Live2D Cursor models are now available! Accepting: Two Slots Please read all details in the description! If you would like to join the waitlist, DM me or comment! #Live2dCommissions | #Live2DShowcase | #Live2D

✦ | Cursor Keychain Commissions | ✦ Live2D Cursor models are now available! Accepting: Two Slots Please read all details in the description! If you would like to join the waitlist, DM me or comment! #Live2dCommissions | #Live2DShowcase | #Live2D

Venus VT🪐🌟 | VGen Live2D

53,871 görüntüleme • 1 yıl önce

my upcoming cursor clone tutorial just got the feature to clone from github, a breeze to implement with Inngest and Convex 😎

my upcoming cursor clone tutorial just got the feature to clone from github, a breeze to implement with Inngest and Convex 😎

Code With Antonio

36,874 görüntüleme • 6 ay önce