
Matt Turck
@mattturck • 122,390 subscribers
VC at @FirstMarkCap. I'm a VC, so of course I have a podcast: https://t.co/Usnenl0zfI
Shorts
Videos
0:25
Sensitive content
This media may contain sensitive content.

State of Enterprise AI 2026: Aaron Levie on Tokenmaxxing, The Rise of Headless, and AI-Proofing Your Job 00:00 Intro 01:18 Silicon Valley engineering vs. everyone else 05:35 Are enterprise CIOs actually bullish on AI? 08:51 Tokenmaxxing & why your AI bill is about to explode 11:34 The myth of falling token costs and AI spend escaping IT budgets 17:37 The $5B startup hiding in AI compute 18:14 The mosaic of models inside every enterprise 21:28 Why coding works and the rest of knowledge work doesn't 25:53 The Bob and Sally problem: access control breaks agents 30:31 Will enterprise AI really take 10 years to roll out 32:24 The capability overhang: why faster models slow diffusion 34:23 Data is the bottleneck (it always was) 39:02 The rise of internal forward-deployed engineers 41:23 Why the AI doomers are wrong about jobs 43:43 Headless software is inevitable 46:14 What replaces per-seat pricing 47:37 How Box itself is going headless 49:42 How the org chart actually evolves 1:00:33 Future-proofing yourself as an enterprise employee 1:06:40 Are we all just going to work for OpenAI and Anthropic? 1:07:11 Where startups can still win as the labs move up
Matt Turck48,371 görüntüleme • 6 gün önce

Me after unstacking the dishwasher when my wife has been running the household 24/7 for the last 15+ years
Matt Turck1,289,804 görüntüleme • 4 ay önce

Why AI Progress Suddenly Feels Real - my conversation with Yann Dubois, who co-leads the Post-Training Frontiers team at OpenAI 00:00 - Intro 01:30 - Why recent AI progress feels like a step function 04:13 - Model reliability & the emotional rollercoaster of shipping GPT-5.5 07:33 - How OpenAI structures vertical and horizontal teams 09:49 - Improving model efficiency and test-time compute 12:32 - Yann's journey from Switzerland to OpenAI 15:37 - Reasoning in 2026: Real-world utility vs verifiable rewards 18:34 - GPT-5.5 Thinking vs Pro: Scaling test-time compute 20:09 - How reasoning models become more efficient 23:23 - Pre-training scaling and overcoming the data wall 27:03 - Multimodal data, synthetic data, and embodied AI 31:05 - Demystifying mid-training and post-training 37:21 - Does RL create new capabilities in AI? 38:53 - The challenges and frontier of scaling RL 43:09 - Is building AI models a craft or a strict science 48:21 - How AI models generalize across different domains 54:18 - How reinforcement learning cures AI hallucinations 56:04 - Negative generalization and conflicting instructions 58:05 - Can RL scale to law, medicine, and the broader economy? 1:00:19 - The evaluation bottleneck and Model as a Judge 1:04:21 - Continuous AI progress & continual learning 1:08:49 - Will foundation models eat the agent harness 1:11:23 - Why startups should focus on the last mile of AI
Matt Turck98,543 görüntüleme • 13 gün önce

.Ramp Labs (the AI unit of Ramp) has been *cooking* with agentic innovation Here's Alex L discussing and demo'ing code self-maintaining software and the concept of AI software factories #DataDrivenNYC ______________ 00:04 - Intro 01:11 - The shift from writing code to code maintenance 01:59 - Introducing Ramp Inspect, the background coding agent 03:05 - The first experiment: Nightly AI code automation 04:23 - The limits of stateless monitoring in large observability surfaces 05:47 - Using Datadog monitors to give the AI state and focus 07:23 - Real-world example: AI autonomously fixing an authentication bug 08:14 - How to control noise and implement an AI triage pattern 09:27 - The old vs. new paradigm for continuous code observability 10:21 - Key learnings on building autonomous AI software factories
Matt Turck66,621 görüntüleme • 29 gün önce

How GPT-5 thinks, with OpenAI VP of Research Jerry Tworek 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road to OpenAI: Growing up in Poland, Dropping out of School, Trading 20:32 - Working on Robotics and Rubik's Cube Solving 23:02 - A Day in the Life: Talking to Researchers 24:06 - How Research Priorities Are Determined 26:53 - OpenAI's Culture of Transparency 29:32 - Balancing Research with Shipping Fast 31:52 - Using OpenAI's Own Tools Daily 32:43 - Pre-Training Plus RL: The Modern AI Stack 35:10 - Reinforcement Learning 101: Training Dogs 40:17 - The Evolution of Deep Reinforcement Learning 42:09 - When GPT-4 Seemed Underwhelming at First 45:39 - How RLHF Made GPT-4 Actually Useful 48:02 - Unsupervised vs Supervised Learning 49:59 - GRPO and How DeepSeek Accelerated US Research 53:05 - What It Takes to Scale Reinforcement Learning 55:36 - Agentic AI and Long-Horizon Thinking 59:19 - Alignment as an RL Problem 1:01:11 - Winning ICPC World Finals Without Specific Training 1:05:53 - Applying RL Beyond Math and Coding 1:09:15 - The Path from Here to AGI 1:12:23 - Pure RL vs Language Models
Matt Turck451,229 görüntüleme • 7 ay önce

New York, the city where everyone is a venture capitalist
Matt Turck1,737,385 görüntüleme • 2 yıl önce

Claude Cowork, Mythos, and the Future of Software: my conversation with Felix Rieseberg, who leads Cowork at Anthropic 00:00 Intro 01:53 Claude Mythos Preview and the “step-function change” 06:16 Why Anthropic is treating Mythos differently 11:19 The real story behind Claude Cowork’s “10-day” build 12:42 Why Anthropic realized Claude Code needed a non-technical version 15:44 What Claude Cowork actually is 17:03 Under the hood: virtual machines, tools, skills 18:36 Where Cowork’s memory actually lives 19:26 How Cowork connects to files, apps, and the internet 20:45 Why Felix thinks the local computer is under-appreciated 24:49 Trust: how do you get users comfortable with AI agents? 28:45 What UX actually means for AI agents 31:27 Anthropic Cowork's roadmap is only one month long 34:12 Building 100 prototypes 35:10 If execution is free, what becomes the bottleneck? 37:25 Does it come down to taste? 40:12 The hardest part of building Claude Cowork 41:43 Advice for founders building AI agents 44:21 SaaSpocalypse: what’s left for software startups 49:30 Where AI agents are going next 51:20 Regulated industries and enterprise adoption 54:15 Hot takes: what's underrated, overrated, and what Felix would build today
Matt Turck79,883 görüntüleme • 1 ay önce

Failing to Understand the Exponential, Again? My conversation with Julian Schrittwieser - Julian Schrittwieser (Anthropic, AlphaGo Zero, MuZero) - on Move 37, Scaling RL, Nobel Prize for AI, and the AI frontier: 00:00 - Cold open: “We’re not seeing any slowdown.” 00:32 - Intro — Meet Julian 01:09 - The “exponential” from inside frontier labs 04:46 - 2026–2027: agents that work a full day; expert-level breadth 08:58 - Benchmarks vs reality: long-horizon work, GDP-Val, user value 10:26 - Move 37 — what actually happened and why it mattered 13:55 - Novel science: AlphaCode/AlphaTensor → when does AI earn a Nobel? 16:25 - Discontinuity vs smooth progress (and warning signs) 19:08 - Does pre-training + RL get us there? (AGI debates aside) 20:55 - Sutton’s “RL from scratch”? Julian’s take 23:03 - Julian’s path: Google → DeepMind → Anthropic 26:45 - AlphaGo (learn + search) in plain English 30:16 - AlphaGo Zero (no human data) 31:00 - AlphaZero (one algorithm: Go, chess, shogi) 31:46 - MuZero (planning with a learned world model) 33:23 -Lessons for today’s agents: search + learning at scale 34:57 - Do LLMs already have implicit world models? 39:02 - Why RL on LLMs took time (stability, feedback loops) 41:43 - Compute & scaling for RL — what we see so far 42:35 - Rewards frontier: human prefs, rubrics, RLVR, process rewards 44:36 - RL training data & the “flywheel” (and why quality matters) 48:02 - RL & Agents 101 — why RL unlocks robustness 50:51 - Should builders use RL-as-a-service? Or just tools + prompts? 52:18 - What’s missing for dependable agents (capability vs engineering) 53:51 - Evals & Goodhart — internal vs external benchmarks 57:35 - Mechanistic interpretability & “Golden Gate Claude” 1:00:03 - Safety & alignment at Anthropic — how it shows up in practice 1:03:48 - Jobs: human–AI complementarity (comparative advantage) 1:06:33 - Inequality, policy, and the case for 10× productivity → abundance 1:09:24 - Closing thoughts
Matt Turck235,526 görüntüleme • 7 ay önce

Deeply thoughtful conversation with Zico Kolter, board member at OpenAI and head of the machine learning department at Carnegie Mellon University, about AI safety, AI security, agents and frontier AI 00:00 Intro 01:32 OpenAI board role and Safety & Security Committee 03:53 How OpenAI reviews major model releases 05:33 OpenAI’s preparedness framework explained 09:46 Are frontier AI models getting safer? 12:33 Why AI safety does not come from scale 15:23 The four categories of AI risk 19:38 Doomerism vs accelerationism in AI 24:11 The six-month AI pause debate 26:20 AI safety as a global effort 28:04 How Zico Kolter got into machine learning 31:05 OpenAI in the early days 34:14 Why Carnegie Mellon became an AI powerhouse 38:43 What Gray Swan does in AI security 40:44 AI safety vs AI security 43:15 The GCG jailbreak paper 49:19 How AI labs responded to jailbreak research 50:19 State-of-the-art AI defenses 52:32 State-of-the-art AI attacks 54:22 Why AI agents expand the attack surface 58:39 Are AI agents ready for production? 59:40 Mechanistic interpretability explained 1:02:31 Will AI be safer in two years? 1:03:46 Reinforcement learning and self-improving models 1:08:09 Do post-transformer architectures matter 1:09:29 Best research directions in AI now 1:11:00 Zico Kolter’s Intro to Modern AI course 1:14:53 Why modern AI is simpler than people think
Matt Turck32,460 görüntüleme • 27 gün önce

Thanksgiving-week treat: an epic conversation on Frontier AI with Lukasz Kaiser -co-author of “Attention Is All You Need” (Transformers) and leading research scientist at OpenAI working on GPT-5.1-era reasoning models. 00:00 – Cold open and intro 01:29 – “AI slowdown” vs a wild week of new frontier models 08:03 – Low-hanging fruit, infra, RL training and better data 11:39 – What is a reasoning model, in plain language 17:02 – Chain-of-thought and training the thinking process with RL 21:39 – Łukasz’s path: from logic and France to Google and Kurzweil 24:20 – Inside the Transformer story and what “attention” really means 28:42 – From Google Brain to OpenAI: culture, scale and GPUs 32:49 – What’s next for pre-training, GPUs and distillation 37:29 – Can we still understand these models? Circuits, sparsity and black boxes 39:42 – GPT-4 → GPT-5 → GPT-5.1: what actually changed 42:40 – Post-training, safety and teaching GPT-5.1 different tones 46:16 – How long should GPT-5.1 think? Reasoning tokens and jagged abilities 47:43 – The five-year-old’s dot puzzle that still breaks frontier models 52:22 – Generalization, child-like learning and whether reasoning is enough 53:48 – Beyond Transformers: ARC, LeCun’s ideas and multimodal bottlenecks 56:10 – GPT-5.1 Codex Max, long-running agents and compaction 1:00:06 – Will foundation models eat most apps? The translation analogy and trust 1:02:34 – What still needs to be solved, and where AI might go next
Matt Turck167,758 görüntüleme • 6 ay önce

NVIDIA's New Moat & Why China is "Semiconductor Pilled” — loved my conversation with Dylan Patel (Dylan Patel). Raw, incredibly insightful, and... hilarious. 00:00 - Intro 01:16 - Nvidia acquires Groq: A pivot to specialization 07:09 - Why AI models might need "wide" compute, not just fast 10:06 - Is the CUDA moat dead? (Open source vs. Nvidia) 17:49 - The startup landscape: Etched, Cerebras, and 1% odds 22:51 - Geopolitics: China's "semiconductor-pilled" culture 35:46 - Huawei's vertical integration is terrifying 39:28 - The $100B AI revenue reality check 41:12 - US Onshoring: Why total self-sufficiency is a fantasy 44:55 - Can the US actually build fabs? (The delay problem) 48:33 - The CapEx Bubble: Is $500B spending irrational? 54:53 - Energy Crisis: Why gas turbines will power AI, not nuclear 57:06 - The "AI uses all the water" myth (Hamburger comparison) 1:03:40 - Circular Debt? Debunking the Nvidia-CoreWeave risk 1:07:24 - Claude Code & the software singularity 1:10:23 - The death of the Junior Analyst role 1:11:14 - Model predictions: Opus 4.5 and the RL gap 1:14:37 - San Francisco Lore: Living with roommates Dwarkesh Patel & Sholto Douglas
Matt Turck96,634 görüntüleme • 3 ay önce

Literally the #1 reason why America has produced so many incredible startups
Matt Turck502,104 görüntüleme • 2 yıl önce

While Silicon Valley obsesses over AGI, Mistral AI is betting that big enterprises and sovereign nations will want to own, not rent, their intelligence My conversation with co-founder & CTO Timothee Lacroix (Timothee Lacroix), for his first US podcast ever Was reminded during this conversation that Mistral is barely 2.5 years old - remarkable 00:00 — Intro 01:27 — Mistral vs. The World: From Research Lab to Sovereign Power 03:48 — Inside Mistral Compute: Building an 18,000 GPU Cluster 08:42 — The Trillion-Dollar Question: Competing Without a Big Tech Parent 10:37 — The Reality of Enterprise AI: Escaping "POC Purgatory" 15:06 — Why Mistral Hires Forward Deployed Engineers (FDEs) 16:57 — The Contrarian Take: Why "Agents" are just "Workflows" 19:35 — Trust & Autonomy: The Truth About Agent Reliability 21:26 — The Missing Stack: Governance and Versioning for AI 26:24 — When Will AI Actually Work? (The 2026 Timeline) 30:33 — Beyond Chat: The "Banger" Sovereign Use Cases 35:46 — Mistral 3 Architecture: Mixture of Experts vs. Dense 43:12 — Synthetic Data & The Post-Training Bottleneck 45:12 — Reasoning Models: Why "Thinking" is Just Tool Use 46:22 — Launching DevStral 2 and the Vibe CLI 50:49 — Engineering Lessons: How to Build Frontier AI Efficiently 56:08 — Are Enterprises Ready for AGI? & The Future of Intelligence
Matt Turck60,215 görüntüleme • 3 ay önce

My latest conversation with the always incredible Benedict Evans: OpenAI’s moat problem, the rise of ephemeral and improvised sofware, OpenClaw & agents. 00:00 Intro 01:06 OpenAI's Focus Shift 03:12 ChatGPT usage: a "mile wide, inch deep" 09:03 Why better models do not solve the real problem 13:58 Why AI product teams are strategy takers, not strategy setters 15:38 Do agents help create defensibility? 20:06 OpenClaw and the "Desktop Linux" moment for AI 25:52 Why "everyone will build their own software" is completely wrong 28:09 Improvised software vs. institutionalized software 29:23 Why there will be more software, not less 36:15 Are we heading toward value destruction before value creation? 38:03 Circular revenue, leverage, and AI bubble dynamics 38:53 Big Tech's Trillion-Dollar CapEx Crisis & Financial Gravity 45:23 Why AI job exposure charts can be misleading 52:15 How Fortune 500 Execs are actually deploying AI today 56:45 The White Space: What this means for founders and investors
Matt Turck36,490 görüntüleme • 2 ay önce

Sonnet 4.5 & the AI Plateau Myth — epic conversation with Sholto Douglas of Anthropic 0:00 - Intro 1:09 - What's Behind The Rapid Pace of AI Releases at Anthropic 2:49 - Opus, Sonnet, and Haiku Model Tiers 4:14 - Sholto's Story: From Australian Fencer to AI Researcher 12:01 - The YouTube Effect: Mastery Through Observation 16:16 - Breaking Into AI Research Without Traditional Academic Signals 18:29 - DeepMind, Gemini, and Building Inference Stacks 23:05 - Why Anthropic? Culture and Mission Differences Amongst AI Research Labs 25:08 - What Is "Taste" in AI Research? 31:46 - This Week's Big Launch: Sonnet 4.5, Best Coding Model in the World 36:40 - From 7 Hours to 30 Hours: The Long-Running AI Agent Breakthrough 38:41 - How AI Agents Self-Correct and Maintain Coherence 43:13 - The Role of Memory in Extended Coding Sessions 47:42 - Pre-Training vs. RL: Textbooks vs. Worked Problems 52:11 - Test-Time Compute & Reinforcement Learning 55:55 - Why RL Finally Started Working on LLMs in 2024 59:38 - The Path to AGI 1:02:05 - Are We Hitting a Plateau in AI? So Many Low Hanging Fruits 1:03:41 - Beyond Coding: GDPVal & Impact Economic Sectors 1:05:47 - Preparing for 10-100x Individual Leverage & The Upcoming Robotics Explosion
Matt Turck93,678 görüntüleme • 8 ay önce

Every agent needs its own computer: my conversation with Ivan Burazin, CEO of Daytona, about sandboxes and the emerging agent stack. 00:00 Intro 02:13 What is an AI agent sandbox? 03:17 Security risks of running agents locally 05:17 Stateful vs. stateless hyperscalers 07:04 The history of cloud IDEs and the end of localhost 09:45 Do all AI agents need a sandbox? 12:26 Sandbox use cases: RL evals & background agents 14:10 Unpacking the emerging AI Agent Stack 16:20 The unsolved problem of agent memory and learning 19:37 Where sandboxes fit in the agent harness 21:35 OpenAI, Anthropic, and agent SDKs 23:06 Ivan's founder journey: From CodeAnywhere to Daytona 26:59 GTM strategies and building developer communities 33:48 Why customer support is your best GTM strategy 35:34 Leveraging Twitter during the AI super cycle 40:50 The technical anatomy of a sandbox 41:53 Why fast spin-up speeds maximize GPU efficiency 46:09 Firecracker, QEMU, and isolation primitives 49:58 Why sandbox snapshots and state forking matter 51:40 Why Daytona built a custom scheduler from scratch 55:24 The challenge of long-running stateful sandboxes 58:10 The build your own sandbox trap 1:01:03 Why AI agents might trigger a global CPU shortage 1:02:46 The future of the AI Agent Stack
Matt Turck11,641 görüntüleme • 20 gün önce