Matt Turck's banner

Matt Turck

@mattturck • 141,897 subscribers

VC at @FirstMarkCap. Host: MAD Podcast; Organizer: Data Driven NYC, Author: MAD Landscape.

Shorts

Hang it in the Louvre.

Hang it in the Louvre.

318,272 просмотров

me pretending to do work while my agents run 24/7 in the background

me pretending to do work while my agents run 24/7 in the background

425,395 просмотров

“Don’t worry, there will still be great jobs even when AI automates everything” The jobs:

“Don’t worry, there will still be great jobs even when AI automates everything” The jobs:

1,028,817 просмотров

Nike: just do it Red Bull: just did it

Nike: just do it Red Bull: just did it

3,147,386 просмотров

VCs when they hear that someone is leaving OpenAI to start a new company

VCs when they hear that someone is leaving OpenAI to start a new company

1,830,274 просмотров

Every fundraising pitch right now

Every fundraising pitch right now

957,045 просмотров

VCs tweeting vs VCs investing

VCs tweeting vs VCs investing

452,504 просмотров

VCs tweeting vs VCs investing

VCs tweeting vs VCs investing

615,492 просмотров

VCs when asked why they don’t just start a company themselves

VCs when asked why they don’t just start a company themselves

313,141 просмотров

When your VC is exhausted from attending the board meeting

When your VC is exhausted from attending the board meeting

18,877 просмотров

Pre-revenue AI founder when they receive a term sheet below $100M valuation

Pre-revenue AI founder when they receive a term sheet below $100M valuation

236,400 просмотров

Could somebody clarify whether folks think we should read that article or no?lol

Could somebody clarify whether folks think we should read that article or no?lol

62,236 просмотров

VC adding value by hyping the company on social media

VC adding value by hyping the company on social media

35,382 просмотров

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Chills - Thierry Henry on growing up playing in the suburbs of Paris (and why 54 World Cup players are from there)

Chills - Thierry Henry on growing up playing in the suburbs of Paris (and why 54 World Cup players are from there)

709,979 просмотров • 14 дней назад

The cafeteria at the Meta office on Monday

Sensitive content

This media may contain sensitive content.

The cafeteria at the Meta office on Monday

5,984,517 просмотров • 1 год назад

Me after unstacking the dishwasher when my wife has been running the household 24/7 for the last 15+ years

Me after unstacking the dishwasher when my wife has been running the household 24/7 for the last 15+ years

1,290,505 просмотров • 6 месяцев назад

Inside Nemotron and NVIDIA's AI lab: my conversation with Bryan Catanzaro (Bryan Catanzaro). NVIDIA is a chip company. So why does it put hundreds of researchers on building AI models - and then give them away for free? We go deep into the Nemotron models, what it takes to build a top AI lab, and the future of frontier AI. 01:33 - Is open source AI catching the frontier? 05:29 - Do closed labs blocking distillation slow open source down? 07:42 - Is the US falling behind China? 10:30 - Why companies actually choose open models 12:39 - A "crazy" 2008 bet: machine learning on GPUs 15:33 - Working with Andrew Ng and Dario Amodei at Baidu 17:41 - Coming back to NVIDIA: DLSS and the birth of Megatron 21:55 - The real reason NVIDIA builds its own models 24:28 - Is Moore's Law really dead? 33:37 - The Nemotron family: Nano, Super, Ultra 35:09 - Built for agents: why NVIDIA bets on speed 36:02 - How you train a 550B model in 4 bits 39:25 - Hybrid Mamba-Transformer, explained simply 42:31 - Mixture of experts, and why NVIDIA built NVL72 around it 47:26 - Why a 1-million-token context window matters 49:26 - Multi-token prediction: how the model predicts 5 tokens at once 52:47 - Multi-teacher distillation: teaching one model from many 58:01 - Where reinforcement learning goes next 01:00:16 - Inside NVIDIA's research org: "the mission is the boss" 01:04:03 - How NVIDIA decides who gets the GPUs 01:10:53 - Why NVIDIA still feels entrepreneurial after 33 years 01:12:58 - Why Bryan doesn't believe in the singularity 01:17:50 - The AI backlash 01:19:18 - The controversial case: open AI is safer than closed

Inside Nemotron and NVIDIA's AI lab: my conversation with Bryan Catanzaro (Bryan Catanzaro). NVIDIA is a chip company. So why does it put hundreds of researchers on building AI models - and then give them away for free? We go deep into the Nemotron models, what it takes to build a top AI lab, and the future of frontier AI. 01:33 - Is open source AI catching the frontier? 05:29 - Do closed labs blocking distillation slow open source down? 07:42 - Is the US falling behind China? 10:30 - Why companies actually choose open models 12:39 - A "crazy" 2008 bet: machine learning on GPUs 15:33 - Working with Andrew Ng and Dario Amodei at Baidu 17:41 - Coming back to NVIDIA: DLSS and the birth of Megatron 21:55 - The real reason NVIDIA builds its own models 24:28 - Is Moore's Law really dead? 33:37 - The Nemotron family: Nano, Super, Ultra 35:09 - Built for agents: why NVIDIA bets on speed 36:02 - How you train a 550B model in 4 bits 39:25 - Hybrid Mamba-Transformer, explained simply 42:31 - Mixture of experts, and why NVIDIA built NVL72 around it 47:26 - Why a 1-million-token context window matters 49:26 - Multi-token prediction: how the model predicts 5 tokens at once 52:47 - Multi-teacher distillation: teaching one model from many 58:01 - Where reinforcement learning goes next 01:00:16 - Inside NVIDIA's research org: "the mission is the boss" 01:04:03 - How NVIDIA decides who gets the GPUs 01:10:53 - Why NVIDIA still feels entrepreneurial after 33 years 01:12:58 - Why Bryan doesn't believe in the singularity 01:17:50 - The AI backlash 01:19:18 - The controversial case: open AI is safer than closed

56,250 просмотров • 16 дней назад

State of AI compute 2026: my conversation with stephen balaban of Lambda on the neocloud boom, data centers, GPUs and what's ahead 00:00 — Cold open 01:21 — Why GPU compute was never a commodity 02:45 — The H100 price index and what it gets wrong 04:02 — The real moat: technology or financing? 05:57 — Winner-take-all, or room for many neoclouds 06:48 — Are we overbuilding or underbuilding AI compute? 09:26 — What if AI gets 10x more compute-efficient? 10:44 — The real bottleneck: land, power, and shell 11:38 — The backlash against data centers — and the misinformation 15:00 — Opening the hood: from photons to tokens 17:11 — Extracting more value from the same chip 19:26 — Frontier inference and distributed training, explained 23:26 — What actually drives compute cost 25:21 — Lambda's chip stack and the NVIDIA relationship 26:17 — A multi-silicon world? CUDA, CUDNN, and NVIDIA's real moat 28:59 — Networking, storage, and the one-click cluster 34:46 — Renting vs. owning, and full vertical integration 36:24 — How global is Lambda? Does location still matter? 38:44 — The financing stack: off-take agreements, SPVs, and credit 41:16 — Why a 2023 GPU leases for more today 42:36 — A futures market for compute? 43:54 — Origin story: facial recognition, Perceptio, and Apple 47:03 — The Lambda hat and Dream Scope 48:59 — The $60K bet that became a cloud business 52:00 — Holding the team together through the hard times 54:30 — Bringing on a new CEO; Stephen as CTO 57:33 — Matching xAI on high-velocity deployment 59:29 — "AI won't write software — it will become the software" 01:01:30 — Neural software vs. vibe coding 01:04:25 — Do agents change the compute layer 01:06:14 — Self-assembling software inside Lambda 01:08:18 — Gigawatt-scale AI factories 01:08:57 — One person, one GPU 01:12:04 — Hot takes: overrated and underrated in AI

State of AI compute 2026: my conversation with stephen balaban of Lambda on the neocloud boom, data centers, GPUs and what's ahead 00:00 — Cold open 01:21 — Why GPU compute was never a commodity 02:45 — The H100 price index and what it gets wrong 04:02 — The real moat: technology or financing? 05:57 — Winner-take-all, or room for many neoclouds 06:48 — Are we overbuilding or underbuilding AI compute? 09:26 — What if AI gets 10x more compute-efficient? 10:44 — The real bottleneck: land, power, and shell 11:38 — The backlash against data centers — and the misinformation 15:00 — Opening the hood: from photons to tokens 17:11 — Extracting more value from the same chip 19:26 — Frontier inference and distributed training, explained 23:26 — What actually drives compute cost 25:21 — Lambda's chip stack and the NVIDIA relationship 26:17 — A multi-silicon world? CUDA, CUDNN, and NVIDIA's real moat 28:59 — Networking, storage, and the one-click cluster 34:46 — Renting vs. owning, and full vertical integration 36:24 — How global is Lambda? Does location still matter? 38:44 — The financing stack: off-take agreements, SPVs, and credit 41:16 — Why a 2023 GPU leases for more today 42:36 — A futures market for compute? 43:54 — Origin story: facial recognition, Perceptio, and Apple 47:03 — The Lambda hat and Dream Scope 48:59 — The $60K bet that became a cloud business 52:00 — Holding the team together through the hard times 54:30 — Bringing on a new CEO; Stephen as CTO 57:33 — Matching xAI on high-velocity deployment 59:29 — "AI won't write software — it will become the software" 01:01:30 — Neural software vs. vibe coding 01:04:25 — Do agents change the compute layer 01:06:14 — Self-assembling software inside Lambda 01:08:18 — Gigawatt-scale AI factories 01:08:57 — One person, one GPU 01:12:04 — Hot takes: overrated and underrated in AI

70,916 просмотров • 1 месяц назад

New York, the city where everyone is a venture capitalist

New York, the city where everyone is a venture capitalist

1,737,397 просмотров • 2 лет назад

How GPT-5 thinks, with OpenAI VP of Research Jerry Tworek 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road to OpenAI: Growing up in Poland, Dropping out of School, Trading 20:32 - Working on Robotics and Rubik's Cube Solving 23:02 - A Day in the Life: Talking to Researchers 24:06 - How Research Priorities Are Determined 26:53 - OpenAI's Culture of Transparency 29:32 - Balancing Research with Shipping Fast 31:52 - Using OpenAI's Own Tools Daily 32:43 - Pre-Training Plus RL: The Modern AI Stack 35:10 - Reinforcement Learning 101: Training Dogs 40:17 - The Evolution of Deep Reinforcement Learning 42:09 - When GPT-4 Seemed Underwhelming at First 45:39 - How RLHF Made GPT-4 Actually Useful 48:02 - Unsupervised vs Supervised Learning 49:59 - GRPO and How DeepSeek Accelerated US Research 53:05 - What It Takes to Scale Reinforcement Learning 55:36 - Agentic AI and Long-Horizon Thinking 59:19 - Alignment as an RL Problem 1:01:11 - Winning ICPC World Finals Without Specific Training 1:05:53 - Applying RL Beyond Math and Coding 1:09:15 - The Path from Here to AGI 1:12:23 - Pure RL vs Language Models

How GPT-5 thinks, with OpenAI VP of Research Jerry Tworek 00:00 - Intro 01:01 - What Reasoning Actually Means in AI 02:32 - Chain of Thought: Models Thinking in Words 05:25 - How Models Decide How Long to Think 07:24 - Evolution from o1 to o3 to GPT-5 11:00 - The Road to OpenAI: Growing up in Poland, Dropping out of School, Trading 20:32 - Working on Robotics and Rubik's Cube Solving 23:02 - A Day in the Life: Talking to Researchers 24:06 - How Research Priorities Are Determined 26:53 - OpenAI's Culture of Transparency 29:32 - Balancing Research with Shipping Fast 31:52 - Using OpenAI's Own Tools Daily 32:43 - Pre-Training Plus RL: The Modern AI Stack 35:10 - Reinforcement Learning 101: Training Dogs 40:17 - The Evolution of Deep Reinforcement Learning 42:09 - When GPT-4 Seemed Underwhelming at First 45:39 - How RLHF Made GPT-4 Actually Useful 48:02 - Unsupervised vs Supervised Learning 49:59 - GRPO and How DeepSeek Accelerated US Research 53:05 - What It Takes to Scale Reinforcement Learning 55:36 - Agentic AI and Long-Horizon Thinking 59:19 - Alignment as an RL Problem 1:01:11 - Winning ICPC World Finals Without Specific Training 1:05:53 - Applying RL Beyond Math and Coding 1:09:15 - The Path from Here to AGI 1:12:23 - Pure RL vs Language Models

451,229 просмотров • 9 месяцев назад

Epic (sound on)

Epic (sound on)

1,846,819 просмотров • 3 лет назад

Why AI Progress Suddenly Feels Real - my conversation with Yann Dubois, who co-leads the Post-Training Frontiers team at OpenAI 00:00 - Intro 01:30 - Why recent AI progress feels like a step function 04:13 - Model reliability & the emotional rollercoaster of shipping GPT-5.5 07:33 - How OpenAI structures vertical and horizontal teams 09:49 - Improving model efficiency and test-time compute 12:32 - Yann's journey from Switzerland to OpenAI 15:37 - Reasoning in 2026: Real-world utility vs verifiable rewards 18:34 - GPT-5.5 Thinking vs Pro: Scaling test-time compute 20:09 - How reasoning models become more efficient 23:23 - Pre-training scaling and overcoming the data wall 27:03 - Multimodal data, synthetic data, and embodied AI 31:05 - Demystifying mid-training and post-training 37:21 - Does RL create new capabilities in AI? 38:53 - The challenges and frontier of scaling RL 43:09 - Is building AI models a craft or a strict science 48:21 - How AI models generalize across different domains 54:18 - How reinforcement learning cures AI hallucinations 56:04 - Negative generalization and conflicting instructions 58:05 - Can RL scale to law, medicine, and the broader economy? 1:00:19 - The evaluation bottleneck and Model as a Judge 1:04:21 - Continuous AI progress & continual learning 1:08:49 - Will foundation models eat the agent harness 1:11:23 - Why startups should focus on the last mile of AI

Why AI Progress Suddenly Feels Real - my conversation with Yann Dubois, who co-leads the Post-Training Frontiers team at OpenAI 00:00 - Intro 01:30 - Why recent AI progress feels like a step function 04:13 - Model reliability & the emotional rollercoaster of shipping GPT-5.5 07:33 - How OpenAI structures vertical and horizontal teams 09:49 - Improving model efficiency and test-time compute 12:32 - Yann's journey from Switzerland to OpenAI 15:37 - Reasoning in 2026: Real-world utility vs verifiable rewards 18:34 - GPT-5.5 Thinking vs Pro: Scaling test-time compute 20:09 - How reasoning models become more efficient 23:23 - Pre-training scaling and overcoming the data wall 27:03 - Multimodal data, synthetic data, and embodied AI 31:05 - Demystifying mid-training and post-training 37:21 - Does RL create new capabilities in AI? 38:53 - The challenges and frontier of scaling RL 43:09 - Is building AI models a craft or a strict science 48:21 - How AI models generalize across different domains 54:18 - How reinforcement learning cures AI hallucinations 56:04 - Negative generalization and conflicting instructions 58:05 - Can RL scale to law, medicine, and the broader economy? 1:00:19 - The evaluation bottleneck and Model as a Judge 1:04:21 - Continuous AI progress & continual learning 1:08:49 - Will foundation models eat the agent harness 1:11:23 - Why startups should focus on the last mile of AI

100,864 просмотров • 1 месяц назад

Why AI Can Now Make Discoveries - my conversation with Dan Roberts, Lead of the Foundations of Reinforcement Learning team at OpenAI 00:00 Intro: AI's wild week in mathematics 01:21 What OpenAI's Foundations of RL team does 03:08 Dan's journey: from black holes and quantum gravity to frontier AI 07:04 Are AI systems becoming useful for real science 08:21 The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic 08:52 Why the OpenAI result was an act of exploration 10:25 OpenAI vs. DeepMind: informal reasoning vs. formal proof 12:13 RL 101: learning by doing, not just watching 15:10 Why reinforcement learning works 15:58 How RL breaks: sparse feedback and long-horizon tasks 17:03 RLHF: how human feedback shaped early language models 18:48 Move 37, self-play, and the search for novel strategies 22:16 Explore vs. exploit in scientific discovery 24:49 Why RL may now be "the cake," not the cherry on top 25:46 Why RL started working with large language models 27:29 Is RL "sucking supervision through a straw"? 28:47 Why language may be the grounding layer for intelligence 31:46 A contrarian take on the Bitter Lesson 32:41 What test-time compute actually is 34:50 How RL gives models the ability to think 35:40 Verifiable rewards, math, coding, and the messy real world 38:00 What physics can teach us about AI 42:08 Is there a thermodynamics of AI? 43:08 From Erdős problems to Einstein-level AI 45:16 Is AI already doing original science? 45:51 How far are we from AI automating AI research 47:41 Why Dan is excited about the future of science

Why AI Can Now Make Discoveries - my conversation with Dan Roberts, Lead of the Foundations of Reinforcement Learning team at OpenAI 00:00 Intro: AI's wild week in mathematics 01:21 What OpenAI's Foundations of RL team does 03:08 Dan's journey: from black holes and quantum gravity to frontier AI 07:04 Are AI systems becoming useful for real science 08:21 The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic 08:52 Why the OpenAI result was an act of exploration 10:25 OpenAI vs. DeepMind: informal reasoning vs. formal proof 12:13 RL 101: learning by doing, not just watching 15:10 Why reinforcement learning works 15:58 How RL breaks: sparse feedback and long-horizon tasks 17:03 RLHF: how human feedback shaped early language models 18:48 Move 37, self-play, and the search for novel strategies 22:16 Explore vs. exploit in scientific discovery 24:49 Why RL may now be "the cake," not the cherry on top 25:46 Why RL started working with large language models 27:29 Is RL "sucking supervision through a straw"? 28:47 Why language may be the grounding layer for intelligence 31:46 A contrarian take on the Bitter Lesson 32:41 What test-time compute actually is 34:50 How RL gives models the ability to think 35:40 Verifiable rewards, math, coding, and the messy real world 38:00 What physics can teach us about AI 42:08 Is there a thermodynamics of AI? 43:08 From Erdős problems to Einstein-level AI 45:16 Is AI already doing original science? 45:51 How far are we from AI automating AI research 47:41 Why Dan is excited about the future of science

64,952 просмотров • 1 месяц назад

Failing to Understand the Exponential, Again? My conversation with Julian Schrittwieser - Julian Schrittwieser (Anthropic, AlphaGo Zero, MuZero) - on Move 37, Scaling RL, Nobel Prize for AI, and the AI frontier: 00:00 - Cold open: “We’re not seeing any slowdown.” 00:32 - Intro — Meet Julian 01:09 - The “exponential” from inside frontier labs 04:46 - 2026–2027: agents that work a full day; expert-level breadth 08:58 - Benchmarks vs reality: long-horizon work, GDP-Val, user value 10:26 - Move 37 — what actually happened and why it mattered 13:55 - Novel science: AlphaCode/AlphaTensor → when does AI earn a Nobel? 16:25 - Discontinuity vs smooth progress (and warning signs) 19:08 - Does pre-training + RL get us there? (AGI debates aside) 20:55 - Sutton’s “RL from scratch”? Julian’s take 23:03 - Julian’s path: Google → DeepMind → Anthropic 26:45 - AlphaGo (learn + search) in plain English 30:16 - AlphaGo Zero (no human data) 31:00 - AlphaZero (one algorithm: Go, chess, shogi) 31:46 - MuZero (planning with a learned world model) 33:23 -Lessons for today’s agents: search + learning at scale 34:57 - Do LLMs already have implicit world models? 39:02 - Why RL on LLMs took time (stability, feedback loops) 41:43 - Compute & scaling for RL — what we see so far 42:35 - Rewards frontier: human prefs, rubrics, RLVR, process rewards 44:36 - RL training data & the “flywheel” (and why quality matters) 48:02 - RL & Agents 101 — why RL unlocks robustness 50:51 - Should builders use RL-as-a-service? Or just tools + prompts? 52:18 - What’s missing for dependable agents (capability vs engineering) 53:51 - Evals & Goodhart — internal vs external benchmarks 57:35 - Mechanistic interpretability & “Golden Gate Claude” 1:00:03 - Safety & alignment at Anthropic — how it shows up in practice 1:03:48 - Jobs: human–AI complementarity (comparative advantage) 1:06:33 - Inequality, policy, and the case for 10× productivity → abundance 1:09:24 - Closing thoughts

Failing to Understand the Exponential, Again? My conversation with Julian Schrittwieser - Julian Schrittwieser (Anthropic, AlphaGo Zero, MuZero) - on Move 37, Scaling RL, Nobel Prize for AI, and the AI frontier: 00:00 - Cold open: “We’re not seeing any slowdown.” 00:32 - Intro — Meet Julian 01:09 - The “exponential” from inside frontier labs 04:46 - 2026–2027: agents that work a full day; expert-level breadth 08:58 - Benchmarks vs reality: long-horizon work, GDP-Val, user value 10:26 - Move 37 — what actually happened and why it mattered 13:55 - Novel science: AlphaCode/AlphaTensor → when does AI earn a Nobel? 16:25 - Discontinuity vs smooth progress (and warning signs) 19:08 - Does pre-training + RL get us there? (AGI debates aside) 20:55 - Sutton’s “RL from scratch”? Julian’s take 23:03 - Julian’s path: Google → DeepMind → Anthropic 26:45 - AlphaGo (learn + search) in plain English 30:16 - AlphaGo Zero (no human data) 31:00 - AlphaZero (one algorithm: Go, chess, shogi) 31:46 - MuZero (planning with a learned world model) 33:23 -Lessons for today’s agents: search + learning at scale 34:57 - Do LLMs already have implicit world models? 39:02 - Why RL on LLMs took time (stability, feedback loops) 41:43 - Compute & scaling for RL — what we see so far 42:35 - Rewards frontier: human prefs, rubrics, RLVR, process rewards 44:36 - RL training data & the “flywheel” (and why quality matters) 48:02 - RL & Agents 101 — why RL unlocks robustness 50:51 - Should builders use RL-as-a-service? Or just tools + prompts? 52:18 - What’s missing for dependable agents (capability vs engineering) 53:51 - Evals & Goodhart — internal vs external benchmarks 57:35 - Mechanistic interpretability & “Golden Gate Claude” 1:00:03 - Safety & alignment at Anthropic — how it shows up in practice 1:03:48 - Jobs: human–AI complementarity (comparative advantage) 1:06:33 - Inequality, policy, and the case for 10× productivity → abundance 1:09:24 - Closing thoughts

235,526 просмотров • 8 месяцев назад

State of Enterprise AI 2026: Aaron Levie on Tokenmaxxing, The Rise of Headless, and AI-Proofing Your Job 00:00 Intro 01:18 Silicon Valley engineering vs. everyone else 05:35 Are enterprise CIOs actually bullish on AI? 08:51 Tokenmaxxing & why your AI bill is about to explode 11:34 The myth of falling token costs and AI spend escaping IT budgets 17:37 The $5B startup hiding in AI compute 18:14 The mosaic of models inside every enterprise 21:28 Why coding works and the rest of knowledge work doesn't 25:53 The Bob and Sally problem: access control breaks agents 30:31 Will enterprise AI really take 10 years to roll out 32:24 The capability overhang: why faster models slow diffusion 34:23 Data is the bottleneck (it always was) 39:02 The rise of internal forward-deployed engineers 41:23 Why the AI doomers are wrong about jobs 43:43 Headless software is inevitable 46:14 What replaces per-seat pricing 47:37 How Box itself is going headless 49:42 How the org chart actually evolves 1:00:33 Future-proofing yourself as an enterprise employee 1:06:40 Are we all just going to work for OpenAI and Anthropic? 1:07:11 Where startups can still win as the labs move up

State of Enterprise AI 2026: Aaron Levie on Tokenmaxxing, The Rise of Headless, and AI-Proofing Your Job 00:00 Intro 01:18 Silicon Valley engineering vs. everyone else 05:35 Are enterprise CIOs actually bullish on AI? 08:51 Tokenmaxxing & why your AI bill is about to explode 11:34 The myth of falling token costs and AI spend escaping IT budgets 17:37 The $5B startup hiding in AI compute 18:14 The mosaic of models inside every enterprise 21:28 Why coding works and the rest of knowledge work doesn't 25:53 The Bob and Sally problem: access control breaks agents 30:31 Will enterprise AI really take 10 years to roll out 32:24 The capability overhang: why faster models slow diffusion 34:23 Data is the bottleneck (it always was) 39:02 The rise of internal forward-deployed engineers 41:23 Why the AI doomers are wrong about jobs 43:43 Headless software is inevitable 46:14 What replaces per-seat pricing 47:37 How Box itself is going headless 49:42 How the org chart actually evolves 1:00:33 Future-proofing yourself as an enterprise employee 1:06:40 Are we all just going to work for OpenAI and Anthropic? 1:07:11 Where startups can still win as the labs move up

49,373 просмотров • 1 месяц назад

.Ramp Labs (the AI unit of Ramp) has been *cooking* with agentic innovation Here's Alex L discussing and demo'ing code self-maintaining software and the concept of AI software factories #DataDrivenNYC ______________ 00:04 - Intro 01:11 - The shift from writing code to code maintenance 01:59 - Introducing Ramp Inspect, the background coding agent 03:05 - The first experiment: Nightly AI code automation 04:23 - The limits of stateless monitoring in large observability surfaces 05:47 - Using Datadog monitors to give the AI state and focus 07:23 - Real-world example: AI autonomously fixing an authentication bug 08:14 - How to control noise and implement an AI triage pattern 09:27 - The old vs. new paradigm for continuous code observability 10:21 - Key learnings on building autonomous AI software factories

.Ramp Labs (the AI unit of Ramp) has been cooking with agentic innovation Here's Alex L discussing and demo'ing code self-maintaining software and the concept of AI software factories #DataDrivenNYC ______________ 00:04 - Intro 01:11 - The shift from writing code to code maintenance 01:59 - Introducing Ramp Inspect, the background coding agent 03:05 - The first experiment: Nightly AI code automation 04:23 - The limits of stateless monitoring in large observability surfaces 05:47 - Using Datadog monitors to give the AI state and focus 07:23 - Real-world example: AI autonomously fixing an authentication bug 08:14 - How to control noise and implement an AI triage pattern 09:27 - The old vs. new paradigm for continuous code observability 10:21 - Key learnings on building autonomous AI software factories

67,511 просмотров • 2 месяцев назад

Claude Cowork, Mythos, and the Future of Software: my conversation with Felix Rieseberg, who leads Cowork at Anthropic 00:00 Intro 01:53 Claude Mythos Preview and the “step-function change” 06:16 Why Anthropic is treating Mythos differently 11:19 The real story behind Claude Cowork’s “10-day” build 12:42 Why Anthropic realized Claude Code needed a non-technical version 15:44 What Claude Cowork actually is 17:03 Under the hood: virtual machines, tools, skills 18:36 Where Cowork’s memory actually lives 19:26 How Cowork connects to files, apps, and the internet 20:45 Why Felix thinks the local computer is under-appreciated 24:49 Trust: how do you get users comfortable with AI agents? 28:45 What UX actually means for AI agents 31:27 Anthropic Cowork's roadmap is only one month long 34:12 Building 100 prototypes 35:10 If execution is free, what becomes the bottleneck? 37:25 Does it come down to taste? 40:12 The hardest part of building Claude Cowork 41:43 Advice for founders building AI agents 44:21 SaaSpocalypse: what’s left for software startups 49:30 Where AI agents are going next 51:20 Regulated industries and enterprise adoption 54:15 Hot takes: what's underrated, overrated, and what Felix would build today

Claude Cowork, Mythos, and the Future of Software: my conversation with Felix Rieseberg, who leads Cowork at Anthropic 00:00 Intro 01:53 Claude Mythos Preview and the “step-function change” 06:16 Why Anthropic is treating Mythos differently 11:19 The real story behind Claude Cowork’s “10-day” build 12:42 Why Anthropic realized Claude Code needed a non-technical version 15:44 What Claude Cowork actually is 17:03 Under the hood: virtual machines, tools, skills 18:36 Where Cowork’s memory actually lives 19:26 How Cowork connects to files, apps, and the internet 20:45 Why Felix thinks the local computer is under-appreciated 24:49 Trust: how do you get users comfortable with AI agents? 28:45 What UX actually means for AI agents 31:27 Anthropic Cowork's roadmap is only one month long 34:12 Building 100 prototypes 35:10 If execution is free, what becomes the bottleneck? 37:25 Does it come down to taste? 40:12 The hardest part of building Claude Cowork 41:43 Advice for founders building AI agents 44:21 SaaSpocalypse: what’s left for software startups 49:30 Where AI agents are going next 51:20 Regulated industries and enterprise adoption 54:15 Hot takes: what's underrated, overrated, and what Felix would build today

80,519 просмотров • 3 месяцев назад

Literally the #1 reason why America has produced so many incredible startups

Literally the #1 reason why America has produced so many incredible startups

502,104 просмотров • 2 лет назад

Thanksgiving-week treat: an epic conversation on Frontier AI with Lukasz Kaiser -co-author of “Attention Is All You Need” (Transformers) and leading research scientist at OpenAI working on GPT-5.1-era reasoning models. 00:00 – Cold open and intro 01:29 – “AI slowdown” vs a wild week of new frontier models 08:03 – Low-hanging fruit, infra, RL training and better data 11:39 – What is a reasoning model, in plain language 17:02 – Chain-of-thought and training the thinking process with RL 21:39 – Łukasz’s path: from logic and France to Google and Kurzweil 24:20 – Inside the Transformer story and what “attention” really means 28:42 – From Google Brain to OpenAI: culture, scale and GPUs 32:49 – What’s next for pre-training, GPUs and distillation 37:29 – Can we still understand these models? Circuits, sparsity and black boxes 39:42 – GPT-4 → GPT-5 → GPT-5.1: what actually changed 42:40 – Post-training, safety and teaching GPT-5.1 different tones 46:16 – How long should GPT-5.1 think? Reasoning tokens and jagged abilities 47:43 – The five-year-old’s dot puzzle that still breaks frontier models 52:22 – Generalization, child-like learning and whether reasoning is enough 53:48 – Beyond Transformers: ARC, LeCun’s ideas and multimodal bottlenecks 56:10 – GPT-5.1 Codex Max, long-running agents and compaction 1:00:06 – Will foundation models eat most apps? The translation analogy and trust 1:02:34 – What still needs to be solved, and where AI might go next

Thanksgiving-week treat: an epic conversation on Frontier AI with Lukasz Kaiser -co-author of “Attention Is All You Need” (Transformers) and leading research scientist at OpenAI working on GPT-5.1-era reasoning models. 00:00 – Cold open and intro 01:29 – “AI slowdown” vs a wild week of new frontier models 08:03 – Low-hanging fruit, infra, RL training and better data 11:39 – What is a reasoning model, in plain language 17:02 – Chain-of-thought and training the thinking process with RL 21:39 – Łukasz’s path: from logic and France to Google and Kurzweil 24:20 – Inside the Transformer story and what “attention” really means 28:42 – From Google Brain to OpenAI: culture, scale and GPUs 32:49 – What’s next for pre-training, GPUs and distillation 37:29 – Can we still understand these models? Circuits, sparsity and black boxes 39:42 – GPT-4 → GPT-5 → GPT-5.1: what actually changed 42:40 – Post-training, safety and teaching GPT-5.1 different tones 46:16 – How long should GPT-5.1 think? Reasoning tokens and jagged abilities 47:43 – The five-year-old’s dot puzzle that still breaks frontier models 52:22 – Generalization, child-like learning and whether reasoning is enough 53:48 – Beyond Transformers: ARC, LeCun’s ideas and multimodal bottlenecks 56:10 – GPT-5.1 Codex Max, long-running agents and compaction 1:00:06 – Will foundation models eat most apps? The translation analogy and trust 1:02:34 – What still needs to be solved, and where AI might go next

167,926 просмотров • 7 месяцев назад

NVIDIA's New Moat & Why China is "Semiconductor Pilled” — loved my conversation with Dylan Patel (Dylan Patel). Raw, incredibly insightful, and... hilarious. 00:00 - Intro 01:16 - Nvidia acquires Groq: A pivot to specialization 07:09 - Why AI models might need "wide" compute, not just fast 10:06 - Is the CUDA moat dead? (Open source vs. Nvidia) 17:49 - The startup landscape: Etched, Cerebras, and 1% odds 22:51 - Geopolitics: China's "semiconductor-pilled" culture 35:46 - Huawei's vertical integration is terrifying 39:28 - The $100B AI revenue reality check 41:12 - US Onshoring: Why total self-sufficiency is a fantasy 44:55 - Can the US actually build fabs? (The delay problem) 48:33 - The CapEx Bubble: Is $500B spending irrational? 54:53 - Energy Crisis: Why gas turbines will power AI, not nuclear 57:06 - The "AI uses all the water" myth (Hamburger comparison) 1:03:40 - Circular Debt? Debunking the Nvidia-CoreWeave risk 1:07:24 - Claude Code & the software singularity 1:10:23 - The death of the Junior Analyst role 1:11:14 - Model predictions: Opus 4.5 and the RL gap 1:14:37 - San Francisco Lore: Living with roommates Dwarkesh Patel & Sholto Douglas

NVIDIA's New Moat & Why China is "Semiconductor Pilled” — loved my conversation with Dylan Patel (Dylan Patel). Raw, incredibly insightful, and... hilarious. 00:00 - Intro 01:16 - Nvidia acquires Groq: A pivot to specialization 07:09 - Why AI models might need "wide" compute, not just fast 10:06 - Is the CUDA moat dead? (Open source vs. Nvidia) 17:49 - The startup landscape: Etched, Cerebras, and 1% odds 22:51 - Geopolitics: China's "semiconductor-pilled" culture 35:46 - Huawei's vertical integration is terrifying 39:28 - The $100B AI revenue reality check 41:12 - US Onshoring: Why total self-sufficiency is a fantasy 44:55 - Can the US actually build fabs? (The delay problem) 48:33 - The CapEx Bubble: Is $500B spending irrational? 54:53 - Energy Crisis: Why gas turbines will power AI, not nuclear 57:06 - The "AI uses all the water" myth (Hamburger comparison) 1:03:40 - Circular Debt? Debunking the Nvidia-CoreWeave risk 1:07:24 - Claude Code & the software singularity 1:10:23 - The death of the Junior Analyst role 1:11:14 - Model predictions: Opus 4.5 and the RL gap 1:14:37 - San Francisco Lore: Living with roommates Dwarkesh Patel & Sholto Douglas

96,793 просмотров • 5 месяцев назад

While Silicon Valley obsesses over AGI, Mistral AI is betting that big enterprises and sovereign nations will want to own, not rent, their intelligence My conversation with co-founder & CTO Timothee Lacroix (Timothee Lacroix), for his first US podcast ever Was reminded during this conversation that Mistral is barely 2.5 years old - remarkable 00:00 — Intro 01:27 — Mistral vs. The World: From Research Lab to Sovereign Power 03:48 — Inside Mistral Compute: Building an 18,000 GPU Cluster 08:42 — The Trillion-Dollar Question: Competing Without a Big Tech Parent 10:37 — The Reality of Enterprise AI: Escaping "POC Purgatory" 15:06 — Why Mistral Hires Forward Deployed Engineers (FDEs) 16:57 — The Contrarian Take: Why "Agents" are just "Workflows" 19:35 — Trust & Autonomy: The Truth About Agent Reliability 21:26 — The Missing Stack: Governance and Versioning for AI 26:24 — When Will AI Actually Work? (The 2026 Timeline) 30:33 — Beyond Chat: The "Banger" Sovereign Use Cases 35:46 — Mistral 3 Architecture: Mixture of Experts vs. Dense 43:12 — Synthetic Data & The Post-Training Bottleneck 45:12 — Reasoning Models: Why "Thinking" is Just Tool Use 46:22 — Launching DevStral 2 and the Vibe CLI 50:49 — Engineering Lessons: How to Build Frontier AI Efficiently 56:08 — Are Enterprises Ready for AGI? & The Future of Intelligence

While Silicon Valley obsesses over AGI, Mistral AI is betting that big enterprises and sovereign nations will want to own, not rent, their intelligence My conversation with co-founder & CTO Timothee Lacroix (Timothee Lacroix), for his first US podcast ever Was reminded during this conversation that Mistral is barely 2.5 years old - remarkable 00:00 — Intro 01:27 — Mistral vs. The World: From Research Lab to Sovereign Power 03:48 — Inside Mistral Compute: Building an 18,000 GPU Cluster 08:42 — The Trillion-Dollar Question: Competing Without a Big Tech Parent 10:37 — The Reality of Enterprise AI: Escaping "POC Purgatory" 15:06 — Why Mistral Hires Forward Deployed Engineers (FDEs) 16:57 — The Contrarian Take: Why "Agents" are just "Workflows" 19:35 — Trust & Autonomy: The Truth About Agent Reliability 21:26 — The Missing Stack: Governance and Versioning for AI 26:24 — When Will AI Actually Work? (The 2026 Timeline) 30:33 — Beyond Chat: The "Banger" Sovereign Use Cases 35:46 — Mistral 3 Architecture: Mixture of Experts vs. Dense 43:12 — Synthetic Data & The Post-Training Bottleneck 45:12 — Reasoning Models: Why "Thinking" is Just Tool Use 46:22 — Launching DevStral 2 and the Vibe CLI 50:49 — Engineering Lessons: How to Build Frontier AI Efficiently 56:08 — Are Enterprises Ready for AGI? & The Future of Intelligence

63,839 просмотров • 5 месяцев назад

Deeply thoughtful conversation with Zico Kolter, board member at OpenAI and head of the machine learning department at Carnegie Mellon University, about AI safety, AI security, agents and frontier AI 00:00 Intro 01:32 OpenAI board role and Safety & Security Committee 03:53 How OpenAI reviews major model releases 05:33 OpenAI’s preparedness framework explained 09:46 Are frontier AI models getting safer? 12:33 Why AI safety does not come from scale 15:23 The four categories of AI risk 19:38 Doomerism vs accelerationism in AI 24:11 The six-month AI pause debate 26:20 AI safety as a global effort 28:04 How Zico Kolter got into machine learning 31:05 OpenAI in the early days 34:14 Why Carnegie Mellon became an AI powerhouse 38:43 What Gray Swan does in AI security 40:44 AI safety vs AI security 43:15 The GCG jailbreak paper 49:19 How AI labs responded to jailbreak research 50:19 State-of-the-art AI defenses 52:32 State-of-the-art AI attacks 54:22 Why AI agents expand the attack surface 58:39 Are AI agents ready for production? 59:40 Mechanistic interpretability explained 1:02:31 Will AI be safer in two years? 1:03:46 Reinforcement learning and self-improving models 1:08:09 Do post-transformer architectures matter 1:09:29 Best research directions in AI now 1:11:00 Zico Kolter’s Intro to Modern AI course 1:14:53 Why modern AI is simpler than people think

Deeply thoughtful conversation with Zico Kolter, board member at OpenAI and head of the machine learning department at Carnegie Mellon University, about AI safety, AI security, agents and frontier AI 00:00 Intro 01:32 OpenAI board role and Safety & Security Committee 03:53 How OpenAI reviews major model releases 05:33 OpenAI’s preparedness framework explained 09:46 Are frontier AI models getting safer? 12:33 Why AI safety does not come from scale 15:23 The four categories of AI risk 19:38 Doomerism vs accelerationism in AI 24:11 The six-month AI pause debate 26:20 AI safety as a global effort 28:04 How Zico Kolter got into machine learning 31:05 OpenAI in the early days 34:14 Why Carnegie Mellon became an AI powerhouse 38:43 What Gray Swan does in AI security 40:44 AI safety vs AI security 43:15 The GCG jailbreak paper 49:19 How AI labs responded to jailbreak research 50:19 State-of-the-art AI defenses 52:32 State-of-the-art AI attacks 54:22 Why AI agents expand the attack surface 58:39 Are AI agents ready for production? 59:40 Mechanistic interpretability explained 1:02:31 Will AI be safer in two years? 1:03:46 Reinforcement learning and self-improving models 1:08:09 Do post-transformer architectures matter 1:09:29 Best research directions in AI now 1:11:00 Zico Kolter’s Intro to Modern AI course 1:14:53 Why modern AI is simpler than people think

32,922 просмотров • 2 месяцев назад