Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Karpathy told Dwarkesh that a 1 billion parameter model, trained on clean data, could hit the intelligence of today's 1.8 trillion parameter frontier. That is a 1,800x compression claim. The math behind it is more defensible than it sounds. When researchers at frontier labs look at random samples from...

507,662 Aufrufe • vor 2 Monaten •via X (Twitter)

0 Kommentare

Keine Kommentare verfügbar

Kommentare vom Original-Post werden hier angezeigt

Ähnliche Videos

The most overlooked part of the SpaceX IPO thesis is the model and most people are completely missing it (Save this) Everyone has been focused on the Anthropic compute deal and the Colossus revenue because those are numbers you can put in a spreadsheet. Six months ago, xAI was competing reasonably well on model performance but was not clearly on the frontier. Then SpaceX exercised its option to acquire Cursor for $60 billion, the largest startup acquisition in history just days after completing the largest IPO in history at $75 billion. Cursor is a team of 700 to 800 people, was on track to exit 2026 at up to $10 billion in revenue, had millions of professional developers using it daily, and had already built a team with the genuine potential to compete at the frontier, the one thing holding them back was compute. SpaceX just gave them the largest GPU cluster in the world to work with. Grok 4.3, a 1.5 trillion parameter model, is currently training with Cursor's proprietary coding data being injected directly into pre-training, not just fine tuning which is a fundamentally more powerful integration than anything the market is currently modeling. The prior version, Grok 4, was already on the Pareto frontier as of 10 to 12 days ago, the most intelligent 500 billion parameter model in the world, sitting alongside Google Gemini, Anthropic, and OpenAI as one of only four systems at the true frontier. Composer 2.5, the previous Cursor model was Pareto dominant in coding tasks just before the acquisition closed, meaning SpaceX inherited a model that was already best-in-class in the highest-value AI use case in the market. The AWS parallel is the one everyone keeps missing. Bezos built data center capacity for Black Friday, sat on idle infrastructure the rest of the year, and monetized it into what was at the time the most profitable technology business in history and investors hated it in 2009 and 2010 because he was burning free cash flow on capacity that had no obvious revenue yet. SpaceX is in exactly that position, it built Colossus for xAI's own training needs, is monetizing excess capacity to Anthropic at $1.25 billion per month across 220,000 Nvidia GPUs, and has reportedly secured up to 20% of Nvidia's early Vera Rubin allocation, giving it the most powerful and scarcest GPU infrastructure in the world during the critical window when those chips are hardest to get. The $60 billion Cursor acquisition closed at a moment when SpaceX had essentially unlimited compute, a team already at the frontier, and a product with deep enterprise distribution, three things no other model lab had simultaneously when it was at this stage. The market is pricing the compute business conservatively and ignoring the model call option entirely, and coding is the fastest path to AGI, once you are on the Pareto frontier with that compute, revenue scales fast. Anthropic went from negligible revenue to $30 billion annualized in under 18 months and that is the existence proof. Bullish on SpaceXAI and Elon Musk

Milk Road AI

68,768 Aufrufe • vor 7 Tagen

Sam Altman just handed every startup founder a one-question autopsy. Altman: “If you’re building something on GPT-4 that a reasonable observer would say we’re going to steamroll you.” Not might. Not could. Going to. He said it with the calm of someone describing weather. Because to him it is weather. The model improves. Whatever was built on the old version’s weaknesses gets washed away. That is not strategy. That is erosion. And most founders are building on the erosion line. They find a gap in the current model. They wrap a product around it. They raise money. They hire. They scale. Then OpenAI releases the next version and the gap closes and the product has no reason to exist anymore. Altman: “When we just do our fundamental job, which is make the model better with every crank, then you get the ‘OpenAI killed my startup’ meme.” He is telling you directly. They are not hunting you. They are not even thinking about you. They are just improving the model. You happen to be standing where the improvement lands. That is the part founders refuse to hear. OpenAI does not need to compete with you. It just needs to keep doing exactly what it was already doing and your entire company disappears as a side effect. You are not a competitor. You are a temporary symptom of incomplete intelligence. The moment the intelligence completes you become nothing. Then Brad Lightcap delivered the cleanest diagnostic ever spoken in venture capital. Lightcap: “Ask if a 100x improvement in the model is something they’re excited about.” One question. The entire investment thesis reduced to a single binary. Does the next model make your company more powerful or does it make your company pointless. There is no middle ground. Lightcap: “We know the companies that come to us saying, ‘We want the next model. When is it coming out? I want to be the first to try it.’” These companies built something that feeds on intelligence. The smarter the model gets the more their product can do. They are not threatened by progress. They are starving for it. Then there are the companies Lightcap never hears from. The ones who go quiet when a new model drops. The ones who read the release notes like a death sentence. The ones privately praying the next generation takes longer because every improvement shrinks the ground beneath them. If you are hoping the model stays roughly where it is you have already told the market everything it needs to know about your company. You are not building on intelligence. You are building on the absence of it. Altman: “95% of the world should be betting on the latter category.” The latter category is simple. Assume the model keeps getting better at the pace it has been getting better. Build for that world. Not the world where GPT-4 is the ceiling. The world where GPT-4 is the floor and the ceiling has not been built yet. Then Altman told a story that should be framed on the wall of every startup in the country. A medical AI company came to him that morning. They were not complaining about the model. They were not worried about being replaced. They were demanding it improve faster. Altman: “Here’s how many people are dying every day you delay.” That is what alignment with the trajectory looks like. A company so deeply built on intelligence improving that every day the model stays the same is a day someone dies who did not have to. They are not building on a flaw. They are building on a future that has not arrived fast enough. That is the difference. The wrapper startup patches what the model cannot do today. The real company builds what the model will unlock tomorrow. One is running from the train. The other is laying the track. Altman told you the train is not slowing down. Lightcap told you exactly how to know which side you are on. One question. Does a 100x smarter model make you more valuable or erase you. If you had to pause before answering you already did.

Dustin

39,109 Aufrufe • vor 2 Monaten

Chamath: Anthropic's Mythos Warning Is Theater @jason: “Chamath, is it the Boy who Cried Wolf, or is this the real deal now?” Chamath Palihapitiya: “I think it's mostly theater. In February of 2019 when Dario was still at OpenAI, they did the same thing with GPT-2. That was a 1.5 billion parameter model, which sounds like a total fart in the wind in 2026. But at that time, this model was supposed to be the end of days. And at the end of it, it was a huge nothingburger. If you actually think that Mythos is capable of doing what it says it can do, two things are true. One is, a very sophisticated hacker can probably do those things right now with Opus. And two, if these exploits are this easy to find, whether you use Opus or whether you use Mythos, the reality is you'd have to shut down the internet for about five years to patch them all. So when you see a large multi-trillion dollar GSIB bank, it's a bit of theater. Why? What do you think they can actually accomplish in two months? Do you actually think that if there's these vulnerabilities, it's all going to get fixed? Let's give them six months, let's give them nine months. So I do think that Sacks is right, that they have figured out a very clever go-to-market muscle here that activates hyper attention and hyper usage, and so I give them tremendous credit. But we've seen it before, we saw it when these folks were the principal architects at OpenAI, and we're now seeing the same playbook here. The reality is that capitalism moves forward, the funding needs moves forward, and the need for these guys to build adoption moves forward. And that's going to supersede what this is.”

The All-In Podcast

220,049 Aufrufe • vor 2 Monaten