Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

Karpathy told Dwarkesh that a 1 billion parameter model, trained on clean data, could hit the intelligence of today's 1.8 trillion parameter frontier. That is a 1,800x compression claim. The math behind it is more defensible than it sounds. When researchers at frontier labs look at random samples from... their training corpus, they see stock ticker symbols, broken HTML, forum spam, autogenerated gibberish. Not Wikipedia. Not the Wall Street Journal. The actual pretraining dataset is mostly noise, and the model is burning parameters to vaguely remember all of it. One estimate pegs Llama 3's information compression at 0.07 bits per token. Well-structured English carries around 1.5 bits per token of real information. The trillion-parameter model is holding a roughly 5% resolution image of the internet it trained on. So when a lab ships a 1.8 trillion parameter model, the overwhelming majority of those weights are handling rough memorization. They are compression overhead for a noisy training set, taking up capacity that could be doing reasoning instead. Karpathy's proposal is to separate the two. Build a cognitive core: a small model that contains only the algorithms for reasoning and problem-solving, stripped of encyclopedic memorization. Pair it with external memory the model queries when it needs a fact. A 1 billion parameter reasoner plus retrieval beats a 1.8 trillion parameter model trying to do both. The data already supports this direction. GPT-4o runs at roughly 200 billion parameters and outperforms the original 1.8 trillion GPT-4. Inference costs for GPT-3.5 level performance fell 280x between 2022 and 2024, driven almost entirely by smaller, cleaner, better-architected models. The trend line is pointing where Karpathy says it should. The real implication for anyone tracking the AI trade: data quality is the actual constraint. The companies winning the next phase will be the ones who figured out what to train on, and what to throw away.show more

Aakash Gupta

276,165 subscribers

507,662 Aufrufe • vor 2 Monaten •via X (Twitter)

Wissenschaft & Technologie

Anya Rossi• Live Now

Private livecam show

0 Kommentare

Keine Kommentare verfügbar

Kommentare vom Original-Post werden hier angezeigt

Ähnliche Videos

ELON MUSK: "Grok 5 will be the largest model, a 6 trillion parameter model, whereas Grok 3 and 4 are based on a 3 trillion parameter model. Moreover, the 6 trillion parameters will have a much higher intelligence density per gigabyte. Its really going to feel Sentient."

ELON MUSK: "Grok 5 will be the largest model, a 6 trillion parameter model, whereas Grok 3 and 4 are based on a 3 trillion parameter model. Moreover, the 6 trillion parameters will have a much higher intelligence density per gigabyte. Its really going to feel Sentient."

DogeDesigner

322,011 Aufrufe • vor 7 Monaten

You'd think the race to AGI would mean training the biggest possible model. But parameter scaling had stalled for a long time after GPT-4's trillion+ parameters, and only now are models getting bigger again. What gives? Partially it’s RL scaling, as Dylan Patel explains. A 5T parameter model takes 5x longer to generate RL rollouts than a 1T model. Even if the bigger model is 2x more sample-efficient, the smaller model finishes RL faster, gets deployed to research sooner, and starts helping build the next model before the big one is even done training.

You'd think the race to AGI would mean training the biggest possible model. But parameter scaling had stalled for a long time after GPT-4's trillion+ parameters, and only now are models getting bigger again. What gives? Partially it’s RL scaling, as Dylan Patel explains. A 5T parameter model takes 5x longer to generate RL rollouts than a 1T model. Even if the bigger model is 2x more sample-efficient, the smaller model finishes RL faster, gets deployed to research sooner, and starts helping build the next model before the big one is even done training.

Dwarkesh Patel

65,123 Aufrufe • vor 2 Monaten

🇺🇸 ZUCK: META AI IS THE BIGGEST, BEST, IN THE WORLD!! Has anyone ever used it?... “Meta AI now has nearly 600 million monthly actives and as promised is on track to be the most used AI assistant in the world by the end of the year. Llama 3.3 is a new 70 billion parameter text model that performs about as well as our 405 billion parameter model but now it is easier and more efficient to run. So that is the last Lama 3.0 release. The next stop is Lama 4.0.” Source: Instagram

🇺🇸 ZUCK: META AI IS THE BIGGEST, BEST, IN THE WORLD!! Has anyone ever used it?... “Meta AI now has nearly 600 million monthly actives and as promised is on track to be the most used AI assistant in the world by the end of the year. Llama 3.3 is a new 70 billion parameter text model that performs about as well as our 405 billion parameter model but now it is easier and more efficient to run. So that is the last Lama 3.0 release. The next stop is Lama 4.0.” Source: Instagram

Mario Nawfal

482,099 Aufrufe • vor 1 Jahr

NVIDIA CEO: GROK 5 IS A 7 TRILLION PARAMETER RACE AGAINST TIME Jensen Huang is dialing in on the real challenge: not making bigger models, but training them fast without draining power or budgets. Grok 5 is right in the middle of that race. “The next frontier model. Elon already mentioned that the next version of Grok, Grok 5 I believe, is 7 trillion parameters. This one is 10, and the green represents Blackwell. In the case of Rubin, notice that the throughput is much higher, so it only takes one fourth as many of these systems to train the model within the one-month timeframe we have given here.” Source: Rohan Paul

NVIDIA CEO: GROK 5 IS A 7 TRILLION PARAMETER RACE AGAINST TIME Jensen Huang is dialing in on the real challenge: not making bigger models, but training them fast without draining power or budgets. Grok 5 is right in the middle of that race. “The next frontier model. Elon already mentioned that the next version of Grok, Grok 5 I believe, is 7 trillion parameters. This one is 10, and the green represents Blackwell. In the case of Rubin, notice that the throughput is much higher, so it only takes one fourth as many of these systems to train the model within the one-month timeframe we have given here.” Source: Rohan Paul

Mario Nawfal

76,354 Aufrufe • vor 5 Monaten

TWO BOXES THE SIZE OF A MAC MINI JUST RAN A 235 BILLION PARAMETER MODEL ON A DESK It is two NVIDIA DGX Spark units linked by a single cable. A year ago a model this size meant renting a GPU cluster by the hour. Now it sits next to your monitor for around $8,000. Here is the twist most people miss. Linking them does not create one shared 256GB memory pool. The model is split across both boxes, and that is the only reason a 235B model fits at all. It answers at roughly 10 tokens per second, and both chips sit at just 74 degrees while sipping around 50 watts. Every token stays on the desk. Nothing touches a cloud, and nothing leaves the room. The ceiling for what you can run at home just jumped from 70B to 235B. Bookmark this & Watch it run ↓

TWO BOXES THE SIZE OF A MAC MINI JUST RAN A 235 BILLION PARAMETER MODEL ON A DESK It is two NVIDIA DGX Spark units linked by a single cable. A year ago a model this size meant renting a GPU cluster by the hour. Now it sits next to your monitor for around $8,000. Here is the twist most people miss. Linking them does not create one shared 256GB memory pool. The model is split across both boxes, and that is the only reason a 235B model fits at all. It answers at roughly 10 tokens per second, and both chips sit at just 74 degrees while sipping around 50 watts. Every token stays on the desk. Nothing touches a cloud, and nothing leaves the room. The ceiling for what you can run at home just jumped from 70B to 235B. Bookmark this & Watch it run ↓

slash1s

100,761 Aufrufe • vor 9 Tagen

$A crypto project actually trained a 72B parameter AI model from scratch using decentralized GPU compute. Not fine-tuned, not a wrapper: trained from zero. The model benchmarks competitively against Meta's LLaMA 3 on reasoning tasks, and the entire training run cost a fraction of what centralized labs spend. If decentralized compute can produce frontier-class models, the moat around OpenAI and Anthropic is thinner than people think.$

A crypto project actually trained a 72B parameter AI model from scratch using decentralized GPU compute. Not fine-tuned, not a wrapper: trained from zero. The model benchmarks competitively against Meta's LLaMA 3 on reasoning tasks, and the entire training run cost a fraction of what centralized labs spend. If decentralized compute can produce frontier-class models, the moat around OpenAI and Anthropic is thinner than people think.

VirtualBacon

40,543 Aufrufe • vor 2 Monaten

The most overlooked part of the SpaceX IPO thesis is the model and most people are completely missing it (Save this) Everyone has been focused on the Anthropic compute deal and the Colossus revenue because those are numbers you can put in a spreadsheet. Six months ago, xAI was competing reasonably well on model performance but was not clearly on the frontier. Then SpaceX exercised its option to acquire Cursor for $60 billion, the largest startup acquisition in history just days after completing the largest IPO in history at $75 billion. Cursor is a team of 700 to 800 people, was on track to exit 2026 at up to $10 billion in revenue, had millions of professional developers using it daily, and had already built a team with the genuine potential to compete at the frontier, the one thing holding them back was compute. SpaceX just gave them the largest GPU cluster in the world to work with. Grok 4.3, a 1.5 trillion parameter model, is currently training with Cursor's proprietary coding data being injected directly into pre-training, not just fine tuning which is a fundamentally more powerful integration than anything the market is currently modeling. The prior version, Grok 4, was already on the Pareto frontier as of 10 to 12 days ago, the most intelligent 500 billion parameter model in the world, sitting alongside Google Gemini, Anthropic, and OpenAI as one of only four systems at the true frontier. Composer 2.5, the previous Cursor model was Pareto dominant in coding tasks just before the acquisition closed, meaning SpaceX inherited a model that was already best-in-class in the highest-value AI use case in the market. The AWS parallel is the one everyone keeps missing. Bezos built data center capacity for Black Friday, sat on idle infrastructure the rest of the year, and monetized it into what was at the time the most profitable technology business in history and investors hated it in 2009 and 2010 because he was burning free cash flow on capacity that had no obvious revenue yet. SpaceX is in exactly that position, it built Colossus for xAI's own training needs, is monetizing excess capacity to Anthropic at $1.25 billion per month across 220,000 Nvidia GPUs, and has reportedly secured up to 20% of Nvidia's early Vera Rubin allocation, giving it the most powerful and scarcest GPU infrastructure in the world during the critical window when those chips are hardest to get. The $60 billion Cursor acquisition closed at a moment when SpaceX had essentially unlimited compute, a team already at the frontier, and a product with deep enterprise distribution, three things no other model lab had simultaneously when it was at this stage. The market is pricing the compute business conservatively and ignoring the model call option entirely, and coding is the fastest path to AGI, once you are on the Pareto frontier with that compute, revenue scales fast. Anthropic went from negligible revenue to $30 billion annualized in under 18 months and that is the existence proof. Bullish on SpaceXAI and Elon Musk

The most overlooked part of the SpaceX IPO thesis is the model and most people are completely missing it (Save this) Everyone has been focused on the Anthropic compute deal and the Colossus revenue because those are numbers you can put in a spreadsheet. Six months ago, xAI was competing reasonably well on model performance but was not clearly on the frontier. Then SpaceX exercised its option to acquire Cursor for $60 billion, the largest startup acquisition in history just days after completing the largest IPO in history at $75 billion. Cursor is a team of 700 to 800 people, was on track to exit 2026 at up to $10 billion in revenue, had millions of professional developers using it daily, and had already built a team with the genuine potential to compete at the frontier, the one thing holding them back was compute. SpaceX just gave them the largest GPU cluster in the world to work with. Grok 4.3, a 1.5 trillion parameter model, is currently training with Cursor's proprietary coding data being injected directly into pre-training, not just fine tuning which is a fundamentally more powerful integration than anything the market is currently modeling. The prior version, Grok 4, was already on the Pareto frontier as of 10 to 12 days ago, the most intelligent 500 billion parameter model in the world, sitting alongside Google Gemini, Anthropic, and OpenAI as one of only four systems at the true frontier. Composer 2.5, the previous Cursor model was Pareto dominant in coding tasks just before the acquisition closed, meaning SpaceX inherited a model that was already best-in-class in the highest-value AI use case in the market. The AWS parallel is the one everyone keeps missing. Bezos built data center capacity for Black Friday, sat on idle infrastructure the rest of the year, and monetized it into what was at the time the most profitable technology business in history and investors hated it in 2009 and 2010 because he was burning free cash flow on capacity that had no obvious revenue yet. SpaceX is in exactly that position, it built Colossus for xAI's own training needs, is monetizing excess capacity to Anthropic at $1.25 billion per month across 220,000 Nvidia GPUs, and has reportedly secured up to 20% of Nvidia's early Vera Rubin allocation, giving it the most powerful and scarcest GPU infrastructure in the world during the critical window when those chips are hardest to get. The $60 billion Cursor acquisition closed at a moment when SpaceX had essentially unlimited compute, a team already at the frontier, and a product with deep enterprise distribution, three things no other model lab had simultaneously when it was at this stage. The market is pricing the compute business conservatively and ignoring the model call option entirely, and coding is the fastest path to AGI, once you are on the Pareto frontier with that compute, revenue scales fast. Anthropic went from negligible revenue to $30 billion annualized in under 18 months and that is the existence proof. Bullish on SpaceXAI and Elon Musk

Milk Road AI

68,768 Aufrufe • vor 7 Tagen

You can smell a big model. Not the parameter count. Not the benchmark score. It's that feeling when something is actually reasoning. Not just pattern matching. We call it "big model smell."

You can smell a big model. Not the parameter count. Not the benchmark score. It's that feeling when something is actually reasoning. Not just pattern matching. We call it "big model smell."

Arena.ai

111,988 Aufrufe • vor 2 Monaten

Jensen Huang said Grok 5 will be 7 Trillion parameter model. On time to train, with the training window fixed at 1 month, the new Vera-Rubin GPU system of Nvidia needs 1/4 the number of systems compared with Blackwell to train the same frontier model. “factory throughput” improves by about 10x over Blackwell, and Blackwell itself was about 10x over Hopper. i.e. overall Rubin is roughly 100x Hopper in factory throughput per watt, which matters hugely, because a 1 GW, $50B data center is power-limited and revenue scales with throughput per watt. --- From Nvidia YT channel

Jensen Huang said Grok 5 will be 7 Trillion parameter model. On time to train, with the training window fixed at 1 month, the new Vera-Rubin GPU system of Nvidia needs 1/4 the number of systems compared with Blackwell to train the same frontier model. “factory throughput” improves by about 10x over Blackwell, and Blackwell itself was about 10x over Hopper. i.e. overall Rubin is roughly 100x Hopper in factory throughput per watt, which matters hugely, because a 1 GW, $50B data center is power-limited and revenue scales with throughput per watt. --- From Nvidia YT channel

Rohan Paul

165,770 Aufrufe • vor 5 Monaten

$Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.$

Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.

Sapient Intelligence

510,154 Aufrufe • vor 1 Monat

one half of this keynote sells you the cloud forever, the other half shows the chip that lets you stop renting it this is the other half, AMD's CEO holding the chip in her hand 00:00 - Lisa Su introduces the Ryzen AI Halo, a system built for local AI 00:29 - the line that matters, it runs models up to 200 billion parameters locally, not connected to anything 00:42 - a 200 billion parameter model, the tier of the top paid AI plans, on a desktop that fits in your hand so the cloud wants $200 a month, forever, for access you never own this is the box that runs the same class of model with nothing leaving the room that is the whole point of my breakdown, the $200 a month was never the intelligence, it was the meter and the meter just became optional most people will see a spec demo the part that matters is what it lets you stop paying for full breakdown below

one half of this keynote sells you the cloud forever, the other half shows the chip that lets you stop renting it this is the other half, AMD's CEO holding the chip in her hand 00:00 - Lisa Su introduces the Ryzen AI Halo, a system built for local AI 00:29 - the line that matters, it runs models up to 200 billion parameters locally, not connected to anything 00:42 - a 200 billion parameter model, the tier of the top paid AI plans, on a desktop that fits in your hand so the cloud wants $200 a month, forever, for access you never own this is the box that runs the same class of model with nothing leaving the room that is the whole point of my breakdown, the $200 a month was never the intelligence, it was the meter and the meter just became optional most people will see a spec demo the part that matters is what it lets you stop paying for full breakdown below

John Doe

25,960 Aufrufe • vor 9 Tagen

Sam Altman just handed every startup founder a one-question autopsy. Altman: “If you’re building something on GPT-4 that a reasonable observer would say we’re going to steamroll you.” Not might. Not could. Going to. He said it with the calm of someone describing weather. Because to him it is weather. The model improves. Whatever was built on the old version’s weaknesses gets washed away. That is not strategy. That is erosion. And most founders are building on the erosion line. They find a gap in the current model. They wrap a product around it. They raise money. They hire. They scale. Then OpenAI releases the next version and the gap closes and the product has no reason to exist anymore. Altman: “When we just do our fundamental job, which is make the model better with every crank, then you get the ‘OpenAI killed my startup’ meme.” He is telling you directly. They are not hunting you. They are not even thinking about you. They are just improving the model. You happen to be standing where the improvement lands. That is the part founders refuse to hear. OpenAI does not need to compete with you. It just needs to keep doing exactly what it was already doing and your entire company disappears as a side effect. You are not a competitor. You are a temporary symptom of incomplete intelligence. The moment the intelligence completes you become nothing. Then Brad Lightcap delivered the cleanest diagnostic ever spoken in venture capital. Lightcap: “Ask if a 100x improvement in the model is something they’re excited about.” One question. The entire investment thesis reduced to a single binary. Does the next model make your company more powerful or does it make your company pointless. There is no middle ground. Lightcap: “We know the companies that come to us saying, ‘We want the next model. When is it coming out? I want to be the first to try it.’” These companies built something that feeds on intelligence. The smarter the model gets the more their product can do. They are not threatened by progress. They are starving for it. Then there are the companies Lightcap never hears from. The ones who go quiet when a new model drops. The ones who read the release notes like a death sentence. The ones privately praying the next generation takes longer because every improvement shrinks the ground beneath them. If you are hoping the model stays roughly where it is you have already told the market everything it needs to know about your company. You are not building on intelligence. You are building on the absence of it. Altman: “95% of the world should be betting on the latter category.” The latter category is simple. Assume the model keeps getting better at the pace it has been getting better. Build for that world. Not the world where GPT-4 is the ceiling. The world where GPT-4 is the floor and the ceiling has not been built yet. Then Altman told a story that should be framed on the wall of every startup in the country. A medical AI company came to him that morning. They were not complaining about the model. They were not worried about being replaced. They were demanding it improve faster. Altman: “Here’s how many people are dying every day you delay.” That is what alignment with the trajectory looks like. A company so deeply built on intelligence improving that every day the model stays the same is a day someone dies who did not have to. They are not building on a flaw. They are building on a future that has not arrived fast enough. That is the difference. The wrapper startup patches what the model cannot do today. The real company builds what the model will unlock tomorrow. One is running from the train. The other is laying the track. Altman told you the train is not slowing down. Lightcap told you exactly how to know which side you are on. One question. Does a 100x smarter model make you more valuable or erase you. If you had to pause before answering you already did.

Sam Altman just handed every startup founder a one-question autopsy. Altman: “If you’re building something on GPT-4 that a reasonable observer would say we’re going to steamroll you.” Not might. Not could. Going to. He said it with the calm of someone describing weather. Because to him it is weather. The model improves. Whatever was built on the old version’s weaknesses gets washed away. That is not strategy. That is erosion. And most founders are building on the erosion line. They find a gap in the current model. They wrap a product around it. They raise money. They hire. They scale. Then OpenAI releases the next version and the gap closes and the product has no reason to exist anymore. Altman: “When we just do our fundamental job, which is make the model better with every crank, then you get the ‘OpenAI killed my startup’ meme.” He is telling you directly. They are not hunting you. They are not even thinking about you. They are just improving the model. You happen to be standing where the improvement lands. That is the part founders refuse to hear. OpenAI does not need to compete with you. It just needs to keep doing exactly what it was already doing and your entire company disappears as a side effect. You are not a competitor. You are a temporary symptom of incomplete intelligence. The moment the intelligence completes you become nothing. Then Brad Lightcap delivered the cleanest diagnostic ever spoken in venture capital. Lightcap: “Ask if a 100x improvement in the model is something they’re excited about.” One question. The entire investment thesis reduced to a single binary. Does the next model make your company more powerful or does it make your company pointless. There is no middle ground. Lightcap: “We know the companies that come to us saying, ‘We want the next model. When is it coming out? I want to be the first to try it.’” These companies built something that feeds on intelligence. The smarter the model gets the more their product can do. They are not threatened by progress. They are starving for it. Then there are the companies Lightcap never hears from. The ones who go quiet when a new model drops. The ones who read the release notes like a death sentence. The ones privately praying the next generation takes longer because every improvement shrinks the ground beneath them. If you are hoping the model stays roughly where it is you have already told the market everything it needs to know about your company. You are not building on intelligence. You are building on the absence of it. Altman: “95% of the world should be betting on the latter category.” The latter category is simple. Assume the model keeps getting better at the pace it has been getting better. Build for that world. Not the world where GPT-4 is the ceiling. The world where GPT-4 is the floor and the ceiling has not been built yet. Then Altman told a story that should be framed on the wall of every startup in the country. A medical AI company came to him that morning. They were not complaining about the model. They were not worried about being replaced. They were demanding it improve faster. Altman: “Here’s how many people are dying every day you delay.” That is what alignment with the trajectory looks like. A company so deeply built on intelligence improving that every day the model stays the same is a day someone dies who did not have to. They are not building on a flaw. They are building on a future that has not arrived fast enough. That is the difference. The wrapper startup patches what the model cannot do today. The real company builds what the model will unlock tomorrow. One is running from the train. The other is laying the track. Altman told you the train is not slowing down. Lightcap told you exactly how to know which side you are on. One question. Does a 100x smarter model make you more valuable or erase you. If you had to pause before answering you already did.

Dustin

39,109 Aufrufe • vor 2 Monaten

GROK 5. the first 7 trillion parameter model

GROK 5. the first 7 trillion parameter model

🍓🍓🍓

39,839 Aufrufe • vor 5 Monaten

"One of the biggest misconceptions" Cerebras CFO Bob Komin pushes back on the small-models narrative. "We serve all models, and there is no limit to the size of the models that we can serve. Today, we're serving trillion parameter models. We're serving trillion parameter models that are internal for OpenAI today. We are currently running OpenAI 5.4 and 5.5 with them."

"One of the biggest misconceptions" Cerebras CFO Bob Komin pushes back on the small-models narrative. "We serve all models, and there is no limit to the size of the models that we can serve. Today, we're serving trillion parameter models. We're serving trillion parameter models that are internal for OpenAI today. We are currently running OpenAI 5.4 and 5.5 with them."

Deirdre Bosa

83,925 Aufrufe • vor 1 Monat

Blender AI Real-Time Motion Capture Plugin — connect a 1080P camera or upload videos. It runs locally with a 1-billion-parameter model and requires 8GB of VRAM for real-time processing. It supports both real-time capture and video uploads. The full-parameter version currently supports NVIDIA CUDA and requires DX11 or higher.

Blender AI Real-Time Motion Capture Plugin — connect a 1080P camera or upload videos. It runs locally with a 1-billion-parameter model and requires 8GB of VRAM for real-time processing. It supports both real-time capture and video uploads. The full-parameter version currently supports NVIDIA CUDA and requires DX11 or higher.

CYANPUPPETS

53,996 Aufrufe • vor 3 Monaten

Blender AI Real-Time Motion Capture Plugin — connect a 1080P camera or upload videos. It runs locally with a 1-billion-parameter model and requires 8GB of VRAM for real-time processing. It supports both real-time capture and video uploads. The full-parameter version currently supports NVIDIA CUDA and requires DX11 or higher.

Blender AI Real-Time Motion Capture Plugin — connect a 1080P camera or upload videos. It runs locally with a 1-billion-parameter model and requires 8GB of VRAM for real-time processing. It supports both real-time capture and video uploads. The full-parameter version currently supports NVIDIA CUDA and requires DX11 or higher.

CYANPUPPETS

26,492 Aufrufe • vor 4 Monaten

Chamath: Anthropic's Mythos Warning Is Theater @jason: “Chamath, is it the Boy who Cried Wolf, or is this the real deal now?” Chamath Palihapitiya: “I think it's mostly theater. In February of 2019 when Dario was still at OpenAI, they did the same thing with GPT-2. That was a 1.5 billion parameter model, which sounds like a total fart in the wind in 2026. But at that time, this model was supposed to be the end of days. And at the end of it, it was a huge nothingburger. If you actually think that Mythos is capable of doing what it says it can do, two things are true. One is, a very sophisticated hacker can probably do those things right now with Opus. And two, if these exploits are this easy to find, whether you use Opus or whether you use Mythos, the reality is you'd have to shut down the internet for about five years to patch them all. So when you see a large multi-trillion dollar GSIB bank, it's a bit of theater. Why? What do you think they can actually accomplish in two months? Do you actually think that if there's these vulnerabilities, it's all going to get fixed? Let's give them six months, let's give them nine months. So I do think that Sacks is right, that they have figured out a very clever go-to-market muscle here that activates hyper attention and hyper usage, and so I give them tremendous credit. But we've seen it before, we saw it when these folks were the principal architects at OpenAI, and we're now seeing the same playbook here. The reality is that capitalism moves forward, the funding needs moves forward, and the need for these guys to build adoption moves forward. And that's going to supersede what this is.”

Chamath: Anthropic's Mythos Warning Is Theater @jason: “Chamath, is it the Boy who Cried Wolf, or is this the real deal now?” Chamath Palihapitiya: “I think it's mostly theater. In February of 2019 when Dario was still at OpenAI, they did the same thing with GPT-2. That was a 1.5 billion parameter model, which sounds like a total fart in the wind in 2026. But at that time, this model was supposed to be the end of days. And at the end of it, it was a huge nothingburger. If you actually think that Mythos is capable of doing what it says it can do, two things are true. One is, a very sophisticated hacker can probably do those things right now with Opus. And two, if these exploits are this easy to find, whether you use Opus or whether you use Mythos, the reality is you'd have to shut down the internet for about five years to patch them all. So when you see a large multi-trillion dollar GSIB bank, it's a bit of theater. Why? What do you think they can actually accomplish in two months? Do you actually think that if there's these vulnerabilities, it's all going to get fixed? Let's give them six months, let's give them nine months. So I do think that Sacks is right, that they have figured out a very clever go-to-market muscle here that activates hyper attention and hyper usage, and so I give them tremendous credit. But we've seen it before, we saw it when these folks were the principal architects at OpenAI, and we're now seeing the same playbook here. The reality is that capitalism moves forward, the funding needs moves forward, and the need for these guys to build adoption moves forward. And that's going to supersede what this is.”

The All-In Podcast

220,049 Aufrufe • vor 2 Monaten

NeurochainAI is the all-in-one platform for developers to build AI dApps including GPU DePIN, Data Layer, and AI model layer. But what does it actually mean? AI is a $1.8 trillion industry, and NeurochainAI is tapping into 1% of that - as an infrastructure of interconnected GPUs, AI models, and data. Even simpler? Think of us as Ethereum for AI🧠🌐

NeurochainAI is the all-in-one platform for developers to build AI dApps including GPU DePIN, Data Layer, and AI model layer. But what does it actually mean? AI is a $1.8 trillion industry, and NeurochainAI is tapping into 1% of that - as an infrastructure of interconnected GPUs, AI models, and data. Even simpler? Think of us as Ethereum for AI🧠🌐

Neurochain.AI

105,278 Aufrufe • vor 2 Jahren

Last fall, when submitting the state budget for 2025, the Russian government promised a deficit of 1.2 trillion rubles ($14,2 billion). In reality, the deficit will be several times higher. Currently, the deficit is almost 5 trillion ($59,5 billion), and by the end of the year, it could reach 6-8 trillion ($95 billion). Now, when submitting the state budget for 2026, the deficit is estimated at 4.6 trillion rubles ($54,7 billion) already (a prognosis three times worse than for 2025).

Last fall, when submitting the state budget for 2025, the Russian government promised a deficit of 1.2 trillion rubles ($14,2 billion). In reality, the deficit will be several times higher. Currently, the deficit is almost 5 trillion ($59,5 billion), and by the end of the year, it could reach 6-8 trillion ($95 billion). Now, when submitting the state budget for 2026, the deficit is estimated at 4.6 trillion rubles ($54,7 billion) already (a prognosis three times worse than for 2025).

Anton Gerashchenko

173,165 Aufrufe • vor 9 Monaten