Загрузка видео...

Не удалось загрузить видео

На главную

Microsoft CEO Satya Nadella's new interivew: Explains how the next AI moat will not be the model you use, but the learning loop only your company can run. He is really asking what happens to the firm when intelligence becomes something you can rent. For a century, companies protected...

74,452 просмотров • 1 день назад •via X (Twitter)

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

The creator of High Bandwidth Memory said something that reframes the entire AI investment thesis, AI equals memory (Save this). Most people still think about AI hardware through a training lens. During training, the bottleneck is raw compute, GPUs stay near 100% utilization crunching through billions of gradient updates. Inference is a completely different problem. When a model generates a response, it produces tokens one at a time and at every single step, the entire model has to be loaded from memory into the processor to generate just one token. The GPU cores sit there, waiting for data to arrive. This is what engineers mean when they say inference is memory bound, the bottleneck is not how many calculations you can do per second but rather how fast you can move data from memory to the chip. Adding more GPUs does not fix a memory bandwidth problem, it just gives you more processors starving for the same data. Modern LLMs use a KV cache, a data structure that stores the conversation's context so the model does not have to recompute it from scratch on each step. The KV cache is what gives a model its memory of the conversation. It grows with every token and for long documents or deep reasoning chains, it can dwarf the model weights themselves in memory consumption. This means memory directly determines how long a context the model can hold, how many users you can serve simultaneously, how fast it responds and how cheaply you can run it. A memory constrained model is not just slower but rather qualitatively worse, it forgets earlier parts of the conversation, truncates context and hallucinates more because it literally cannot hold the relevant information long enough to use it. The world now spends more on inference than training, and every ChatGPT query, every Claude document analysis, every API call is an inference workload. Inference economics, cost per token, latency, context length, concurrent users are memory problems first and compute problems second. The companies that control memory bandwidth and supply are not suppliers to the AI trade but rather are the AI trade. Long Micron! Follow me Melvin for more AI, semis and the next big market themes.

Melvin

47,148 просмотров • 5 дней назад

Dario Amodei just told software engineers exactly how long they have. Six to twelve months. Amodei: “I have engineers within Anthropic who say I don’t write any code anymore. I just let the model write the code, I edit it, I do the things around it.” The people building the most powerful AI in history have already stopped writing code. That is not a forecast. That is the current working condition inside the lab closest to the frontier. Amodei: “We might be six to 12 months away from when the model is doing most, maybe all, of what SWEs do end-to-end.” The tech industry spent a decade making software engineers its highest-paid, most protected class. That era has a last day now. When a model can execute an entire software build end-to-end, the ability to write syntax stops being a skill. It becomes a credential for a job that no longer exists. Amodei: “And then it’s a question of how fast does that loop close.” That is the sentence everyone skipped. The code was never the hard part. The hard part was everything around it. The model just learned everything around it. Writing the code is already nearly gone. Testing is next. Deployment is next. When all three collapse into a single autonomous execution loop, the machine no longer needs a human in the chain at all. The corporation or sovereign state that closes that loop first does not gain a competitive advantage. It gains a category of speed that biological engineers cannot match, track, or reverse. That is not disruption. That is replacement at a systems level. Amodei is not describing a future disruption. He is describing the current state of his own building. The loop is already closing. The only question is whether you are inside it or outside it when it seals.

Dustin

315,019 просмотров • 3 месяцев назад

Chamath: AI advantage may come less from models than from private inputs. "When labs can build similar models, the real win comes from one unique ingredient in order to monetize it well. Here is a basic thing about machine learning that is worth knowing: if you take 1,000 of the same inputs and give them to Facebook, Microsoft, Google, and Amazon, they will all come up with the same machine learning model. But if you have one extra thing, one little ingredient that all of those other companies do not have, your output can be markedly different. It is like giving two great chefs three ingredients, but giving the third chef one extra ingredient. That person has the ability to do something very special. Right now, we are in a world where everybody is crawling the open web. We are going to move to a world where, as everybody gets sophisticated enough and information is widely available, somebody is going to say, “You know what? This site, I am not going to allow anybody else to access. It is only for me, only for my models.” Those models will become better. So we have to let that play out a little bit. It is going to be a really interesting arms race. The next wave of M&A, for example, could be companies like Google, Microsoft, and Facebook looking at these companies and saying, “Can they be viable inputs to my large language models or to my other machine learning and AI models?” --- A company with unique workflows, transactions, medical records, industrial logs, legal archives, design files, or user behavior can turn boring private data into a compounding advantage. Some startups may never become great public companies on their own, yet still become valuable because they own a data stream that makes a larger AI system sharper, more differentiated, or harder to copy. That turns acquisition strategy upside down: the buyer may not be purchasing revenue, brand, or even software, but a private ingredient for intelligence. ---- From "iConnections" YouTube channel, (link in comment)

Rohan Paul

143,134 просмотров • 1 месяц назад

Sam Altman just handed every startup founder a one-question autopsy. Altman: “If you’re building something on GPT-4 that a reasonable observer would say we’re going to steamroll you.” Not might. Not could. Going to. He said it with the calm of someone describing weather. Because to him it is weather. The model improves. Whatever was built on the old version’s weaknesses gets washed away. That is not strategy. That is erosion. And most founders are building on the erosion line. They find a gap in the current model. They wrap a product around it. They raise money. They hire. They scale. Then OpenAI releases the next version and the gap closes and the product has no reason to exist anymore. Altman: “When we just do our fundamental job, which is make the model better with every crank, then you get the ‘OpenAI killed my startup’ meme.” He is telling you directly. They are not hunting you. They are not even thinking about you. They are just improving the model. You happen to be standing where the improvement lands. That is the part founders refuse to hear. OpenAI does not need to compete with you. It just needs to keep doing exactly what it was already doing and your entire company disappears as a side effect. You are not a competitor. You are a temporary symptom of incomplete intelligence. The moment the intelligence completes you become nothing. Then Brad Lightcap delivered the cleanest diagnostic ever spoken in venture capital. Lightcap: “Ask if a 100x improvement in the model is something they’re excited about.” One question. The entire investment thesis reduced to a single binary. Does the next model make your company more powerful or does it make your company pointless. There is no middle ground. Lightcap: “We know the companies that come to us saying, ‘We want the next model. When is it coming out? I want to be the first to try it.’” These companies built something that feeds on intelligence. The smarter the model gets the more their product can do. They are not threatened by progress. They are starving for it. Then there are the companies Lightcap never hears from. The ones who go quiet when a new model drops. The ones who read the release notes like a death sentence. The ones privately praying the next generation takes longer because every improvement shrinks the ground beneath them. If you are hoping the model stays roughly where it is you have already told the market everything it needs to know about your company. You are not building on intelligence. You are building on the absence of it. Altman: “95% of the world should be betting on the latter category.” The latter category is simple. Assume the model keeps getting better at the pace it has been getting better. Build for that world. Not the world where GPT-4 is the ceiling. The world where GPT-4 is the floor and the ceiling has not been built yet. Then Altman told a story that should be framed on the wall of every startup in the country. A medical AI company came to him that morning. They were not complaining about the model. They were not worried about being replaced. They were demanding it improve faster. Altman: “Here’s how many people are dying every day you delay.” That is what alignment with the trajectory looks like. A company so deeply built on intelligence improving that every day the model stays the same is a day someone dies who did not have to. They are not building on a flaw. They are building on a future that has not arrived fast enough. That is the difference. The wrapper startup patches what the model cannot do today. The real company builds what the model will unlock tomorrow. One is running from the train. The other is laying the track. Altman told you the train is not slowing down. Lightcap told you exactly how to know which side you are on. One question. Does a 100x smarter model make you more valuable or erase you. If you had to pause before answering you already did.

Dustin

39,109 просмотров • 2 месяцев назад

The AI industry is optimizing for a definition of intelligence that does not exist. Andrew Ng just said it out loud. Ng: “AGI, to me, should be less about AI that already knows everything under the sun. That seems very challenging, doesn’t seem practical.” The human brain is not the most powerful economic asset in history because of what it holds. It is powerful because of what it can pick up. Ng: “The amazing thing about the human brain is its plasticity, or its ability to learn.” That same biological hardware that earns a PhD in quantum physics could have been trained on chess, surgery, or rewriting global supply chains from scratch. Ng: “That same human brain, just given different training, could have been a chess master, or could have been amazing at playing tennis.” General intelligence is not omniscience. It is the structural capacity to master whatever you point it at. Ng: “It is through learning that we then gain these incredibly specialized intelligences.” The winner is not whoever builds the biggest model. It is whoever builds the most adaptable one. The AI that walks into a domain it has never touched and executes before a human analyst finishes reading the brief. Ng: “What makes the human brain so valuable for economic tasks, is its ability to just learn to do whatever is needed.” Every corporation on earth pays for human labor because humans adapt. Not because they already know everything. AGI is the digitization of that exact capability. At machine speed. At infinite scale. Ng: “A lot of what makes the human brain so general is not that my brain or your brain already knows everything under the sun. It’s our ability to adapt, to learn a huge range of things.” The most powerful economic asset in history was never specialized knowledge. It was the raw capacity to acquire any knowledge, in any domain, on demand. The winning AI is not an encyclopedia. It is the force that makes encyclopedias irrelevant. And once that exists, the question stops being what the AI knows. It becomes what you can teach it before your competitor wakes up. Most people dominating this conversation have not understood that yet.

Dustin

19,802 просмотров • 4 месяцев назад

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me Santiago so you don't miss what comes next.

Santiago

384,495 просмотров • 3 лет назад

Responsible AI should not feel like the path of most resistance. That was the strongest idea from my SAS Innovate conversation with Reggie Townsend, leading Data Ethics, Governance, and Social Impact at SAS Software. Reggie framed governance not as a compliance layer, but as a way to scale human judgment. Too often, we innovate first and govern later. Model selected. Agent deployed. Process built. Then governance arrives at the end and feels like friction. Bolt it on after the fact, and resistance is guaranteed. The opportunity: design responsible AI, so it becomes intuitive, action-oriented, and useful in the flow of work. That is what Reggie meant by making responsibility "irresistible." Second point: Use cases must lead. When everyone can access the same models, differentiation will not come from the technology. It will come from how leaders define outcomes, govern applications, and connect business value to institutional trust. The risk does not live in the model. The value does not live in the model. Both live in the use case. For CEOs and boards, this is the shift: from model-first oversight to outcome-first accountability. Better questions before scaling AI: - What human decision are we shaping? - What business outcome are we improving? - What risk are we containing? - What judgment are we extending? Responsible AI becomes strategic when it helps people make better decisions, faster, with greater confidence. Most leaders can't see where governance sits inside their AI operating model. SAS AI Navigator makes it visible: Design for the human. Not only the technology.

Sabine VanderLinden

835,838 просмотров • 1 месяц назад