Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

Microsoft CEO Satya Nadella's new interivew: Explains how the next AI moat will not be the model you use, but the learning loop only your company can run. He is really asking what happens to the firm when intelligence becomes something you can rent. For a century, companies protected... value through people, processes, data, routines, customer memory, and the tacit knowledge buried in daily operations. Foundation models threaten to flatten that advantage because the same general intelligence can be used by everyone. Nadella’s answer is that firms need their own “hill climbing machine,” a private loop where models learn from company-specific tasks, traces, evaluations, and outcomes. That means the real asset is not just the model. The asset is the environment that keeps improving the model in ways competitors cannot copy. Private evals become strategic memory. Workflow traces become training signal. Human judgment becomes a way to steer compounding, not just correct mistakes. This also reframes AI adoption: a company that only consumes a foundation model may gain productivity, but it may leak the deeper value of its operating knowledge. A company that builds a disciplined learning loop can turn everyday work into accumulating IP. The future firm may therefore be measured by how well it converts its unique activity into durable model improvement. The frontier will not belong only to whoever owns the largest model. It will belong to whoever owns the best loop. ---- From "Stanford Online" YouTube channel, (link in comment)show more

Rohan Paul

151,931 subscribers

74,452 просмотров • 1 день назад •via X (Twitter)

Новости и политика Наука и технологии

Anya Rossi• Live Now

Private livecam show

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

Satya Nadella reveals why every company may need its own AI model: the model becomes the new company database. "To me, a model is like the database market." "A firm should be able to take the tacit knowledge it has and embed it inside weights in a model that they control." "When somebody asks me how many models should there be, I'll say as many models as firms in the world." The contrarian part: the value may not sit in one universal frontier model. It sits in each company turning its private operating knowledge into a controlled model.

Satya Nadella reveals why every company may need its own AI model: the model becomes the new company database. "To me, a model is like the database market." "A firm should be able to take the tacit knowledge it has and embed it inside weights in a model that they control." "When somebody asks me how many models should there be, I'll say as many models as firms in the world." The contrarian part: the value may not sit in one universal frontier model. It sits in each company turning its private operating knowledge into a controlled model.

Karl Mehta

464,767 просмотров • 10 дней назад

Gabbit Robotics co-founder Nicole Mattero on why the next AI winners won't fight over your screen, they'll fight to get closer to you: She kicks things off with what she admits is a hot take: "My hot take is that hardware is the only moat going forward because anyone's going to be able to spin up a model and make it really good and really cheap." Her logic is simple. Once a strong model is cheap and easy for anyone to build, the model alone can't set you apart. So what actually protects you? "Where does that leave you? Basically companies that have tactile modes such as hardware are the only ones that actually survive." Hardware, though, is only one piece of her argument. The second piece is data: "The models that win in the long run are the ones where private data is their [moat]." From there, Mattero ties the two together. Hardware gives a company one moat. The ability to gather unique personal data gives it another. Put both in place and the edge starts to multiply: "When you have a company that has hardware as a [moat], but then B is able to collect really unique personal data, that also becomes a [moat]." And that, she says, is the reason so many foundation model companies are reaching beyond the screen and into the physical world: "That's why I think a lot of these foundation models are like, let's get something in the home or on the person. It gives them an edge on both fronts."

Gabbit Robotics co-founder Nicole Mattero on why the next AI winners won't fight over your screen, they'll fight to get closer to you: She kicks things off with what she admits is a hot take: "My hot take is that hardware is the only moat going forward because anyone's going to be able to spin up a model and make it really good and really cheap." Her logic is simple. Once a strong model is cheap and easy for anyone to build, the model alone can't set you apart. So what actually protects you? "Where does that leave you? Basically companies that have tactile modes such as hardware are the only ones that actually survive." Hardware, though, is only one piece of her argument. The second piece is data: "The models that win in the long run are the ones where private data is their [moat]." From there, Mattero ties the two together. Hardware gives a company one moat. The ability to gather unique personal data gives it another. Put both in place and the edge starts to multiply: "When you have a company that has hardware as a [moat], but then B is able to collect really unique personal data, that also becomes a [moat]." And that, she says, is the reason so many foundation model companies are reaching beyond the screen and into the physical world: "That's why I think a lot of these foundation models are like, let's get something in the home or on the person. It gives them an edge on both fronts."

Big Brain AI

19,195 просмотров • 1 месяц назад

Anthropic has created an AI model that “can get into almost any software in the world,” former AI company founder and CEO Matt Shumer says — but the company is not releasing it to the public: “I have never seen something like this — a company that is about to IPO and having such a moral compass.” Shumer shares what makes the AI model so powerful and potentially dangerous.

Anthropic has created an AI model that “can get into almost any software in the world,” former AI company founder and CEO Matt Shumer says — but the company is not releasing it to the public: “I have never seen something like this — a company that is about to IPO and having such a moral compass.” Shumer shares what makes the AI model so powerful and potentially dangerous.

CBS Mornings

98,714 просмотров • 2 месяцев назад

The creator of High Bandwidth Memory said something that reframes the entire AI investment thesis, AI equals memory (Save this). Most people still think about AI hardware through a training lens. During training, the bottleneck is raw compute, GPUs stay near 100% utilization crunching through billions of gradient updates. Inference is a completely different problem. When a model generates a response, it produces tokens one at a time and at every single step, the entire model has to be loaded from memory into the processor to generate just one token. The GPU cores sit there, waiting for data to arrive. This is what engineers mean when they say inference is memory bound, the bottleneck is not how many calculations you can do per second but rather how fast you can move data from memory to the chip. Adding more GPUs does not fix a memory bandwidth problem, it just gives you more processors starving for the same data. Modern LLMs use a KV cache, a data structure that stores the conversation's context so the model does not have to recompute it from scratch on each step. The KV cache is what gives a model its memory of the conversation. It grows with every token and for long documents or deep reasoning chains, it can dwarf the model weights themselves in memory consumption. This means memory directly determines how long a context the model can hold, how many users you can serve simultaneously, how fast it responds and how cheaply you can run it. A memory constrained model is not just slower but rather qualitatively worse, it forgets earlier parts of the conversation, truncates context and hallucinates more because it literally cannot hold the relevant information long enough to use it. The world now spends more on inference than training, and every ChatGPT query, every Claude document analysis, every API call is an inference workload. Inference economics, cost per token, latency, context length, concurrent users are memory problems first and compute problems second. The companies that control memory bandwidth and supply are not suppliers to the AI trade but rather are the AI trade. Long Micron! Follow me Melvin for more AI, semis and the next big market themes.

The creator of High Bandwidth Memory said something that reframes the entire AI investment thesis, AI equals memory (Save this). Most people still think about AI hardware through a training lens. During training, the bottleneck is raw compute, GPUs stay near 100% utilization crunching through billions of gradient updates. Inference is a completely different problem. When a model generates a response, it produces tokens one at a time and at every single step, the entire model has to be loaded from memory into the processor to generate just one token. The GPU cores sit there, waiting for data to arrive. This is what engineers mean when they say inference is memory bound, the bottleneck is not how many calculations you can do per second but rather how fast you can move data from memory to the chip. Adding more GPUs does not fix a memory bandwidth problem, it just gives you more processors starving for the same data. Modern LLMs use a KV cache, a data structure that stores the conversation's context so the model does not have to recompute it from scratch on each step. The KV cache is what gives a model its memory of the conversation. It grows with every token and for long documents or deep reasoning chains, it can dwarf the model weights themselves in memory consumption. This means memory directly determines how long a context the model can hold, how many users you can serve simultaneously, how fast it responds and how cheaply you can run it. A memory constrained model is not just slower but rather qualitatively worse, it forgets earlier parts of the conversation, truncates context and hallucinates more because it literally cannot hold the relevant information long enough to use it. The world now spends more on inference than training, and every ChatGPT query, every Claude document analysis, every API call is an inference workload. Inference economics, cost per token, latency, context length, concurrent users are memory problems first and compute problems second. The companies that control memory bandwidth and supply are not suppliers to the AI trade but rather are the AI trade. Long Micron! Follow me Melvin for more AI, semis and the next big market themes.

Melvin

47,148 просмотров • 5 дней назад

Dario Amodei just told software engineers exactly how long they have. Six to twelve months. Amodei: “I have engineers within Anthropic who say I don’t write any code anymore. I just let the model write the code, I edit it, I do the things around it.” The people building the most powerful AI in history have already stopped writing code. That is not a forecast. That is the current working condition inside the lab closest to the frontier. Amodei: “We might be six to 12 months away from when the model is doing most, maybe all, of what SWEs do end-to-end.” The tech industry spent a decade making software engineers its highest-paid, most protected class. That era has a last day now. When a model can execute an entire software build end-to-end, the ability to write syntax stops being a skill. It becomes a credential for a job that no longer exists. Amodei: “And then it’s a question of how fast does that loop close.” That is the sentence everyone skipped. The code was never the hard part. The hard part was everything around it. The model just learned everything around it. Writing the code is already nearly gone. Testing is next. Deployment is next. When all three collapse into a single autonomous execution loop, the machine no longer needs a human in the chain at all. The corporation or sovereign state that closes that loop first does not gain a competitive advantage. It gains a category of speed that biological engineers cannot match, track, or reverse. That is not disruption. That is replacement at a systems level. Amodei is not describing a future disruption. He is describing the current state of his own building. The loop is already closing. The only question is whether you are inside it or outside it when it seals.

Dario Amodei just told software engineers exactly how long they have. Six to twelve months. Amodei: “I have engineers within Anthropic who say I don’t write any code anymore. I just let the model write the code, I edit it, I do the things around it.” The people building the most powerful AI in history have already stopped writing code. That is not a forecast. That is the current working condition inside the lab closest to the frontier. Amodei: “We might be six to 12 months away from when the model is doing most, maybe all, of what SWEs do end-to-end.” The tech industry spent a decade making software engineers its highest-paid, most protected class. That era has a last day now. When a model can execute an entire software build end-to-end, the ability to write syntax stops being a skill. It becomes a credential for a job that no longer exists. Amodei: “And then it’s a question of how fast does that loop close.” That is the sentence everyone skipped. The code was never the hard part. The hard part was everything around it. The model just learned everything around it. Writing the code is already nearly gone. Testing is next. Deployment is next. When all three collapse into a single autonomous execution loop, the machine no longer needs a human in the chain at all. The corporation or sovereign state that closes that loop first does not gain a competitive advantage. It gains a category of speed that biological engineers cannot match, track, or reverse. That is not disruption. That is replacement at a systems level. Amodei is not describing a future disruption. He is describing the current state of his own building. The loop is already closing. The only question is whether you are inside it or outside it when it seals.

Dustin

315,019 просмотров • 3 месяцев назад

Chamath: AI advantage may come less from models than from private inputs. "When labs can build similar models, the real win comes from one unique ingredient in order to monetize it well. Here is a basic thing about machine learning that is worth knowing: if you take 1,000 of the same inputs and give them to Facebook, Microsoft, Google, and Amazon, they will all come up with the same machine learning model. But if you have one extra thing, one little ingredient that all of those other companies do not have, your output can be markedly different. It is like giving two great chefs three ingredients, but giving the third chef one extra ingredient. That person has the ability to do something very special. Right now, we are in a world where everybody is crawling the open web. We are going to move to a world where, as everybody gets sophisticated enough and information is widely available, somebody is going to say, “You know what? This site, I am not going to allow anybody else to access. It is only for me, only for my models.” Those models will become better. So we have to let that play out a little bit. It is going to be a really interesting arms race. The next wave of M&A, for example, could be companies like Google, Microsoft, and Facebook looking at these companies and saying, “Can they be viable inputs to my large language models or to my other machine learning and AI models?” --- A company with unique workflows, transactions, medical records, industrial logs, legal archives, design files, or user behavior can turn boring private data into a compounding advantage. Some startups may never become great public companies on their own, yet still become valuable because they own a data stream that makes a larger AI system sharper, more differentiated, or harder to copy. That turns acquisition strategy upside down: the buyer may not be purchasing revenue, brand, or even software, but a private ingredient for intelligence. ---- From "iConnections" YouTube channel, (link in comment)

Rohan Paul

143,134 просмотров • 1 месяц назад

Sam Altman just handed every startup founder a one-question autopsy. Altman: “If you’re building something on GPT-4 that a reasonable observer would say we’re going to steamroll you.” Not might. Not could. Going to. He said it with the calm of someone describing weather. Because to him it is weather. The model improves. Whatever was built on the old version’s weaknesses gets washed away. That is not strategy. That is erosion. And most founders are building on the erosion line. They find a gap in the current model. They wrap a product around it. They raise money. They hire. They scale. Then OpenAI releases the next version and the gap closes and the product has no reason to exist anymore. Altman: “When we just do our fundamental job, which is make the model better with every crank, then you get the ‘OpenAI killed my startup’ meme.” He is telling you directly. They are not hunting you. They are not even thinking about you. They are just improving the model. You happen to be standing where the improvement lands. That is the part founders refuse to hear. OpenAI does not need to compete with you. It just needs to keep doing exactly what it was already doing and your entire company disappears as a side effect. You are not a competitor. You are a temporary symptom of incomplete intelligence. The moment the intelligence completes you become nothing. Then Brad Lightcap delivered the cleanest diagnostic ever spoken in venture capital. Lightcap: “Ask if a 100x improvement in the model is something they’re excited about.” One question. The entire investment thesis reduced to a single binary. Does the next model make your company more powerful or does it make your company pointless. There is no middle ground. Lightcap: “We know the companies that come to us saying, ‘We want the next model. When is it coming out? I want to be the first to try it.’” These companies built something that feeds on intelligence. The smarter the model gets the more their product can do. They are not threatened by progress. They are starving for it. Then there are the companies Lightcap never hears from. The ones who go quiet when a new model drops. The ones who read the release notes like a death sentence. The ones privately praying the next generation takes longer because every improvement shrinks the ground beneath them. If you are hoping the model stays roughly where it is you have already told the market everything it needs to know about your company. You are not building on intelligence. You are building on the absence of it. Altman: “95% of the world should be betting on the latter category.” The latter category is simple. Assume the model keeps getting better at the pace it has been getting better. Build for that world. Not the world where GPT-4 is the ceiling. The world where GPT-4 is the floor and the ceiling has not been built yet. Then Altman told a story that should be framed on the wall of every startup in the country. A medical AI company came to him that morning. They were not complaining about the model. They were not worried about being replaced. They were demanding it improve faster. Altman: “Here’s how many people are dying every day you delay.” That is what alignment with the trajectory looks like. A company so deeply built on intelligence improving that every day the model stays the same is a day someone dies who did not have to. They are not building on a flaw. They are building on a future that has not arrived fast enough. That is the difference. The wrapper startup patches what the model cannot do today. The real company builds what the model will unlock tomorrow. One is running from the train. The other is laying the track. Altman told you the train is not slowing down. Lightcap told you exactly how to know which side you are on. One question. Does a 100x smarter model make you more valuable or erase you. If you had to pause before answering you already did.

Sam Altman just handed every startup founder a one-question autopsy. Altman: “If you’re building something on GPT-4 that a reasonable observer would say we’re going to steamroll you.” Not might. Not could. Going to. He said it with the calm of someone describing weather. Because to him it is weather. The model improves. Whatever was built on the old version’s weaknesses gets washed away. That is not strategy. That is erosion. And most founders are building on the erosion line. They find a gap in the current model. They wrap a product around it. They raise money. They hire. They scale. Then OpenAI releases the next version and the gap closes and the product has no reason to exist anymore. Altman: “When we just do our fundamental job, which is make the model better with every crank, then you get the ‘OpenAI killed my startup’ meme.” He is telling you directly. They are not hunting you. They are not even thinking about you. They are just improving the model. You happen to be standing where the improvement lands. That is the part founders refuse to hear. OpenAI does not need to compete with you. It just needs to keep doing exactly what it was already doing and your entire company disappears as a side effect. You are not a competitor. You are a temporary symptom of incomplete intelligence. The moment the intelligence completes you become nothing. Then Brad Lightcap delivered the cleanest diagnostic ever spoken in venture capital. Lightcap: “Ask if a 100x improvement in the model is something they’re excited about.” One question. The entire investment thesis reduced to a single binary. Does the next model make your company more powerful or does it make your company pointless. There is no middle ground. Lightcap: “We know the companies that come to us saying, ‘We want the next model. When is it coming out? I want to be the first to try it.’” These companies built something that feeds on intelligence. The smarter the model gets the more their product can do. They are not threatened by progress. They are starving for it. Then there are the companies Lightcap never hears from. The ones who go quiet when a new model drops. The ones who read the release notes like a death sentence. The ones privately praying the next generation takes longer because every improvement shrinks the ground beneath them. If you are hoping the model stays roughly where it is you have already told the market everything it needs to know about your company. You are not building on intelligence. You are building on the absence of it. Altman: “95% of the world should be betting on the latter category.” The latter category is simple. Assume the model keeps getting better at the pace it has been getting better. Build for that world. Not the world where GPT-4 is the ceiling. The world where GPT-4 is the floor and the ceiling has not been built yet. Then Altman told a story that should be framed on the wall of every startup in the country. A medical AI company came to him that morning. They were not complaining about the model. They were not worried about being replaced. They were demanding it improve faster. Altman: “Here’s how many people are dying every day you delay.” That is what alignment with the trajectory looks like. A company so deeply built on intelligence improving that every day the model stays the same is a day someone dies who did not have to. They are not building on a flaw. They are building on a future that has not arrived fast enough. That is the difference. The wrapper startup patches what the model cannot do today. The real company builds what the model will unlock tomorrow. One is running from the train. The other is laying the track. Altman told you the train is not slowing down. Lightcap told you exactly how to know which side you are on. One question. Does a 100x smarter model make you more valuable or erase you. If you had to pause before answering you already did.

Dustin

39,109 просмотров • 2 месяцев назад

The AI industry is optimizing for a definition of intelligence that does not exist. Andrew Ng just said it out loud. Ng: “AGI, to me, should be less about AI that already knows everything under the sun. That seems very challenging, doesn’t seem practical.” The human brain is not the most powerful economic asset in history because of what it holds. It is powerful because of what it can pick up. Ng: “The amazing thing about the human brain is its plasticity, or its ability to learn.” That same biological hardware that earns a PhD in quantum physics could have been trained on chess, surgery, or rewriting global supply chains from scratch. Ng: “That same human brain, just given different training, could have been a chess master, or could have been amazing at playing tennis.” General intelligence is not omniscience. It is the structural capacity to master whatever you point it at. Ng: “It is through learning that we then gain these incredibly specialized intelligences.” The winner is not whoever builds the biggest model. It is whoever builds the most adaptable one. The AI that walks into a domain it has never touched and executes before a human analyst finishes reading the brief. Ng: “What makes the human brain so valuable for economic tasks, is its ability to just learn to do whatever is needed.” Every corporation on earth pays for human labor because humans adapt. Not because they already know everything. AGI is the digitization of that exact capability. At machine speed. At infinite scale. Ng: “A lot of what makes the human brain so general is not that my brain or your brain already knows everything under the sun. It’s our ability to adapt, to learn a huge range of things.” The most powerful economic asset in history was never specialized knowledge. It was the raw capacity to acquire any knowledge, in any domain, on demand. The winning AI is not an encyclopedia. It is the force that makes encyclopedias irrelevant. And once that exists, the question stops being what the AI knows. It becomes what you can teach it before your competitor wakes up. Most people dominating this conversation have not understood that yet.

The AI industry is optimizing for a definition of intelligence that does not exist. Andrew Ng just said it out loud. Ng: “AGI, to me, should be less about AI that already knows everything under the sun. That seems very challenging, doesn’t seem practical.” The human brain is not the most powerful economic asset in history because of what it holds. It is powerful because of what it can pick up. Ng: “The amazing thing about the human brain is its plasticity, or its ability to learn.” That same biological hardware that earns a PhD in quantum physics could have been trained on chess, surgery, or rewriting global supply chains from scratch. Ng: “That same human brain, just given different training, could have been a chess master, or could have been amazing at playing tennis.” General intelligence is not omniscience. It is the structural capacity to master whatever you point it at. Ng: “It is through learning that we then gain these incredibly specialized intelligences.” The winner is not whoever builds the biggest model. It is whoever builds the most adaptable one. The AI that walks into a domain it has never touched and executes before a human analyst finishes reading the brief. Ng: “What makes the human brain so valuable for economic tasks, is its ability to just learn to do whatever is needed.” Every corporation on earth pays for human labor because humans adapt. Not because they already know everything. AGI is the digitization of that exact capability. At machine speed. At infinite scale. Ng: “A lot of what makes the human brain so general is not that my brain or your brain already knows everything under the sun. It’s our ability to adapt, to learn a huge range of things.” The most powerful economic asset in history was never specialized knowledge. It was the raw capacity to acquire any knowledge, in any domain, on demand. The winning AI is not an encyclopedia. It is the force that makes encyclopedias irrelevant. And once that exists, the question stops being what the AI knows. It becomes what you can teach it before your competitor wakes up. Most people dominating this conversation have not understood that yet.

Dustin

19,802 просмотров • 4 месяцев назад

Max Tegmark says the way through the AI black box may be to deploy its outputs instead. A cat can learn, but it cannot export what it learned. A human scientist can. As intelligence rises, more knowledge may become externalized into code and proofs. Deploy what the mind can verify, not the mind itself.

Max Tegmark says the way through the AI black box may be to deploy its outputs instead. A cat can learn, but it cannot export what it learned. A human scientist can. As intelligence rises, more knowledge may become externalized into code and proofs. Deploy what the mind can verify, not the mind itself.

vitrupo

49,627 просмотров • 2 месяцев назад

JD Vance just admitted the White House could take ownership of every major AI company in the US. Why? Because even governments are starting to realize the same thing: AI will create trillions in value, and at this rate, only a few Big Tech companies will benefit. Right now, billions of people are training AI every day through their data, actions, and online behavior. Yet almost all of the value flows back to a few billionaires and shareholders. That doesn't seem fair. At Action Model, we're building a different future. A people-owned AI model where those helping train it can share in the value they help create. If AI is built by the people, it should benefit the people too. Sign up to the Action Model movement and join the 500k+ users already contributing to a community-owned AI.

JD Vance just admitted the White House could take ownership of every major AI company in the US. Why? Because even governments are starting to realize the same thing: AI will create trillions in value, and at this rate, only a few Big Tech companies will benefit. Right now, billions of people are training AI every day through their data, actions, and online behavior. Yet almost all of the value flows back to a few billionaires and shareholders. That doesn't seem fair. At Action Model, we're building a different future. A people-owned AI model where those helping train it can share in the value they help create. If AI is built by the people, it should benefit the people too. Sign up to the Action Model movement and join the 500k+ users already contributing to a community-owned AI.

Action Model

18,854 просмотров • 14 дней назад

"We're not a consumer company, we're the shortest path to artificial labor." - Bernt Bornich, founder of 1X "It just happens to be that the shortest path is through consumer, because you need to live and learn among people." "Model intelligence comes from the diversity of your AI dataset. There isn't enough variance (if you are just) in a factory." "We want to deploy robots there (in homes), where we get data through a truly intelligent machine that doesn't just repeat a motion."

"We're not a consumer company, we're the shortest path to artificial labor." - Bernt Bornich, founder of 1X "It just happens to be that the shortest path is through consumer, because you need to live and learn among people." "Model intelligence comes from the diversity of your AI dataset. There isn't enough variance (if you are just) in a factory." "We want to deploy robots there (in homes), where we get data through a truly intelligent machine that doesn't just repeat a motion."

TBPN

15,742 просмотров • 1 год назад

Elon Musk just confirmed when humans exit the AI development loop entirely. Not eventually. Not theoretically. He gave a date. Musk: “Humans are gradually getting less and less in the loop on the recursive self-improvement. Every successive model is built by the one before it.” The market still assumes thousands of engineers are hand-coding the next frontier model. The compute already took over the build. Musk: “That is happening to a large degree, but it’s not yet fully automated. It may be there, if not this year, but not later than next year.” Once biological friction is fully automated out of the loop, the AI arms race detaches from human comprehension. Permanently. Global markets are still planning for a gradual transition. That timeline is already dead. Interviewer: “Do you see a hard takeoff at that point?” Musk: “We’re in the hard takeoff. Right now.” Musk: “I go to sleep, there’s some massive AI breakthrough. And when I wake up there’s another one.” You don’t outrun autonomous, self-improving intelligence by hiring more biological engineers. By the time the hiring sprint finishes, the model has already iterated into something the job description can’t describe. The architects of intelligence are no longer biological. And the thing building itself didn’t ask for the job.

Elon Musk just confirmed when humans exit the AI development loop entirely. Not eventually. Not theoretically. He gave a date. Musk: “Humans are gradually getting less and less in the loop on the recursive self-improvement. Every successive model is built by the one before it.” The market still assumes thousands of engineers are hand-coding the next frontier model. The compute already took over the build. Musk: “That is happening to a large degree, but it’s not yet fully automated. It may be there, if not this year, but not later than next year.” Once biological friction is fully automated out of the loop, the AI arms race detaches from human comprehension. Permanently. Global markets are still planning for a gradual transition. That timeline is already dead. Interviewer: “Do you see a hard takeoff at that point?” Musk: “We’re in the hard takeoff. Right now.” Musk: “I go to sleep, there’s some massive AI breakthrough. And when I wake up there’s another one.” You don’t outrun autonomous, self-improving intelligence by hiring more biological engineers. By the time the hiring sprint finishes, the model has already iterated into something the job description can’t describe. The architects of intelligence are no longer biological. And the thing building itself didn’t ask for the job.

Dustin

322,709 просмотров • 3 месяцев назад

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me Santiago so you don't miss what comes next.

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me Santiago so you don't miss what comes next.

Santiago

384,495 просмотров • 3 лет назад

Threadguy explains why Apple will capture ALL of the value from AI “Models have become a commodity, we'll be able to just use the best model at any given time” "The more data AI has access to, the better it is. Everyone already has an iPhone, the next iPhone is just going to launch with some OpenClaw esque assistant utilising the best model plugged into it and its probably going to be the best AI experience you’ve ever had”

Threadguy explains why Apple will capture ALL of the value from AI “Models have become a commodity, we'll be able to just use the best model at any given time” "The more data AI has access to, the better it is. Everyone already has an iPhone, the next iPhone is just going to launch with some OpenClaw esque assistant utilising the best model plugged into it and its probably going to be the best AI experience you’ve ever had”

Jack

24,179 просмотров • 4 месяцев назад

Enterprise AI deployments today are frozen in time. Model capabilities stagnate in production. The problem compounds because companies aren’t static either. Every time your company improves, the model falls further behind. The bottleneck is continual learning. How does a model do something once and improve from feedback? The future of enterprise AI is Specific Intelligence: custom models teams own, trained on a company’s choices, interaction by interaction, using internal knowledge general models cannot access. Applied Compute helps companies train, serve, own, and improve custom models. Thanks Apoorv Agrawal for having Yash Patil at MS&E 435 to talk about the future of model training.

Enterprise AI deployments today are frozen in time. Model capabilities stagnate in production. The problem compounds because companies aren’t static either. Every time your company improves, the model falls further behind. The bottleneck is continual learning. How does a model do something once and improve from feedback? The future of enterprise AI is Specific Intelligence: custom models teams own, trained on a company’s choices, interaction by interaction, using internal knowledge general models cannot access. Applied Compute helps companies train, serve, own, and improve custom models. Thanks Apoorv Agrawal for having Yash Patil at MS&E 435 to talk about the future of model training.

Applied Compute

12,395 просмотров • 1 месяц назад

Choose a model (any model) and build your application with it. Do not spend time swapping models early on. Do not try to optimize before you have a working system. This is one of the first recommendations I make to every new team I consult with. Eventually, it will be time to optimize the model. • You may need a cheaper model • You may need a faster model • You might need a smarter model Good luck if you stitched together 12 different APIs and SDKs from 7 different vendors. Over half of the companies I consult for run on Microsoft software and have access to Microsoft Foundry. Microsoft Foundry is a complete agentic ecosystem. If you're in that world and building AI applications, Microsoft Foundry is where everything lives: • Models (largest selection in the market) • Agentic SDK (Python, C#, JavaScript/TypeScript) • Tools • Evaluations • Monitoring They are fully integrated with GitHub and Visual Studio Code. The best part: Their agentic platform is fully agnostic of the models you use. You can integrate with any model using the same OpenAI-style API. Swapping one model for another takes 1 second.

Choose a model (any model) and build your application with it. Do not spend time swapping models early on. Do not try to optimize before you have a working system. This is one of the first recommendations I make to every new team I consult with. Eventually, it will be time to optimize the model. • You may need a cheaper model • You may need a faster model • You might need a smarter model Good luck if you stitched together 12 different APIs and SDKs from 7 different vendors. Over half of the companies I consult for run on Microsoft software and have access to Microsoft Foundry. Microsoft Foundry is a complete agentic ecosystem. If you're in that world and building AI applications, Microsoft Foundry is where everything lives: • Models (largest selection in the market) • Agentic SDK (Python, C#, JavaScript/TypeScript) • Tools • Evaluations • Monitoring They are fully integrated with GitHub and Visual Studio Code. The best part: Their agentic platform is fully agnostic of the models you use. You can integrate with any model using the same OpenAI-style API. Swapping one model for another takes 1 second.

Santiago

12,014 просмотров • 4 месяцев назад

Introducing the new Where everyone can contribute to model development And benefit from it Enter FLock, the private AI training platform, where AI is created, trained, and governed by all. Collaborative, Onchain, And Private Not your model, not your AI YOUR model, YOUR AI

Introducing the new Where everyone can contribute to model development And benefit from it Enter FLock, the private AI training platform, where AI is created, trained, and governed by all. Collaborative, Onchain, And Private Not your model, not your AI YOUR model, YOUR AI

FLock.io

69,464 просмотров • 1 год назад

"There should be as many models in the world as firms in the world." Satya and I dig into when to own vs. rent your intelligence, why every company should be building and climbing its own private evals, and what makes for a stable frontier.

"There should be as many models in the world as firms in the world." Satya and I dig into when to own vs. rent your intelligence, why every company should be building and climbing its own private evals, and what makes for a stable frontier.

Yash Patil

165,274 просмотров • 7 дней назад

Responsible AI should not feel like the path of most resistance. That was the strongest idea from my SAS Innovate conversation with Reggie Townsend, leading Data Ethics, Governance, and Social Impact at SAS Software. Reggie framed governance not as a compliance layer, but as a way to scale human judgment. Too often, we innovate first and govern later. Model selected. Agent deployed. Process built. Then governance arrives at the end and feels like friction. Bolt it on after the fact, and resistance is guaranteed. The opportunity: design responsible AI, so it becomes intuitive, action-oriented, and useful in the flow of work. That is what Reggie meant by making responsibility "irresistible." Second point: Use cases must lead. When everyone can access the same models, differentiation will not come from the technology. It will come from how leaders define outcomes, govern applications, and connect business value to institutional trust. The risk does not live in the model. The value does not live in the model. Both live in the use case. For CEOs and boards, this is the shift: from model-first oversight to outcome-first accountability. Better questions before scaling AI: - What human decision are we shaping? - What business outcome are we improving? - What risk are we containing? - What judgment are we extending? Responsible AI becomes strategic when it helps people make better decisions, faster, with greater confidence. Most leaders can't see where governance sits inside their AI operating model. SAS AI Navigator makes it visible: Design for the human. Not only the technology.

Responsible AI should not feel like the path of most resistance. That was the strongest idea from my SAS Innovate conversation with Reggie Townsend, leading Data Ethics, Governance, and Social Impact at SAS Software. Reggie framed governance not as a compliance layer, but as a way to scale human judgment. Too often, we innovate first and govern later. Model selected. Agent deployed. Process built. Then governance arrives at the end and feels like friction. Bolt it on after the fact, and resistance is guaranteed. The opportunity: design responsible AI, so it becomes intuitive, action-oriented, and useful in the flow of work. That is what Reggie meant by making responsibility "irresistible." Second point: Use cases must lead. When everyone can access the same models, differentiation will not come from the technology. It will come from how leaders define outcomes, govern applications, and connect business value to institutional trust. The risk does not live in the model. The value does not live in the model. Both live in the use case. For CEOs and boards, this is the shift: from model-first oversight to outcome-first accountability. Better questions before scaling AI: - What human decision are we shaping? - What business outcome are we improving? - What risk are we containing? - What judgment are we extending? Responsible AI becomes strategic when it helps people make better decisions, faster, with greater confidence. Most leaders can't see where governance sits inside their AI operating model. SAS AI Navigator makes it visible: Design for the human. Not only the technology.

Sabine VanderLinden

835,838 просмотров • 1 месяц назад

✨ Made a new mini feature on Photo AI: [ Grab from 3d model ] So the problem is we're at that stage in time (typical for AI) where image-to-3d models are not good enough but are fun to play with, but we know they'll be good enough in 1-2 years With [ Make 3d model ] you already can turn any Photo AI pic into a 3d model but it still looks hyper clunky and deformed, but it works! One cool idea I had to make that more useful and made now: Let people make a 3d model then change the view of the it with the 3d viewer, then press [ o ] and it grabs a frame of the 3d That image you can then [ Remix ] (img2img), and it becomes a real photo again and that in turn you can then turn into a video again with [ Make video ] So that essentially gives you a fully freeform camera position control to take photos with One thing I need to fix is the background/skybox, I kinda need to take the original photo and remove the person and just get the background for the 3d model viewer, in this case it should be white, but it's a start!

✨ Made a new mini feature on Photo AI: [ Grab from 3d model ] So the problem is we're at that stage in time (typical for AI) where image-to-3d models are not good enough but are fun to play with, but we know they'll be good enough in 1-2 years With [ Make 3d model ] you already can turn any Photo AI pic into a 3d model but it still looks hyper clunky and deformed, but it works! One cool idea I had to make that more useful and made now: Let people make a 3d model then change the view of the it with the 3d viewer, then press [ o ] and it grabs a frame of the 3d That image you can then [ Remix ] (img2img), and it becomes a real photo again and that in turn you can then turn into a video again with [ Make video ] So that essentially gives you a fully freeform camera position control to take photos with One thing I need to fix is the background/skybox, I kinda need to take the original photo and remove the person and just get the background for the 3d model viewer, in this case it should be white, but it's a start!

@levelsio

119,210 просмотров • 1 год назад