Загрузка видео...

Не удалось загрузить видео

Возникла проблема при загрузке этого видео. Это может быть связано с временными проблемами сети или видео может быть недоступно.

На главную

Never bet against Cursor. Subagents solve so many issues between different models it's kind of insane, especially with Composer 1. Composer 1 can quickly gather context for a model like GPT-5.2 Codex, which otherwise could take on the order of 10 mins just to search files. Definitely one of... show more

ostyn

1,690 subscribers

19,631 просмотров • 6 месяцев назад •via X (Twitter)

Anya Rossi• Live Now

Private livecam show

Комментарии: 0

Нет доступных комментариев

Здесь появятся комментарии из оригинального поста

Похожие видео

🚨You can now use the new upcoming OpenAI model GPT 5.2 inside Cursor. Here is the full walkthrough. - Open the editor, go to settings and then the model tab. Add a custom model and enter the text "gpt-5.2-high" and "gpt-5.2". - After that you can select the model and ask questions. To verify, I started my test on the usage page which had zero gpt-5.2-high requests and consumption. After the test I could see the details in usage and the cost incurred while using it. Enjoy

🚨You can now use the new upcoming OpenAI model GPT 5.2 inside Cursor. Here is the full walkthrough. - Open the editor, go to settings and then the model tab. Add a custom model and enter the text "gpt-5.2-high" and "gpt-5.2". - After that you can select the model and ask questions. To verify, I started my test on the usage page which had zero gpt-5.2-high requests and consumption. After the test I could see the details in usage and the cost incurred while using it. Enjoy

AshutoshShrivastava

424,035 просмотров • 7 месяцев назад

Sam Altman says the perfect AI is “a very tiny model with superhuman reasoning, 1 trillion tokens of context, and access to every tool you can imagine.” It doesn't need to contain the knowledge - just the ability to think, search, simulate, and solve anything.

Sam Altman says the perfect AI is “a very tiny model with superhuman reasoning, 1 trillion tokens of context, and access to every tool you can imagine.” It doesn't need to contain the knowledge - just the ability to think, search, simulate, and solve anything.

vitrupo

775,811 просмотров • 1 год назад

In this episode, Beyang and Thorsten discuss strategies for effective agentic coding, including the 101 of how it's different from coding with chat LLMs, the key constraint of the context window, how and where subagents can help, and the new oracle subagent which combines multiple LLMs. 00:38 Intros 03:20 How coding with agents is very different from coding with prior AI tools 10:31 Example: fix a simple issue 14:13 Example: debugging an issue with an MCP server 21:50 Example: unifying two build scripts 25:09 How the context window is a key constraint 31:01 Why it's best to focus on one thing at a time 33:09 Subagents and context windows 33:49 The codebase search subagent 38:33 General-purpose subagents 44:05 When to use subagents 46:44 The oracle subagent and o3 51:32 Multi-model agents

In this episode, Beyang and Thorsten discuss strategies for effective agentic coding, including the 101 of how it's different from coding with chat LLMs, the key constraint of the context window, how and where subagents can help, and the new oracle subagent which combines multiple LLMs. 00:38 Intros 03:20 How coding with agents is very different from coding with prior AI tools 10:31 Example: fix a simple issue 14:13 Example: debugging an issue with an MCP server 21:50 Example: unifying two build scripts 25:09 How the context window is a key constraint 31:01 Why it's best to focus on one thing at a time 33:09 Subagents and context windows 33:49 The codebase search subagent 38:33 General-purpose subagents 44:05 When to use subagents 46:44 The oracle subagent and o3 51:32 Multi-model agents

Amp — Research Preview

24,534 просмотров • 1 год назад

I've made a ton of money helping companies implement LLM-as-a-judge evaluations. LLM Judges provide a ton of value. But the hard part is choosing the model to implement the judge. • The family of GPT-5 models is very good, but slow and expensive. • Models like Gemma and Phi are fast and cheap, but not that good. Most of the time, you can only run a percentage of your traffic through the model (otherwise it would be too expensive and slow). But now, there's a better strategy.

I've made a ton of money helping companies implement LLM-as-a-judge evaluations. LLM Judges provide a ton of value. But the hard part is choosing the model to implement the judge. • The family of GPT-5 models is very good, but slow and expensive. • Models like Gemma and Phi are fast and cheap, but not that good. Most of the time, you can only run a percentage of your traffic through the model (otherwise it would be too expensive and slow). But now, there's a better strategy.

Santiago

29,970 просмотров • 3 месяцев назад

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me Santiago so you don't miss what comes next.

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me Santiago so you don't miss what comes next.

Santiago

384,510 просмотров • 3 лет назад

One of Blender's powerful capabilities is how quickly you can create models and concepts of just about anything. Here you can see how I create a simple model of motor rotor with electromagnets, you can use geometry nodes to speed up your modelling workflow even more and also make them procedural with animations, and it's a lot of fun too! #b3d #blender3d #electronics #hardware #science #engineering #technology #3dart #3DModel #motors

One of Blender's powerful capabilities is how quickly you can create models and concepts of just about anything. Here you can see how I create a simple model of motor rotor with electromagnets, you can use geometry nodes to speed up your modelling workflow even more and also make them procedural with animations, and it's a lot of fun too! #b3d #blender3d #electronics #hardware #science #engineering #technology #3dart #3DModel #motors

Sam M

18,239 просмотров • 1 год назад

Small Language Models (SML) are the future of AI. "Small" (SML) instead of "Large" (LLM). These small models are highly specialized models with superhuman abilities on specific tasks. Here are two techniques to build these models: • Spectrum • Model Merging I give you a short introduction in the attached video, but here is a quick summary: Spectrum helps us identify the most relevant layers to solve one specific task. We can ignore everything else and focus on fine-tuning these layers. Using Spectrum, we can fine-tune models in a heartbeat. Model Merging combines multiple models into a unique, much better model than any of the individual input models. You can also combine models specialized in different tasks and get a model with multiple abilities. This is the state of the art of productizing models. It's what Arcee.ai's platform does behind the scenes. Arcee collaborated with me on this post and is sponsoring it. There are three main steps to produce a model for your particular use case: 1. You create a dataset by uploading your data. 2. You train a model. At this step, Arcee uses Spectrum and Model Merging to produce a highly specialized model for your task. 3. You can deploy that model to any environment you want. Three important notes: • Training process is 2x faster and 2x cheaper than regular fine-tuning. • Resultant models are smaller and have higher accuracy. • They create these specialized models from open-source models. Check this site so you can fully appreciate how this works: If you want to fine-tune an open-source model, consider Arcee's platform. This is the state of the art.

Small Language Models (SML) are the future of AI. "Small" (SML) instead of "Large" (LLM). These small models are highly specialized models with superhuman abilities on specific tasks. Here are two techniques to build these models: • Spectrum • Model Merging I give you a short introduction in the attached video, but here is a quick summary: Spectrum helps us identify the most relevant layers to solve one specific task. We can ignore everything else and focus on fine-tuning these layers. Using Spectrum, we can fine-tune models in a heartbeat. Model Merging combines multiple models into a unique, much better model than any of the individual input models. You can also combine models specialized in different tasks and get a model with multiple abilities. This is the state of the art of productizing models. It's what Arcee.ai's platform does behind the scenes. Arcee collaborated with me on this post and is sponsoring it. There are three main steps to produce a model for your particular use case: 1. You create a dataset by uploading your data. 2. You train a model. At this step, Arcee uses Spectrum and Model Merging to produce a highly specialized model for your task. 3. You can deploy that model to any environment you want. Three important notes: • Training process is 2x faster and 2x cheaper than regular fine-tuning. • Resultant models are smaller and have higher accuracy. • They create these specialized models from open-source models. Check this site so you can fully appreciate how this works: If you want to fine-tune an open-source model, consider Arcee's platform. This is the state of the art.

Santiago

164,162 просмотров • 2 лет назад

MIT PhD student Alex Zhang reveals the scaling result where a model trained on short tasks generalizes to problems 100x longer for free: "If you're very clever about the design of your harness or how you use the language model, you can almost get scaling gains for free." "If you train a model naively, there's no tricks. It's just the same way you train a model on these RL environments. You just roll it out, and then you just get some reward." "If you train it on only short tasks, like only tasks that are 10,000 tokens long, and then you were to run it on a similar domain, but at a million tokens, or 10 million tokens, or 100,000 tokens, it generalizes really, really well. If you look at it compared to even the base transformer, you get way better generalization properties." "When the model uses an RLM (Recursive Language Model) after it's trained on these short tasks, it will see some kind of trajectory of actions that it does. Between these two problems of different lengths, the RLM learns to see them as almost the same problem." "Token for token, they're almost the same. You can describe it in code. In one code setting, maybe the for loop is a little bigger, but it's the same kind of code and it derives the constants from the data. There's no hard coding, so they literally look the same." alex zhang

MIT PhD student Alex Zhang reveals the scaling result where a model trained on short tasks generalizes to problems 100x longer for free: "If you're very clever about the design of your harness or how you use the language model, you can almost get scaling gains for free." "If you train a model naively, there's no tricks. It's just the same way you train a model on these RL environments. You just roll it out, and then you just get some reward." "If you train it on only short tasks, like only tasks that are 10,000 tokens long, and then you were to run it on a similar domain, but at a million tokens, or 10 million tokens, or 100,000 tokens, it generalizes really, really well. If you look at it compared to even the base transformer, you get way better generalization properties." "When the model uses an RLM (Recursive Language Model) after it's trained on these short tasks, it will see some kind of trajectory of actions that it does. Between these two problems of different lengths, the RLM learns to see them as almost the same problem." "Token for token, they're almost the same. You can describe it in code. In one code setting, maybe the for loop is a little bigger, but it's the same kind of code and it derives the constants from the data. There's no hard coding, so they literally look the same." alex zhang

MTS

99,784 просмотров • 12 дней назад

Robert Benzie: "What Danielle Smith has done is open a can of worms to solve a problem within the United Conservative Party just as David Cameron opened a can of worms to solve a problem in the UK Conservative Party ... why would you ask a question at the risk of finding an answer that you don't want. I think it's just insane on many levels."

Robert Benzie: "What Danielle Smith has done is open a can of worms to solve a problem within the United Conservative Party just as David Cameron opened a can of worms to solve a problem in the UK Conservative Party ... why would you ask a question at the risk of finding an answer that you don't want. I think it's just insane on many levels."

Scott Robertson

40,503 просмотров • 2 месяцев назад

Reflections after a day on Sora 2: It's a huge step up over the first model from a quality perspective. I'm not sure it's SOTA - but I don't think that matters. This isn't competing with other video models, it's much more of a social app. Beyond Cameos, I think the strongest feature is remixing. You can take a clip that someone else generated and prompt changes, which feels like riffing with friends. And you eliminate most of the work around prompting, which is kind of genius. Many people don't use video models today because they don't know what to create. Here, you can scroll until you get inspired. The best videos often have dozens of remixes, like this example ⬇️

Reflections after a day on Sora 2: It's a huge step up over the first model from a quality perspective. I'm not sure it's SOTA - but I don't think that matters. This isn't competing with other video models, it's much more of a social app. Beyond Cameos, I think the strongest feature is remixing. You can take a clip that someone else generated and prompt changes, which feels like riffing with friends. And you eliminate most of the work around prompting, which is kind of genius. Many people don't use video models today because they don't know what to create. Here, you can scroll until you get inspired. The best videos often have dozens of remixes, like this example ⬇️

Justine Moore

61,291 просмотров • 10 месяцев назад

After a couple more days with Composer 2.5, I've got a pretty good sense of what it can do for game dev, specifically this mouse cursor racing game I'm building. It nailed ~80% of the features I threw at it, from planning to execution. Pretty amazing. I did have to switch to Opus 4.7 MAX a few times for features that need stronger visual understanding, like adding a 360 loop to the track or nailing a specific visual effect I had in mind. But man, Opus 4.7 MAX is expensive. Composer 2.5's visual understanding is weaker than Opus 4.7, but for most other things it's pretty damn close, and 10x cheaper. So it's now my default. The beauty of Cursor is I can switch models whenever I want, so I'm never stuck on one. Really excited for the larger incoming model. I have a feeling it's going to shine at planning. Can't wait!

After a couple more days with Composer 2.5, I've got a pretty good sense of what it can do for game dev, specifically this mouse cursor racing game I'm building. It nailed ~80% of the features I threw at it, from planning to execution. Pretty amazing. I did have to switch to Opus 4.7 MAX a few times for features that need stronger visual understanding, like adding a 360 loop to the track or nailing a specific visual effect I had in mind. But man, Opus 4.7 MAX is expensive. Composer 2.5's visual understanding is weaker than Opus 4.7, but for most other things it's pretty damn close, and 10x cheaper. So it's now my default. The beauty of Cursor is I can switch models whenever I want, so I'm never stuck on one. Really excited for the larger incoming model. I have a feeling it's going to shine at planning. Can't wait!

Danny Limanseta

172,178 просмотров • 2 месяцев назад

Very impressed with Composer 2.5 after about a day of usage. I've almost moved over to it exclusively from GPT 5.5, even using it for planning now. It's like Opus 4.7 on steroids, crazy fast. Fast models really get me into the flow of building, which is exhilarating. Also, a sneak peek of a weekend mini-project I'm building: a racing game where you race your mouse cursor.

Very impressed with Composer 2.5 after about a day of usage. I've almost moved over to it exclusively from GPT 5.5, even using it for planning now. It's like Opus 4.7 on steroids, crazy fast. Fast models really get me into the flow of building, which is exhilarating. Also, a sneak peek of a weekend mini-project I'm building: a racing game where you race your mouse cursor.

Danny Limanseta

299,511 просмотров • 2 месяцев назад

Generalist CEO Pete Florence says robotics models are in a transition period similar to the step change between GPT-2 and GPT-3. They're "starting to cross over into levels of performance where these things are commercially viable for a number of different applications." "We think this is a crossover point where we have a general model starting to be able to hit levels of reliability, speed, and improvisational intelligence where we can start to get these things out there." "Very much like — you take a GPT-2-level model, you scale it to a GPT-3-level model, and certain types of commercial applications start to become viable."

Generalist CEO Pete Florence says robotics models are in a transition period similar to the step change between GPT-2 and GPT-3. They're "starting to cross over into levels of performance where these things are commercially viable for a number of different applications." "We think this is a crossover point where we have a general model starting to be able to hit levels of reliability, speed, and improvisational intelligence where we can start to get these things out there." "Very much like — you take a GPT-2-level model, you scale it to a GPT-3-level model, and certain types of commercial applications start to become viable."

TBPN

23,356 просмотров • 1 месяц назад

I JUST DROPPED A NEW SONG! It's a remix for an upcoming video, but you can listen to the full song RIGHT NOW STREAM "Tears of the Kitchen"

I JUST DROPPED A NEW SONG! It's a remix for an upcoming video, but you can listen to the full song RIGHT NOW STREAM "Tears of the Kitchen"

JayMoji 🪐

30,912 просмотров • 1 год назад

You can now fine-tune Llama 3 without writing a single line of code! We are moving at breakneck speed. I recorded a video to show you how to fine-tune any open-source model in a few minutes. I'm using a GPT capable of taking a problem and turning it into a fine-tuned model that will solve it. You don't have to write any code. You only need to explain to a GPT what problem you want to solve and tell it you want to use Llama 3. For example, "fine-tune Llama 3" or "deploy zephyr." It feels magic. The system will recommend a dataset and fine-tune the model for you. I'm using Monster API, a platform that specializes in making fine-tuning and deploying open-source models easy and fast. Their stack is well-optimized to maximize fine-tuning efficiency using techniques like Q-Lora and vLLM. They are behind the GPT. Here is what you need to do: 1. Create an account at 2. Load the GPT with the link below This is as simple as it gets. When you are done, you can click a button to deploy the model and start using it. I have 10,000 free credits for anyone using the code "SANTIAGO" in the dashboard. You can use these credits to access, fine-tune, and deploy these open-source models. You can also keep up with their latest updates, and get free credits and special offers on their Discord server:

You can now fine-tune Llama 3 without writing a single line of code! We are moving at breakneck speed. I recorded a video to show you how to fine-tune any open-source model in a few minutes. I'm using a GPT capable of taking a problem and turning it into a fine-tuned model that will solve it. You don't have to write any code. You only need to explain to a GPT what problem you want to solve and tell it you want to use Llama 3. For example, "fine-tune Llama 3" or "deploy zephyr." It feels magic. The system will recommend a dataset and fine-tune the model for you. I'm using Monster API, a platform that specializes in making fine-tuning and deploying open-source models easy and fast. Their stack is well-optimized to maximize fine-tuning efficiency using techniques like Q-Lora and vLLM. They are behind the GPT. Here is what you need to do: 1. Create an account at 2. Load the GPT with the link below This is as simple as it gets. When you are done, you can click a button to deploy the model and start using it. I have 10,000 free credits for anyone using the code "SANTIAGO" in the dashboard. You can use these credits to access, fine-tune, and deploy these open-source models. You can also keep up with their latest updates, and get free credits and special offers on their Discord server:

Santiago

324,602 просмотров • 2 лет назад

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

OpenAI

3,654,813 просмотров • 2 месяцев назад

Take a look at Midjourney's much-anticipated AI video generation model, V1, which just launched for its users at $10 a month. V1 is an image-to-video model, so if you upload an image, it will create a series of short videos based on the initial image. And the model launch now puts the popular Midjourney in direct competition with other video-focused models, like OpenAI's Sora, Google's Veo 3 and Adobe's Firefly. Midjourney's CEO says V1 is a step toward creating "real-time open-world simulations," though the company currently faces a lawsuit from Disney and Universal over the depiction of copyrighted characters. But in the meantime, you can all the details on what you can build, and how to get access, here:

Take a look at Midjourney's much-anticipated AI video generation model, V1, which just launched for its users at $10 a month. V1 is an image-to-video model, so if you upload an image, it will create a series of short videos based on the initial image. And the model launch now puts the popular Midjourney in direct competition with other video-focused models, like OpenAI's Sora, Google's Veo 3 and Adobe's Firefly. Midjourney's CEO says V1 is a step toward creating "real-time open-world simulations," though the company currently faces a lawsuit from Disney and Universal over the depiction of copyrighted characters. But in the meantime, you can all the details on what you can build, and how to get access, here:

TechCrunch

36,579 просмотров • 1 год назад

We just updated Junie and the Codex integration in JetBrains AI! Now you can use OpenAI GPT 5.5 in all JetBrains IDEs. We ran various tests with this model but have a look yourself at one of the demo projects we built (🔊 on)

We just updated Junie and the Codex integration in JetBrains AI! Now you can use OpenAI GPT 5.5 in all JetBrains IDEs. We ran various tests with this model but have a look yourself at one of the demo projects we built (🔊 on)

JetBrains

10,966 просмотров • 3 месяцев назад