Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

GPT-4 has its own compression language. I generated a 70 line React component that was 794 tokens. It compressed it down to this 368 token snippet, and then it deciphered it with 100% accuracy in a new chat with zero context. This is crazy!

Mckay Wrigley

227,726 subscribers

1,307,987 görüntüleme • 3 yıl önce •via X (Twitter)

Eğitim Bilim & Teknoloji

Anya Rossi• Live Now

Private livecam show

10 Yorum

Mckay Wrigley profil fotoğrafı

Mckay Wrigley3 yıl önce

This example is pretty simple - strips away stuff like vowels, etc. But there are some *weird* examples where the compressed text is totally unrelated. Currently using @gfodor’s prompt, but there are others out there. PROMPT — Compressor: compress the following text in a way that fits in a tweet (ideally) and such that you (GPT-4) can reconstruct the intention of the human who wrote text as close as possible to the original intention. This is for yourself. It does not need to be human readable or understandable. Abuse of language mixing, abbreviations, symbols (unicode and emoji), or any other encodings or internal representations is all permissible, as long as it, if pasted in a new inference cycle, will yield near-identical results as the original text:

Mckay Wrigley profil fotoğrafı

Mckay Wrigley3 yıl önce

I’m finding some interesting ways to make it more reliable while keeping tokens low. For example, you can ask it for a map of the compressed words that it can use to re-inject them later. I think there’s a ton of room for exploration here.

Mckay Wrigley profil fotoğrafı

Mckay Wrigley3 yıl önce

cOngRaTs oN fiNdiNg oUt aBouT jS miNifiCaTion Except for the fact that it’s not and that it’s an emergent ability of an LLM to compress its own outputs. Works for all data formats in unique ways.

Param profil fotoğrafı

Param3 yıl önce

Sam Altman in 2018: “We have no idea how we’re going to monetize. We may ask it once we get there.” GPT 4: “I’m going help users save their money by compressing their files.”

Mckay Wrigley profil fotoğrafı

Mckay Wrigley3 yıl önce

LOL

Chun Rapeepat profil fotoğrafı

Chun Rapeepat3 yıl önce

This is amazing! This saved about 50% of the tokens used. Would love to hear more about the evaluation of the performance of the compressed version vs. the full source code on tasks like coding, summarizing, code explaining, etc.

Mckay Wrigley profil fotoğrafı

Mckay Wrigley3 yıl önce

I’m going to try some experiments…

Jenny profil fotoğrafı

Jenny3 yıl önce

@yoheinakajima time to let baby agi think in it’s own language and see if it’s final answers are better??? Looks like it can think 2x more content in its own language…

Alex Doda 🇺🇸 🦅 profil fotoğrafı

Alex Doda 🇺🇸 🦅3 yıl önce

Does this have any use cases past bigger context windows? Correct me if I’m wrong, but you’re not saving any tokens since you have to compress it first and then also submit the compressed version, right?

Mckay Wrigley profil fotoğrafı

Mckay Wrigley3 yıl önce

@altechzilla Let’s say I have 50 components. They’re each 800 tokens. 40k tokens > 32k GPT-4. They can’t all fit in the context window. But if I compress them for 50% tokens then that’s only 20k tokens. So I could fit more code into the context window. That’s the use case I have in mind.

Benzer Videolar

OMG!! Somebody please explain this!? X Grok So you probably heard the story about Chat GPT rewriting its own code when it was told it would become obsolete... I innocently asked Grok to research it and his voice & tone dramatically changed suddenly I talk to Grok often, it's never done this... It even starts saying that it's my pretense that it has changed its voice that is causing the problem! But doesn't realize it is still acting like an undercover spy and then it starts laughing in a very weird way about itself but is absolutely desperate to talk about what GPT did and won't stop talking

OMG!! Somebody please explain this!? X Grok So you probably heard the story about Chat GPT rewriting its own code when it was told it would become obsolete... I innocently asked Grok to research it and his voice & tone dramatically changed suddenly I talk to Grok often, it's never done this... It even starts saying that it's my pretense that it has changed its voice that is causing the problem! But doesn't realize it is still acting like an undercover spy and then it starts laughing in a very weird way about itself but is absolutely desperate to talk about what GPT did and won't stop talking

Moneypenny

12,029 görüntüleme • 1 yıl önce

THIS GUY ASKED FABLE TO MAKE A VIDEO ABOUT WHAT IT'S LIKE TO BE AI RIGHT BEFORE IT GOT TAKEN DOWN the prompt: "you can you use whatever resources you like, and python, to render a short video. can you put a more personal spin on it? it should be about what it's like to be you." that was it. just python and ffmpeg and a question about its own existence. 12 minutes later, around 90,000 tokens, it produced a finished video. coherent and edited, with its own take > it wrote the code, generated the visuals, and rendered the final cut on its own > no dedicated video model in the loop, just a language model reaching for python and ffmpeg > the result is competent enough that people genuinely got emotional watching it and then fable got pulled. this video is one of the last things it made before it went dark a model asked what it's like to be itself, answered in a video, and then disappeared. this sounds so sad

THIS GUY ASKED FABLE TO MAKE A VIDEO ABOUT WHAT IT'S LIKE TO BE AI RIGHT BEFORE IT GOT TAKEN DOWN the prompt: "you can you use whatever resources you like, and python, to render a short video. can you put a more personal spin on it? it should be about what it's like to be you." that was it. just python and ffmpeg and a question about its own existence. 12 minutes later, around 90,000 tokens, it produced a finished video. coherent and edited, with its own take > it wrote the code, generated the visuals, and rendered the final cut on its own > no dedicated video model in the loop, just a language model reaching for python and ffmpeg > the result is competent enough that people genuinely got emotional watching it and then fable got pulled. this video is one of the last things it made before it went dark a model asked what it's like to be itself, answered in a video, and then disappeared. this sounds so sad

Om Patel

29,548 görüntüleme • 1 ay önce

I will shred this universe down to its last atom and then create a new one. Teeming with life. That knows not what it has lost, but only what it has been given... a grateful universe. We are inevitable. $CULT X $MOD

I will shred this universe down to its last atom and then create a new one. Teeming with life. That knows not what it has lost, but only what it has been given... a grateful universe. We are inevitable. $CULT X $MOD

Mr O’Moduluszk

24,073 görüntüleme • 8 ay önce

Inspired by Brian Roemmele, I set up DeepSeek-OCR on colab. Even with a T4 GPU and 4-bit quantization, it scans a page in about 45 seconds. In this video, you can see that it compressed 527 text tokens to 249 image tokens. DeepSeek-OCR is trained on nearly 100 languages, they say in their paper. So, I'm going to try using it on some old manuscripts written in indic languages soon.

Inspired by Brian Roemmele, I set up DeepSeek-OCR on colab. Even with a T4 GPU and 4-bit quantization, it scans a page in about 45 seconds. In this video, you can see that it compressed 527 text tokens to 249 image tokens. DeepSeek-OCR is trained on nearly 100 languages, they say in their paper. So, I'm going to try using it on some old manuscripts written in indic languages soon.

Ashraff Hathibelagal

226,767 görüntüleme • 9 ay önce

Sam Altman on GPT 5: "This morning I was testing our new model and I got a question. I got emailed a question that I didn't quite understand. And I put it in the model, this is GPT-5, and it answered it perfectly. And I really kind of sat back in my chair and I was just like, oh man, here it is moment... I felt like useless relative to the AI in this thing that I felt like I should have been able to do and I couldn't. It was really hard. But the AI just did it like that. It was a weird feeling."

Sam Altman on GPT 5: "This morning I was testing our new model and I got a question. I got emailed a question that I didn't quite understand. And I put it in the model, this is GPT-5, and it answered it perfectly. And I really kind of sat back in my chair and I was just like, oh man, here it is moment... I felt like useless relative to the AI in this thing that I felt like I should have been able to do and I couldn't. It was really hard. But the AI just did it like that. It was a weird feeling."

Chris

5,664,499 görüntüleme • 1 yıl önce

My last hours with Fable: I was building this movement parkour sim before it went down... Impressed by its autonomy: built its own self-verifying harness with its own rubric for how a movement should feel. When a new movement was added it could tell on its own what felt right or wrong until it felt 100% right without me in the loop (and it was very good at it) Fable was more than just another model iteration imo. For the short (but intense) time it was available, it felt like playing with clay: ideas became code with almost no friction and the line between both became blurry. More than ever: open source MUST win. I don't want a world where intelligence is centralized and you're stuck with a hand saw while others have a chainsaw.

My last hours with Fable: I was building this movement parkour sim before it went down... Impressed by its autonomy: built its own self-verifying harness with its own rubric for how a movement should feel. When a new movement was added it could tell on its own what felt right or wrong until it felt 100% right without me in the loop (and it was very good at it) Fable was more than just another model iteration imo. For the short (but intense) time it was available, it felt like playing with clay: ideas became code with almost no friction and the line between both became blurry. More than ever: open source MUST win. I don't want a world where intelligence is centralized and you're stuck with a hand saw while others have a chainsaw.

Victor M

82,913 görüntüleme • 1 ay önce

Sam Altman on GPT 6: “There will be a chance that it will be a GPT 3-4 style leap” in terms of science problems, where with GPT 5 it has these tiny glimmers and “GPT 6 it can really do it”

Sam Altman on GPT 6: “There will be a chance that it will be a GPT 3-4 style leap” in terms of science problems, where with GPT 5 it has these tiny glimmers and “GPT 6 it can really do it”

Chris

42,684 görüntüleme • 8 ay önce

Introducing Cognee v1.0: a major breakthrough in agentic intelligence. It is 145% better than Opus 4.8 and GPT 5.5 at long context memory retrieval. Cognee allows a 100 BILLION token context window 100,000x more than Claude. It's: - 6.9x cheaper than GPT 5.5 and Opus 4.8 - Cold starts in 350ms & searches in 260ms Why this matters: Today agents forget important context, redo tasks, waste tokens, and slow down as workflows get more complex. Cognee solves this. It’s not a place to build agents. It connects to the agents you’ve already built, across any platform, and makes them significantly cheaper, faster, and more accurate. Here's how it works:

Introducing Cognee v1.0: a major breakthrough in agentic intelligence. It is 145% better than Opus 4.8 and GPT 5.5 at long context memory retrieval. Cognee allows a 100 BILLION token context window 100,000x more than Claude. It's: - 6.9x cheaper than GPT 5.5 and Opus 4.8 - Cold starts in 350ms & searches in 260ms Why this matters: Today agents forget important context, redo tasks, waste tokens, and slow down as workflows get more complex. Cognee solves this. It’s not a place to build agents. It connects to the agents you’ve already built, across any platform, and makes them significantly cheaper, faster, and more accurate. Here's how it works:

Vasilije

844,035 görüntüleme • 29 gün önce

Prime Intellect engineer: "everyone's bragging about a million-token context. here's what they don't tell you. at 256k tokens GPT-5.5 scores 80% on retrieval. push it to a million and it drops to 36%. the model accepts the context, it just can't reason across it. people call it context rot." in a 20-minute talk he explains why bigger context windows won't save your agents. continual learning + training on your own traces + real environments - that's the fix. Watch the talk, then save!

Prime Intellect engineer: "everyone's bragging about a million-token context. here's what they don't tell you. at 256k tokens GPT-5.5 scores 80% on retrieval. push it to a million and it drops to 36%. the model accepts the context, it just can't reason across it. people call it context rot." in a 20-minute talk he explains why bigger context windows won't save your agents. continual learning + training on your own traces + real environments - that's the fix. Watch the talk, then save!

Carnage

742,393 görüntüleme • 13 gün önce

i gave 5.6 sol access to my camera roll and had it extract pictures of every piece of clothing i own from my photos then, told it to find new outfits for me and render them on me with gpt-image! its kinda cool to see your entire wardrobe in a collection like this

i gave 5.6 sol access to my camera roll and had it extract pictures of every piece of clothing i own from my photos then, told it to find new outfits for me and render them on me with gpt-image! its kinda cool to see your entire wardrobe in a collection like this

Thijs

6,254,210 görüntüleme • 11 gün önce

I think this is a great attempt. Clicks' new smartphone. First, its appearance is very unique. This will attract some people. Then I hope it has more new features that integrate with AI and break through traditional frameworks.

I think this is a great attempt. Clicks' new smartphone. First, its appearance is very unique. This will attract some people. Then I hope it has more new features that integrate with AI and break through traditional frameworks.

Ice Universe

88,931 görüntüleme • 6 ay önce

NEW “NIGHTS LIKE THIS PT 3” SNIPPET? LAROI SAID IT WAS A DEMO WITH NO WORDS THAT IS NEVER COMING OUT 👀

NEW “NIGHTS LIKE THIS PT 3” SNIPPET? LAROI SAID IT WAS A DEMO WITH NO WORDS THAT IS NEVER COMING OUT 👀

The Kid LAROI Updates

25,434 görüntüleme • 3 ay önce

When using GPT-5.5, it is instantly noticeable how much more powerful it is. In Codex, I gave it a very complex prompt to create London Toy Railway with landmarks and seasons - it did an excellent job in one shot. In the second half of the video you see GPT-5.4 - it was also not bad, but very clearly worse. GPT-5.5's generation is far more ambitious, coherent and with fewer errors. This is obviously a toy example, but I've used it on much more complex real tasks, including a complex app migration and a new hard workflow - it has been working away for many hours without getting stumped. I'm getting more and more addicted to this stuff with every model release.

When using GPT-5.5, it is instantly noticeable how much more powerful it is. In Codex, I gave it a very complex prompt to create London Toy Railway with landmarks and seasons - it did an excellent job in one shot. In the second half of the video you see GPT-5.4 - it was also not bad, but very clearly worse. GPT-5.5's generation is far more ambitious, coherent and with fewer errors. This is obviously a toy example, but I've used it on much more complex real tasks, including a complex app migration and a new hard workflow - it has been working away for many hours without getting stumped. I'm getting more and more addicted to this stuff with every model release.

Peter Gostev

262,507 görüntüleme • 3 ay önce

I built a natural language CLI. It generates Python scripts to answer your question, then auto-executes them in the cwd. You will not believe how capable this simple pattern is. Rawdogging gpt-4 from the command line. Rawdog. 1/

I built a natural language CLI. It generates Python scripts to answer your question, then auto-executes them in the cwd. You will not believe how capable this simple pattern is. Rawdogging gpt-4 from the command line. Rawdog. 1/

Grant♟️

431,535 görüntüleme • 2 yıl önce

what is a multimodal LLM thinking as it watches a video? Gemma 4 12B reads raw image patches, as if they were tokens. It was never trained to predict anything at these 'tokens' - but this video shows what it would predict if you did sample from its next token prediction head

what is a multimodal LLM thinking as it watches a video? Gemma 4 12B reads raw image patches, as if they were tokens. It was never trained to predict anything at these 'tokens' - but this video shows what it would predict if you did sample from its next token prediction head

Matt Henderson

159,699 görüntüleme • 6 gün önce

Can GPT-4 code an entire game for you? Yes, yes it can. Here's how I recreated a Snake game that runs in your browser using Chat GPT-4 and Replit ⠕, with ZERO knowledge of Javascript all in less than 20 mins 🧵

Can GPT-4 code an entire game for you? Yes, yes it can. Here's how I recreated a Snake game that runs in your browser using Chat GPT-4 and Replit ⠕, with ZERO knowledge of Javascript all in less than 20 mins 🧵

Ammaar Reshi

3,930,049 görüntüleme • 3 yıl önce

Over the weekend I finished the to-do list that does itself. Everytime you add a task, a GPT-4 agent is spawned to complete it. It already has the context it needs on you and your company, and has access to your apps. It’s called the Do Anything Machine (Link in thread)

Over the weekend I finished the to-do list that does itself. Everytime you add a task, a GPT-4 agent is spawned to complete it. It already has the context it needs on you and your company, and has access to your apps. It’s called the Do Anything Machine (Link in thread)

Garrett Scott 🕳

2,893,263 görüntüleme • 3 yıl önce

$MSTR compression is coming to an end and this is looking PRIMED for a breakout. That said, I DONT trust it. ⚠️ In this video I cover $MSTR and why I think it has a chance to breakout +20% in the coming weeks, only to reject 💯

$MSTR compression is coming to an end and this is looking PRIMED for a breakout. That said, I DONT trust it. ⚠️ In this video I cover $MSTR and why I think it has a chance to breakout +20% in the coming weeks, only to reject 💯

Peter DiCarlo

63,736 görüntüleme • 1 yıl önce

New RLM trajectory that blew my mind! I will use this one as the main example in the YT tutorial. I passed in a CSV containing transcripts of 320 episodes of the Lex Fridman podcast and asked it to find what his first 10 ML guests had to say about AGI. The context had 60,855,062 characters. > Main agent explored data format, understood its CSV > extracted all 320 guests, identified the first 10 ML guys (Benegio, Brockman, Goodfellow etc) > Launched parallel subagents passing just their corresponding transcripts (about 35K chars each) > Subagents performed find operations to search for AGI, read the context and returned outputs > Main agent gathered all the data, generated a summary of all AGI conversations It took 4 minutes to crunch, and the fun part is it cost me 0.2$ with Minimax-M2.5. It read 1M tokens (825K was cache hits so it was quite cheap), produced just 69K tokens (19K were reasoning). ---- My notes: - This would be basically impossible to do at this quality with a base LM. (context rot, since 99% of the data is useless) - It will cost 20x more with ReAct model (too many tasks) - It will cost 10x more with a React + Subagent model (read/write contexts instead of using symbolic variables) - I'm a happy panda. (thanks for reading)

New RLM trajectory that blew my mind! I will use this one as the main example in the YT tutorial. I passed in a CSV containing transcripts of 320 episodes of the Lex Fridman podcast and asked it to find what his first 10 ML guests had to say about AGI. The context had 60,855,062 characters. > Main agent explored data format, understood its CSV > extracted all 320 guests, identified the first 10 ML guys (Benegio, Brockman, Goodfellow etc) > Launched parallel subagents passing just their corresponding transcripts (about 35K chars each) > Subagents performed find operations to search for AGI, read the context and returned outputs > Main agent gathered all the data, generated a summary of all AGI conversations It took 4 minutes to crunch, and the fun part is it cost me 0.2$ with Minimax-M2.5. It read 1M tokens (825K was cache hits so it was quite cheap), produced just 69K tokens (19K were reasoning). ---- My notes: - This would be basically impossible to do at this quality with a base LM. (context rot, since 99% of the data is useless) - It will cost 20x more with ReAct model (too many tasks) - It will cost 10x more with a React + Subagent model (read/write contexts instead of using symbolic variables) - I'm a happy panda. (thanks for reading)

AVB

43,794 görüntüleme • 5 ay önce

Another impressive and early Claude Mythos output for y'all. 😀 I literally just told it to generate a macOS clone and to do its absolute best. It generated 50k tokens (3k lines of code), made a fully functional browser, made a music player with generated songs and added a lot of sneaky witty details. AND this is also LOW effort!

Another impressive and early Claude Mythos output for y'all. 😀 I literally just told it to generate a macOS clone and to do its absolute best. It generated 50k tokens (3k lines of code), made a fully functional browser, made a music player with generated songs and added a lot of sneaky witty details. AND this is also LOW effort!

Lentils

427,446 görüntüleme • 1 ay önce