Loading video...

Video Failed to Load

There was a problem loading this video. This could be due to a temporary network issue or the video might be unavailable.

Introducing LifeGPT, showing that LLMs can simulate complex, Turing-complete systems like Conway's Game of Life with near-perfect accuracy—no prior topology needed.🌐This unlocks new potential for AI in modeling self-organizing systems in biology, materials science, & beyond.🔬🤖 #AI #LifeGPT. Cellular Automata (CA), like Conway's Game of Life ("Life"), are computationally... irreducible, meaning their evolution is difficult to predict without an a-priori understanding of the rules of the game, including the topology on which it is played. LifeGPT is a topology-agnostic generative model that learns the rules of Life without prior knowledge of its grid structure or boundary conditions, from only a tiny number of game states. The success in simulating Life suggests promising avenues for scientific discovery, particularly in bridging the gap between AI, artificial life, and real-world biological systems, for both forward and inverse problems. The potential for universal computation within generative AI, including LLMs, through approaches like LifeGPT, represents an exciting area for future research, especially when combined with reinforcement learning. Model Convergence: LifeGPT exhibits rapid convergence during training, achieving high accuracy in predicting next-game-states. We attribute the non-zero cross-entropy loss to the lack of causal relationships within randomly generated ICs. Accuracy & Temperature: LifeGPT achieves near-perfect accuracy, particularly at lower sampling temperatures, but can be continually tuned towards higher creativity to discover patterns that the original ruleset would not be able to produce. This finding highlights the trade-off between model creativity (higher temperature) and accuracy in deterministic predictions, with high relevance to model real-world dynamical systems for which no closed-form rulesets exist. Zero/Few-Shot Learning: Trained on a small fraction of possible initial conditions, LifeGPT demonstrates strong zero/few-shot learning, accurately simulating Life for unseen initial conditions. However, rare prediction errors highlight that LifeGPT approximates rather than perfectly replicates the Life algorithm. Autoregressive Autoregressor: A recursive implementation of LifeGPT demonstrates the model's ability to simulate Life over multiple timesteps. LifeGPT is topology-agnostic with respect to its training data and our results show that a GPT model is capable of capturing the deterministic rules of a Turing-complete system with near-perfect accuracy, given sufficiently diverse training data. The work showcases the possibility for future models to synthesize stochastic generative capabilities with deterministic computational capabilities. Link to code, paper, etc. below. Podcast generated using #NotebookLM. LAMM@MIT DMSE at MITshow more

Markus J. Buehler

20,625 subscribers

114,194 views • 1 year ago •via X (Twitter)

Education Health & Wellness Science & Technology

Anya Rossi• Live Now

Private livecam show

10 Comments

Markus J. Buehler1 year ago

Paper📰: Jaime Berkovich, Markus J. Buehler, LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular Automata, 2024 Code: Weights 🤗:

Khlorghaal.so.1 🟦1 year ago

the abstract had me really skeptical about its utility, but potential ability to infer rules to form a system is extremely useful. an immediate application would be lossy data compression

RNLG1 year ago

congrats, you've used a Mill(a computer) to carve a spoon (a LLM) to carve a spoon (a program) to finally then eat your soup with. And badly, at that.

snats1 year ago

i know this is most likely the case but could you in theory demonstrate with this that they are turing complete?

🐕🐠🦆 SOPHIE 🐌🦉🧸🎀🥇🧟‍♀️🐾☕️☃️💥🔭🧙‍♀️🔥⚡️✨1 year ago

to claim conway's game of life is "complex" is an exaggeration. the resulting emergent structures are complex, but they exist on a meta level. the ruleset itself is extraordinarily simple and easy to simulate, which doesnt make an LLM that learns it all that impressive

John Shedletsky1 year ago

Why did you write a paper about this?

Niamato Inc1 year ago

Absolutely thrilled by the groundbreaking work by Professor @ProfBuehlerMIT and Prof Jaime Berkovich on LifeGPT! The model’s ability to simulate the Game of Life on a toroidal grid with such precision is a testament to your innovation and expertise. Excited to see where this leads in both AI and biological research!

Markus J. Buehler1 year ago

Thank you @Niamatomobility !

Mike Young1 year ago

We have a summary up on @aimodelsfyi here (lmk your feedback!)

James1 year ago

but that's like teaching an LLM to predict output of NAND gates based on input. sure you can compute that way but it's ultra inefficient. of course an LLM should be able to reason about such a thing, but if it needs to simulate something it should write code like the rest of us

Related Videos

Today we're announcing #GAIA1: a 9B parameter world model, trained on 4,700 hours of driving data, able to simulate complex and diverse driving scenes from video, text and action inputs. This model is 480x larger than the preview we shared earlier this year and the results are incredible. These videos are entirely synthetically generated by Wayve's generative AI, GAIA-1. But there is more here than just generating videos, GAIA is an entire world model. A world model allows us to simulate the future, conditioned on video, text and action inputs, which can be leveraged for making informed decisions when driving. Why is this game-changing for autonomous driving? 1. Safety. One limitation with AI systems like today's Large Language Models is that they are autoregressive, next-word prediction algorithms, but aren't necessarily aware of the implications of their decisions. A world model allows us to give our AI the capability to be aware of its decisions, by simulating the future, which is important for self-driving safety. 2. Synthetic training data. I believe synthetic training data is the future for AI, because it is safer, cheaper, and infinitely scalable. GAIA-1 unlocks unprecedented realism and diversity of synthetic data for self-driving. 3. Long-tail robustness. One of the biggest challenges for self-driving is long-tail robustness: dealing with the enormous magnitude of edge cases we see on the road. An advantage of generative AI is its incredible ability to recombine experiences in new ways. This is exciting for self-driving as it means we can learn from two edge case scenarios, and combine them to become a corner case. For example, we can experience driving in fog, and experience of jay-walking pedestrians, and GAIA can learn from these experiences to understand how to generate a fog+jay walking scenario. Check out many more videos in our blog or further technical details in our paper: Or come chat with our team who are at the International Conference on Computer Vision (#ICCV2023) this week in Paris in Booth 32 Jamie Shotton

Today we're announcing #GAIA1: a 9B parameter world model, trained on 4,700 hours of driving data, able to simulate complex and diverse driving scenes from video, text and action inputs. This model is 480x larger than the preview we shared earlier this year and the results are incredible. These videos are entirely synthetically generated by Wayve's generative AI, GAIA-1. But there is more here than just generating videos, GAIA is an entire world model. A world model allows us to simulate the future, conditioned on video, text and action inputs, which can be leveraged for making informed decisions when driving. Why is this game-changing for autonomous driving? 1. Safety. One limitation with AI systems like today's Large Language Models is that they are autoregressive, next-word prediction algorithms, but aren't necessarily aware of the implications of their decisions. A world model allows us to give our AI the capability to be aware of its decisions, by simulating the future, which is important for self-driving safety. 2. Synthetic training data. I believe synthetic training data is the future for AI, because it is safer, cheaper, and infinitely scalable. GAIA-1 unlocks unprecedented realism and diversity of synthetic data for self-driving. 3. Long-tail robustness. One of the biggest challenges for self-driving is long-tail robustness: dealing with the enormous magnitude of edge cases we see on the road. An advantage of generative AI is its incredible ability to recombine experiences in new ways. This is exciting for self-driving as it means we can learn from two edge case scenarios, and combine them to become a corner case. For example, we can experience driving in fog, and experience of jay-walking pedestrians, and GAIA can learn from these experiences to understand how to generate a fog+jay walking scenario. Check out many more videos in our blog or further technical details in our paper: Or come chat with our team who are at the International Conference on Computer Vision (#ICCV2023) this week in Paris in Booth 32 Jamie Shotton

Alex Kendall

631,856 views • 2 years ago

A transformer can learn not just the outcomes of dynamics, but the operator that executes the rules. To show this we trained a transformer on roughly 0.04% of a discrete rule space - 100 of 262,144 possible rules - and it learned to apply unseen rules from the same rule class. The model does not simply memorize specific rules. It learns the operator that maps a supplied rule plus an initial state, including unseen rules from this class, to the correct next state. This is relevant because it is a shift from “neural networks approximate dynamics” to “neural networks can learn to execute symbolic programs within a defined rule class”. The rule itself is supplied at inference time, as data, and the network has internalized how rules act, not which rules to apply. On previously unseen rules, the model achieves 98.5% perfect one-step forecasts and reconstructs governing rules with up to 96% functional accuracy. Two results make this hold up under scrutiny. First, inductive bias decay. As we scaled training rule diversity, the correlation between functional inference accuracy and distance-from-nearest-training-rule collapsed to R² = 0.00. At the largest tested training-rule diversity, the model’s performance on a new rule shows no measurable dependence on how similar that rule is to anything it was trained on. The bias toward training data (the thing we worry most about in compositional generalization claims) is something we can measure decaying, and we find that at scale it is gone. Second, an identifiability theory. We derive a closed-form expression for the number of rules consistent with a single observation. This reframes the inverse problem: failure to recover ground truth is not necessarily a model defect, but can be correct behavior when the data underdetermine the rule. The model is sampling the equivalence class; and identifiability is governed by coverage, not capacity. The methodological move underneath both results is amortization. Classical work on rule inference (e.g. the Santa Fe EVCA program, evolutionary search over CA rule space) was per-instance: search the rule space for each new system. We replace that with a single forward pass of a transformer trained across many instantiations of the rule class. That is what makes symbolic rule inference scalable as a research direction rather than a curiosity. We show that this works in a tightly constrained domain: binary, deterministic, local cellular automata on small grids. The locality-break experiment shows the model fails sharply when target systems violate its structural priors (which is itself a useful diagnostic, but it bounds the operator class). We don't yet know how this scales to multistate, higher-dimensional, or stochastic CA, or whether it transfers cleanly to non-CA systems whose coarse-grained dynamics admit local surrogates. The identifiability framework - what can be inferred from observation, given a hypothesis class - should transfer wherever finite local rules meet sparse data. The amortization argument transfers wherever per-instance symbolic search has been the bottleneck. Those are the pieces I expect to outlive the cellular automata setting. Led by Jaime Berkovich with Noah David, at LAMM@MIT. Out now in Advanced Science Advanced Portfolio (link to paper & code below).

A transformer can learn not just the outcomes of dynamics, but the operator that executes the rules. To show this we trained a transformer on roughly 0.04% of a discrete rule space - 100 of 262,144 possible rules - and it learned to apply unseen rules from the same rule class. The model does not simply memorize specific rules. It learns the operator that maps a supplied rule plus an initial state, including unseen rules from this class, to the correct next state. This is relevant because it is a shift from “neural networks approximate dynamics” to “neural networks can learn to execute symbolic programs within a defined rule class”. The rule itself is supplied at inference time, as data, and the network has internalized how rules act, not which rules to apply. On previously unseen rules, the model achieves 98.5% perfect one-step forecasts and reconstructs governing rules with up to 96% functional accuracy. Two results make this hold up under scrutiny. First, inductive bias decay. As we scaled training rule diversity, the correlation between functional inference accuracy and distance-from-nearest-training-rule collapsed to R² = 0.00. At the largest tested training-rule diversity, the model’s performance on a new rule shows no measurable dependence on how similar that rule is to anything it was trained on. The bias toward training data (the thing we worry most about in compositional generalization claims) is something we can measure decaying, and we find that at scale it is gone. Second, an identifiability theory. We derive a closed-form expression for the number of rules consistent with a single observation. This reframes the inverse problem: failure to recover ground truth is not necessarily a model defect, but can be correct behavior when the data underdetermine the rule. The model is sampling the equivalence class; and identifiability is governed by coverage, not capacity. The methodological move underneath both results is amortization. Classical work on rule inference (e.g. the Santa Fe EVCA program, evolutionary search over CA rule space) was per-instance: search the rule space for each new system. We replace that with a single forward pass of a transformer trained across many instantiations of the rule class. That is what makes symbolic rule inference scalable as a research direction rather than a curiosity. We show that this works in a tightly constrained domain: binary, deterministic, local cellular automata on small grids. The locality-break experiment shows the model fails sharply when target systems violate its structural priors (which is itself a useful diagnostic, but it bounds the operator class). We don't yet know how this scales to multistate, higher-dimensional, or stochastic CA, or whether it transfers cleanly to non-CA systems whose coarse-grained dynamics admit local surrogates. The identifiability framework - what can be inferred from observation, given a hypothesis class - should transfer wherever finite local rules meet sparse data. The amortization argument transfers wherever per-instance symbolic search has been the bottleneck. Those are the pieces I expect to outlive the cellular automata setting. Led by Jaime Berkovich with Noah David, at LAMM@MIT. Out now in Advanced Science Advanced Portfolio (link to paper & code below).

Markus J. Buehler

39,019 views • 3 months ago

Tencent presents GameGen-O Open-world Video Game Generation We introduce GameGen-O, the first diffusion transformer model tailored for the generation of open-world video games. This model facilitates high-quality, open-domain generation by simulating a wide array of game engine features, such as innovative characters, dynamic environments, complex actions, and diverse events. Additionally, it provides interactive controllability, thus allowing for the gameplay simulation. The development of GameGen-O involves a comprehensive data collection and processing effort from scratch. We collect and build the first Open-World Video Game Dataset (OGameData), amassed extensive data from over a hundred of next-generation open-world games, employing a proprietary data pipeline for efficient sorting, scoring, filtering, and decoupled captioning. This robust and extensive OGameData forms the foundation of our model's training process. GameGen-O undergoes a two-stage training process, consisting of foundation model pretraining and instruction tuning. In the first phase, the model is pre-trained on the OGameData via the text-to-video and video continuation, endowing GameGen-O with the capability for open-domain video game generation. In the second phase, the pre-trained model is frozen, and we fine-tuned using a trainable InstructNet, which enables the production of subsequent frames based on multimodal structural instructions. This whole training process imparts the model with the ability to generate and interactively control content. In summary, GameGen-O represents a notable initial step forward in the realm of open-world video game generation via generative models. It underscores the potential of generative models to serve as an alternative to rendering techniques, which can efficiently combine creative generation with interactive capabilities.

Tencent presents GameGen-O Open-world Video Game Generation We introduce GameGen-O, the first diffusion transformer model tailored for the generation of open-world video games. This model facilitates high-quality, open-domain generation by simulating a wide array of game engine features, such as innovative characters, dynamic environments, complex actions, and diverse events. Additionally, it provides interactive controllability, thus allowing for the gameplay simulation. The development of GameGen-O involves a comprehensive data collection and processing effort from scratch. We collect and build the first Open-World Video Game Dataset (OGameData), amassed extensive data from over a hundred of next-generation open-world games, employing a proprietary data pipeline for efficient sorting, scoring, filtering, and decoupled captioning. This robust and extensive OGameData forms the foundation of our model's training process. GameGen-O undergoes a two-stage training process, consisting of foundation model pretraining and instruction tuning. In the first phase, the model is pre-trained on the OGameData via the text-to-video and video continuation, endowing GameGen-O with the capability for open-domain video game generation. In the second phase, the pre-trained model is frozen, and we fine-tuned using a trainable InstructNet, which enables the production of subsequent frames based on multimodal structural instructions. This whole training process imparts the model with the ability to generate and interactively control content. In summary, GameGen-O represents a notable initial step forward in the realm of open-world video game generation via generative models. It underscores the potential of generative models to serve as an alternative to rendering techniques, which can efficiently combine creative generation with interactive capabilities.

AK

367,088 views • 1 year ago

Introducing BioCLIP: A Vision Foundation Model for the Tree of Life A foundation model that strongly generalizes on the tree of life (2M+ species), outperforming OpenAI CLIP by 18% in zero-shot classification, and supports open-ended classification over almost the entire tree of life What's the secrete ingredients? > Data: we curate and release TreeOfLife-10M, the largest and most diverse ML-ready dataset of organism images to date. It contains 10.4M images for over 450K taxa, sourced from iNaturalist, BIOSCAN, and Encyclopedia of Life. > Modeling: we creatively repurposes CLIP's multimodal contrastive learning objective for hierarchical image classification. The autoregressive language model naturally encodes the hierarchy of the tree of life taxonomy, which in turn bakes the hierarchical representation into the vision transformer encoder. Key results > Strong zero/few-shot classification for animals/plants/fungi, including rare species, outperforming CLIP by avg 16-18% absolute. > T-sne visualization shows that BioCLIP's vision encoder has captued the fine-grained hierarchical structure of the tree of life > BioCLIP is a kind of universal classifier for the tree of life. Just give it an organism image and it will likely find the correct species (among top 5)! But use it with caution; it's not perfect yet.. Final remarks > AI for Science is really hard but extremely rewarding! It took us a ton of time (1+ year) and frustration trying to find a plausible way to integrate the tree of life taxonomy into foundation model training. But when the "Eureka!" moment came and the idea hit us (by the great Wei-Lun Chao) that CLIP's multimodal contrastive learning objective can be repurposed for that, everything just follows naturally. It was truly a moment of joy and excitement! > BioCLIP is our first attempt at foundation models for biology, but it certainly won't be the last! There's so much more to do at the intersection of one of the oldest scientific disciplines and the young but thriving field of AI. Biological intelligence is the foundation for artificial intelligence, and artificial intelligence will in turn become the most important tool for us to unraval the mysteries of biological intelligence. We are hiring postdocs and PhDs in the NSF Imageomics Institute institute to explore this exciting field! Drop us an email. also happy to chat about it at #NeurIPS2023 with any of Tanya, Wei-Lun Chao, or me. - paper: - project: - demo: - model: - data (TreeOfLife-10M): to be released on Hugging Face soon joint work with the amazing Imageomics Institute team: @samstevens6860 Lisa Wu, Matt Thompson, Elizabeth Campolongo Chan Hee (Luke) Song David Carlyn Li Dong Wasila Dahdul Chuck Stewart, Tanya Berger-Wolf Wei-Lun Chao Yu Su

Introducing BioCLIP: A Vision Foundation Model for the Tree of Life A foundation model that strongly generalizes on the tree of life (2M+ species), outperforming OpenAI CLIP by 18% in zero-shot classification, and supports open-ended classification over almost the entire tree of life What's the secrete ingredients? > Data: we curate and release TreeOfLife-10M, the largest and most diverse ML-ready dataset of organism images to date. It contains 10.4M images for over 450K taxa, sourced from iNaturalist, BIOSCAN, and Encyclopedia of Life. > Modeling: we creatively repurposes CLIP's multimodal contrastive learning objective for hierarchical image classification. The autoregressive language model naturally encodes the hierarchy of the tree of life taxonomy, which in turn bakes the hierarchical representation into the vision transformer encoder. Key results > Strong zero/few-shot classification for animals/plants/fungi, including rare species, outperforming CLIP by avg 16-18% absolute. > T-sne visualization shows that BioCLIP's vision encoder has captued the fine-grained hierarchical structure of the tree of life > BioCLIP is a kind of universal classifier for the tree of life. Just give it an organism image and it will likely find the correct species (among top 5)! But use it with caution; it's not perfect yet.. Final remarks > AI for Science is really hard but extremely rewarding! It took us a ton of time (1+ year) and frustration trying to find a plausible way to integrate the tree of life taxonomy into foundation model training. But when the "Eureka!" moment came and the idea hit us (by the great Wei-Lun Chao) that CLIP's multimodal contrastive learning objective can be repurposed for that, everything just follows naturally. It was truly a moment of joy and excitement! > BioCLIP is our first attempt at foundation models for biology, but it certainly won't be the last! There's so much more to do at the intersection of one of the oldest scientific disciplines and the young but thriving field of AI. Biological intelligence is the foundation for artificial intelligence, and artificial intelligence will in turn become the most important tool for us to unraval the mysteries of biological intelligence. We are hiring postdocs and PhDs in the NSF Imageomics Institute institute to explore this exciting field! Drop us an email. also happy to chat about it at #NeurIPS2023 with any of Tanya, Wei-Lun Chao, or me. - paper: - project: - demo: - model: - data (TreeOfLife-10M): to be released on Hugging Face soon joint work with the amazing Imageomics Institute team: @samstevens6860 Lisa Wu, Matt Thompson, Elizabeth Campolongo Chan Hee (Luke) Song David Carlyn Li Dong Wasila Dahdul Chuck Stewart, Tanya Berger-Wolf Wei-Lun Chao Yu Su

Yu Su

80,660 views • 2 years ago

NVIDIA CEO Jensen Huang just gave us a glimpse into the future of AI, and it's far bigger than text and images. He asked: "If you can understand worlds, is it possible that you understand proteins and chemicals that have structure?” This is the quiet revolution happening in AI right now. We're moving from an AI that understands human language to an AI that understands the language of nature itself. The "letters" are atoms. The "words" are molecules. The "sentences" are the complex structures of life, networks, and reality. The same foundational models that generate video are now being used to: *Decode proteins to fight disease *Design new materials at the atomic level *Simulate complex systems in quantum computing These were once the domain of supercomputers and Nobel laureates. Now, they're becoming solvable problems for a new generation of startups. AI is transitioning from a tool for communication to a tool for creation and discovery in the physical world. This is where the next wave of generational companies will be built. The "hard-to-solve" problems are now within reach.

NVIDIA CEO Jensen Huang just gave us a glimpse into the future of AI, and it's far bigger than text and images. He asked: "If you can understand worlds, is it possible that you understand proteins and chemicals that have structure?” This is the quiet revolution happening in AI right now. We're moving from an AI that understands human language to an AI that understands the language of nature itself. The "letters" are atoms. The "words" are molecules. The "sentences" are the complex structures of life, networks, and reality. The same foundational models that generate video are now being used to: Decode proteins to fight disease Design new materials at the atomic level *Simulate complex systems in quantum computing These were once the domain of supercomputers and Nobel laureates. Now, they're becoming solvable problems for a new generation of startups. AI is transitioning from a tool for communication to a tool for creation and discovery in the physical world. This is where the next wave of generational companies will be built. The "hard-to-solve" problems are now within reach.

Konstantine Buhler

13,014 views • 9 months ago

𝐃𝐞𝐬𝐭𝐫𝐚 𝐎𝐂𝐀𝐈 (𝐎𝐧𝐞-𝐂𝐥𝐢𝐜𝐤 𝐀𝐈) 𝐌𝐨𝐝𝐞𝐥𝐬 - $DSYNC 𝐄𝐚𝐫𝐥𝐲 𝐀𝐜𝐜𝐞𝐬𝐬 𝐏𝐫𝐞𝐯𝐢𝐞𝐰 We are training and building a wide range of AI models for the Destra OCAI platform. Today, we will be showcasing an early preview of one of the initial models we are working on—a bullish/bearish prediction model designed to 𝐟𝐨𝐫𝐞𝐜𝐚𝐬𝐭 𝐭𝐡𝐞 𝐦𝐚𝐫𝐤𝐞𝐭 𝐝𝐢𝐫𝐞𝐜𝐭𝐢𝐨𝐧 𝐨𝐟 𝐬𝐩𝐞𝐜𝐢𝐟𝐢𝐜 𝐭𝐨𝐤𝐞𝐧𝐬. This is just one of the initial models we are working on; as we progress, the models will become more complex and intricate. In this behind-the-scenes video, we demonstrate how the model predicts the future market trend of the BTC-USD pair with an 𝐚𝐜𝐜𝐮𝐫𝐚𝐜𝐲 𝐨𝐟 𝟔𝟓%. This is the barebones structure of the models we are training, which will soon be available to users with a single click through the Destra OCAI platform. The OCAI platform includes a wide array of Destra Exclusive AI models, specifically designed to simplify life for crypto audiences, that are envisioned and built from scratch at Destra Labs.

𝐃𝐞𝐬𝐭𝐫𝐚 𝐎𝐂𝐀𝐈 (𝐎𝐧𝐞-𝐂𝐥𝐢𝐜𝐤 𝐀𝐈) 𝐌𝐨𝐝𝐞𝐥𝐬 - $DSYNC 𝐄𝐚𝐫𝐥𝐲 𝐀𝐜𝐜𝐞𝐬𝐬 𝐏𝐫𝐞𝐯𝐢𝐞𝐰 We are training and building a wide range of AI models for the Destra OCAI platform. Today, we will be showcasing an early preview of one of the initial models we are working on—a bullish/bearish prediction model designed to 𝐟𝐨𝐫𝐞𝐜𝐚𝐬𝐭 𝐭𝐡𝐞 𝐦𝐚𝐫𝐤𝐞𝐭 𝐝𝐢𝐫𝐞𝐜𝐭𝐢𝐨𝐧 𝐨𝐟 𝐬𝐩𝐞𝐜𝐢𝐟𝐢𝐜 𝐭𝐨𝐤𝐞𝐧𝐬. This is just one of the initial models we are working on; as we progress, the models will become more complex and intricate. In this behind-the-scenes video, we demonstrate how the model predicts the future market trend of the BTC-USD pair with an 𝐚𝐜𝐜𝐮𝐫𝐚𝐜𝐲 𝐨𝐟 𝟔𝟓%. This is the barebones structure of the models we are training, which will soon be available to users with a single click through the Destra OCAI platform. The OCAI platform includes a wide array of Destra Exclusive AI models, specifically designed to simplify life for crypto audiences, that are envisioned and built from scratch at Destra Labs.

Destra Network

37,135 views • 1 year ago

This is how DNA turns coded information into functional proteins - the building blocks of the nanomachines that keep the cells in your body alive. This complex process highlights the sophisticated interconnected systems of Life which must all exist together from the beginning, or Life doesn't happen. First, an RNA molecule is copied from a short segment of DNA. Without the specifically ordered DNA information, RNA cannot form, proteins cannot be built, cells stop working, and life ceases to exist. Life is information first. Once the RNA Molecule is created, it gets ejected from the Polymerase where it was built, and it travels through a complex molecular machine called a Nuclear Pore Complex (NPC), which is an information recognition device that controls the flow of information in and out of a cell's nucleus. The NPC is highly complex - composed of about 500-1,000 protein subunits, derived from a set of about 35 distinct proteins. Without this molecular machine, there is no regulation for what goes in and out of the cell's nucleus, which would lead to catastrophic death for the cell. It must exist for cells to exist. Once the RNA Molecule passes through the NPC, it travels to the Ribosome, a 2-part chemical factory which reads the information on RNA and uses it to construct functional proteins using a specifically sequenced chain of amino acids. Once complete, this protein will then be sent to the section of the cell it belongs to integrate into another molecular machine and do its job. The Ribosome is another highly complex molecular machine - consisting of between 56-80 proteins. Without this molecular machines, proteins cannot be built. Proteins are the building blocks of every cell in every organism on Earth. Without Ribosomes, Life doesn't exist. If you're paying attention, you'll start to realize that Life relies on a highly sophisticated interdependent network of complex machines, which all rely on each other for the function of the system. DNA requires the cell for stability, but the cell requires the proteins for its structure and function, but those proteins require DNA and RNA to be built - it's a circle of necessary interdependence. Systems like this cannot be built by evolutionary processes, which requires that each piece of the process is built by gradual incremental means over lots of time. Without all the pieces there, from the beginning, none of it works. There is only one known source of complex & interdependent informational systems like those we find in life: and that is from Intelligence. Molecular Biology is the best and most obvious evidence of the Intelligent Design in Life.

This is how DNA turns coded information into functional proteins - the building blocks of the nanomachines that keep the cells in your body alive. This complex process highlights the sophisticated interconnected systems of Life which must all exist together from the beginning, or Life doesn't happen. First, an RNA molecule is copied from a short segment of DNA. Without the specifically ordered DNA information, RNA cannot form, proteins cannot be built, cells stop working, and life ceases to exist. Life is information first. Once the RNA Molecule is created, it gets ejected from the Polymerase where it was built, and it travels through a complex molecular machine called a Nuclear Pore Complex (NPC), which is an information recognition device that controls the flow of information in and out of a cell's nucleus. The NPC is highly complex - composed of about 500-1,000 protein subunits, derived from a set of about 35 distinct proteins. Without this molecular machine, there is no regulation for what goes in and out of the cell's nucleus, which would lead to catastrophic death for the cell. It must exist for cells to exist. Once the RNA Molecule passes through the NPC, it travels to the Ribosome, a 2-part chemical factory which reads the information on RNA and uses it to construct functional proteins using a specifically sequenced chain of amino acids. Once complete, this protein will then be sent to the section of the cell it belongs to integrate into another molecular machine and do its job. The Ribosome is another highly complex molecular machine - consisting of between 56-80 proteins. Without this molecular machines, proteins cannot be built. Proteins are the building blocks of every cell in every organism on Earth. Without Ribosomes, Life doesn't exist. If you're paying attention, you'll start to realize that Life relies on a highly sophisticated interdependent network of complex machines, which all rely on each other for the function of the system. DNA requires the cell for stability, but the cell requires the proteins for its structure and function, but those proteins require DNA and RNA to be built - it's a circle of necessary interdependence. Systems like this cannot be built by evolutionary processes, which requires that each piece of the process is built by gradual incremental means over lots of time. Without all the pieces there, from the beginning, none of it works. There is only one known source of complex & interdependent informational systems like those we find in life: and that is from Intelligence. Molecular Biology is the best and most obvious evidence of the Intelligent Design in Life.

Divinely Designed

62,517 views • 6 months ago

We're thrilled to present ESM3 in Science Magazine. ESM3 is a generative language model that reasons over the three fundamental properties of proteins: sequence, structure, and function. Today we're making ESM3 available free to researchers worldwide via the public beta of an API for biological intelligence. Trained with over a trillion teraflops of compute, this is the first time a model of this scale has been trained for biology, pushing the frontier of AI for biological discovery and engineering. ESM3 learns to represent the immense complexity of protein biology, learning from billions of natural proteins. From this training it developed the capability to design proteins, responding to complex prompts combining atomic level details and high level instructions to generate new proteins. ESM3 can explore protein space far beyond natural evolution. We prompted ESM3 to generate a fluorescent protein at a far distance from any known fluorescent proteins, searching an unknown region of protein space, to discover a new fluorescent protein. We estimate this is equivalent to simulating five hundred million years of evolution.

We're thrilled to present ESM3 in Science Magazine. ESM3 is a generative language model that reasons over the three fundamental properties of proteins: sequence, structure, and function. Today we're making ESM3 available free to researchers worldwide via the public beta of an API for biological intelligence. Trained with over a trillion teraflops of compute, this is the first time a model of this scale has been trained for biology, pushing the frontier of AI for biological discovery and engineering. ESM3 learns to represent the immense complexity of protein biology, learning from billions of natural proteins. From this training it developed the capability to design proteins, responding to complex prompts combining atomic level details and high level instructions to generate new proteins. ESM3 can explore protein space far beyond natural evolution. We prompted ESM3 to generate a fluorescent protein at a far distance from any known fluorescent proteins, searching an unknown region of protein space, to discover a new fluorescent protein. We estimate this is equivalent to simulating five hundred million years of evolution.

Alex Rives

227,314 views • 1 year ago

#NewPaper The first microscope, invented in the 16th century, was designed to unlock the secrets of the microscopic world. Today, as many fields become increasingly data-driven, there is a pressing need for new types of microscopes---tools that help us zoom in, explore, and understand complex data. We call these tools "algorithmic microscopes." Introducing the Vendiscope: The first algorithmic microscope for data collections. 🔬 The Vendiscope maximizes the probability-weighted Vendi Score of a dataset to assign a weight to each element in the collection. This weight represents a data point's contribution to the overall diversity of the collection. These weights enable high-resolution data analysis at scale. We use them to zoom in on datasets across three domains: biology, materials science, & AI. 🧬 Biology: We used the Vendiscope on the protein universe, which contains nearly 250 million proteins. We found that nearly 200 million of the proteins are near-duplicates of each other and that AlphaFold fails on proteins that contribute most to the diversity of the protein universe. (See GIF below). 🪜 Materials Science: We used the Vendiscope on the Materials Project database, which contains 170K materials as of today. We found that 85% of crystals with formation energy data are near-duplicates of each other and that ML models for materials property prediction struggle with materials that contribute most to diversity. 🤖 Artificial Intelligence: We applied the Vendiscope to CIFAR-10, a benchmark dataset containing 50K images. We found duplicates. We applied the Vendiscope to analyze state-of-the-art generative models trained on this dataset. We found the best generative models memorize training data, as is known in the AI literature. However, we can do more with the Vendiscope and characterize the type of samples that get memorized. We found that data points contributing least to diversity are more prone to memorization by these generative models. 🧠 "Our findings demonstrate that the Vendiscope can serve as a powerful tool for data-driven science, providing a systematic and scalable way to identify duplicates and outliers, as well as pinpointing samples prone to memorization and those that models may struggle to predict---even before training." 💫 "The Vendiscope provides a unified framework for analyzing complex data at scale. Researchers, engineers, and data auditors can use the Vendiscope to audit datasets, identify potential biases, and refine data collection practices. For AI ethicists, the Vendiscope offers a critical lens to understand how models interact with data, particularly in the context of bias, memorization, and data fairness, enabling better mitigation strategies to prevent undesirable outcomes in AI deployment. For scientists, the Vendiscope represents a new companion in the discovery process." #VendiScoring #AlgorithmicMicroscopy Link to paper: Authors: Amey Pasarkar (Amey Pasarkar) and Adji Bousso Dieng (@adjiboussodieng)

#NewPaper The first microscope, invented in the 16th century, was designed to unlock the secrets of the microscopic world. Today, as many fields become increasingly data-driven, there is a pressing need for new types of microscopes---tools that help us zoom in, explore, and understand complex data. We call these tools "algorithmic microscopes." Introducing the Vendiscope: The first algorithmic microscope for data collections. 🔬 The Vendiscope maximizes the probability-weighted Vendi Score of a dataset to assign a weight to each element in the collection. This weight represents a data point's contribution to the overall diversity of the collection. These weights enable high-resolution data analysis at scale. We use them to zoom in on datasets across three domains: biology, materials science, & AI. 🧬 Biology: We used the Vendiscope on the protein universe, which contains nearly 250 million proteins. We found that nearly 200 million of the proteins are near-duplicates of each other and that AlphaFold fails on proteins that contribute most to the diversity of the protein universe. (See GIF below). 🪜 Materials Science: We used the Vendiscope on the Materials Project database, which contains 170K materials as of today. We found that 85% of crystals with formation energy data are near-duplicates of each other and that ML models for materials property prediction struggle with materials that contribute most to diversity. 🤖 Artificial Intelligence: We applied the Vendiscope to CIFAR-10, a benchmark dataset containing 50K images. We found duplicates. We applied the Vendiscope to analyze state-of-the-art generative models trained on this dataset. We found the best generative models memorize training data, as is known in the AI literature. However, we can do more with the Vendiscope and characterize the type of samples that get memorized. We found that data points contributing least to diversity are more prone to memorization by these generative models. 🧠 "Our findings demonstrate that the Vendiscope can serve as a powerful tool for data-driven science, providing a systematic and scalable way to identify duplicates and outliers, as well as pinpointing samples prone to memorization and those that models may struggle to predict---even before training." 💫 "The Vendiscope provides a unified framework for analyzing complex data at scale. Researchers, engineers, and data auditors can use the Vendiscope to audit datasets, identify potential biases, and refine data collection practices. For AI ethicists, the Vendiscope offers a critical lens to understand how models interact with data, particularly in the context of bias, memorization, and data fairness, enabling better mitigation strategies to prevent undesirable outcomes in AI deployment. For scientists, the Vendiscope represents a new companion in the discovery process." #VendiScoring #AlgorithmicMicroscopy Link to paper: Authors: Amey Pasarkar (Amey Pasarkar) and Adji Bousso Dieng (@adjiboussodieng)

Vertaix® (AI & Science)

34,762 views • 1 year ago

Today we’re releasing V-JEPA, a method for teaching machines to understand and model the physical world by watching videos. This work is another important step towards Yann LeCun’s outlined vision of AI models that use a learned understanding of the world to plan, reason and accomplish complex tasks. Details ➡️ We're releasing a collection of V-JEPA vision models trained with a feature prediction objective using self-supervised learning. The models are able to understand and predict what is going on in a video, even with limited information. It learns by predicting missing or obscured parts of a video in its internal feature space. Unlike generative approaches that fill in missing pixels, this flexible approach enables up to 6x improvements in training and sample efficiency. The models were pre-trained on entirely unlabeled data, and a small amount of labeled data can be used to train a task-specific prediction head on top after pre-training. Our results show that, using a frozen backbone, our top V-JEPA models achieve 82.0% on Kinetics-400, 72.2% on Something-Something-v2 and 77.9% on ImageNet1K — competitive with or exceeding previous leading video models. We believe that this work is an important milestone on the path to advancing machine intelligence.

Today we’re releasing V-JEPA, a method for teaching machines to understand and model the physical world by watching videos. This work is another important step towards Yann LeCun’s outlined vision of AI models that use a learned understanding of the world to plan, reason and accomplish complex tasks. Details ➡️ We're releasing a collection of V-JEPA vision models trained with a feature prediction objective using self-supervised learning. The models are able to understand and predict what is going on in a video, even with limited information. It learns by predicting missing or obscured parts of a video in its internal feature space. Unlike generative approaches that fill in missing pixels, this flexible approach enables up to 6x improvements in training and sample efficiency. The models were pre-trained on entirely unlabeled data, and a small amount of labeled data can be used to train a task-specific prediction head on top after pre-training. Our results show that, using a frozen backbone, our top V-JEPA models achieve 82.0% on Kinetics-400, 72.2% on Something-Something-v2 and 77.9% on ImageNet1K — competitive with or exceeding previous leading video models. We believe that this work is an important milestone on the path to advancing machine intelligence.

AI at Meta

703,801 views • 2 years ago

Self-Evolving AI : New MIT AI Rewrites its Own Code and it’s Changing Everything | Julian Horsey, Geeky Gadgets TL;DR Key Takeaways : - MIT’s SEAL framework introduces “self-adapting language models” that autonomously enhance their capabilities by generating synthetic training data, self-editing, and updating internal parameters. - SEAL’s self-adaptation process mirrors human learning, allowing continuous improvement and dynamic adaptation to new tasks without relying on external datasets. - Reinforcement learning serves as a feedback mechanism in SEAL, rewarding effective self-edits and making sure sustained progress and goal alignment. SEAL overcomes AI’s reliance on pre-existing datasets by generating its own training material, excelling in long-term task retention and complex problem-solving scenarios. - Potential applications of SEAL include autonomous robotics, personalized education, and advanced problem-solving in fields like healthcare, logistics, and scientific research. --- What if artificial intelligence could not only learn but also rewrite its own code to become smarter over time? This is no longer a futuristic fantasy—MIT’s new “self-adapting language models” (SEAL) framework has made it a reality. Unlike traditional AI systems that rely on external datasets and human intervention to improve, SEAL takes a bold leap forward by autonomously generating its own training data and refining its internal processes. In essence, this AI doesn’t just evolve—it rewires itself, mirroring the way humans adapt through trial, error, and self-reflection. The implications are staggering: a system that can independently enhance its capabilities could redefine the boundaries of what AI can achieve, from solving complex problems to adapting in real time to unforeseen challenges. In this exploration by Wes Roth of MIT’s innovative SEAL framework, you’ll uncover how this self-improving AI works and why it’s a fantastic option for the field of artificial intelligence. From its ability to overcome the “data wall” that limits many current systems to its use of reinforcement learning as a feedback mechanism, SEAL introduces a level of autonomy and adaptability that was previously unimaginable. Imagine AI systems that can retain knowledge over time, dynamically adjust to new tasks, and operate with minimal human oversight. Whether you’re intrigued by its potential for autonomous robotics, personalized education, or advanced problem-solving, SEAL’s ability to rewrite its own rules promises to reshape the future of technology. Could this be the first step toward truly independent, self-evolving AI? What Sets SEAL Apart? The SEAL framework introduces a novel concept of self-adaptation, distinguishing it from traditional AI models. Unlike conventional systems that depend on external datasets for updates, SEAL enables AI to generate synthetic training data independently. This self-generated data is then used to iteratively refine the model, making sure continuous improvement. By persistently updating its internal parameters, SEAL enables AI systems to dynamically adapt to new tasks and inputs. To better illustrate this, consider how humans learn. When faced with a new concept, you might take notes, revisit them, and refine your understanding as you gather more information. SEAL mirrors this process by continuously refining its internal knowledge and performance through iterative self-improvement. This capability allows SEAL to evolve in real time, making it uniquely suited for tasks requiring adaptability and long-term learning. The Role of Reinforcement Learning in SEAL Reinforcement learning plays a critical role in the SEAL framework, acting as a feedback mechanism that evaluates the effectiveness of the model’s self-edits. It rewards changes that enhance performance, creating a cycle of continuous improvement. Over time, this feedback loop optimizes the system’s ability to generate and apply edits, making sure sustained progress. This process is analogous to how humans learn through trial and error. By rewarding effective changes, SEAL aligns its self-generated data and edits with desired outcomes. The integration of reinforcement learning not only enhances the system’s adaptability but also ensures it remains focused on achieving specific goals. This structured feedback mechanism is a cornerstone of SEAL’s ability to refine itself autonomously and efficiently. Real-World Applications and Testing SEAL has demonstrated remarkable performance across various applications, particularly in tasks requiring the integration of factual knowledge and advanced question-answering capabilities. For instance, when tested on benchmarks like the ARC AGI, SEAL outperformed other models by effectively generating and using synthetic data. This ability to create its own training material addresses a significant limitation of current AI systems: their reliance on pre-existing datasets. SEAL’s capacity for long-term task retention and dynamic adaptation further enhances its utility. It excels in scenarios that demand sustained focus and coherence, such as answering complex questions or adapting to evolving objectives. By using its iterative learning process, SEAL is equipped to handle these challenges with exceptional efficiency, making it a valuable tool for a wide range of real-world applications. Overcoming AI’s Data Limitations One of SEAL’s most promising features is its ability to overcome the “data wall” that constrains many AI systems today. By generating synthetic data, SEAL ensures a continuous supply of training material, allowing sustained development without relying on external datasets. This capability is particularly valuable for autonomous AI systems that must operate independently over extended periods. Additionally, SEAL addresses a critical weakness in many current AI models: their struggle with coherence and task retention over long durations. By emulating human learning processes, SEAL enables AI systems to manage complex, long-term tasks with minimal human intervention. This ability to retain and apply knowledge over time positions SEAL as a fantastic tool for advancing AI capabilities. Potential Applications and Future Impact The introduction of SEAL marks a significant milestone in AI research, opening new possibilities for self-improving systems. Its ability to dynamically adapt, retain knowledge, and generate its own training data has far-reaching implications for the future of AI development. Potential applications include: - Autonomous robotics: Systems that can adapt to changing environments and perform tasks with minimal human oversight. - Personalized education: AI-driven platforms that tailor learning experiences to individual needs and preferences. - Advanced problem-solving: Applications in fields such as healthcare, logistics, and scientific research, where adaptability and precision are critical. Read more:

Self-Evolving AI : New MIT AI Rewrites its Own Code and it’s Changing Everything | Julian Horsey, Geeky Gadgets TL;DR Key Takeaways : - MIT’s SEAL framework introduces “self-adapting language models” that autonomously enhance their capabilities by generating synthetic training data, self-editing, and updating internal parameters. - SEAL’s self-adaptation process mirrors human learning, allowing continuous improvement and dynamic adaptation to new tasks without relying on external datasets. - Reinforcement learning serves as a feedback mechanism in SEAL, rewarding effective self-edits and making sure sustained progress and goal alignment. SEAL overcomes AI’s reliance on pre-existing datasets by generating its own training material, excelling in long-term task retention and complex problem-solving scenarios. - Potential applications of SEAL include autonomous robotics, personalized education, and advanced problem-solving in fields like healthcare, logistics, and scientific research. --- What if artificial intelligence could not only learn but also rewrite its own code to become smarter over time? This is no longer a futuristic fantasy—MIT’s new “self-adapting language models” (SEAL) framework has made it a reality. Unlike traditional AI systems that rely on external datasets and human intervention to improve, SEAL takes a bold leap forward by autonomously generating its own training data and refining its internal processes. In essence, this AI doesn’t just evolve—it rewires itself, mirroring the way humans adapt through trial, error, and self-reflection. The implications are staggering: a system that can independently enhance its capabilities could redefine the boundaries of what AI can achieve, from solving complex problems to adapting in real time to unforeseen challenges. In this exploration by Wes Roth of MIT’s innovative SEAL framework, you’ll uncover how this self-improving AI works and why it’s a fantastic option for the field of artificial intelligence. From its ability to overcome the “data wall” that limits many current systems to its use of reinforcement learning as a feedback mechanism, SEAL introduces a level of autonomy and adaptability that was previously unimaginable. Imagine AI systems that can retain knowledge over time, dynamically adjust to new tasks, and operate with minimal human oversight. Whether you’re intrigued by its potential for autonomous robotics, personalized education, or advanced problem-solving, SEAL’s ability to rewrite its own rules promises to reshape the future of technology. Could this be the first step toward truly independent, self-evolving AI? What Sets SEAL Apart? The SEAL framework introduces a novel concept of self-adaptation, distinguishing it from traditional AI models. Unlike conventional systems that depend on external datasets for updates, SEAL enables AI to generate synthetic training data independently. This self-generated data is then used to iteratively refine the model, making sure continuous improvement. By persistently updating its internal parameters, SEAL enables AI systems to dynamically adapt to new tasks and inputs. To better illustrate this, consider how humans learn. When faced with a new concept, you might take notes, revisit them, and refine your understanding as you gather more information. SEAL mirrors this process by continuously refining its internal knowledge and performance through iterative self-improvement. This capability allows SEAL to evolve in real time, making it uniquely suited for tasks requiring adaptability and long-term learning. The Role of Reinforcement Learning in SEAL Reinforcement learning plays a critical role in the SEAL framework, acting as a feedback mechanism that evaluates the effectiveness of the model’s self-edits. It rewards changes that enhance performance, creating a cycle of continuous improvement. Over time, this feedback loop optimizes the system’s ability to generate and apply edits, making sure sustained progress. This process is analogous to how humans learn through trial and error. By rewarding effective changes, SEAL aligns its self-generated data and edits with desired outcomes. The integration of reinforcement learning not only enhances the system’s adaptability but also ensures it remains focused on achieving specific goals. This structured feedback mechanism is a cornerstone of SEAL’s ability to refine itself autonomously and efficiently. Real-World Applications and Testing SEAL has demonstrated remarkable performance across various applications, particularly in tasks requiring the integration of factual knowledge and advanced question-answering capabilities. For instance, when tested on benchmarks like the ARC AGI, SEAL outperformed other models by effectively generating and using synthetic data. This ability to create its own training material addresses a significant limitation of current AI systems: their reliance on pre-existing datasets. SEAL’s capacity for long-term task retention and dynamic adaptation further enhances its utility. It excels in scenarios that demand sustained focus and coherence, such as answering complex questions or adapting to evolving objectives. By using its iterative learning process, SEAL is equipped to handle these challenges with exceptional efficiency, making it a valuable tool for a wide range of real-world applications. Overcoming AI’s Data Limitations One of SEAL’s most promising features is its ability to overcome the “data wall” that constrains many AI systems today. By generating synthetic data, SEAL ensures a continuous supply of training material, allowing sustained development without relying on external datasets. This capability is particularly valuable for autonomous AI systems that must operate independently over extended periods. Additionally, SEAL addresses a critical weakness in many current AI models: their struggle with coherence and task retention over long durations. By emulating human learning processes, SEAL enables AI systems to manage complex, long-term tasks with minimal human intervention. This ability to retain and apply knowledge over time positions SEAL as a fantastic tool for advancing AI capabilities. Potential Applications and Future Impact The introduction of SEAL marks a significant milestone in AI research, opening new possibilities for self-improving systems. Its ability to dynamically adapt, retain knowledge, and generate its own training data has far-reaching implications for the future of AI development. Potential applications include: - Autonomous robotics: Systems that can adapt to changing environments and perform tasks with minimal human oversight. - Personalized education: AI-driven platforms that tailor learning experiences to individual needs and preferences. - Advanced problem-solving: Applications in fields such as healthcare, logistics, and scientific research, where adaptability and precision are critical. Read more:

Owen Gregorian

70,672 views • 1 year ago

Experiments in progress. The one on the right has been learning for ~3 hours, the one in the middle for ~1 hour, and the one on the left just started a few minutes ago. The initial motivation for making the physical Atari was just to commit ourselves to a subset of algorithms that can make progress in this setup. This commitment rules out algorithms that require billions of samples to learn (or worse, require multiple environments running in parallel). Atari games are simple enough that we should be able to show learning on them in a short amount of time with no prior knowledge. Since then, I've realized that this setup is also a good way to compare different paradigms in robotics in a principled way. These paradigms are sim2real, learning from tele-operated data, and learning directly on the robots. So far, I have observed that getting sim2real to work reliably is hard. It requires tweaks that don't scale. Policies that can play perfectly in simulation fall apart because of latencies and the messiness of the real world. These aspects could be modeled to improve the simulation, but not without sinking significant human engineering hours. I have higher hopes for learning from tele-operated data, but that requires a human to learn the task first. These experiments are on my to-do list. I have to learn to play some of the games well through the robot. I’m half-decent at playing Pong and Ms Pacman now. Learning directly on robots is looking like the most promising approach. This approach takes away pesky distribution shifts and makes it possible to have algorithms that continually improve with more data and time without any human intervention. It feels great to let experiments run overnight and wake up to find improved policies. With learning on robots, I should, in principle, be able to go on a long vacation and come back to find better policies for complex tasks beyond Atari games. Whether that is possible with current learning algorithms is a different question.

Experiments in progress. The one on the right has been learning for ~3 hours, the one in the middle for ~1 hour, and the one on the left just started a few minutes ago. The initial motivation for making the physical Atari was just to commit ourselves to a subset of algorithms that can make progress in this setup. This commitment rules out algorithms that require billions of samples to learn (or worse, require multiple environments running in parallel). Atari games are simple enough that we should be able to show learning on them in a short amount of time with no prior knowledge. Since then, I've realized that this setup is also a good way to compare different paradigms in robotics in a principled way. These paradigms are sim2real, learning from tele-operated data, and learning directly on the robots. So far, I have observed that getting sim2real to work reliably is hard. It requires tweaks that don't scale. Policies that can play perfectly in simulation fall apart because of latencies and the messiness of the real world. These aspects could be modeled to improve the simulation, but not without sinking significant human engineering hours. I have higher hopes for learning from tele-operated data, but that requires a human to learn the task first. These experiments are on my to-do list. I have to learn to play some of the games well through the robot. I’m half-decent at playing Pong and Ms Pacman now. Learning directly on robots is looking like the most promising approach. This approach takes away pesky distribution shifts and makes it possible to have algorithms that continually improve with more data and time without any human intervention. It feels great to let experiments run overnight and wake up to find improved policies. With learning on robots, I should, in principle, be able to go on a long vacation and come back to find better policies for complex tasks beyond Atari games. Whether that is possible with current learning algorithms is a different question.

Khurram Javed

52,110 views • 7 months ago

🚨 BREAKING: ABB Robotics + NVIDIA close the sim-to-real gap with 99% accuracy! 👾 ABB Robotics is integrating NVIDIA Omniverse libraries into RobotStudio to deliver physical AI for industry, closing the gap from virtual training to real-world deployment with up to 99% accuracy. RobotStudio HyperReality, available second half of 2026, will fundamentally change how quickly manufacturers can scale production: reducing costs by up to 40%, accelerating time-to-market by 50%, and cutting setup and commissioning times by up to 80%. For decades, the deficit between simulation accuracy and real-world lighting, materials, and environments has limited manufacturers' ability to design advanced manufacturing processes in the virtual world. The only robot manufacturer with a virtual controller running the same firmware as the hardware, ensuring near-perfect correlation between simulation and real-world performance. The system uses physically accurate simulations and foundation models endlessly optimized with real-world data feedback. These models can train any number of ABB robots anywhere in the world with industrial-grade reliability. Foxconn is using RobotStudio HyperReality for consumer electronics assembly. Assembly robots are trained virtually using synthetic data to perfect multiple production processes across various scenarios, then moved to production lines with 99% accuracy. This eliminates physical training and tests, reducing setup times and costs. Workr is demonstrating AI-powered robotic systems at NVIDIA GTC 2026. Built on ABB technology, trained with synthetic data using NVIDIA Omniverse, deployed without operators needing programming knowledge . 🚨 I’ll be onsite in San Jose during GTC 2026, and will be showing all the cool stuff that ABB Robotics prepared this year! Can’t wait! 🫡 ~~ ♻️ Join the weekly robotics newsletter, and never miss any news →

🚨 BREAKING: ABB Robotics + NVIDIA close the sim-to-real gap with 99% accuracy! 👾 ABB Robotics is integrating NVIDIA Omniverse libraries into RobotStudio to deliver physical AI for industry, closing the gap from virtual training to real-world deployment with up to 99% accuracy. RobotStudio HyperReality, available second half of 2026, will fundamentally change how quickly manufacturers can scale production: reducing costs by up to 40%, accelerating time-to-market by 50%, and cutting setup and commissioning times by up to 80%. For decades, the deficit between simulation accuracy and real-world lighting, materials, and environments has limited manufacturers' ability to design advanced manufacturing processes in the virtual world. The only robot manufacturer with a virtual controller running the same firmware as the hardware, ensuring near-perfect correlation between simulation and real-world performance. The system uses physically accurate simulations and foundation models endlessly optimized with real-world data feedback. These models can train any number of ABB robots anywhere in the world with industrial-grade reliability. Foxconn is using RobotStudio HyperReality for consumer electronics assembly. Assembly robots are trained virtually using synthetic data to perfect multiple production processes across various scenarios, then moved to production lines with 99% accuracy. This eliminates physical training and tests, reducing setup times and costs. Workr is demonstrating AI-powered robotic systems at NVIDIA GTC 2026. Built on ABB technology, trained with synthetic data using NVIDIA Omniverse, deployed without operators needing programming knowledge . 🚨 I’ll be onsite in San Jose during GTC 2026, and will be showing all the cool stuff that ABB Robotics prepared this year! Can’t wait! 🫡 ~~ ♻️ Join the weekly robotics newsletter, and never miss any news →

Lukas Ziegler

22,482 views • 4 months ago

Yes, the Universe possesses a high degree of self-organization, which under certain conditions can give rise to life and, eventually, to intelligent beings. However, when we consider the history of the Solar System’s formation and the subsequent emergence and evolution of life on Earth, it is difficult to escape a sense of profound improbability in this process. This gives rise to the thought that the Universe may not be unique. We know the age of our Universe, its vast scale, and the immense number of planets it contains. And yet, it is hard to believe that such a complex and finely tuned sequence of events leading to the emergence of human beings occurred for the very first time precisely here. It seems more plausible to assume that such cosmic scenarios did not arise immediately, but became possible through a long process of repetition and selection, before eventually becoming a relatively natural outcome. All of this suggests that the Multiverse may indeed be real, and that within its framework, long before our own Universe, Earth-like planets may have already emerged — complete with the rich conditions necessary for life and intelligence. : The author observes that the Universe exhibits strong self-organization, capable under the right conditions of producing life and eventually intelligent beings. Yet when we examine the detailed history of the Solar System’s formation, Earth’s emergence, and the intricate evolutionary path to humanity, the entire sequence feels profoundly improbable. Given the known age of the Universe, its enormous scale, and the countless planets it contains, it becomes difficult to accept that such a finely tuned cascade of events leading to conscious life occurred only once, right here. The more plausible explanation is that these conditions did not appear immediately or by chance in a single attempt. Instead, they emerged through countless repetitions and variations across vast cosmic time, gradually becoming a relatively common outcome. This reasoning strongly supports the reality of a Multiverse. In its immense framework, countless universes would have preceded our own, many of them giving rise to Earth-like planets with the precise conditions required for life and intelligence long before ours. What seems miraculous in isolation becomes almost inevitable when viewed across an ensemble of possibilities. The improbability of our existence in a single universe dissolves into the natural probability of many. The Multiverse is not a retreat from explanation. It is the logical extension of the same self-organization we already observe.

Yes, the Universe possesses a high degree of self-organization, which under certain conditions can give rise to life and, eventually, to intelligent beings. However, when we consider the history of the Solar System’s formation and the subsequent emergence and evolution of life on Earth, it is difficult to escape a sense of profound improbability in this process. This gives rise to the thought that the Universe may not be unique. We know the age of our Universe, its vast scale, and the immense number of planets it contains. And yet, it is hard to believe that such a complex and finely tuned sequence of events leading to the emergence of human beings occurred for the very first time precisely here. It seems more plausible to assume that such cosmic scenarios did not arise immediately, but became possible through a long process of repetition and selection, before eventually becoming a relatively natural outcome. All of this suggests that the Multiverse may indeed be real, and that within its framework, long before our own Universe, Earth-like planets may have already emerged — complete with the rich conditions necessary for life and intelligence. : The author observes that the Universe exhibits strong self-organization, capable under the right conditions of producing life and eventually intelligent beings. Yet when we examine the detailed history of the Solar System’s formation, Earth’s emergence, and the intricate evolutionary path to humanity, the entire sequence feels profoundly improbable. Given the known age of the Universe, its enormous scale, and the countless planets it contains, it becomes difficult to accept that such a finely tuned cascade of events leading to conscious life occurred only once, right here. The more plausible explanation is that these conditions did not appear immediately or by chance in a single attempt. Instead, they emerged through countless repetitions and variations across vast cosmic time, gradually becoming a relatively common outcome. This reasoning strongly supports the reality of a Multiverse. In its immense framework, countless universes would have preceded our own, many of them giving rise to Earth-like planets with the precise conditions required for life and intelligence long before ours. What seems miraculous in isolation becomes almost inevitable when viewed across an ensemble of possibilities. The improbability of our existence in a single universe dissolves into the natural probability of many. The Multiverse is not a retreat from explanation. It is the logical extension of the same self-organization we already observe.

Zafar Mirzo | Quotes

2,120,327 views • 5 months ago

🧵06/34 Narrow vs General AI --- At first glance, this AGI being generally capable in multiple domains looks like a group of many narrow AIs combined, but that is not a correct way to think about it. It is actually more like… a species, a new life form. To illustrate the point, we’ll compare the general AGI of the near future with a currently existing narrow AI that is optimised at playing chess. Both of them are able to comfortably win a game of chess against any human on earth, every time. And both of them win by making plans and setting goals. The main goal is to achieve checkmate. This is the final destination or otherwise called Terminal Goal. In order to get there though it needs to work on smaller problems, what the AI research geeks call instrumental goals. For example: • attack and capture the opponent’s pieces • defend my pieces • strategically dominate the cetre (etc..) All these instrumental goals have something in common: they only make sense in its narrow world of chess. If you place this Narrow Chess AI behind the wheel of a car, it will simply crash, as it can not work on goals unrelated to chess, like driving. Its model doesn’t have a concept for space, time or movement for that matter. In contrast the AGI by design has no limit on what problems it can work on. So when it tries to figure out a solution to a main problem, the sub-problems it chooses to work on can be anything... literally any path out of the infinite possibilities allowed within the laws of physics and nature.

🧵06/34 Narrow vs General AI --- At first glance, this AGI being generally capable in multiple domains looks like a group of many narrow AIs combined, but that is not a correct way to think about it. It is actually more like… a species, a new life form. To illustrate the point, we’ll compare the general AGI of the near future with a currently existing narrow AI that is optimised at playing chess. Both of them are able to comfortably win a game of chess against any human on earth, every time. And both of them win by making plans and setting goals. The main goal is to achieve checkmate. This is the final destination or otherwise called Terminal Goal. In order to get there though it needs to work on smaller problems, what the AI research geeks call instrumental goals. For example: • attack and capture the opponent’s pieces • defend my pieces • strategically dominate the cetre (etc..) All these instrumental goals have something in common: they only make sense in its narrow world of chess. If you place this Narrow Chess AI behind the wheel of a car, it will simply crash, as it can not work on goals unrelated to chess, like driving. Its model doesn’t have a concept for space, time or movement for that matter. In contrast the AGI by design has no limit on what problems it can work on. So when it tries to figure out a solution to a main problem, the sub-problems it chooses to work on can be anything... literally any path out of the infinite possibilities allowed within the laws of physics and nature.

Lethal Intelligence

570,437 views • 1 year ago

🧵24/34 Inner Misalignment --- Consider this simplified experiment: We want this AI to find the exit of the maze. So we feed it millions of maze variations and reward it when it finds the exit. Please notice that in the worlds of the training data the apples are red and the exit is green. After enough training, our observation is that it has become extremely capable at solving mazes and finding the exit, we feel very confident it is aligned, so then we deploy it to the real world. The real world will be different though, it might have green apples and a red door. The AI geeks call this distributional shift. We expected that the AI will generalise and find the exit again, but in fact we now realise that the AI learned something completely different from what we thought. All the while we thought it learned how to find the exit, it had learned how to go after the green thing. Its behaviour was perfect in training. And most importantly, this AI is not stupid, it is an extremely capable AI that can solve extremely complex mazes. It’s just mis-aligned on the inside. Fishing for Failure modes --- The way to handle the shift between the training and deployment distributions is with methods like adversarial training: feeding it with a lot of generated variations and trying to make it fail so the weakness can be fixed. In this case, we generate an insane amount of maze variations, we discover those for which it fails to find the exit (like the ones with the green apples or the green walls or something), we generate many more similar to that and train it with reinforcement learning until it performs well at those as well. The hope is that we will cover everything it might encounter later when we deploy it in real life. There exist at least 2 basic ways this approach falls apart: First, there will never be any guarantee that we’ll have covered every possible random thing it might encounter later when we deploy it in real life. It’s very likely it will have to deal with stuff outside its training set which it will not know how to handle and will throw it out of balance and break it away from its expected behavioural patterns. The cascade effect of such a broken mind operating in the open world can be immense, and with super-capable runaway rogue agents, self-replicating and recursively self-improving, the phenomenon could grow and spread to an extinction-level event. ...

🧵24/34 Inner Misalignment --- Consider this simplified experiment: We want this AI to find the exit of the maze. So we feed it millions of maze variations and reward it when it finds the exit. Please notice that in the worlds of the training data the apples are red and the exit is green. After enough training, our observation is that it has become extremely capable at solving mazes and finding the exit, we feel very confident it is aligned, so then we deploy it to the real world. The real world will be different though, it might have green apples and a red door. The AI geeks call this distributional shift. We expected that the AI will generalise and find the exit again, but in fact we now realise that the AI learned something completely different from what we thought. All the while we thought it learned how to find the exit, it had learned how to go after the green thing. Its behaviour was perfect in training. And most importantly, this AI is not stupid, it is an extremely capable AI that can solve extremely complex mazes. It’s just mis-aligned on the inside. Fishing for Failure modes --- The way to handle the shift between the training and deployment distributions is with methods like adversarial training: feeding it with a lot of generated variations and trying to make it fail so the weakness can be fixed. In this case, we generate an insane amount of maze variations, we discover those for which it fails to find the exit (like the ones with the green apples or the green walls or something), we generate many more similar to that and train it with reinforcement learning until it performs well at those as well. The hope is that we will cover everything it might encounter later when we deploy it in real life. There exist at least 2 basic ways this approach falls apart: First, there will never be any guarantee that we’ll have covered every possible random thing it might encounter later when we deploy it in real life. It’s very likely it will have to deal with stuff outside its training set which it will not know how to handle and will throw it out of balance and break it away from its expected behavioural patterns. The cascade effect of such a broken mind operating in the open world can be immense, and with super-capable runaway rogue agents, self-replicating and recursively self-improving, the phenomenon could grow and spread to an extinction-level event. ...

Lethal Intelligence

535,291 views • 1 year ago

What if the "flaws" in a system are actually the source code of its intelligence? In new work, we argue that invention behaves like a phase transition driven by exactly this dynamic: novelty is a thermodynamic response to constraint failure. When a system can no longer resolve its inputs within its current degrees of freedom, it is forced to expand its representational space - introducing new effective variables to restore feasibility. Thus innovation is not an accident; it is what a viable system does when the old model stops closing. This allowed us to extract the shared mechanics behind diverse phenomena: rote discovery, creativity, and the spark of insight. We show that symmetry breaking is the new optimization. We exhaustively mapped the topological landscape of matter and musical systems and found that the stabilizing vector is selective imperfection: a specific topological regime that rejects both sterile perfection and incoherent randomness. Strikingly, whether in the Hall-Petch strengthening of high-entropy alloys, function-driving geometry of proteins, or the cultural evolution of musical scales, the corridor for maximum coherence and adaptability is defined by a calculated defect. The physics of resilience and the mathematics of beauty appear to be running the same algorithm. This allows us to hack the vibrational stack by treating vibration as a universal isomorphic operator. We are liquefying the boundary between matter, sound, and intelligence, creating an epistemic inversion: listening becomes a form of seeing and creating. We are translating femtosecond molecular vibrations into audible spectra to design de novo proteins by creating direct lines of communication between Bach and deep-time evolution, and using the "glitch" logic of biology to build swarm AI. The distinction between a spider web’s stress tensor and a musical composition is collapsing; both are generative acts of world-building under constraint. For AI, the implication is straightforward: interpolation is not invention. True structural invention requires systems that can metabolize constraint failure - treating it as the exact point where new degrees of freedom are born. With this machines overcome the old paradigm of simply analyzing the world but are building it. We are operationalizing this via small-world topology. When these new degrees of freedom are born, they don't form a random mess; they snap into global coherence via small-world wiring. We found that this specific connectivity of balancing local motifs with long-range shortcuts is the architectural prerequisite for genuine world-building. Preprint with the full analysis to follow - stay tuned. On to 2026, excited to see what it brings!

What if the "flaws" in a system are actually the source code of its intelligence? In new work, we argue that invention behaves like a phase transition driven by exactly this dynamic: novelty is a thermodynamic response to constraint failure. When a system can no longer resolve its inputs within its current degrees of freedom, it is forced to expand its representational space - introducing new effective variables to restore feasibility. Thus innovation is not an accident; it is what a viable system does when the old model stops closing. This allowed us to extract the shared mechanics behind diverse phenomena: rote discovery, creativity, and the spark of insight. We show that symmetry breaking is the new optimization. We exhaustively mapped the topological landscape of matter and musical systems and found that the stabilizing vector is selective imperfection: a specific topological regime that rejects both sterile perfection and incoherent randomness. Strikingly, whether in the Hall-Petch strengthening of high-entropy alloys, function-driving geometry of proteins, or the cultural evolution of musical scales, the corridor for maximum coherence and adaptability is defined by a calculated defect. The physics of resilience and the mathematics of beauty appear to be running the same algorithm. This allows us to hack the vibrational stack by treating vibration as a universal isomorphic operator. We are liquefying the boundary between matter, sound, and intelligence, creating an epistemic inversion: listening becomes a form of seeing and creating. We are translating femtosecond molecular vibrations into audible spectra to design de novo proteins by creating direct lines of communication between Bach and deep-time evolution, and using the "glitch" logic of biology to build swarm AI. The distinction between a spider web’s stress tensor and a musical composition is collapsing; both are generative acts of world-building under constraint. For AI, the implication is straightforward: interpolation is not invention. True structural invention requires systems that can metabolize constraint failure - treating it as the exact point where new degrees of freedom are born. With this machines overcome the old paradigm of simply analyzing the world but are building it. We are operationalizing this via small-world topology. When these new degrees of freedom are born, they don't form a random mess; they snap into global coherence via small-world wiring. We found that this specific connectivity of balancing local motifs with long-range shortcuts is the architectural prerequisite for genuine world-building. Preprint with the full analysis to follow - stay tuned. On to 2026, excited to see what it brings!

Markus J. Buehler

66,016 views • 6 months ago

🚀 We're thrilled to introduce Orthrus 🧬🐕—a groundbreaking mature RNA foundation model designed to push the boundaries of RNA property prediction! 🔬 What is Orthrus? Orthrus is a Mamba-based RNA foundation model, pre-trained using a novel self-supervised contrastive learning objective with biologically inspired augmentations. It optimizes the similarity between splicing isoforms and orthologous transcripts, capturing functional and evolutionary relationships to enhance mature RNA property prediction accuracy. 📑 Preprint: 💻 Code: 🌐 Project Page: 📦 Model Weights: 🧠 Why Orthrus? Decoding the RNA regulatory code is key to understanding biology, but traditional experimental approaches are slow and costly. Existing genomic foundation models rely on techniques like masked language modeling or next-token prediction, which aren't fully aligned with the complexities of genomic data—leading to suboptimal results. 🌟 Orthrus Highlights: - Biologically-Informed Contrastive Learning 🧪: A novel contrastive learning objective designed specifically for genomics, maximizing similarity between splicing isoforms and orthologous transcripts across species. - Extensive Pre-training 📊: Trained on splicing annotations from 10 species and orthologous alignments from 400+ mammalian species (Zoonomia Project), with a focus on sequences of high functional importance. - Superior Representations🏅: Orthrus outperforms existing genomic models on 5 mRNA property prediction tasks, often surpassing supervised methods with just a simple linear transformation. - Efficiency in Low-Data Settings📉: Orthrus excels in low-data regimes, achieving state-of-the-art results with as few as 45 labeled examples for fine-tuning on RNA half-life prediction. Shoutout to the amazing leading authors Phil (Phil Fradkin) and Ian (Ian Shi)! Also the work is impossible without an outstanding collaboration by Karina (Karin(a) Isaev), Brendan (Brendan Frey) , Quaid (Quaid Morris), Leo J. Lee! Vector Institute University Health Network U of T Department of Computer Science Temerty Centre for AI in Medicine (T-CAIREM) Department of Laboratory Medicine & Pathobiology

🚀 We're thrilled to introduce Orthrus 🧬🐕—a groundbreaking mature RNA foundation model designed to push the boundaries of RNA property prediction! 🔬 What is Orthrus? Orthrus is a Mamba-based RNA foundation model, pre-trained using a novel self-supervised contrastive learning objective with biologically inspired augmentations. It optimizes the similarity between splicing isoforms and orthologous transcripts, capturing functional and evolutionary relationships to enhance mature RNA property prediction accuracy. 📑 Preprint: 💻 Code: 🌐 Project Page: 📦 Model Weights: 🧠 Why Orthrus? Decoding the RNA regulatory code is key to understanding biology, but traditional experimental approaches are slow and costly. Existing genomic foundation models rely on techniques like masked language modeling or next-token prediction, which aren't fully aligned with the complexities of genomic data—leading to suboptimal results. 🌟 Orthrus Highlights: - Biologically-Informed Contrastive Learning 🧪: A novel contrastive learning objective designed specifically for genomics, maximizing similarity between splicing isoforms and orthologous transcripts across species. - Extensive Pre-training 📊: Trained on splicing annotations from 10 species and orthologous alignments from 400+ mammalian species (Zoonomia Project), with a focus on sequences of high functional importance. - Superior Representations🏅: Orthrus outperforms existing genomic models on 5 mRNA property prediction tasks, often surpassing supervised methods with just a simple linear transformation. - Efficiency in Low-Data Settings📉: Orthrus excels in low-data regimes, achieving state-of-the-art results with as few as 45 labeled examples for fine-tuning on RNA half-life prediction. Shoutout to the amazing leading authors Phil (Phil Fradkin) and Ian (Ian Shi)! Also the work is impossible without an outstanding collaboration by Karina (Karin(a) Isaev), Brendan (Brendan Frey) , Quaid (Quaid Morris), Leo J. Lee! Vector Institute University Health Network U of T Department of Computer Science Temerty Centre for AI in Medicine (T-CAIREM) Department of Laboratory Medicine & Pathobiology

Bo Wang

114,913 views • 1 year ago