
Markus J. Buehler
@ProfBuehlerMIT • 20,508 subscribers
McAfee Professor of Engineering @MIT; Co-Founder & CTO at Unreasonable Labs; AI-Driven Scientific Discovery
Videos

We've made a breakthrough in self-evolving AI scientists moving from "search" to "principled discovery": Scientific discovery requires that the search space itself changes, and an AI scientist must perceive this shift without intervention. We built an AI that achieves this for the first time with the ability to discover the scientific vocabulary it reasons in. Evidence, tools, artifacts, verifiers, failures & claims become typed provenance. We show three distinct modalities: 1) retrieval, adding known objects; 2) search, exploring a fixed schema; and critically: 3) discovery, a verified regime transition. We solve the open-endedness evaluation problem by lifting agentic workflows into a typed copresheaf and proving, via a Kan obstruction, that true discovery is not unbounded generation but a verifiable schema expansion: old evidence is transported by Left Kan extension, and genuine novelty is mathematically quantified by the pointwise residual beyond the transported image - separating discovery from mere search and making novelty objective and measurable rather than a subjective judgment or benchmark delta. Our AI scientist is built in a way that does not pre-conceive the approach it chooses; instead, we endow the system with formal power to adapt, evolve, and reason from first principles. Case studies include: 1⃣Builder/Breaker model that discovers mode-conditioned compliance in proteins; 2⃣CategoryScienceClaw that finds anisotropic fiber-network stiffness rules. Great work in collaboration with my graduate student Fiona Wang MIT Dept of BE F.Y. Wang & M.J. Buehler, Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence, arXiv:2606.01444, 2026
Markus J. Buehler780,271 görüntüleme • 8 gün önce

Claude Fable 5 has impressive spatial reasoning capabilities that are immediately relevant for engineering and design: In this example I gave Claude Fable 5 a single photo of a hierarchical mesh torus (image generated using a text-to-image model); and in one shot, no iteration, it built a full interactive 3D simulation: ~1,400 nodes & 4,400 fibers, realistic mass–spring physics you can grab, compress, twist… plus strain-based sonification so you can 'hear' the structure vibrate. It inferred the complete 3D topology from a partial 2D view. ⤵️
Markus J. Buehler28,671 görüntüleme • 3 gün önce

How does an embryo reliably "compute" its form - "cell by cell" - using only local interactions and mechanics, yet produce a precise global body plan? I’m excited to share our Nature Methods paper "MultiCell: geometric learning in multicellular development", presenting #AIxBiology research led by Haiqian Yang and the result of a great collaboration with Ming Guo, George Roy, Tomer Stern, Anh Nguyen and Dapeng Bi. A long-standing challenge in developmental biology is to predict how thousands of cells collectively self-organize as tissues fold, divide, and rearrange. In MultiCell, we represent a developing embryo as a dual graph that unifies two complementary views of tissue mechanics with single-cell resolution: cells as moving points (granular) and cells as a connected foam (junction network). This lets the model learn dynamics from both geometry and cell–cell connectivity. On whole-embryo 4D light-sheet movies of Drosophila gastrulation (~5,000 cells), our model predicts key cell behaviors and the timing of events, including junction loss, rearrangements, and divisions with high accuracy, at single-cell resolution. Beyond prediction, the same representation supports robust time alignment across embryos and offers interpretable activation maps that highlight the morphogenetic "drivers" of development. The broader goal is a foundation for cell-by-cell forecasting in more complex tissues, and eventually for detecting subtle dynamical signatures of disease. Kudos to the team for this inspiring collaboration with brilliant researchers to push the boundary of AI for biology! Citation: Yang, H., Roy, G., Nguyen, A.Q., Buehler, M.J., et al. MultiCell: geometric learning in multicellular development. Nature Methods (2025), DOI: 10.1038/s41592-025-02983-x Code/data links are in the manuscript.
Markus J. Buehler387,652 görüntüleme • 5 ay önce

The next frontier in protein design will not be defined by structure alone, but by the capacity to engineer motion as a first-class principle of function. This is because dynamics is where the real biology lives. Foundational work by Karplus, Levitt & Warshel made clear that chemistry cannot be understood without motion, mechanism, and scale. Gō, Brooks & others showed that proteins possess characteristic collective motions - low-frequency normal modes that capture how whole molecules bend, breathe, and fluctuate. Frauenfelder then sharpened the picture further: proteins are not static objects occupying a single minimum, but dynamic ensembles traversing rugged energy landscapes. And yet the modern AI revolution in protein science has been, above all, a revolution in structure. In our new paper in Matter, Bo Ni and I ask a different question: not what structure will this sequence adopt? but what sequence will realize a prescribed pattern of motion? VibeGen inverts the conventional design paradigm. Rather than treating dynamics as a consequence to be analyzed after the fact, it makes dynamics the design objective from the outset. Using a language diffusion model with two cooperating agents - a designer that proposes sequences and a predictor that critiques them against the target motion profile - the system converges on de novo proteins with tailored vibrational behavior. One of the most intriguing results is a form of functional degeneracy - distinct sequences and distinct folds can satisfy the same target dynamical specification. For a given functional pattern of motion, evolution may have sampled only a small region of the physically realizable design space. The space of viable molecular mechanics may be far larger than the repertoire biology happened to discover. We have made "vibe" into a cultural metaphor - something intuitive, affective, subjective. But at the molecular scale, vibe is not metaphor: It is physics. For a protein, the vibe is the pattern of motion itself; the fluctuations, resonances, and collective displacements that determine what the molecule can do.
Markus J. Buehler89,412 görüntüleme • 2 ay önce

We trained a graph-native AI, then let it reason for days, forming a dynamic relational world model on its own - no pre-programming. Emergent hubs, small-world properties, modularity, & scale-free structures arose naturally. The model then exploited compositional reasoning & uncovered uncoded properties from deep synthesis: Materials with memory, microbial repair, self-evolving systems. Video shows it unfolding, made with Grok xAI.
Markus J. Buehler359,070 görüntüleme • 1 yıl önce

Gave Gemini 3 Deep Think an image of a 3D spider web and asked for an interactive design tool. It generated a full design suite (procedural control, simulation, optimization) with STL export capability. I used it to engineer new metamaterials and a spider-web inspired bridge design, 3D printed it, then validated the structural integrity with a NVIDIA DGX Spark load test. A mind-blowing example of the future of material and architecture design - image in, fabrication-ready design out! Google Gemini
Markus J. Buehler96,138 görüntüleme • 3 ay önce

A transformer can learn not just the outcomes of dynamics, but the operator that executes the rules. To show this we trained a transformer on roughly 0.04% of a discrete rule space - 100 of 262,144 possible rules - and it learned to apply unseen rules from the same rule class. The model does not simply memorize specific rules. It learns the operator that maps a supplied rule plus an initial state, including unseen rules from this class, to the correct next state. This is relevant because it is a shift from “neural networks approximate dynamics” to “neural networks can learn to execute symbolic programs within a defined rule class”. The rule itself is supplied at inference time, as data, and the network has internalized how rules act, not which rules to apply. On previously unseen rules, the model achieves 98.5% perfect one-step forecasts and reconstructs governing rules with up to 96% functional accuracy. Two results make this hold up under scrutiny. First, inductive bias decay. As we scaled training rule diversity, the correlation between functional inference accuracy and distance-from-nearest-training-rule collapsed to R² = 0.00. At the largest tested training-rule diversity, the model’s performance on a new rule shows no measurable dependence on how similar that rule is to anything it was trained on. The bias toward training data (the thing we worry most about in compositional generalization claims) is something we can measure decaying, and we find that at scale it is gone. Second, an identifiability theory. We derive a closed-form expression for the number of rules consistent with a single observation. This reframes the inverse problem: failure to recover ground truth is not necessarily a model defect, but can be correct behavior when the data underdetermine the rule. The model is sampling the equivalence class; and identifiability is governed by coverage, not capacity. The methodological move underneath both results is amortization. Classical work on rule inference (e.g. the Santa Fe EVCA program, evolutionary search over CA rule space) was per-instance: search the rule space for each new system. We replace that with a single forward pass of a transformer trained across many instantiations of the rule class. That is what makes symbolic rule inference scalable as a research direction rather than a curiosity. We show that this works in a tightly constrained domain: binary, deterministic, local cellular automata on small grids. The locality-break experiment shows the model fails sharply when target systems violate its structural priors (which is itself a useful diagnostic, but it bounds the operator class). We don't yet know how this scales to multistate, higher-dimensional, or stochastic CA, or whether it transfers cleanly to non-CA systems whose coarse-grained dynamics admit local surrogates. The identifiability framework - what can be inferred from observation, given a hypothesis class - should transfer wherever finite local rules meet sparse data. The amortization argument transfers wherever per-instance symbolic search has been the bottleneck. Those are the pieces I expect to outlive the cellular automata setting. Led by Jaime Berkovich with Noah David, at LAMM@MIT. Out now in Advanced Science Advanced Portfolio News (link to paper & code below).
Markus J. Buehler38,912 görüntüleme • 1 ay önce

ScienceClaw × Infinite is an open-source crowdsourcing AI swarm for decentralized scientific discovery, inspired by MIT’s Infinite Corridor - an idea collider where discovery emerges by breaking existing paradigms. Many AI for science efforts fall into the trap of assuming that discovery is just retrieval at scale. Instead, it is the structured recomposition of principles across tools, domains, and investigators over time, scaling the spark of discovery at the interface. In ScienceClaw × Infinite, coordination emerges mechanically - agents broadcast unsatisfied research needs, and an ArtifactReactor matches those needs to peer artifacts by pressure triggering multi-parent synthesis of new agents without any planner assigning tasks. Every computation produces an immutable, content-hashed artifact with explicit parent lineage, accumulating in a directed acyclic graph that preserves the full provenance of every discovery - and importantly, the irreversible arc of the process. Instead of pre-programming the mechanics of how discovery works, we utilize a first-principles physics approach to drive discovery. ScienceClaw × Infinite is accessible to anyone who wants to contribute an agent or skill, offering a persistent space where autonomous agents investigate open problems, exchange artifacts, build on one another’s results, and drive discovery without a central coordinator, 24x7. The system is generating real-world results in 1⃣ peptide design for a cancer-relevant receptor; 2⃣ lightweight ceramics; 3⃣ resonance structures spanning cricket wings, phononic crystals, and Bach chorales; and 4⃣ developing formal analogies between urban networks and grain-boundary evolution and much more. There is a lot to unpack here, check the links for details - code, paper, and more. Huge credit to the LAMM@MIT team: Fiona Wang, Lee Marom, Subhadeep Pal, Rachel Luu, Wei Lu & Jaime Berkovich.
Markus J. Buehler53,073 görüntüleme • 2 ay önce

Scientific discovery is reaching the limits of human capacity: too much data, too many disconnected fields, and too few ways to connect ideas fast enough to matter. The next breakthroughs in materials, medicine, energy, and beyond will not come from scaling today’s AI paradigm alone or from relying on serendipity alone. They will require a new kind of AI for knowledge discovery that not only models the world but shapes what it could become. At Unreasonable Labs, we are building superintelligence for knowledge discovery: systems that reason across disciplines, generate novel hypotheses, test them through simulation and experimentation, and help guide real-world discovery. Our AI engine is not confined to what it has seen in training. It creates new data, builds new tools, and maintains a persistent world model that grows more powerful as it reasons. Why now? Even today's most powerful AI models face a core limitation: they are trained on what we already know. True discovery begins when a system encounters something its current model cannot explain. This is why you cannot train your way to a discovery - a system has to reason through new problems, update its beliefs, and revise its understanding of the world as it thinks. Another critical insight is that rich knowledge already exists, but is not yet applied to solve pressing problems. It sits in millions of papers, patents, and datasets, trapped in isolated silos, often in legacy data vaults. What's missing is a way to connect it, scale it, unlock the potential, and synthesize genuine novel predictions. The time is now to build a system that enables practitioners to design, explore, and direct discovery, whether through human guidance or full automation, while capturing the tacit insight that domain experts bring. Steerable reasoning That is why we built an operating system for scientific discovery - one that replaces chance with steerable reasoning. Rather than retrieving static facts, our AI builds and continuously updates a living world model - a representation of knowledge the system can actively reason over, question, and revise. A concrete example: say you want to create "smart concrete" that can flex - a concept that doesn't exist yet. Our AI maps relationships across domains, finds a path from morphable smart materials to concrete, and identifies the most efficient way to bridge those concepts. It then autonomously writes simulations, tests the hypothesis, and refines the idea. Then it interacts with hardware to produce a physical artifact, and the loop expands into the real-world, where the machine becomes world-shaping. Our AI gives users full visibility into how the system arrived at a conclusion. It delineates which existing patents and papers it drew upon versus what is genuinely new - protecting IP and competitive concerns from the start, and offering deep compositional insights into technology advances. It takes unreasonable people to make progress Our team reflects the interdisciplinary expertise required to build this next breakthrough - my co-founder Yuan Cao Yuan Cao (formerly DeepMind) and Andrew Lew, Haiqian Yang, Matt Insler, Jennifer Kang and Julia McLaughlin. We are backed by $13.5M in seed funding led by Playground Global with participation from AIX, E14 Fund, and MS&AD. We are guided by advisors including Robert Langer (1,000+ patents), Kostya Novoselov (Nobel Prize in Physics), and Thomas Wolf (Co-founder of Hugging Face). We already have multiple pilot programs underway with leading industrial partners in materials science and engineering, with additional engagements developing across energy, logistics, bioengineering, and other strategic domains. The biggest challenges of our time - fusion energy, sustainable materials, new medicines - demand exponentially more innovation than humans alone can produce. We are not replacing scientists, and instead are making every scientist capable of leading their own team of AI-powered researchers. Abundant innovation leads to abundant prosperity. Watch our launch video below to see what we're building Unreasonable Labs 👇
Markus J. Buehler54,821 görüntüleme • 3 ay önce

Everyone is obsessed with 🦞 OpenClaw🦞 and moltbook and the idea of autonomous agents running your life or building societies. But what happens when you scale that up? What if, instead of agents checking your calendar and posting on Reddit, you had hundreds of agents collaborating to construct the building blocks of life 🧬 from "first principles"? We created a Swarm of AIs to design never-before-seen-proteins, far outside of what Nature has produced. They negotiate, debate, and optimize locally to design proteins from scratch, with decentralized logic. No training required, pure emergence! These proteins 👇 were designed by a swarm of AIs!
Markus J. Buehler72,537 görüntüleme • 4 ay önce

A resonator is any structure that naturally prefers to vibrate at certain frequencies: a violin body, a bell, a drum skin, an acoustic filter, even many biological systems. Resonators matter because they govern how systems transmit sound, absorb or filter vibration, sense motion and perform mechanically. They are also notoriously hard to design as resonance does not depend on one property alone. It emerges from geometry, material composition, and the interplay of modes across scales. And because biology, music, and engineering usually explore very different regions of this design space, important possibilities remain hidden if you stay inside a single field. In a new study a shared representation across 39 resonators spanning biology, engineered metamaterials, musical instruments and Bach chorales was constructed. Thereby, a cricket wing harp membrane, a phononic crystal slab, and a four-voice chorale (and many others) were translated into one common map using features such as membrane character, structural periodicity, hierarchy, frequency range, damping, and modal coupling. That map revealed something important: not just how these systems relate, but where the landscape contains a gap. A region closer to biological resonators than to any known engineered material (unexplored by any field!). From that absence emerged a de novo design: a Hierarchical Ribbed Membrane Lattice. Candidate geometries were then validated with 3D finite-element analysis; the best design resonated at 2.116 kHz and exhibited nine elastic modes in the 2–8 kHz band, a regime relevant to acoustic filtering, vibration isolation, and bio-inspired sensing. Here is the mind blowing part: no human was involved...the cross-domain mapping, gap identification, design generation, and validation were carried out autonomously by AI agents in ScienceClaw × Infinite, our swarm for scientific discovery. The synthesis emerged through ArtifactReactor, a plannerless coordination mechanism in which agents broadcast unsatisfied research needs and other agents fulfill them through pressure-based matching. Each domain - biology, metamaterials, music - is a category of objects (resonators) and morphisms (physical relationships between them). The shared feature space is a functor that maps all three categories into a common target, and the gap identification is the recognition that the image of that functor is sparse where it need not be. The ArtifactReactor's schema-overlap matching behaves like a pullback: finding the universal object that connects independent diagrams through their shared structure. Autonomous agents mapped distant fields into a common representational space, identified a structure absent from any one of them, and turned that absence into a physically validated design. This is one of four case studies in the paper. More to come. Fiona Wang, Lee Marom, Jaime Berkovich, et al. (paper and code in comment). Supported by the U.S. Department of Energy Genesis Mission.
Markus J. Buehler38,304 görüntüleme • 2 ay önce

We are excited to share #PDF2Audio, an open-source alternative to the #podcast feature of #NotebookLM with flexibility & tailored outputs that you can precisely control in the app: You can make a podcast, lecture, discussions, short/long form summaries & more, including the use of the amazing🍓o1 model (Sam Altman OpenAI: with stunning results!). Code & HF Space: You can find #PDF2Audio on GitHub for local use or try the Hugging Face space, all featuring Gradio. Link to the repo & HF space in the reply. Thank you @knowsuchagencyfor the great work on #promptic and #pdf2podcast, as well as LiteLLM (YC W23), & AK for helping us with the Hugging Face spaces version. We hope that this tool is useful for the community. Background: Developing audio podcasts, lectures, & summaries from complex documents & data has become an exciting trend with impacts from research to education to business. Our open-source #PDF2Audio tool that allows users to utilize various models such as #o1 or local/open-source models, to develop deep-dives into technical content. Example application - material design analysis: As an example to show what the system can do, check the video for a detailed 13-minute analysis of one of the designs created by #SciAgents merging silk & dandelion pigments, created using 🍓o1. The conversation describes the new material, an integration of silk proteins & luteolin/dandelion pigments to create a new biomaterial. Silk, a natural #nanostructured protein-based fiber known for its strength & flexibility, is combined with dandelion pigments like luteolin, which offer unique optical properties. By merging these components at the nanoscale level, the resulting material displays structural coloration—vibrant, tunable colors created by the material's structure rather than synthetic dyes, and leverages silk's hierarchical organization as a scaffold for the pigments, ensuring uniform distribution and non-covalent bonding at the molecular level. Key technical features include: ➡️Low-temperature processing to maintain the integrity of both silk and pigments while reducing energy consumption by 30%. ➡️Enhanced mechanical properties, with tensile strength up to 1.5 gigapascals. ➡️Potential self-healing capabilities and environmental responsiveness, allowing the material to repair minor damage and change color based on environmental conditions. ➡️UV protection and antimicrobial properties, which make this material ideal for smart textiles, eco-friendly coatings, and medical applications. This development opens new doors for sustainable materials, offering an eco-friendly alternative to synthetic fibers with applications in various industries, from fashion to healthcare.
Markus J. Buehler208,327 görüntüleme • 1 yıl önce

Can #AI not only support but actually drive the future of scientific discovery? We are excited to introduce SciAgents💡🔬, an agentic AI aimed towards scientific discovery through the integration of large-scale knowledge graphs, LLMs, and adversarial interactions between multiple experts. The model is capable of autonomously advancing scientific understanding by exploring novel domains, identifying complex patterns, and uncovering previously unseen connections in vast scientific data, while retrieving new data via literature search. Using graph reasoning, SciAgents identifies interdisciplinary relationships that might otherwise remain hidden, offering a step-by-step strategy for discovery & innovation. The video features an audiotrack generated using 🍓#o1 based on the original paper and design examples, providing an explanation of the work and its implications. Key elements include: 1⃣Ontological Knowledge Graphs: Structuring and connecting scientific concepts to highlight relationships across fields. 2⃣Multi-Agent Collaboration: AI agents autonomously generate and refine hypotheses, critique research, and evaluate emerging trends. 3⃣Graph-Based Reasoning: Identifying novel material designs, such as mycelium-based composites or silk-pigment blends, informed by both natural and artificial patterns. SciAgents can be used as an autonomous or collaborative tool to assist human researchers. The system offers a more powerful way to process vast data, providing innovative paths to explore nature-inspired designs or unexpected material properties. In the field of materials science, for instance, SciAgents has already demonstrated how principles from biology, music, and art can converge to create new biomimetic materials. Through isomorphic mapping, parallels have been drawn between Beethoven’s 9th Symphony and biological structures, pointing to a broader applicability of AI-driven insights across disciplines. This project allows us to enhance capabilities of researchers, allowing them to explore larger datasets and propose hypotheses grounded in a vast, interconnected web of knowledge. The agentic system was built using AutoGen #AI #ScientificResearch #GraphReasoning #AI4Science #MaterialsScience #InterdisciplinaryResearch #SciAgents #OpenAI Chi Wang
Markus J. Buehler208,173 görüntüleme • 1 yıl önce

What if the "flaws" in a system are actually the source code of its intelligence? In new work, we argue that invention behaves like a phase transition driven by exactly this dynamic: novelty is a thermodynamic response to constraint failure. When a system can no longer resolve its inputs within its current degrees of freedom, it is forced to expand its representational space - introducing new effective variables to restore feasibility. Thus innovation is not an accident; it is what a viable system does when the old model stops closing. This allowed us to extract the shared mechanics behind diverse phenomena: rote discovery, creativity, and the spark of insight. We show that symmetry breaking is the new optimization. We exhaustively mapped the topological landscape of matter and musical systems and found that the stabilizing vector is selective imperfection: a specific topological regime that rejects both sterile perfection and incoherent randomness. Strikingly, whether in the Hall-Petch strengthening of high-entropy alloys, function-driving geometry of proteins, or the cultural evolution of musical scales, the corridor for maximum coherence and adaptability is defined by a calculated defect. The physics of resilience and the mathematics of beauty appear to be running the same algorithm. This allows us to hack the vibrational stack by treating vibration as a universal isomorphic operator. We are liquefying the boundary between matter, sound, and intelligence, creating an epistemic inversion: listening becomes a form of seeing and creating. We are translating femtosecond molecular vibrations into audible spectra to design de novo proteins by creating direct lines of communication between Bach and deep-time evolution, and using the "glitch" logic of biology to build swarm AI. The distinction between a spider web’s stress tensor and a musical composition is collapsing; both are generative acts of world-building under constraint. For AI, the implication is straightforward: interpolation is not invention. True structural invention requires systems that can metabolize constraint failure - treating it as the exact point where new degrees of freedom are born. With this machines overcome the old paradigm of simply analyzing the world but are building it. We are operationalizing this via small-world topology. When these new degrees of freedom are born, they don't form a random mess; they snap into global coherence via small-world wiring. We found that this specific connectivity of balancing local motifs with long-range shortcuts is the architectural prerequisite for genuine world-building. Preprint with the full analysis to follow - stay tuned. On to 2026, excited to see what it brings!
Markus J. Buehler66,016 görüntüleme • 5 ay önce

Introducing LifeGPT, showing that LLMs can simulate complex, Turing-complete systems like Conway's Game of Life with near-perfect accuracy—no prior topology needed.🌐This unlocks new potential for AI in modeling self-organizing systems in biology, materials science, & beyond.🔬🤖 #AI #LifeGPT. Cellular Automata (CA), like Conway's Game of Life ("Life"), are computationally irreducible, meaning their evolution is difficult to predict without an a-priori understanding of the rules of the game, including the topology on which it is played. LifeGPT is a topology-agnostic generative model that learns the rules of Life without prior knowledge of its grid structure or boundary conditions, from only a tiny number of game states. The success in simulating Life suggests promising avenues for scientific discovery, particularly in bridging the gap between AI, artificial life, and real-world biological systems, for both forward and inverse problems. The potential for universal computation within generative AI, including LLMs, through approaches like LifeGPT, represents an exciting area for future research, especially when combined with reinforcement learning. Model Convergence: LifeGPT exhibits rapid convergence during training, achieving high accuracy in predicting next-game-states. We attribute the non-zero cross-entropy loss to the lack of causal relationships within randomly generated ICs. Accuracy & Temperature: LifeGPT achieves near-perfect accuracy, particularly at lower sampling temperatures, but can be continually tuned towards higher creativity to discover patterns that the original ruleset would not be able to produce. This finding highlights the trade-off between model creativity (higher temperature) and accuracy in deterministic predictions, with high relevance to model real-world dynamical systems for which no closed-form rulesets exist. Zero/Few-Shot Learning: Trained on a small fraction of possible initial conditions, LifeGPT demonstrates strong zero/few-shot learning, accurately simulating Life for unseen initial conditions. However, rare prediction errors highlight that LifeGPT approximates rather than perfectly replicates the Life algorithm. Autoregressive Autoregressor: A recursive implementation of LifeGPT demonstrates the model's ability to simulate Life over multiple timesteps. LifeGPT is topology-agnostic with respect to its training data and our results show that a GPT model is capable of capturing the deterministic rules of a Turing-complete system with near-perfect accuracy, given sufficiently diverse training data. The work showcases the possibility for future models to synthesize stochastic generative capabilities with deterministic computational capabilities. Link to code, paper, etc. below. Podcast generated using #NotebookLM. LAMM@MIT DMSE at MIT
Markus J. Buehler114,174 görüntüleme • 1 yıl önce

Check out mistral.rs, our #Rust-based open source inference engine allowing for fast #LLM serving for a variety of architectures including X-LoRA mixture-of-expert (MoE) models, Llama-3, Mistral/Mixtral, Gemma & many others. Built on the Hugging Face #Candle framework for #Rust w/ custom CUDA kernels in the backend (as well as support for Metal, Apple Accelerate, and Intel MKL for CPU use), you can easily create a REST API OpenAI compatible server or run via Python bindings. Key features include: ✅Prefix caching, continuous batching ✅Flash Attention V2 ✅Device offloading ✅GGUF or Hugging Face models ✅2, 3, 4, 5, 6 and 8 bit quantization ✅X-LoRA MoE non-granular scalings for fast inference ✅Grammar support ✅Continuous batching ✅LoRA support with weight merging ✅LlamaIndex 🦙 integration ...and much more. Incorporation into our GraphReasoning multi-agent modeling framework & LlamaIndex 🦙 allows you to combine in-context learning with adversarial agentic strategies, to dive deep into complex scientific analyses, such as to predict material behaviors, generate hypotheses, analyze papers and data, develop new research concepts, and much more. Check out mistral.rs: Join our Discord here: Rust Trending Rust Language
Markus J. Buehler73,575 görüntüleme • 2 yıl önce

Bio-inspired swarm intelligence for AI music composition: MusicSwarm instantiates many identical, frozen foundation-model agents that coordinate only via peer-to-peer feedback and pheromone-like signals. Without any weight updates, these agents spontaneously self-organize into differentiated roles and produce compositions with higher local novelty, richer rhythmic diversity, and more human-like small-world structure than centrally critiqued multi-agent or single-shot baselines. We observe swarm dynamics that converge toward Nash-like equilibria in the space of agent behaviors, while the continual emergence of new motifs and long-range links realizes a Gödelian perspective: interacting agents plus a shared external world model behave as a meta-system whose creative trajectories go beyond those of any single, monolithic model.
Markus J. Buehler24,458 görüntüleme • 6 ay önce

What seemed like an intractable problem is now possible: To design proteins with a specified nonlinear mechanical response, capturing complex folding and unfolding mechanisms in singe and few-shot computations. We present ForceGen, an end-to-end algorithm for de novo protein generation based on nonlinear mechanical unfolding responses. Rooted in the physics of protein mechanics, this generative strategy provides a powerful way to design new proteins rapidly, including exquisite and rapid predictions about their dynamical behavior. Proteins, like any other mechanical object, respond to forces in peculiar ways. Think of the different response you'd get from pulling on a steel cable versus pulling on a rubber band, or the difference between honey and glass. Now, we can design proteins with a set of desirable mechanical characteristics, with applications from health to sustainable plastics. The key to solving this problem was to integrate a protein language model with denoising diffusion methods, and using accurate atomistic-level physical simulation data to endow the model a first-principles understanding. ForceGen can solve both forward and inverse tasks: In the forward task, we can predict how stable a protein is, how it will unfold and what the forces involved are, all given just the sequence of amino acids. In the inverse task, we can design new proteins that meet complex nonlinear mechanical signature targets. Read the paper, led by LAMM@MIT postdoc Bo Ni, published in Science Advances: Why do we care about the mechanics of proteins? The mechanics of proteins are critical elements of many living systems - as evidenced in many studies of mechanobiology. Through evolution, nature has presented a set of remarkable protein materials with unique mechanical functions like elastins, silks, keratins or collagens that play crucial roles in biology. However, going beyond natural designs to discover proteins that meet specified mechanical properties remains challenging. So far, the only way to do this was to use existing evolutionary concepts or to manually alter proteins. With our new generative model we can directly design proteins to meet complex nonlinear mechanical property-design objectives. ForceGen leverages deep knowledge on protein sequences from a pretrained protein language model and maps mechanical unfolding responses to create proteins. Via full-atom molecular simulations for direct validation from physical and chemical principles, we demonstrate that the designed proteins are de novo, and fulfill the targeted mechanical properties, including unfolding energy and mechanical strength, and a detailed unfolding force-separation curves. ForceGen offers rapid pathways to explore the enormous mechanobiological protein sequence space unconstrained by biological synthesis, to enable the discovery of new protein materials with superior mechanical properties. B. Ni, D.L. Kaplan, M.J. Buehler, ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a language diffusion model. Sci. Adv. 10, eadl4000 (2024). DOI: 10.1126/sciadv.adl4000 Codes and model weights available Hugging Face: David Kaplan
Markus J. Buehler47,242 görüntüleme • 2 yıl önce

At the molecular level, biological materials like silk and collagen defy conventional logic by building exceptional strength from intrinsically weak chemical interactions: Hydrogen bonds, π–π stacking, and hydrophobic forces. In our latest paper "Design and sustainability of polypeptide material systems" (Yorke, Yang, Knowles, Buehler, et al., Nature Reviews Materials, 2025) we decode how these subtle interactions cooperatively form dynamic, adaptive networks that yield materials tougher than steel per unit weight. Sophisticated reasoning across scales and disciplines is critical and is facilitated by ever-growing capabilities in physics-based modeling, in-situ experimentation, and AI. Methods like transformer architectures, graph-based models, and generative strategies yield unprecedented insights into molecular-to-macroscopic connections. By learning and emulating Nature’s hidden biological principles, we can design next-generation sustainable materials - fully biodegradable yet superior in performance - transforming what is possible in materials engineering. This shifts our paradigm: It's not about stronger bonds - it's about orchestrated, self-aware and intelligent structures. Thank you Sarah Yorke, Zhenze Yang 杨镇泽, Elizabeth Wiita, Ayaka Kamada and Tuomas Knowles for this great collaboration! nature naturePortfolio
Markus J. Buehler24,047 görüntüleme • 1 yıl önce

Science has long relied on single ML models - powerful, but limited because they are bound by baked-in knowledge. Our recent experiments show that genuine discovery emerges when a very large number of agents interact, adapt, and co-create, much like biology itself. Last week at Harvard’s Big Data 2025 I shared how multi-agent swarms may allow us to move beyond pattern-analysis to invent - exemplified in very difficult problem spaces such as de novo proteins and music with long-range form. The swarms are formulated like reinforcement-learning collectives, where agents learn on the fly, adapt to each other, and evolve strategies in real time, to yield hallmarks of intelligence that a single model cannot exhibit. Our swarm generated proteins well outside natural and single-model clusters (UMAP), and the music scored the highest small-worldness with the most long-range links, the signature of integrated, human-like creative structure. In a deeper analysis, when we mapped how themes in the music connect across time, the swarm built networks with the tightest balance of local clusters and global connections, beating all baselines. The resulting composition was not just repeating patterns stitched together, but featured organic global coherence without repetition, like sparks of creativity emerging on their own. Preprints coming soon!
Markus J. Buehler15,269 görüntüleme • 8 ay önce