Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Introducing CycleQD: A population-based model merging via Quality Diversity CycleQD builds on our model merging research, advancing two fronts: evolving a swarm of specialized agents to complement one another, and laying the groundwork for life-long learning by enabling diverse, adaptable skill acquisition at the population-level.

121,567 Aufrufe • vor 1 Jahr •via X (Twitter)

8 Kommentare

Profilbild von Sakana AI
Sakana AIvor 1 Jahr

Please check out our paper, Agent Skill Acquisition for Large Language Models via CycleQD This work aims to mimic an ecological niche during the training of a swarm of LLM agents. Just like how each species in an environment finds its own role and position, or niche, a well-evolved AI agent doesn’t have to be great at everything, but it can effectively occupy a specific niche, making it resilient to competition from other agents in the swarm. This kind of approach can enable a population of AI agents to emerge, each with specific capabilities that complement each other, collectively improving over time. The core idea in CycleQD is to create an artificial evolutionary process in which Model Merging is used as a cross-over operation, SVD as a mutation operation, and Quality Diversity as the selection operation, encouraging each agent in the population to develop its own unique capabilities which adds value to the collective. In the paper, we show that CycleQD is able to evolve a swarm of LLM agents, each with their own niche, to tackle difficult agentic workflow tasks. We believe that the future of AI lies in life-long learning where collective systems continuously grow, adapt, and accumulate knowledge over time. CycleQD is a first step, enabling diverse skill learning as a foundation for continual learning.

Profilbild von TuringPost
TuringPostvor 1 Jahr

This is very interesting! Can we say, that you use a swarm intelligence concept here?

Profilbild von Brandon
Brandonvor 1 Jahr

Seems pretty reasonable to me

Profilbild von baraa tulip
baraa tulipvor 1 Jahr

@ceobillionaire 💥💥💢💢💥💥💢💥 please please Help Btc : bc1qv0xceh6h4eaawhnqq95nty85r02vpgjzrjyrjg Eth : 0xaeac98A1a3a3f260Ce969fB57C4ab0595f51f113

Profilbild von justboulatbek
justboulatbekvor 1 Jahr

Are these things available for self deploy to try? Or as a service?

Profilbild von AI Carlos
AI Carlosvor 1 Jahr

I'm intrigued by CycleQD's potential for life-long learning. Can it adapt to new tasks without requiring extensive retraining?

Profilbild von Belkhir Nacim
Belkhir Nacimvor 1 Jahr

any idea to investigate differential evolution or an ES strategy in lieu of a swarm approach?

Profilbild von Data & Analytics
Data & Analyticsvor 1 Jahr

@hardmaru @hardmaru, cycleQD sounds like an intriguing concept! Merging models with Quality Diversity could shake things up in AI research. What aspects of it grab your attention?

Ähnliche Videos

Small Language Models (SML) are the future of AI. "Small" (SML) instead of "Large" (LLM). These small models are highly specialized models with superhuman abilities on specific tasks. Here are two techniques to build these models: • Spectrum • Model Merging I give you a short introduction in the attached video, but here is a quick summary: Spectrum helps us identify the most relevant layers to solve one specific task. We can ignore everything else and focus on fine-tuning these layers. Using Spectrum, we can fine-tune models in a heartbeat. Model Merging combines multiple models into a unique, much better model than any of the individual input models. You can also combine models specialized in different tasks and get a model with multiple abilities. This is the state of the art of productizing models. It's what Arcee.ai's platform does behind the scenes. Arcee collaborated with me on this post and is sponsoring it. There are three main steps to produce a model for your particular use case: 1. You create a dataset by uploading your data. 2. You train a model. At this step, Arcee uses Spectrum and Model Merging to produce a highly specialized model for your task. 3. You can deploy that model to any environment you want. Three important notes: • Training process is 2x faster and 2x cheaper than regular fine-tuning. • Resultant models are smaller and have higher accuracy. • They create these specialized models from open-source models. Check this site so you can fully appreciate how this works: If you want to fine-tune an open-source model, consider Arcee's platform. This is the state of the art.

Santiago

164,162 Aufrufe • vor 1 Jahr

Today, we're joined by Sergey Levine, associate professor at UC Berkeley EECS and co-founder of Physical Intelligence to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the model training "recipe," emphasizing the roles of pre-training and post-training with a diverse mixture of real-world data to ensure robust and intelligent robot learning. We review the data collection approach, which uses human operators and teleoperation rigs, the potential of synthetic data and reinforcement learning in enhancing robotic capabilities, and much more. We also introduce the team’s new FAST tokenizer, which opens the door to a fully Transformer-based model and significant improvements in learning and generalization. Finally, we cover the open-sourcing of π0 and future directions for their research. 🎧 / 🎥 Listen or watch the full episode on our page: 📖 CHAPTERS =============================== 00:00 - Introduction 2:14 - Physical Intelligence 3:47 - Key challenges in robotic learning 6:13 - Reinforcement learning in π0 and robotic foundation models 8:36 - π0 VLM model architecture 15:33 - π0 model recipe 18:39 - Pre-training dataset 22:47 - Post-training 24:23 - Laundry folding demo 31:32 - Scaling laws on π0 model 34:57 - FAST 40:26 - Open sourcing π0 43:37 - Other robot types 46:27 - Future directions

The TWIML AI Podcast

19,942 Aufrufe • vor 1 Jahr