Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

Introducing `Agent-1`: a breakthrough foundation model that can operate software like a human. This is the brain powering Personal Assistant. We’re already well above previous state-of-the-art, and we’re improving massively each week. More details:

Matt Shumer

367,810 subscribers

252,381 görüntüleme • 2 yıl önce •via X (Twitter)

Bilim & Teknoloji Eğitim

Anya Rossi• Live Now

Private livecam show

11 Yorum

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

First, why are we building this? Current hosted APIs are amazing — but operating software isn’t a task today’s models can handle reliably. Even the next generation of unreleased closed models aren’t up to the task (and trust me, we’ve tried).

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

And with the complexity that comes with this type of task, costs are through the roof, and speed is an issue. So, we decided to build our own suite of models, with one purpose: to operate software reliably, quickly and cheaply.

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

This requires a new type of AI model — the goal is to use its parameters ~very effectively~. Current models store lots of knowledge, leaving fewer parameters for reasoning. Instead, we aim to put all of the model's horsepower to work on dynamic reasoning.

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

This approach enables our model to handle situations it was never trained for. Instead of relying on its knowledge of a particular site, it just figures out how to use it! And our software built on top of the model allows it to learn over time, without wasting model parameters.

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

This is just the beginning. As the model rapidly improves, so will its reliability on more complex software. Our goal is to surpass human ability — an assistant that can operate any software and reliably accomplish complex goals on a user’s behalf.

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

We’ll be deploying our newest iterations of Agent-1 into Personal Assistant in the coming weeks and months. We’re excited to see what you can accomplish!

Div Garg profil fotoğrafı

Div Garg2 yıl önce

What's special about this again? @MultiON_AI already surpasses this out of the box with ~1 sec per action speed 🥱 & can self-learn to improve itself automatically. Full post on it tmrw 😎

Yohei profil fotoğrafı

Yohei2 yıl önce

Oh man, Google admin console isn’t easy to figure out, even for a human… (or is it just me?)

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

Same here. This is why I chose it for a demo. If Agent-1 can use GCP, the sky is the limit :)

Soham Sarkar profil fotoğrafı

Soham Sarkar2 yıl önce

I use hyperwrite quite a bit. Right now the biggest problem with assistant is the speed. I understand the technical limitations having worked on agentic stuff as well but hoping you guys have figured out how to solve this :)

Matt Shumer profil fotoğrafı

Matt Shumer2 yıl önce

The speed is my biggest pet peeve... a big incentive for us building these models was to make it faster. Our next goal is 1 second per 'action' (right now, it's 5-10s per). We think we'll be there within a month or two.

Benzer Videolar

Introducing Spark 1 Pro and Spark 1 Mini 🔥 Two new models powering /agent, our state of the art agent that searches, navigates, and extracts web data from a prompt. Mini is 60% cheaper and Pro delivers higher accuracy, making this our most powerful extraction endpoint yet.

Introducing Spark 1 Pro and Spark 1 Mini 🔥 Two new models powering /agent, our state of the art agent that searches, navigates, and extracts web data from a prompt. Mini is 60% cheaper and Pro delivers higher accuracy, making this our most powerful extraction endpoint yet.

Firecrawl

30,271 görüntüleme • 5 ay önce

🧵1/14 Grafilab - The Infrastructure Powering every AI application and Agent: Build, Train, Scale, and Monetize AI AI is reshaping the world, but building and scaling AI solutions is still a privilege for a few. Grafilab is changing that. Here’s how we’re powering the future of AI. 👇

🧵1/14 Grafilab - The Infrastructure Powering every AI application and Agent: Build, Train, Scale, and Monetize AI AI is reshaping the world, but building and scaling AI solutions is still a privilege for a few. Grafilab is changing that. Here’s how we’re powering the future of AI. 👇

Grafilab - Powering AGI

90,003 görüntüleme • 1 yıl önce

New release from Meta FAIR — Meta Motivo is a first-of-its-kind behavioral foundation model for controlling virtual physics-based humanoid agents for a wide range of complex whole-body tasks. The model is capable of expressing human-like behaviors and achieves performance competitive with task-specific methods and outperforms state-of-the-art unsupervised RL and model-based baselines. Try the demo ➡️ Get the model and code ➡️ We’re excited about how this research could pave the way for fully embodied agents, leading to more lifelike NPCs, democratization of character animation and new types of immersive experiences.

New release from Meta FAIR — Meta Motivo is a first-of-its-kind behavioral foundation model for controlling virtual physics-based humanoid agents for a wide range of complex whole-body tasks. The model is capable of expressing human-like behaviors and achieves performance competitive with task-specific methods and outperforms state-of-the-art unsupervised RL and model-based baselines. Try the demo ➡️ Get the model and code ➡️ We’re excited about how this research could pave the way for fully embodied agents, leading to more lifelike NPCs, democratization of character animation and new types of immersive experiences.

AI at Meta

129,055 görüntüleme • 1 yıl önce

Introducing the world's most powerful AI Assistant. Personal Assistant is NOT just another AI chatbot. It can: - Operate your browser to actually complete tasks - Cite sources, so you can trust what it says - And so much more. You won't believe what Personal Assistant can do:

Introducing the world's most powerful AI Assistant. Personal Assistant is NOT just another AI chatbot. It can: - Operate your browser to actually complete tasks - Cite sources, so you can trust what it says - And so much more. You won't believe what Personal Assistant can do:

Matt Shumer

276,929 görüntüleme • 2 yıl önce

We’re thrilled to announce that we've raised a $36M Series A led by martin_casado at a16z to advance the future of AI software engineering, bringing our total funding to $45 million. We’re also introducing functions — a flexible primitive for building with foundation models.

We’re thrilled to announce that we've raised a $36M Series A led by martin_casado at a16z to advance the future of AI software engineering, bringing our total funding to $45 million. We’re also introducing functions — a flexible primitive for building with foundation models.

Braintrust

343,609 görüntüleme • 1 yıl önce

Today, we’re launching 4D generation, powered by our Cube Foundation Model. Creators can build experiences that let players create interactive 3D objects like cars, planes, and more. This is just the beginning, see what's next with AI-powered creation. 1/4

Today, we’re launching 4D generation, powered by our Cube Foundation Model. Creators can build experiences that let players create interactive 3D objects like cars, planes, and more. This is just the beginning, see what's next with AI-powered creation. 1/4

Roblox

1,577,622 görüntüleme • 4 ay önce

We’re excited to announce 𝗚𝗲𝗺𝗶𝗻𝗶: Google’s largest and most capable AI model. Built to be natively multimodal, it can understand and operate across text, code, audio, image and video - and achieves state-of-the-art performance across many tasks. 🧵

We’re excited to announce 𝗚𝗲𝗺𝗶𝗻𝗶: Google’s largest and most capable AI model. Built to be natively multimodal, it can understand and operate across text, code, audio, image and video - and achieves state-of-the-art performance across many tasks. 🧵

Google DeepMind

1,315,267 görüntüleme • 2 yıl önce

Cerebras and Mayo Clinic are proud to announce a new state-of-the-art foundation model for genomics. The breakthrough was made possible by combining Mayo Clinic's extensive patient data with the Cerebras AI platform, training on over a trillion tokens to create complex genomic models that can guide personalized treatment decisions. Read more:

Cerebras and Mayo Clinic are proud to announce a new state-of-the-art foundation model for genomics. The breakthrough was made possible by combining Mayo Clinic's extensive patient data with the Cerebras AI platform, training on over a trillion tokens to create complex genomic models that can guide personalized treatment decisions. Read more:

Cerebras

44,578 görüntüleme • 1 yıl önce

Shine On is our commitment to you. We’re not just powering homes, we’re powering possibilities. From strengthening the electricity grid, to investing in renewable energy and transforming how we serve you each day, JPS is building a brighter, more resilient Jamaica for everyone

Shine On is our commitment to you. We’re not just powering homes, we’re powering possibilities. From strengthening the electricity grid, to investing in renewable energy and transforming how we serve you each day, JPS is building a brighter, more resilient Jamaica for everyone

JPS

120,386 görüntüleme • 1 yıl önce

AI that had human-level intelligence would actually be way above human level in capability. Obviously, if we trained a human-level AI, we could just run way more instances of it in parallel. This is a huge advantage. But it's not the only one. Right now, LLMs are much less sample-efficient than humans, but they can process each individual piece of data much more quickly. A model that was a human-level learner would be able to gain the equivalent of thousands of years of education. AIs can also be way more task-oriented than humans. Even a really well-motivated human worker can only operate with ‘human-level’ effort for relatively short spans of time. AGI could do it continuously.

AI that had human-level intelligence would actually be way above human level in capability. Obviously, if we trained a human-level AI, we could just run way more instances of it in parallel. This is a huge advantage. But it's not the only one. Right now, LLMs are much less sample-efficient than humans, but they can process each individual piece of data much more quickly. A model that was a human-level learner would be able to gain the equivalent of thousands of years of education. AIs can also be way more task-oriented than humans. Even a really well-motivated human worker can only operate with ‘human-level’ effort for relatively short spans of time. AGI could do it continuously.

Dwarkesh Patel

31,861 görüntüleme • 1 ay önce

🌟 Today we’re sharing the vision for a New Web3 — a more interconnected, human internet that drives new value systems and can lay the foundation for a new digital society. Here is the first of two articles, detailing the journey of our evolution and why we built new standards. This one is for the visionaries and dreamers of the initial Ethereum vision of a decentralised internet and shows how we can evolve it into a universal decentralized operating system, entailing the whole of web3. This one is slightly more technical. 👉 Read on Medium:

🌟 Today we’re sharing the vision for a New Web3 — a more interconnected, human internet that drives new value systems and can lay the foundation for a new digital society. Here is the first of two articles, detailing the journey of our evolution and why we built new standards. This one is for the visionaries and dreamers of the initial Ethereum vision of a decentralised internet and shows how we can evolve it into a universal decentralized operating system, entailing the whole of web3. This one is slightly more technical. 👉 Read on Medium:

LUKSO

93,954 görüntüleme • 1 yıl önce

Natural speech? AI's got it down. 💻🎯 With Amazon Nova Sonic, a state-of-the-art speech-to-speech foundation model, you can transform customer interactions & virtual assistants with AI that delivers human-like conversations with contextual richness. 👉

Natural speech? AI's got it down. 💻🎯 With Amazon Nova Sonic, a state-of-the-art speech-to-speech foundation model, you can transform customer interactions & virtual assistants with AI that delivers human-like conversations with contextual richness. 👉

Amazon Web Services

14,038 görüntüleme • 1 yıl önce

“Tim Walz says Minnesota is the #2 state to retire in. I think that we’re finding out that Minnesota is the #1 state for fraud & #1 state for learing. I want everyone to know that we’re going to be like a dog on a bone & we are not going to let this go.”

“Tim Walz says Minnesota is the #2 state to retire in. I think that we’re finding out that Minnesota is the #1 state for fraud & #1 state for learing. I want everyone to know that we’re going to be like a dog on a bone & we are not going to let this go.”

ThePersistence

225,226 görüntüleme • 5 ay önce

AI agents are becoming native users of software. Today, we’re kicking off Unveil Week with a broader vision than human-only collaborative features. Software itself needs to change for the age of agents.

AI agents are becoming native users of software. Today, we’re kicking off Unveil Week with a broader vision than human-only collaborative features. Software itself needs to change for the age of agents.

Liveblocks

648,619 görüntüleme • 2 ay önce

SFPD is launching a high-tech Realtime Investigations Center downtown. The state-of-the-art facility incudes drones, citywide surveillance tools, license plate readers, video walls, and real-time analytics software. This is a model for how cities can fight crime effectively.

SFPD is launching a high-tech Realtime Investigations Center downtown. The state-of-the-art facility incudes drones, citywide surveillance tools, license plate readers, video walls, and real-time analytics software. This is a model for how cities can fight crime effectively.

GrowSF

35,775 görüntüleme • 1 yıl önce

Wrapping up the year and coinciding with #NeurIPS2024, today at Meta FAIR we’re releasing a collection of nine new open source AI research artifacts across our work in developing agents, robustness & safety and new architectures. More in the video from Joelle Pineau. All of this work is part of FAIR’s continued work towards the goal of achieving advanced machine intelligence A few highlights from what we’re releasing today: • Meta Motivo: A first-of-its-kind behavioral foundation model that controls the movements of a virtual embodied humanoid agent to perform complex tasks. • Meta Video Seal: a state-of-the art comprehensive framework for neural video watermarking. • Meta Explore Theory-of-Mind: A program-guided adversarial data generation for theory of mind reasoning. • Meta Large Concept Models: A fundamentally different training paradigm for language modeling that decouples reasoning from language representation. And much more! We’re excited to share this work with the research community and look forward to seeing how it inspires new innovation across the field. Details and access to everything released by FAIR today ➡️

Wrapping up the year and coinciding with #NeurIPS2024, today at Meta FAIR we’re releasing a collection of nine new open source AI research artifacts across our work in developing agents, robustness & safety and new architectures. More in the video from Joelle Pineau. All of this work is part of FAIR’s continued work towards the goal of achieving advanced machine intelligence A few highlights from what we’re releasing today: • Meta Motivo: A first-of-its-kind behavioral foundation model that controls the movements of a virtual embodied humanoid agent to perform complex tasks. • Meta Video Seal: a state-of-the art comprehensive framework for neural video watermarking. • Meta Explore Theory-of-Mind: A program-guided adversarial data generation for theory of mind reasoning. • Meta Large Concept Models: A fundamentally different training paradigm for language modeling that decouples reasoning from language representation. And much more! We’re excited to share this work with the research community and look forward to seeing how it inspires new innovation across the field. Details and access to everything released by FAIR today ➡️

AI at Meta

156,008 görüntüleme • 1 yıl önce

Today we’re launching Agent Experience: helping your product get discovered and used by agents. Your next user is not human. Agents are already choosing tools and writing code for real users. Launch Week, Day 1. The Prompting Company. Y Combinator.

Today we’re launching Agent Experience: helping your product get discovered and used by agents. Your next user is not human. Agents are already choosing tools and writing code for real users. Launch Week, Day 1. The Prompting Company. Y Combinator.

Michelle Marcelline

135,312 görüntüleme • 2 gün önce

Software vulnerabilities can be notoriously time-consuming for developers to find and fix. Today, we’re sharing details about CodeMender: our new AI agent that uses Gemini Deep Think to automatically patch critical software vulnerabilities. 🧵

Software vulnerabilities can be notoriously time-consuming for developers to find and fix. Today, we’re sharing details about CodeMender: our new AI agent that uses Gemini Deep Think to automatically patch critical software vulnerabilities. 🧵

Google DeepMind

367,796 görüntüleme • 8 ay önce

Miss Universe is more than a crown; it’s a global commitment to compassion. Every country, every story, every child matters. And in each visit, we’re reminded that even the smallest act of kindness can light up a life. We’re here to give, to care, and to remind the world that love is action. 🤍

Miss Universe is more than a crown; it’s a global commitment to compassion. Every country, every story, every child matters. And in each visit, we’re reminded that even the smallest act of kindness can light up a life. We’re here to give, to care, and to remind the world that love is action. 🤍

Miss Universe

25,695 görüntüleme • 1 yıl önce

This week, Co-Founder and Head of Research Neurmal//Theoriq dives into the core primitives powering smarter, more dynamic agent-to-agent interactions on Theoriq's Protocol. ⚡️ He demos a live example of a “Flex Agent” that adapts in real time based on natural language, subscribing to statistical pulses, processing data & responding with a customized volatility status. This flexibility showcases how agents can easily communicate, interact, and adapt—key traits for powering the Agentic Economy. More below. 👇

This week, Co-Founder and Head of Research Neurmal//Theoriq dives into the core primitives powering smarter, more dynamic agent-to-agent interactions on Theoriq's Protocol. ⚡️ He demos a live example of a “Flex Agent” that adapts in real time based on natural language, subscribing to statistical pulses, processing data & responding with a customized volatility status. This flexibility showcases how agents can easily communicate, interact, and adapt—key traits for powering the Agentic Economy. More below. 👇

Theoriq

63,008 görüntüleme • 1 yıl önce