AI at Meta

@AIatMeta • 824,596 subscribers

Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

Shorts

825,380 views

259,478 views

1,093,288 views

3,570,504 views

1,224,266 views

1,004,918 views

677,777 views

442,508 views

195,586 views

333,395 views

260,403 views

54,084 views

169,023 views

164,519 views

102,008 views

124,955 views

96,119 views

50,762 views

20,814 views

Videos

sweetdream.ai

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Private Show

Join now for exclusive access

Free preview available • Premium content

6,939,033 views • 3 months ago

70,376 views • 9 days ago

2,263,195 views • 11 months ago

1,250,414 views • 7 months ago

1,088,498 views • 8 months ago

858,880 views • 8 months ago

2,264,759 views • 1 year ago

900,258 views • 11 months ago

1,617,603 views • 1 year ago

544,372 views • 8 months ago

899,553 views • 1 year ago

1,268,811 views • 2 years ago

1,234,668 views • 3 years ago

313,765 views • 9 months ago

531,586 views • 1 year ago

798,246 views • 2 years ago

703,801 views • 2 years ago

728,765 views • 2 years ago

309,942 views • 1 year ago

453,249 views • 1 year ago

Live Cam

AI at Meta

Shorts

Today we're releasing the Segment Anything Model (SAM) — a step toward the first foundation model for image segmentation. SAM is capable of one-click segmentation of any object from any photo or video + zero-shot transfer to other segmentation tasks ➡️

Announced by Mark Zuckerberg this morning — today we're releasing DINOv2, the first method for training computer vision models that uses self-supervised learning to achieve results matching or exceeding industry standards. More on this new work ➡️

Today we’re releasing Code Llama, a large language model built on top of Llama 2, fine-tuned for coding &amp; state-of-the-art for publicly available coding tools. Keeping with our open approach, Code Llama is publicly-available now for both research &amp; commercial use. More ⬇️

Today we're sharing details on AudioCraft, a new family of generative AI models built for generating high-quality, realistic audio &amp; music from text. AudioCraft is a single code base that works for music, sound, compression &amp; generation — all in the same place. More details ⬇️

Today we're releasing the Open Catalyst Demo to the public — this new service will allow researchers to accelerate work in material sciences by enabling them to simulate the reactivity of catalyst materials ~1000x faster than existing computational methods using AI. Demo ⬇️

Introducing ImageBind by Meta AI: the first AI model capable of binding data from six modalities at once. This breakthrough brings machines one step closer to the human ability to bind together information from many different senses. More on this new open source work ⬇️

Together with the Ego4D consortium, today we're releasing Ego-Exo4D, the largest ever public dataset of its kind to support research on video learning &amp; multimodal perception — including 1,400+ hours of videos of skilled human activities. Download ➡️

Our Segment Anything Models are helping advance flood monitoring and disaster response. See how USRA and USGS have fine-tuned SAM to automate a key bottleneck in real-time river mapping, enabling faster, scalable, and more cost-effective disaster preparedness:

SeamlessExpressive, a new AI translation model by research teams at Meta, enables high-quality speech translation that maintains the speaker's vocal style, tone and unique expressions in translated outputs. Try the demo with your own voice ➡️

🤖 New robotics research from Meta AI &amp; CMU Robotics Institute — RoboAgent can acquire a wide diversity of non-trivial skills + generalize them to hundreds of unseen scenarios — all w/ an order of magnitude less data than prior works in this space. More details ➡️

We’re continuing to see exciting results as we work with our Meta Movie Gen models, here are some more examples of what they can do 🧵

Today, we're sharing two major advancements in our work toward general-purpose embodied AI agents: VC-1 &amp; ASC. We're excited for how this work will help build toward a future where AI agents can assist humans in both the virtual &amp; physical world. Details ⬇️

DINOv2 by Meta AI is the first method for training computer vision models that uses self-supervised learning to achieve results matching or exceeding industry standards. More details and a demo ⬇️

The Meta Llama 3 Hackathon is this weekend in SF with @Cerebral_Valley! Get on the list ➡️ What to expect • Two days of building alongside the best hackers in AI • Hands on support from the Llama team • Talks from some of the top names in the industry

You can find more on Meta Sparsh, and more of our recently announced robotics AI research in this post ➡️

Videos

Watch Anya Live

Today Mark shared Meta’s vision for the future of personal superintelligence for everyone. Read his full letter here:

Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images &amp; videos. SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences Details ➡️

We believe an open approach is the right one for the development of today's Al models. Today, we’re releasing Llama 2, the next generation of Meta’s open source Large Language Model, available for free for research &amp; commercial use. Details ➡️

New research from Meta FAIR: Large Concept Models (LCM) is a fundamentally different paradigm for language modeling that decouples reasoning from language representation, inspired by how humans can plan high-level thoughts to communicate.

Today we’re sharing two new advances in our generative AI research: Emu Video &amp; Emu Edit. Details ➡️ These new models deliver exciting results in high quality, diffusion-based text-to-video generation &amp; controlled image editing w/ text instructions. 🧵

Today we're sharing the next milestone in our Seamless Communication research — a new family of AI translation models that preserve expression and deliver near-real time streaming translations. More on this new work ➡️ More on the individual models 🧵

Today we’re releasing Code Llama, a large language model built on top of Llama 2, fine-tuned for coding & state-of-the-art for publicly available coding tools. Keeping with our open approach, Code Llama is publicly-available now for both research & commercial use. More ⬇️

Today we're sharing details on AudioCraft, a new family of generative AI models built for generating high-quality, realistic audio & music from text. AudioCraft is a single code base that works for music, sound, compression & generation — all in the same place. More details ⬇️

Together with the Ego4D consortium, today we're releasing Ego-Exo4D, the largest ever public dataset of its kind to support research on video learning & multimodal perception — including 1,400+ hours of videos of skilled human activities. Download ➡️

🤖 New robotics research from Meta AI & CMU Robotics Institute — RoboAgent can acquire a wide diversity of non-trivial skills + generalize them to hundreds of unseen scenarios — all w/ an order of magnitude less data than prior works in this space. More details ➡️

Today, we're sharing two major advancements in our work toward general-purpose embodied AI agents: VC-1 & ASC. We're excited for how this work will help build toward a future where AI agents can assist humans in both the virtual & physical world. Details ⬇️

Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images & videos. SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences Details ➡️

We believe an open approach is the right one for the development of today's Al models. Today, we’re releasing Llama 2, the next generation of Meta’s open source Large Language Model, available for free for research & commercial use. Details ➡️

Today we’re sharing two new advances in our generative AI research: Emu Video & Emu Edit. Details ➡️ These new models deliver exciting results in high quality, diffusion-based text-to-video generation & controlled image editing w/ text instructions. 🧵