Introducing VL-JEPA: Vision-Language Joint Embedding Predictive Architecture for streaming,...

Pascale Fung's profile picture

Pascale Fung

90,033 views • 5 months ago

Our vision is for AI that uses world models...

AI at Meta's profile picture

AI at Meta

309,704 views • 1 year ago

Today we’re releasing V-JEPA, a method for teaching machines...

AI at Meta's profile picture

AI at Meta

703,412 views • 2 years ago

3D-LLM: Injecting the 3D World into Large Language Models...

AK's profile picture

AK

249,494 views • 2 years ago

Introducing Jan-v2-VL, a multimodal agent built for long-horizon tasks....

👋 Jan's profile picture

👋 Jan

130,228 views • 6 months ago

LFM2-VL support with GGUF and llama.cpp 🥳 You can...

Maxime Labonne's profile picture

Maxime Labonne

19,947 views • 9 months ago

We release Action100M, the hero behind VL-JEPA. It is...

Delong Chen (陈德龙)'s profile picture

Delong Chen (陈德龙)

103,384 views • 4 months ago

Here's my conversation with Yann LeCun (Yann LeCun) about...

Lex Fridman's profile picture

Lex Fridman

1,021,936 views • 2 years ago

Jan-v2-VL-Max-Instruct is out on 💛 Our newest 30B vision-language...

👋 Jan's profile picture

👋 Jan

23,063 views • 5 months ago

MotionGPT: Human Motion as a Foreign Language paper page:...

AK's profile picture

AK

125,311 views • 2 years ago

Pretraining is essential for good performance on a wide...

RoboPapers's profile picture

RoboPapers

23,883 views • 3 months ago

Google presents AudioPaLM: A Large Language Model That Can...

AK's profile picture

AK

290,517 views • 3 years ago

We trained a foundation model on 18 million heart...

Alif Munim (d/acc)'s profile picture

Alif Munim (d/acc)

590,179 views • 4 months ago

VLA-JEPA just dropped in LeRobot 🤖 What makes this...

LeRobot's profile picture

LeRobot

287,409 views • 4 days ago

Start building with Gemini Embedding 2, our most capable...

Google AI Developers's profile picture

Google AI Developers

30,483,382 views • 3 months ago

We raised $1.5m to launch the world’s first LLM...

Yoeven's profile picture

Yoeven

93,403 views • 8 months ago

Today, every Nomic-Embed-Text embedding becomes multimodal. Introducing Nomic-Embed-Vision: -...

CalCo's profile picture

CalCo

103,204 views • 2 years ago

Check out our #ICRA2024 paper "Actor-Critic Model Predictive Control."...

Davide Scaramuzza's profile picture

Davide Scaramuzza

34,874 views • 2 years ago

Introducing DINOv3: a state-of-the-art computer vision model trained with...

AI at Meta's profile picture

AI at Meta

899,338 views • 10 months ago

Yay, finally! Introducing Vision Banana🍌 from Google DeepMind, our...

Songyou Peng's profile picture

Songyou Peng

282,710 views • 1 month ago

📣 Microsoft Research releases Florence-VL, a new family of...

Gradio's profile picture

Gradio

14,371 views • 1 year ago