Introducing VL-JEPA: Vision-Language Joint Embedding Predictive Architecture for streaming,... show more

Pascale Fung
90,033 views • 5 months ago
Our vision is for AI that uses world models... show more

AI at Meta
309,704 views • 1 year ago
Today we’re releasing V-JEPA, a method for teaching machines... show more

AI at Meta
703,412 views • 2 years ago
3D-LLM: Injecting the 3D World into Large Language Models... show more

AK
249,494 views • 2 years ago
Introducing Jan-v2-VL, a multimodal agent built for long-horizon tasks.... show more

👋 Jan
130,228 views • 6 months ago
LFM2-VL support with GGUF and llama.cpp 🥳 You can... show more

Maxime Labonne
19,947 views • 9 months ago
We release Action100M, the hero behind VL-JEPA. It is... show more

Delong Chen (陈德龙)
103,384 views • 4 months ago
Here's my conversation with Yann LeCun (Yann LeCun) about... show more

Lex Fridman
1,021,936 views • 2 years ago
Jan-v2-VL-Max-Instruct is out on 💛 Our newest 30B vision-language... show more

👋 Jan
23,063 views • 5 months ago
MotionGPT: Human Motion as a Foreign Language paper page:... show more

AK
125,311 views • 2 years ago
Pretraining is essential for good performance on a wide... show more

RoboPapers
23,883 views • 3 months ago
Google presents AudioPaLM: A Large Language Model That Can... show more

AK
290,517 views • 3 years ago
We trained a foundation model on 18 million heart... show more

Alif Munim (d/acc)
590,179 views • 4 months ago
VLA-JEPA just dropped in LeRobot 🤖 What makes this... show more

LeRobot
287,409 views • 4 days ago
Start building with Gemini Embedding 2, our most capable... show more

Google AI Developers
30,483,382 views • 3 months ago
We raised $1.5m to launch the world’s first LLM... show more

Yoeven
93,403 views • 8 months ago
Today, every Nomic-Embed-Text embedding becomes multimodal. Introducing Nomic-Embed-Vision: -... show more

CalCo
103,204 views • 2 years ago
Check out our #ICRA2024 paper "Actor-Critic Model Predictive Control."... show more

Davide Scaramuzza
34,874 views • 2 years ago
Introducing DINOv3: a state-of-the-art computer vision model trained with... show more

AI at Meta
899,338 views • 10 months ago
Yay, finally! Introducing Vision Banana🍌 from Google DeepMind, our... show more

Songyou Peng
282,710 views • 1 month ago
📣 Microsoft Research releases Florence-VL, a new family of... show more

Gradio
14,371 views • 1 year ago