Introducing VL-JEPA: Vision-Language Joint Embedding Predictive Architecture for streaming,...

Pascale Fung's profile picture

Pascale Fung

90,033 просмотров • 5 месяцев назад

Our vision is for AI that uses world models...

AI at Meta's profile picture

AI at Meta

309,704 просмотров • 1 год назад

Today we’re releasing V-JEPA, a method for teaching machines...

AI at Meta's profile picture

AI at Meta

703,412 просмотров • 2 лет назад

3D-LLM: Injecting the 3D World into Large Language Models...

AK's profile picture

AK

249,494 просмотров • 2 лет назад

Introducing Jan-v2-VL, a multimodal agent built for long-horizon tasks....

👋 Jan's profile picture

👋 Jan

129,906 просмотров • 6 месяцев назад

LFM2-VL support with GGUF and llama.cpp 🥳 You can...

Maxime Labonne's profile picture

Maxime Labonne

19,947 просмотров • 9 месяцев назад

We release Action100M, the hero behind VL-JEPA. It is...

Delong Chen (陈德龙)'s profile picture

Delong Chen (陈德龙)

103,384 просмотров • 4 месяцев назад

Here's my conversation with Yann LeCun (Yann LeCun) about...

Lex Fridman's profile picture

Lex Fridman

1,021,936 просмотров • 2 лет назад

Jan-v2-VL-Max-Instruct is out on 💛 Our newest 30B vision-language...

👋 Jan's profile picture

👋 Jan

23,063 просмотров • 5 месяцев назад

MotionGPT: Human Motion as a Foreign Language paper page:...

AK's profile picture

AK

125,311 просмотров • 2 лет назад

We trained a foundation model on 18 million heart...

Alif Munim (d/acc)'s profile picture

Alif Munim (d/acc)

590,156 просмотров • 4 месяцев назад

Pretraining is essential for good performance on a wide...

RoboPapers's profile picture

RoboPapers

23,883 просмотров • 3 месяцев назад

Google presents AudioPaLM: A Large Language Model That Can...

AK's profile picture

AK

290,517 просмотров • 3 лет назад

VLA-JEPA just dropped in LeRobot 🤖 What makes this...

LeRobot's profile picture

LeRobot

261,554 просмотров • 3 дней назад

Start building with Gemini Embedding 2, our most capable...

Google AI Developers's profile picture

Google AI Developers

30,483,382 просмотров • 3 месяцев назад

We raised $1.5m to launch the world’s first LLM...

Yoeven's profile picture

Yoeven

93,403 просмотров • 8 месяцев назад

Today, every Nomic-Embed-Text embedding becomes multimodal. Introducing Nomic-Embed-Vision: -...

CalCo's profile picture

CalCo

103,204 просмотров • 2 лет назад

Check out our #ICRA2024 paper "Actor-Critic Model Predictive Control."...

Davide Scaramuzza's profile picture

Davide Scaramuzza

34,870 просмотров • 2 лет назад

Introducing DINOv3: a state-of-the-art computer vision model trained with...

AI at Meta's profile picture

AI at Meta

899,338 просмотров • 9 месяцев назад

Yay, finally! Introducing Vision Banana🍌 from Google DeepMind, our...

Songyou Peng's profile picture

Songyou Peng

282,670 просмотров • 1 месяц назад

📣 Microsoft Research releases Florence-VL, a new family of...

Gradio's profile picture

Gradio

14,371 просмотров • 1 год назад