Introducing VL-JEPA: Vision-Language Joint Embedding Predictive Architecture for streaming,...

Pascale Fung's profile picture

Pascale Fung

90,033 görüntüleme • 5 ay önce

Our vision is for AI that uses world models...

AI at Meta's profile picture

AI at Meta

309,704 görüntüleme • 1 yıl önce

Today we’re releasing V-JEPA, a method for teaching machines...

AI at Meta's profile picture

AI at Meta

703,412 görüntüleme • 2 yıl önce

3D-LLM: Injecting the 3D World into Large Language Models...

AK's profile picture

AK

249,494 görüntüleme • 2 yıl önce

Introducing Jan-v2-VL, a multimodal agent built for long-horizon tasks....

👋 Jan's profile picture

👋 Jan

130,228 görüntüleme • 6 ay önce

LFM2-VL support with GGUF and llama.cpp 🥳 You can...

Maxime Labonne's profile picture

Maxime Labonne

19,947 görüntüleme • 9 ay önce

We release Action100M, the hero behind VL-JEPA. It is...

Delong Chen (陈德龙)'s profile picture

Delong Chen (陈德龙)

103,384 görüntüleme • 4 ay önce

Here's my conversation with Yann LeCun (Yann LeCun) about...

Lex Fridman's profile picture

Lex Fridman

1,021,936 görüntüleme • 2 yıl önce

Jan-v2-VL-Max-Instruct is out on 💛 Our newest 30B vision-language...

👋 Jan's profile picture

👋 Jan

23,063 görüntüleme • 5 ay önce

MotionGPT: Human Motion as a Foreign Language paper page:...

AK's profile picture

AK

125,311 görüntüleme • 2 yıl önce

Pretraining is essential for good performance on a wide...

RoboPapers's profile picture

RoboPapers

23,883 görüntüleme • 3 ay önce

Google presents AudioPaLM: A Large Language Model That Can...

AK's profile picture

AK

290,517 görüntüleme • 3 yıl önce

We trained a foundation model on 18 million heart...

Alif Munim (d/acc)'s profile picture

Alif Munim (d/acc)

590,179 görüntüleme • 4 ay önce

VLA-JEPA just dropped in LeRobot 🤖 What makes this...

LeRobot's profile picture

LeRobot

280,985 görüntüleme • 4 gün önce

Start building with Gemini Embedding 2, our most capable...

Google AI Developers's profile picture

Google AI Developers

30,483,382 görüntüleme • 3 ay önce

We raised $1.5m to launch the world’s first LLM...

Yoeven's profile picture

Yoeven

93,403 görüntüleme • 8 ay önce

Today, every Nomic-Embed-Text embedding becomes multimodal. Introducing Nomic-Embed-Vision: -...

CalCo's profile picture

CalCo

103,204 görüntüleme • 2 yıl önce

Check out our #ICRA2024 paper "Actor-Critic Model Predictive Control."...

Davide Scaramuzza's profile picture

Davide Scaramuzza

34,874 görüntüleme • 2 yıl önce

Introducing DINOv3: a state-of-the-art computer vision model trained with...

AI at Meta's profile picture

AI at Meta

899,338 görüntüleme • 10 ay önce

Yay, finally! Introducing Vision Banana🍌 from Google DeepMind, our...

Songyou Peng's profile picture

Songyou Peng

282,710 görüntüleme • 1 ay önce

📣 Microsoft Research releases Florence-VL, a new family of...

Gradio's profile picture

Gradio

14,371 görüntüleme • 1 yıl önce