正在加载视频...
视频加载失败
We are releasing 4M-21 with a permissive license, including its source code and trained models. It's a pretty effective multimodal model that solves 10s of tasks & modalities. See the demo code, sample results, and the tokenizers of diverse modalities on the website. IMO, the multitask learning aspect of... show more
69,241 次观看 • 2 年前 •via X (Twitter)
6 条评论

Amir Zamir2 年前
shoutout to @roman__bachmann, @oguzhanthefatih, @dmizrahi_ who led the work, along with @aligarjani, @mingfei_gao, David Griffiths, @hujm99, @afshin_dn, @zamir_ar.

Shikun Liu2 年前
Great work! And thanks for open-sourcing to the community. :)

Isaac Kargar2 年前
No audio input?

Amir Zamir2 年前
It’s a matter of data. Otherwise IMO the method will work as-is.

Lele2 年前
massive

Puneet (Linkedin Top Voice | AI and Data speaker)2 年前
@zamir_ar You guys are nailing it! #multimodal #framework



