Video wird geladen...
Video konnte nicht geladen werden
We are releasing 4M-21 with a permissive license, including its source code and trained models. It's a pretty effective multimodal model that solves 10s of tasks & modalities. See the demo code, sample results, and the tokenizers of diverse modalities on the website. IMO, the multitask learning aspect of... show more
69,241 Aufrufe • vor 2 Jahren •via X (Twitter)
6 Kommentare

Amir Zamirvor 2 Jahren
shoutout to @roman__bachmann, @oguzhanthefatih, @dmizrahi_ who led the work, along with @aligarjani, @mingfei_gao, David Griffiths, @hujm99, @afshin_dn, @zamir_ar.

Shikun Liuvor 2 Jahren
Great work! And thanks for open-sourcing to the community. :)

Isaac Kargarvor 2 Jahren
No audio input?

Amir Zamirvor 2 Jahren
It’s a matter of data. Otherwise IMO the method will work as-is.

Lelevor 2 Jahren
massive

Puneet (Linkedin Top Voice | AI and Data speaker)vor 2 Jahren
@zamir_ar You guys are nailing it! #multimodal #framework



