Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

New 🤗 transformers release includes a very powerful Multimodel Large Language Model (MLLM) by Microsoft called KOSMOS-2! 🤩 The highlight of KOSMOS-2 is grounding, the model is *incredibly* accurate! 🌎 Play with the demo here 👉 But how does this model work? Let's take a look! 👀🧶

143,816 görüntüleme • 2 yıl önce •via X (Twitter)

11 Yorum

merve profil fotoğrafı
merve2 yıl önce

Grounding helps machine learning models relate to real-world examples. Including grounding makes models more performant by means of accuracy and robustness during inference. It also helps reduce the so-called "hallucinations" in language models.

merve profil fotoğrafı
merve2 yıl önce

In Kosmos-2, model is grounded to perform following tasks and is evaluated on 👇 - multimodal grounding & phrase grounding, e.g. localizing the object through natural language query - multimodal referring, e.g. describing object characteristics & location - perception-language tasks - language understanding and generation

merve profil fotoğrafı
merve2 yıl önce

The dataset used for grounding, called GRiT is also available on Hugging Face Hub 👉 Thanks to transformers integration, you can use KOSMOS-2 with few lines of code 🤩 See below! 👇

merve profil fotoğrafı
merve2 yıl önce

also big kudos to @ydshieh for implementing this in transformers ✨

Rainmaker profil fotoğrafı
Rainmaker2 yıl önce

Can Machine Learning beat the market? Check out this post on my free Substack where I share code and commentary for an XGBoost model and a Random Forest model that both deliver powerful performances.

merve profil fotoğrafı
merve2 yıl önce

multimodal* 🥲

Luis C profil fotoğrafı
Luis C2 yıl önce

@Microsoft Wow, its very fast on an A40

iamrobotbear (bk) profil fotoğrafı
iamrobotbear (bk)2 yıl önce

@Microsoft License?

Vlad profil fotoğrafı
Vlad2 yıl önce

@ClementDelangue @Microsoft 🤔 I could use this to improve I'm using the Blip model and is not bad but this looks like it could give more accurate results.

SkalskiP profil fotoğrafı
SkalskiP2 yıl önce

@Microsoft Yup KOSMOS-2 is awesome!

Risichad 🦾 profil fotoğrafı
Risichad 🦾2 yıl önce

@Microsoft AI can understand a video at 3fps then !!!

Benzer Videolar