Video yükleniyor...
Video Yüklenemedi
LLava just hit 3800 stars on Github. It's a multimodal Large Language-and-Vision Assistant that can understand images and text. LLava can even handle memes (the same ones GPT-4 demo'ed at launch) and set a new SOTA on Science QA. It also supports LLaMA-2, LoRA training with academia GPUs, higher... show more
143,527 görüntüleme • 2 yıl önce •via X (Twitter)
11 Yorum

Github: Demo:

By:@imhaotian,@ChunyuanLi,@QingyangWu1,@yong_jae_lee

The rise of these models and the speed of which they are entering the market makes me think we are soon only going to interact with LLM’s

Absolutely, or LLM-assisted websites. The equivalent of intercom on every website.

So good.

Definitely want to play with this soon

Let me know how it goes, about to pip install it

Great find. Looking very promising.

@readwise save thread

@memdotai mem it

@AlphaSignalAI Saved! Here's the compiled thread: 🪄 AI-generated summary: "LLava is a multimodal Large Language-and-Vision Assistant that can understand images and text, and even handle memes. It has achieved a new SOTA on Science QA and supports LoRA...
