Video yükleniyor...
Video Yüklenemedi
Today, every Nomic-Embed-Text embedding becomes multimodal. Introducing Nomic-Embed-Vision: - a high quality, unified embedding space for image, text, and multimodal tasks - outperforms both OpenAI CLIP and text-embedding-3-small - open weights and code to enable indie hacking, research, and experimentation - released in collaboration with MongoDB, LlamaIndex 🦙, ,... show more
103,204 görüntüleme • 2 yıl önce •via X (Twitter)
11 Yorum

Existing text-image embedding models, including OpenAI’s CLIP, dramatically underperform specialized text encoders on text retrieval tasks. This forces developers to deploy several embedding models and store several vector indices for multimodal applications. With Nomic-Embed-Vision, developers can use a single vector space to power both their text-text and text-image retrieval tasks.

We’ve been honored by the reception of Nomic-Embed-Text, which has grown into one of the most downloaded models on @huggingface. We designed Nomic-Embed-Vision to be compatible with Nomic-Embed-Text out of the box, making it easy for developers using Nomic-Embed-Text to extend their applications with multimodal features. Put simply, any vector created using Nomic-Embed-Text can be used to query vectors created by Nomic-Embed-Vision, and vice versa.

We are releasing Nomic-Embed-Vision under a CC-BY-NC-4.0 license. This will enable researchers and hackers to continue experimenting with our models, as well as enable Nomic to continue releasing great models in the future. As Nomic releases future models, we intend to apply Apache-2.0 licenses to the less recent models in our catalogue. You can download the model on @huggingface here!

We also worked with @LangChainAI and @llama_index to ensure day 1 compatibility with the model orchestration frameworks developers love:

If you want to use Nomic-Embed-Vision or Nomic-Embed-Text in production, we recommend using our @awscloud marketplace offering, and storing the vectors in a @MongoDB Atlas vector store:

You can also access the model through our python client and in our Nomic Embedding API.

To learn more about how we built Nomic-Embed-Vision, check out our blog post, and keep an eye out for our forthcoming technical report:

Nomic Embed was trained on @digitalocean compute, with early experiments made possible by a generous compute grant from @LambdaAPI.

I got early access and built this using Nomic-Embed-Vision! Check out a museum collection of 250,000 works of art!

@Teknium1 this is really cool, what can I use to visualize/plot the cluster after computing the embeddings?

@Teknium1 You can use

