I vibe coded a visual PDF search app with ColQwen2. This is how it works: - Store PDF files as images in a Weaviate AI Database vector database - Embed images and text with a multimodal late-interaction model (ColQwen2) - Generate token-wise (and summed) similarity maps to highlight image patches with high similarity Now I need to refactor the messy vibe-coded project. In the meantime, you can check out the Notebook this demo is based on to try it out yourself: