Loading video...
Video Failed to Load
[1/n] Do distinct large models admit a simple map that aligns their embedding spaces? We show that across multimodal contrastive models—trained on different data and architectures—an orthogonal map aligns image embeddings. Strikingly, the same map also aligns text embeddings.
36,956 views • 3 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
