Loading video...

Video Failed to Load

Go Home

For years it’s been an open question — how much is a language model learning and synthesizing information, and how much is it just memorizing and reciting? Introducing OLMoTrace, a new feature in the Ai2 Playground that begins to shed some light. 🔦

177,422 views • 1 year ago •via X (Twitter)

11 Comments

Ai2's profile picture
Ai21 year ago

OLMoTrace connects phrases or even whole sentences in the language model’s output back to verbatim matches in its training data. It does this by searching billions of documents and trillions of tokens in real time and highlighting where it finds compelling matches.

Ai2's profile picture
Ai21 year ago

OLMoTrace is useful for fact checking✅, understanding hallucinations🎃, tracing reasoning capabilities🧠, or just generally helping you see where an LLMs response may have come from.

Ai2's profile picture
Ai21 year ago

This new feature is made possible by our commitment to fully open models, with everything from model weights, recipes, code, and training data freely available. Openness, transparency, and traceability are key to establishing trust in AI, and we hope this serves as a step in that direction. 💫

Ai2's profile picture
Ai21 year ago

Try OLMoTrace in the Ai2 Playground today:

Ai2's profile picture
Ai21 year ago

Learn more about how OLMoTrace works on our blog:

Yang's profile picture
Yang1 year ago

Want to learn how practical AI skills and automations for your business and work? Check out our 50+ step-by-step video tutorials 100% FREE 20+ hours of Ai and Automation goodness absolutely free 🥳

Qian's profile picture
Qian1 year ago

Excellent work! Will OLMoTrace be open-sourced in the future?

Ai2's profile picture
Ai21 year ago

It already is! Check out the GitHub repo here: 🙌

Raghav Sharma's profile picture
Raghav Sharma1 year ago

Great 👍

Eitan Turok's profile picture
Eitan Turok1 year ago

very cool Since this uses exact string matching with n-grams, then any typos when talking to the chatbot will really ruin your chances of finding a match in the training data...

dentro.de/ai's profile picture
dentro.de/ai1 year ago

Now I'm trying to visualise how that would work for pictures 😅

Related Videos