
Transluce
@TransluceAI • 9,290 subscribers
Open and scalable technology for understanding AI systems.
Videos

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇
Transluce196,707 views • 1 year ago
No more content to load