
Transluce
@TransluceAI • 9,290 subscribers
Open and scalable technology for understanding AI systems.
Videos

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇
Transluce196,707 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar