Loading video...
Video Failed to Load
📣 Introducing SWE-PolyBench: A new open-source multilingual benchmark for evaluating #AI coding agents SWE-PolyBench is the first benchmark to evaluate AI coding agents' ability to understand complex codebases, helping advance AI performance in the real world. Learn more. 👉
10,866 views • 1 year ago •via X (Twitter)
3 Comments

Anthony Jia Sides1 year ago
Lol

BlockseBlock1 year ago
SWE-PolyBench is a great step for AI coding agents

solitarycyclist901 year ago
Why do techies feel they all need to dress up like Steve Jobs when presenting?

