Загрузка видео...
Не удалось загрузить видео
📣 Introducing SWE-PolyBench: A new open-source multilingual benchmark for evaluating #AI coding agents SWE-PolyBench is the first benchmark to evaluate AI coding agents' ability to understand complex codebases, helping advance AI performance in the real world. Learn more. 👉
10,866 просмотров • 1 год назад •via X (Twitter)
Комментарии: 3

Anthony Jia Sides1 год назад
Lol

BlockseBlock1 год назад
SWE-PolyBench is a great step for AI coding agents

solitarycyclist901 год назад
Why do techies feel they all need to dress up like Steve Jobs when presenting?


