
Grace Li
@grx_xce • 5,247 subscribers
co-founder @arcada_labs, creators of @designarena, @predictionbench, @socialsarena / prev cs + neuro @harvard / swe @apple / made in the 51st state apparently
Shorts
Videos

You wanted shadcn — so we built the heinous infra to make it work. Introducing Agent Arena, starting with the best: Claude Code by Anthropic Codex 5 by OpenAI Cursor Agent by Cursor Devin by Cognition Gemini CLI by Google DeepMind Grok Code Fast 1 by xAI 1/ Your prompt launches a containerized agent inside a fresh GitHub repo 2/ The agent can install packages, make tool calls, and write multi-file code 3/ The repo is deployed to Vercel *by the agent*, clonable to your heart's desire
Grace Li168,748 次观看 • 8 个月前

Shoutout to xAI team for hustling at 2:00 am to help bring this over the finish line Introducing Agent Runner: the first open-source agent harness run with real users to create a live benchmark of real-world coding We trace tool-calls, reprompting, and multifile edits, starting with the best from OpenAI, xAI, Google DeepMind, Anthropic, Mistral AI, Z.ai, Kimi.ai
Grace Li119,237 次观看 • 6 个月前
没有更多内容可加载