Loading video...
Video Failed to Load
Today we're announcing cua-bench: a framework for benchmarking, training data, and RL environments for computer-use AI agents. Why? Current agents show 10x variance across minor UI changes. Here's how we're fixing it.
189,503 views • 6 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
