正在加载视频...
视频加载失败
When evaluated against the WebVoyager benchmark, which tests agent performance on end-to-end real world web tasks, Project Mariner achieved a state-of-the-art result of 83.5% working as a single agent setup.
0 条评论
暂无评论
原始帖子的评论将显示在这里
正在加载视频...
暂无评论
原始帖子的评论将显示在这里