正在加载视频...
视频加载失败
Continuous-video agents (computer use, robotics, static scenes) burn compute re-ingesting pixels that didn't move. VLMaxxing teaches a frozen video VLM to skip the reruns. 54 fps perception on Gemma 4 26B, training-free, no accuracy drift. w/ JF Bastien (arXiv 2605.03351)
0 条评论
暂无评论
原始帖子的评论将显示在这里
