
Yujia Qin
@TsingYoga • 5,618 subscribers
ByteDance Seed, Agent, Previously Tsinghua Univ.
Videos

Introducing UI-TARS-1.5, a vision-language model that beats OpenAI Operator and Claude 3.7 on GUI Agent and Game Agent tasks. We've open-sourced a small-size version model for research purposes, more details can be found in our blog. TARS learns solely from a screen, but generalizes beyond a screen! Blog: Model: App:
Yujia Qin85,137 просмотров • 1 год назад
Больше нет контента для загрузки