Leonard Tang's banner
Leonard Tang's profile picture

Leonard Tang

@leonardtang_3,654 subscribers

co-founder & ceo @haizelabs

Shorts

First came pre-training scaling; then came inference-time scaling. Now comes judge-time scaling. Despite progress in AI through scaled inference-time compute, AI remains unreliable in open-ended, non-verifiable domains. The key limitation is not generation—it is evaluation. Therefore, the next big leap for AI comes from better judging. In service of this future, today we release Verdict, a library for scaling judge-time compute.

First came pre-training scaling; then came inference-time scaling. Now comes judge-time scaling. Despite progress in AI through scaled inference-time compute, AI remains unreliable in open-ended, non-verifiable domains. The key limitation is not generation—it is evaluation. Therefore, the next big leap for AI comes from better judging. In service of this future, today we release Verdict, a library for scaling judge-time compute.

111,291 次观看

Videos

没有更多内容可加载