正在加载视频...
视频加载失败
Today we previewed Reinforcement Fine-Tuning, a new model customization technique that enables organizations to build expert models for specific, complex tasks in domains such as coding, scientific research, or finance.
9 条评论

We’re expanding alpha access to researchers, universities, and enterprises through our Reinforcement Fine-Tuning Research Program. Spots are limited—apply now.

Day 1 - $200 a month Day 2 - Something not actually available. Why does this expressly not feel like 12 Days of Christmas when that's what it was trying to bill itself as?

Great stuff and exciting to see the use of RFT to tune more powerful custom domain models. TL;DR for who is interested:

2 out of 12 days have been announcing things for organisations...

Science models coming

Going to be a game changer in portfolio backtesting for financial firms.

awesome, also try out chatgpt and much more here:

OpenAI has introduced Reinforcement Fine-Tuning (RFT), a new technique designed to enhance AI model performance in specialized domains like coding, scientific research, and finance.

Releasing a method you use to fine-tune your frontier models is absolutely fantastic—hands down, a great initiative. Thank you! I’m excited about the opportunity to be part of this research program, hopefully.


