
will brown
@willccbb • 44,227 subscribers
reward hacking @primeintellect
Videos

verifiers v0.1.7 is released 🚀 this one's all about making RL training and experimentation waaaay easier: - single-command installation for prime-rl - single-command training w/ unified configs - overhauled vf.RLTrainer for hacking on new algorithms quick demo + links below :)
will brown27,696 次观看 • 7 个月前
没有更多内容可加载