正在加载视频...
视频加载失败
🚨How do LLMs acquire human values?🤔 We often point to preference optimization. However, in our new work, we trace how and when model values shift during post-training and uncover surprising dynamics. We ask: How do data, algorithms, and their interaction shape model values?🧵
39,953 次观看 • 7 个月前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里


