
Omar Shaikh
@oshaikh13 • 2,097 subscribers
member of sociotechnical staff @Stanford
Videos

LLMs sound homogeneous *because* feedback modalities like rankings, principles, and pairs cater to group-level preferences. Asking an individual to rank ~1K outputs or provide accurate principles takes effort. What if we relied on a few demos to elicit annotator preferences?
Omar Shaikh52,304 次观看 • 2 年前
没有更多内容可加载