Kevin Wang's banner

Kevin Wang

@kevin_wang3290 • 1,174 subscribers

Research @ OpenAI (RL/reasoning) | prev CS @Princeton '22–'25, research @princeton_rl + @princeton_nlp, quant intern @citsecurities

Shorts

1/ While most RL methods use shallow MLPs (~2–5 layers), we show that scaling up to 1000-layers for contrastive RL (CRL) can significantly boost performance, ranging from doubling performance to 50x on a diverse suite of robotic tasks. Webpage+Paper+Code:

1/ While most RL methods use shallow MLPs (~2–5 layers), we show that scaling up to 1000-layers for contrastive RL (CRL) can significantly boost performance, ranging from doubling performance to 50x on a diverse suite of robotic tasks. Webpage+Paper+Code:

155,134 görüntüleme