Adithya S K's banner
Adithya S K's profile picture

Adithya S K

@adithya_s_k13,474 subscribers

Scaling RL @huggingface 🤗 • Founded @cognitivelab_ai Prev : Research @MSFTResearch • ML @apple • AI Resident @lossfunk • 22

Shorts

Introducing RL Environment Creator Skill Now any one can create RL environments $ npx skills add adithya-s-k/RL_Envs_101 > You can create environments across multiple frameworks like OpenEnv, OpenReward, Verifiers, NemoGym ... > the repo has live working examples of environments that your coding agent can reference > The skill is design to first understand what type of model you are training and create an environment while keeping that in mind ps. There’s a lot more to building RL environments that can be used for training. One major aspect is the data, which this skill can’t directly solve. However, the skill will help with implementing tools, rewards, and other components of an RL environment, making it easier to go from idea to implementation quickly across different frameworks. Let me know if you’d be interested in a detailed, end-to-end blog/tutorial on building an environment and actually training a model for a useful use case.

Introducing RL Environment Creator Skill Now any one can create RL environments $ npx skills add adithya-s-k/RL_Envs_101 > You can create environments across multiple frameworks like OpenEnv, OpenReward, Verifiers, NemoGym ... > the repo has live working examples of environments that your coding agent can reference > The skill is design to first understand what type of model you are training and create an environment while keeping that in mind ps. There’s a lot more to building RL environments that can be used for training. One major aspect is the data, which this skill can’t directly solve. However, the skill will help with implementing tools, rewards, and other components of an RL environment, making it easier to go from idea to implementation quickly across different frameworks. Let me know if you’d be interested in a detailed, end-to-end blog/tutorial on building an environment and actually training a model for a useful use case.

46,438 次观看

throwback to the time where i flew a VTOL on campus during my 2nd Sem on Btech i miss building hardware stuff with all the stuff going on with AI is hard to dedicate time to building cool hardware projects any AI x Hardware weeked project ideas you guys would suggest?

throwback to the time where i flew a VTOL on campus during my 2nd Sem on Btech i miss building hardware stuff with all the stuff going on with AI is hard to dedicate time to building cool hardware projects any AI x Hardware weeked project ideas you guys would suggest?

41,311 次观看

Videos

没有更多内容可加载