Hrishbh Dalal's banner
Hrishbh Dalal's profile picture

Hrishbh Dalal

@HrishbhDalal1,265 subscribers

AI Researcher and Freelancer in Germany Reach out to me for collaborations on Projects with AI and ML 🤝 Working with AI and LLMs to create an army of Minions

Shorts

What if we could teach an AI to master the strategic game of 2048 through pure reinforcement learning? I did exactly that with "Agent 2048" - fine-tuning Qwen 7B model using GRPO to develop spatial reasoning and merge strategies with zero prior gameplay SFTdata! Thanks to Hugging Face and Unsloth AI for their easy to use implementation kalomaze and will brown you might like this :)

What if we could teach an AI to master the strategic game of 2048 through pure reinforcement learning? I did exactly that with "Agent 2048" - fine-tuning Qwen 7B model using GRPO to develop spatial reasoning and merge strategies with zero prior gameplay SFTdata! Thanks to Hugging Face and Unsloth AI for their easy to use implementation kalomaze and will brown you might like this :)

30,851 просмотров

Videos

Больше нет контента для загрузки