Loading video...
Video Failed to Load
still experimenting with LoRA based on the Thinking Machines configuration and just implemented it in colab. In this notebook I set up a fine tune of Qwen/Qwen3-0.6B on the OpenR1-Math dataset with lora rank of 1. with this setup you can get the same reward accuracy as full fine-tuning,... show more
25,624 views • 8 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
