Loading video...
Video Failed to Load
Ever wondered how training dynamics differ between LLMs 🖋️ and Vision 👁️ models? We explore this and close the gap between VMs and LLMs in our #NeurIPS2024 paper "TrAct: Making First-layer Pre-Activations Trainable". Paper📜 Video🎥
20,875 views • 1 year ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
