正在加载视频...
视频加载失败
Ever wondered how training dynamics differ between LLMs 🖋️ and Vision 👁️ models? We explore this and close the gap between VMs and LLMs in our #NeurIPS2024 paper "TrAct: Making First-layer Pre-Activations Trainable". Paper📜 Video🎥
20,875 次观看 • 1 年前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里
