正在加载视频...
视频加载失败
🚨 BREAKING: A research lab just released a 15B model that generates multilingual talking human videos with synced audio, beats every competitor in human evaluation, and runs in 38 seconds on one GPU. It's called daVinci-MagiHuman. The key insight is that every other model in this category stacks cross-attention,... show more
0 条评论
暂无评论
原始帖子的评论将显示在这里

