
Kevin Lin
@KevinQHLin • 2,361 subscribers
multimodal x agent x next postdoc @UniofOxford visiting @Stanford phd @NUSingapore | ex @Meta @Microsoft
Shorts
Videos

🌟Introducing🎻Violin — an Open-source Video Translation Skill. 📹Video is the dominant medium on the internet, yet most high-quality content (lecture, talk, podcast) is locked behind a single language, leaving global audiences behind. So we built Violin: a video skill that combines speech recognition, LLM translation, and speech synthesis into one seamless pipeline. 🌐 Demo: 📝 Blog: 🔗 GitHub: ✨Key Features: 🎙️High-quality multilingual ASR & Translation & TTS. 🗣️Personalize translation & voice (turn an academic talk into something children can follow). 💬Chat with the video — ask any questions grounded in the video. 🧩Support Web app, CLI, and Agent skill 🍃Fully open-source under MIT. ❤️Built with the wonderful Shang Zhu and advised by James Zou ! All features powered by Together AI . Try it and let us know what you think! 🎻
Kevin Lin135,914 次观看 • 21 天前

Thanks AK for sharing our work!! 🤔Today’s video generation models (e.g., Veo3, SoRA) are great at realism, but they still struggle to convey structured knowledge and logical teaching. 🌟Code2Video🌟takes a different path: starting from Python Manim code, it renders project-level programs into educational videos—bridging coding, visualization, and knowledge! 📷 Code: 🏠 Website: 📄 arXiv: We want to share our gratitude to Grant Sanderson and @manim_community !!! Thanks to the great team Anno Yanzhe Chen and Mike Shou ! #VIDEO #education #Sora2
Kevin Lin29,691 次观看 • 8 个月前
没有更多内容可加载