Video wird geladen...
Video konnte nicht geladen werden
Experimenting with the magic of open-source! Whisper for text translation, XTTS for audio, and Video-retalker for seamless mouth sync in a short video Its not perfect, but I think we're close with open source #buildinpublic #opensource
80,941 Aufrufe • vor 2 Jahren •via X (Twitter)
10 Kommentare

Luis Cvor 2 Jahren
For reference these were my runs on @replicate: Curious to see what you all come up with!

Michael Aubry — BasedLabs.aivor 2 Jahren
Do you know how d-id or heygen works? Wondering if there is a good oss model

Bilawal Sidhuvor 2 Jahren
this is awesome, but gosh i wish it worked better for beards

Luis Cvor 2 Jahren
Likewise. There seems to be limits when using the video-retalking model

APvor 2 Jahren
Really cool! @artificialguybr

Luis Cvor 2 Jahren
@artificialguybr Thanks! It was cool enough that I just had to share it

⟁ndrew Vvor 2 Jahren
Pretty damn good for a first pass, impressive!

Luis Cvor 2 Jahren
Right? All these models have been open source for a while too, just needed to chain them together

Crypto Industryvor 2 Jahren
The magic of open source is real, and you're the tech wizard making it happen! 🧙♀️

Luis Cvor 2 Jahren
Thanks! Im just testing out ideas tbh
