Loading video...
Video Failed to Load
Introducing Dialog 1.0 - Ultra-emotional AI Text-To-Speech model Outperforms Elevenlabs on expressiveness and quality 3 to 1 <1% error rate Supports 30+ languages Best in class voice cloning Low latency: 303ms TTFA (Time to First Audio) Experience it for yourself on Read more below⬇️
196,152 views • 1 year ago •via X (Twitter)
11 Comments

• 2/5 It beats Elevenlabs in Human preference testing by 3:1 - here are the results from 100 independent evaluators across 60 samples (we cloned the same voice, and used the same prompts)

• 3/5 When users expressed a preference, they chose expressiveness and pacing more than any other factor

• 4/5 And it’s fast, really fast. PlayDialog has lower latency than most other models in the market today, allowing a wide range of use cases like: 📚Narrations and audiobooks 🎙️AI-generated podcasts 🎵AI announcers and DJs 💬AI customer support agents 🗣️Voice agents

• 5/5 Read full details on our blog You can use these voices in our AI Voiceover Studio and through our Text-to-Speech API. What are you waiting for? We’re excited to see what you build!

Our speech-to-text models are the most accurate on the market with top rankings across industry benchmarks. - The highest accuracy rates—up to 95% - Up to 30% fewer hallucinations than other leaders - Low latency—63 minutes converts in 35 seconds Try via API for free today 👇

Congrats! Any plan to open source?

👀

remarkably lifelike

Thanks everyone! We’re excited for this. Do share your experiences and feedback! ❤️

Very impressive. Great job

Excited to play around with this!


