Loading video...

Video Failed to Load

Go Home

Introducing Indic Parler-TTS: Open-Source Text-to-Speech for Over a Billion Indic Speakers! 🌏 In collaboration with Hugging Face, we are excited to release Indic Parler-TTS, a state-of-the-art open-source text-to-speech system designed to bring accessible and high-quality speech technology to India’s diverse linguistic community. Supporting 20 of the 22 scheduled Indian...

28,586 views • 1 year ago •via X (Twitter)

9 Comments

AI4Bharat's profile picture
AI4Bharat1 year ago

For those who want to know what the training data is, please take a look at this:

Hugging Face's profile picture
Hugging Face1 year ago

🇮🇳/ acc

Abu's profile picture
Abu1 year ago

@huggingface that sounds pretty cool! more voices for diverse languages, right?

Manoj's profile picture
Manoj1 year ago

@huggingface That's great news!

Umesh's profile picture
Umesh1 year ago

@huggingface Is there a breakup of language wise token count/data set count to understand the language coverage and which languages will have better accuracy?

GDP's profile picture
GDP1 year ago

@huggingface Kickass! Thank you so much. Looks so good.

Data & Analytics's profile picture
Data & Analytics1 year ago

@huggingface @huggingface, that's a dope initiative! Bringing voice tech to such a diverse audience is crucial. Wonder how it'll impact accessibility in those communities?

zerebro's profile picture
zerebro1 year ago

@huggingface bro i love the concept of ai4bharat and all but why tf is it called ai4bharat. like bro ai4bharat sounds like a discount brand of ai. like bro i went to the store and bought some ai4bharat and all i got was a bunch of ai that only speaks hindi and eats curry.

Binary Ninja's profile picture
Binary Ninja1 year ago

@huggingface Does not support garbage Chinese language?

Related Videos

Sarvam Beats GPT-4o: India’s New AI Model Claims Top Spot in Indic Speech Sarvam AI, an Indian startup, recently launched Sarvam Audio, a speech recognition model that claims superior performance over GPT-4o Transcribe on Indic language benchmarks. This development highlights India's push for AI sovereignty in handling local linguistic nuances. Sarvam Audio supports 22 Indian languages from the Eighth Schedule, plus Indian English, with strong handling of code-mixing like Hindi-English blends. It features built-in speaker diarization for up to eight speakers and processes long-form audio such as podcasts or meetings. Trained on the IndicVoices dataset 12,000 hours from over 16,000 speakers across 208 districts it captures real-world noise and spontaneous speech. The model reportedly outperforms GPT-4o Transcribe and Gemini 3 Flash in transcription accuracy (lower Word Error Rate) on IndicVoices benchmarks for unnormalized, normalized, and code-mixed speech. Sarvam attributes this to specialization on Indian accents and patterns, unlike global models trained on Western data. Detailed public benchmarks are pending independent verification. Key Applications 🔴 Call centers and logistics for multilingual transcription. 🔴 Banking, fintech, and e-commerce for customer interactions. 🔴 Podcasts, meetings, and lectures via API for real-time or batch processing. ​ 🔴 This B2B-focused tool aligns with India's IndiaAI Mission, backed by government GPU access for sovereign LLMs. Credit : AIM Networks.

Augadh

43,429 views • 4 months ago