
Pratyush Kumar
@pratykumar • 31,050 subscribers
Building @SarvamAI
Videos

Drop 14a/14: Over the past few days, we’ve been putting our models, products, and research out there. The positive feedback has helped more people agree with our long held belief that #IndiaCan be a builder in this space. Today, we are rolling out Indus - a chat interface to experience Sarvam 105B. Here is what all you can do:
Pratyush Kumar500,241 просмотров • 3 месяцев назад

Drop 5/14: Introducing Bulbul V3, our latest text-to-speech model. It raises the bar for how human it sounds, while being super robust. In an independent third-party human listening study, Bulbul V3 delivers the highest listener preference, and low error rates across use-cases and languages. See details in our blog, but first watch the video.
Pratyush Kumar421,873 просмотров • 4 месяцев назад

Drop 9/14: Today we are introducing Sarvam Studio, our product to help creators go multilingual. One piece of content, every corner of India. With AI video dubbing, Studio generates high-fidelity dubs in 11 Indian languages. In an expert study, participants preferred Sarvam Studio for overall quality and production readiness. With agentic document translation, Studio excels in contextually translating long-form content across genres. Our evaluations demonstrate that readers strongly preferred the output from Studio across different genres.
Pratyush Kumar255,554 просмотров • 4 месяцев назад

Drop 3/14: Our Conversational Agents platform, Samvaad, now powers over a million minutes of interactions daily. Every month our TAM estimates keep increasing as we discover new use cases, not thought possible earlier - population scale outreaches, hybrid onboarding journeys over phone and WhatsApp, 24*7 sales assistants, ... Here is a real conversation between a merchant and B2B marketplace support agent.
Pratyush Kumar102,507 просмотров • 4 месяцев назад

Drop 11/14: We have been releasing Sarvam’s models and products - and yes, there is more to come. But today, we are excited to share the real and diverse impact our work is driving at scale. First up, preserving our cultural heritage. We have been working with Ekatra Foundation and Navajivan Trust (founded by Mahatma Gandhi) to digitize Gujarati documents from the early 19th and 20th centuries. Through this collaboration, we built out a whole product - Sarvam Akshar. Powered by the Sarvam Vision model, Akshar delivers state-of-the-art accuracy enabling reliable digitization of complex, real-world documents. Akshar sets the stage for our effort to leverage foundational models to preserve India’s cultural heritage - a direction we will double down on. Read more in our blog:
Pratyush Kumar59,945 просмотров • 4 месяцев назад

I go around saying we should be builders across the 'full stack' Sarvam - from compute, to data, to models, to apps. And in come Unnikrishnan Nambiar and Varun and say we should compose our own music for model launches! And so here we go with our first Sarvam sound track. India's most ambitious decade to build is here; come join the build. And yes for your ringtone upgrade, go here:
Pratyush Kumar58,973 просмотров • 4 месяцев назад

Thanks Lex Fridman for the engaging chat with Narendra Modi. As PM Modi mentioned, India is a country with many langs. Making the podcast available in more langs will widen its reach. 🧵 with snippets in 9 langs. Happy to share full versions with you. Built with love by Sarvam
Pratyush Kumar160,557 просмотров • 1 год назад

In this demo, we show real-time voice translation between Indian languages. The user can speak in their preferred language, and the system translates and responds in another, preserving both meaning and natural delivery. Speech recognition, translation, and expressive text-to-speech work together seamlessly.
Pratyush Kumar15,206 просмотров • 4 месяцев назад

Beyond just transcription, Saaras V3 excels in three areas. It can automatically detect the language of the audio. It now comes with word-level time-stamps. And it also supports diarisation for multi-speaker audios. Try out your polyglot skills in the blog.
Pratyush Kumar15,368 просмотров • 4 месяцев назад

On to healthcare now. We partnered with the National Health Authority of India (National Health Authority (NHA)) to strengthen the awareness-to-enrollment loop for the Ayushman Vay Vandana Yojana. Within just 6 days, the campaign reached 1.4 million citizens across 6 states and 5 languages, driving an average 40%+ increase in daily enrollments and accelerating access to healthcare benefits at scale. Dr. Sunil Kumar Barnwal (Sunil Kumar Barnwal) shares his experience on working with us.
Pratyush Kumar13,920 просмотров • 4 месяцев назад

In Tamil Nadu, the vision takes shape through Digital Sangam, a Sovereign AI Research Park. It is a landmark public-private partnership of the State with Sarvam and IIT Madras. At its core will be a 20MW AI-optimized data center, which will power both research and e-governance workloads. Tamil Nadu will leverage its lead in building state data centers and digitisation, to make every aspect of governance AI-ready. This will further enhance the state’s abilities to provide welfare at population-scale, while cementing it as a preferred choice for advanced manufacturing.
Pratyush Kumar12,378 просмотров • 4 месяцев назад
Больше нет контента для загрузки