
LiveKit
@livekit • 9,725 subscribers
Open source framework and cloud platform for building voice, video, and physical AI agents. https://t.co/OWLvFH82oN
Videos

Introducing Agents UI, an open-source shadcn component library for building polished React frontends for your voice agents. Audio visualizers. Media controls. Session management tools. Chat transcripts. All wired to LiveKit Agents. Install via the shadcn CLI and own the code.
LiveKit182,387 просмотров • 3 месяцев назад

How can a voice agent tell when you’re actually interrupting it? VAD is too sensitive—laughs, “mm-hmm,” or a sneeze shouldn’t stop the agent. We trained an audio model for adaptive interruption handling so agents can distinguish real interruptions from noise.
LiveKit43,832 просмотров • 2 месяцев назад

Gemini 3.1 Flash Live just dropped and it's available with LiveKit today. This is the first Gemini 3 native audio model on the Live API. Better instruction following, improved tool calling, reduced speaker drift, and support for 70+ languages. Audio in, audio out. No text conversion in between.
LiveKit40,218 просмотров • 2 месяцев назад

Voice agents do not sound robotic because they are slow. They sound robotic because the model writes like an essay and then reads it out loud. We just shared a post on making STT to LLM to TTS sound human. Make the model more human by including ums, sos, real pauses, and even laughter tags. Tiny rhythm changes can make a huge difference.
LiveKit45,870 просмотров • 3 месяцев назад

Today we’re launching our first homegrown AI model: an open source turn detection model for building voice agents. Instead of relying solely on voice activity detection (VAD), which only considers when a user is speaking, our model also considers what has and is being said in the context of a conversation and predicts when a user is finished expressing their thoughts before the agent responds. Conversations with AI voice agents using this new model flow much more naturally without constant interruptions from the AI— check it out (more videos, details, and code in the thread):
LiveKit126,804 просмотров • 1 год назад

Voice cloning is now available on LiveKit Inference. We’re launching with Inworld AI and Cartesia. Clone a voice once and use it across multiple TTS providers, with automatic fallback to the same voice if a provider fails mid-call. Free to create and available on all paid plans today.
LiveKit10,778 просмотров • 1 месяц назад

Introducing LiveKit Inference — a new cloud service that gives you access to the most popular voice AI models with just your LiveKit API key. We manage rate limits for you, report on usage, and consolidate billing. All LiveKit Cloud plans now include free monthly inference credits. A single string update allows you to call models from: AssemblyAI Deepgram Google DeepMind Inworld AI OpenAI Rime
LiveKit37,072 просмотров • 8 месяцев назад
Больше нет контента для загрузки