
Baseten
@baseten • 10,614 subscribers
Inference is everything.
Shorts
Videos

DeepSeek-V3 dropped today and the LLM world just got turned upside down. Again. Early indicators are that this model completely transforms the closed and open-source model landscapes. Tl;Dr - OSS is now SOTA/Top3 again. Here are the key details to know: - Open source and licensed for commercial use - Beats Llamas, Qwens, GPT-4o, Sonnet 3.5 - MoE w/ 671B params, 37B active per token - 128K-token context window - Distilled o3-style reasoning Deeper dive in 🧵 This is one of the first models that need the horsepower of H200s GPUs, so we’re getting them ready to go. If you’re interested in running DeepSeek-V3, reach out to us about a dedicated deployment on H200s: h/t zhyncs for putting us on this early, Dhruv Singal for getting it running on H200s, and Philip Kiely for the demo!
Baseten69,598 Aufrufe • vor 1 Jahr

🚀 Our "technical" marketer might not be looped in, but today is our biggest launch day yet. We're introducing two new products to serve the inference lifecycle: Model APIs and Training. Model APIs are frontier models running on the Baseten Inference Stack, purpose-built for production. Baseten Training (Beta) provides infra and tooling without limitations for AI models destined for production. Huge shoutout to the many partners and customers we've worked with as we built these two new products—more details below.
Baseten35,207 Aufrufe • vor 1 Jahr

🚀 New Generally Available Whisper drop: The fastest, most accurate, and cost-effective transcription with over 1000x real-time factor for production AI workloads. 🚀 Our new Generally Available Whisper implementation delivers: 🏎️ Over 1000x real-time factor ✨ The lowest word error rate 💪 Production-grade reliability 🧩 Custom scaling and hardware per processing step 👉 See how in our blog: Reach out to get record-breaking performance for your mission-critical AI workloads!
Baseten15,474 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar