OpenBMB's banner
OpenBMB's profile picture

OpenBMB

@OpenBMB8,298 subscribers

OpenBMB (Open Lab for Big Model Base) aims to build foundation models and systems towards AGI. Connect with us: https://t.co/N9pevTnoOa

Shorts

🎙️MiniCPM-o 4.5: Full-duplex interaction in motion. Watch the 9B model track and identify fruit price tags in a dynamic live stream. Unlike traditional reactive systems, MiniCPM-o 4.5 processes continuous video and audio inputs to see, listen, and respond simultaneously—without mutual blocking. 🚀This end-to-end architecture enables low-latency, proactive feedback even while the device is moving, bridging the gap between static vision-language tasks and real-world live interaction. Try the demo and share your feedback with us! Hugging Face 👉 #MiniCPMo45 #MLLM #EdgeAI #OpenSource

🎙️MiniCPM-o 4.5: Full-duplex interaction in motion. Watch the 9B model track and identify fruit price tags in a dynamic live stream. Unlike traditional reactive systems, MiniCPM-o 4.5 processes continuous video and audio inputs to see, listen, and respond simultaneously—without mutual blocking. 🚀This end-to-end architecture enables low-latency, proactive feedback even while the device is moving, bridging the gap between static vision-language tasks and real-world live interaction. Try the demo and share your feedback with us! Hugging Face 👉 #MiniCPMo45 #MLLM #EdgeAI #OpenSource

25,215 просмотров

Videos

OpenBMB's profile picture

🚀 🚀Excited to announce the technical report of MiniCPM-o 4.5! MiniCPM-o 4.5 transitions #AI interaction from traditional turn-based processing to a real-time, native full-duplex stream-based paradigm. 🌊 The Omni-Flow Framework Instead of traditional VAD-based workarounds, we introduce the #Omni-#Flow framework. This unified stream paradigm aligns video, audio, and text on a synchronized millisecond timeline. • Native Full-Duplex: Simultaneous perception and response. • Proactive Interaction: Natively manages turn-taking without external VAD, supports proactive reminding. 📉 9B Scale, SOTA Performance MiniCPM-o 4.5 demonstrates SOTA multimodal intelligence at its scale: • Multimodal Benchmarks: Comparable to #Gemini 2.5 Flash on MMBench EN (87.6) and MathVista (80.1). • Streaming Evaluation: 54.4% win rate on LiveSports-3K-CC, surpassing specialized models. 💻 The Ultimate Edge AI — Fully Functional without Network Connection We are providing one-click installers for Windows (12G VRAM,RTX 5070) and macOS (M1-M5 Max/ M5 Pro). • Local API Support: Deploy your own inference server to integrate native full-duplex into custom apps. • Free Access: We are offering free community API services for exploration. • 100% Private: Your data never leaves your machine. Deploy in under 10 minutes. 🛠️👇 👐 Join the Open Future The weights are open. The protocol is public. 📄 Technical Report: 💻 GitHub: 🤗 HuggingFace: 🌐 Web Demo: #MiniCPMo #OpenSourceAI #EdgeAI #MachineLearning #ComputerVision #LLM

OpenBMB

146,678 просмотров • 1 месяц назад

Больше нет контента для загрузки