Poseidon's banner

Poseidon

@psdnai • 25,668 subscribers

Real-world data for physical AI

Shorts

AI was trained on the open internet, but the data that matters most lives in the real world. Introducing early access to Numo, an app built to collect the next generation of AI training data. Starting with voice data collection in Bengali, Hindi, Tamil, and Telugu. Details ↴

AI was trained on the open internet, but the data that matters most lives in the real world. Introducing early access to Numo, an app built to collect the next generation of AI training data. Starting with voice data collection in Bengali, Hindi, Tamil, and Telugu. Details ↴

1,240,896 görüntüleme

Voice AI has an evaluation problem. Models look strong on public benchmarks, then collapse on real-world audio. Introducing a recipe-driven evaluation framework for low-resource languages, real-world audio, and production failure modes. Details ↓

Voice AI has an evaluation problem. Models look strong on public benchmarks, then collapse on real-world audio. Introducing a recipe-driven evaluation framework for low-resource languages, real-world audio, and production failure modes. Details ↓

34,448 görüntüleme

Bengali is spoken by ~280M people. Yet when we tested frontier models on Bengali transcript review, they rarely agreed on what was incorrect. The results point to a broader challenge for AI in low-resource languages ↓

Bengali is spoken by ~280M people. Yet when we tested frontier models on Bengali transcript review, they rarely agreed on what was incorrect. The results point to a broader challenge for AI in low-resource languages ↓

16,919 görüntüleme

Most teams collecting voice data optimize for volume over quality, partly because they’re measuring quality wrong. To help evaluate quality we created the Poseidon Score. When applied, single-speaker audio scored well while multi-speaker conversations scored worse. Why? ↓

Most teams collecting voice data optimize for volume over quality, partly because they’re measuring quality wrong. To help evaluate quality we created the Poseidon Score. When applied, single-speaker audio scored well while multi-speaker conversations scored worse. Why? ↓

28,515 görüntüleme

Poseidon needs voice data and reliable ground truth in low-resource languages to benchmark against. To ensure LLM transcript accuracy, we worked with linguists to audit Bengali outputs. For a language spoken by 280M people, the gaps we found point to a deeper issue: data ↓

Poseidon needs voice data and reliable ground truth in low-resource languages to benchmark against. To ensure LLM transcript accuracy, we worked with linguists to audit Bengali outputs. For a language spoken by 280M people, the gaps we found point to a deeper issue: data ↓

22,705 görüntüleme

Conversational voice data exposes a subtle failure mode in many ASR pipelines. The metric says the data is bad, but human reviewers say the audio is clear. Often the issue isn’t transcription quality itself, but that the evaluation stack wasn’t built for turn-taking, silence, and multi-speaker structure. Learn how to solve this in our latest blog:

Conversational voice data exposes a subtle failure mode in many ASR pipelines. The metric says the data is bad, but human reviewers say the audio is clear. Often the issue isn’t transcription quality itself, but that the evaluation stack wasn’t built for turn-taking, silence, and multi-speaker structure. Learn how to solve this in our latest blog:

11,202 görüntüleme

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Real-world AI can’t be trained on scraped data. It needs real voices. Live now: 🔱 Users may earn Poseidon points, and from time to time, partner rewards for audio that helps train AI on accents, noise, and conversations.

Real-world AI can’t be trained on scraped data. It needs real voices. Live now: 🔱 Users may earn Poseidon points, and from time to time, partner rewards for audio that helps train AI on accents, noise, and conversations.

444,914 görüntüleme • 10 ay önce

AI is moving beyond the browser and into the real world. The bottleneck? Data. Today we’re announcing a $15M seed round led by a16z crypto to build infra that collects, curates, and licenses high-quality data for physical AI. Incubated by and built on @StoryProtocol.

AI is moving beyond the browser and into the real world. The bottleneck? Data. Today we’re announcing a $15M seed round led by a16z crypto to build infra that collects, curates, and licenses high-quality data for physical AI. Incubated by and built on @StoryProtocol.

292,009 görüntüleme • 1 yıl önce

Coming soon: Poseidon App 🔱 AI needs better audio data – this starts with you. Share voice samples to help AI understand accents, noise, and real conversations. Users may earn Poseidon points and, from time to time, additional partner rewards. Stay tuned.

Coming soon: Poseidon App 🔱 AI needs better audio data – this starts with you. Share voice samples to help AI understand accents, noise, and real conversations. Users may earn Poseidon points and, from time to time, additional partner rewards. Stay tuned.

200,079 görüntüleme • 10 ay önce

You can’t train a Tesla robot to hold a coffee cup with scraped internet data. You need countless angles of egocentric video from real homes, hands, and lighting. Poseidon coordinates and curates this data at scale, with IP cleared on Story.

You can’t train a Tesla robot to hold a coffee cup with scraped internet data. You need countless angles of egocentric video from real homes, hands, and lighting. Poseidon coordinates and curates this data at scale, with IP cleared on Story.

64,825 görüntüleme • 1 yıl önce

Daha fazla içerik yok.