moondream's banner
moondream's profile picture

moondream

@moondreamai12,166 subscribers

Production vision AI that runs everywhere. Open models, fine-tuning, cloud, and fast inference for real-world deployment.

Shorts

Moondream doesn’t just OCR, it understands. Full JSON receipts, instant Q&A. Open, tiny, blazingly fast.

Moondream doesn’t just OCR, it understands. Full JSON receipts, instant Q&A. Open, tiny, blazingly fast.

280,476 views

We’re introducing Segmentation. SVG masks from prompt, points, or box. SOTA on benchmarks.

We’re introducing Segmentation. SVG masks from prompt, points, or box. SOTA on benchmarks.

153,358 views

Announcing MLX native Mac support for Moondream 3. Moondream running on Mac, Linux, Windows (free!): pip install moondream-station Details:

Announcing MLX native Mac support for Moondream 3. Moondream running on Mac, Linux, Windows (free!): pip install moondream-station Details:

70,340 views

SOTA object detection with powerful visual reasoning: "skateboarder wearing checkered shirt".

SOTA object detection with powerful visual reasoning: "skateboarder wearing checkered shirt".

60,524 views

Moondream 3: world's best open-vocab object detection, packed in a fast, open model.

Moondream 3: world's best open-vocab object detection, packed in a fast, open model.

52,595 views

Moondream 3 Preview adds frontier-level visual reasoning while retaining Moondream 2's blazing speed. No compromises.

Moondream 3 Preview adds frontier-level visual reasoning while retaining Moondream 2's blazing speed. No compromises.

44,461 views

Moondream doesn’t just see the game, it understands it. “Who’s winning?” “Fallen player?” “Player with the ball?” “Main sponsor?”. It spots them all in a single frame. Smart. Fast. Free →

Moondream doesn’t just see the game, it understands it. “Who’s winning?” “Fallen player?” “Player with the ball?” “Main sponsor?”. It spots them all in a single frame. Smart. Fast. Free →

23,556 views

Moondream 3 can parse complex parking signs in one step. Prompt: "extract sign details" → JSON of each rule + transcription. No OCR stack, no regex: just vision that understands structure. ⚡️Fast, cheap, grounded vision AI.

Moondream 3 can parse complex parking signs in one step. Prompt: "extract sign details" → JSON of each rule + transcription. No OCR stack, no regex: just vision that understands structure. ⚡️Fast, cheap, grounded vision AI.

21,862 views

Moondream segmenting delivers pixel-accurate vector masks for arms, packages, and conveyor bins. A free, open-source model with state-of-the-art segmentation benchmarks, built for real-world automation.

Moondream segmenting delivers pixel-accurate vector masks for arms, packages, and conveyor bins. A free, open-source model with state-of-the-art segmentation benchmarks, built for real-world automation.

15,141 views

SOTA segmenting through prompts. Player blocking the shot, a jersey, armband, Steph's face. No sweat.

SOTA segmenting through prompts. Player blocking the shot, a jersey, armband, Steph's face. No sweat.

13,727 views

Need to redact video content? Moondream lets you redact content with simple prompting.

Need to redact video content? Moondream lets you redact content with simple prompting.

25,565 views

Videos

No more content to load