Overview

Moondream is an open-source family of Vision Language Models (VLMs) built for powerful, efficient visual reasoning. Our newest release, Moondream 3 Preview, is a mixture-of-experts model with grounded visual reasoning, a 32k context window, and native support for multiple vision skills—like pointing, counting, and object detection—all designed with a deployment-friendly ethos.

Moondream 3 Preview is now the default model for our cloud API and local processing with Moondream Station – get started here.

Key stats

9B total params, 2B active params (maintains similar inference speeds to our previous models)
32k context window (up from 2k)

Model Skills

Moondream has built-in vision-specific skills that make it easy to generate specific types of vision outputs (e.g., bounding boxes, or 2D points). These are:

Object Detection
Pointing and Counting
Visual Question Answering
Captioning

Performance Benchmarks

Here are some early benchmark results. We show it alongside some top frontier models for comparison. Moondream also produces answers in fraction of the time of these bigger models. We'll publish more complete results later and include inference times to make this clearer.

Task	Moondream 3 Preview	GPT 5	Gemini 2.5-Flash	Claude 4 Sonnet
Object Detection
refcocog	88.6	49.8	75.1	26.2
refcoco+	81.8	46.3	70.2	23.4
refcoco	91.1	57.2	75.8	30.1
Counting
CountbenchQA	93.2	89.3	81.2	90.1
Document Understanding
ChartQA	86.6	85*	79.5	74.3*
DocVQA	88.3	89*	94.2	89.5*
Hallucination (higher is better)
pope	89.0	88.4	88.1	84.6

License

Copyright (c) 2025 M87 Labs, Inc. This distribution includes Model Weights licensed under the Business Source License 1.1 with an Additional Use Grant (No Third-Party Service). Commercial hosting or rehosting requires an agreement with contact@m87.ai.

Learn more

Try the interactive demo
Full technical specifications for Moondream 3
Browse legacy versions of Moondream
Get started with our Getting Started guide

Key stats​

Model Skills​

Performance Benchmarks​

License​

Learn more​

Key stats

Model Skills

Performance Benchmarks

License

Learn more