Skip to main content

Create object-detection datasets (YOLO) using Moondream

Project description

MoonLabel

MoonLabel Logo

An object-detection and image-caption dataset tool.

Powered by Moondream VLM

PyPI version


Overview

MoonLabel is both a Python library and a tiny web UI to generate object-detection and image-caption datasets quickly.

  1. Use the library to auto-label folders of images and export YOLO, COCO, VOC, or Captions.
  2. Or launch the UI, switch between Detection/Caption, and export with one click.

Backends supported: Moondream Cloud, Moondream Station, or fully local (Hugging Face).

Demo

https://github.com/user-attachments/assets/a2dfc6b6-c83d-4296-986b-ac221e10fc3b


Features

  • 📦 Library + UImoonlabel package with an optional web UI.
  • 🌐 FastAPI server — Served by a single moonlabel-ui command.
  • ⚛️ Modern frontend — React, TypeScript, TailwindCSS, Vite.
  • 🖼️ Object detection — Choose between Moondream Cloud, the open-source Hugging Face model, or the native Moondream Station app.
  • 📝 Image caption datasets — Export captions as captions.jsonl alongside images.
  • GPU-accelerated & offline — Local and Station modes automatically use available hardware acceleration (CUDA / MPS).

Install

  • Library only (Cloud/Station by default):
pip install moonlabel
  • Library + UI server:
pip install "moonlabel[ui]"
  • Local inference (Hugging Face) extras:
pip install "moonlabel[local]"
  • Both UI and local inference:
pip install "moonlabel[ui,local]"

Quick Start (UI)

pip install "moonlabel[ui]"
moonlabel-ui    # opens http://localhost:8342

Choose backend in Settings:

  • Moondream Cloud: paste API key
  • Moondream Station: set endpoint (default http://localhost:2020/v1)
  • Local (Hugging Face): install local extras and select Local

In the Home page:

  • Use the top toggle to select Detection or Caption dataset.
  • For Caption, choose length (short/normal/long) and generate/export.

Quick Start (Library)

from moonlabel import create_dataset

# Cloud
create_dataset("/path/to/images", objects=["person"], api_key="YOUR_API_KEY")

# Station
create_dataset("/path/to/images", objects=["car"], station_endpoint="http://localhost:2020/v1")

# Local (after: pip install "moonlabel[local]")
create_dataset("/path/to/images", objects=["bottle"])  # no key needed

By default this exports YOLO. Choose formats via export_format:

# YOLO (default)
create_dataset("/path/to/images", objects=["person"], export_format="yolo")

# COCO
create_dataset("/path/to/images", objects=["person", "car"], export_format="coco")

# Pascal VOC
create_dataset("/path/to/images", objects=["cat", "dog"], export_format="voc")

# Image Captioning
create_dataset("/path/to/images", export_format="caption")
# With length (short|normal|long)
create_dataset("/path/to/images", export_format="caption", caption_length="normal")

Output layouts:

  • YOLO: images/, labels/, classes.txt
  • COCO: images/, annotations/instances.json, classes.txt
  • VOC: images/, annotations/*.xml, classes.txt
  • Caption: images/, annotations/captions.jsonl

Moondream Station Mode

The backend can connect to a running Moondream Station instance for fast, native, on-device inference.

  1. Download, install, and run Moondream Station.
  2. Ensure the endpoint matches your Station configuration (default: http://localhost:2020/v1).

Local Mode (Hugging Face)

The backend can run fully offline using the open-source vikhyatk/moondream2 checkpoint.

  1. pip install "moonlabel[local]"
  2. In the UI, select Local (no API key required).

The first detection will trigger a one-off model download to ~/.cache/huggingface/; subsequent runs reuse the cached weights.

GPU / Device selection

The backend chooses the best device automatically in the following order: CUDA → Apple Silicon (MPS) → CPU.

Override via environment variable before launching the backend:

# Force GPU
export MOONDREAM_DEVICE=cuda

# Force Apple Silicon
export MOONDREAM_DEVICE=mps

# CPU only
export MOONDREAM_DEVICE=cpu

Project Structure

moonlabel/
├── src/moonlabel/             # Python package (library + server)
│   ├── dataset.py             # create_dataset API
│   ├── infer.py               # Moondream wrapper (cloud/station/local)
│   ├── types.py               # shared types 
│   ├── utils.py               # helpers 
│   ├── yolo.py                # YOLO label writer
│   ├── coco.py                # COCO writer
│   ├── voc.py                 # Pascal VOC XML writer
│   └── server/                # FastAPI app + static assets
│       ├── api.py
│       ├── cli.py             # moonlabel-ui entrypoint (port 8342)
│       └── static/            # embedded UI build (no npm for users)
├── ui/                        # Frontend source (for maintainers)
│   └── dist/                  # Built files to embed
├── scripts/embed_ui.py        # Copies ui/dist → src/moonlabel/server/static
├── Makefile                   # make ui-build, ui-embed, release
└── pyproject.toml

Roadmap / TODOs

Below are planned enhancements and upcoming features. Contributions welcome!

  • Local Hugging Face model support – Offline inference with optional GPU acceleration.
  • Moondream Station integration – Native Mac/Linux app support for on-device inference.
  • Batch uploads – Label multiple images in one go, with progress tracking.
  • Additional export formats – COCO JSON and Pascal VOC alongside YOLO.

License

This project is licensed under the terms of the Apache License 2.0. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

moonlabel-0.1.5.tar.gz (143.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

moonlabel-0.1.5-py3-none-any.whl (144.1 kB view details)

Uploaded Python 3

File details

Details for the file moonlabel-0.1.5.tar.gz.

File metadata

  • Download URL: moonlabel-0.1.5.tar.gz
  • Upload date:
  • Size: 143.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for moonlabel-0.1.5.tar.gz
Algorithm Hash digest
SHA256 bedf77fb3aabfcb149aa78c84c6d2eaf75de66cf91985670a2a8a46d6607905c
MD5 356140e989c349890613ee731222ad9e
BLAKE2b-256 2cfd8898489caa1b6dfb36b98c0a4fe7770a5252b2fc60a9d0d5ebff4f9ff0aa

See more details on using hashes here.

File details

Details for the file moonlabel-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: moonlabel-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 144.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for moonlabel-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 fb80f67908869c1ff68de5f9a63d151cee3cf74e8014760a3f8afda6e36cc801
MD5 6db4429448451d067034a2bdc914ee5f
BLAKE2b-256 350f6f90feaca03cc298eed8f6ffda168f20b55da13d57d17d054faa88c81c7e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page