RTSP → object detection → ONVIF Profile-M metadata producer

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

rtsp-to-onvif-m-adapter

Reads one RTSP stream, runs a pluggable object detector on captured frames, and emits per-frame detections as ONVIF Profile-M metadata — as onvif-mj JSON or tt:MetadataStream XML. One stream per process; run several to scale out.

Conformance

The XML output validates against the official ONVIF metadatastream.xsd (XSD 1.1) in the test suite (tests/test_compliance.py). Canonical output shape: schema/onvif-mj.example.json.

A non-normative JSON Schema for the payload is provided at schema/onvif-mj.schema.json — an inference from the XSD and this implementation, not an official ONVIF artifact (the XSD stays authoritative). It is deliberately open: you can add optional fields (e.g. a ReID descriptor) without breaking validation, and define your own schema to constrain them. Every payload the test suite produces is validated against it.

from datetime import datetime, timezone
from onvif_m import (BoundingBox, ClassCandidate, DetectedObject,
                     build_frame, build_payload, to_xml_string, from_pixel_bbox)

obj = DetectedObject(
    object_id=0,
    bbox=from_pixel_bbox(64, 48, 192, 240, width=640, height=480),  # pixels -> ONVIF coords
    classes=[ClassCandidate("Human", 0.94)],
)
frame = build_frame(datetime.now(timezone.utc), source="cam-7", objects=[obj])
payload = build_payload([frame])     # onvif-mj JSON  -> {"Frame": [...]}
xml = to_xml_string(payload)         # tt:MetadataStream XML

Coordinates use the ONVIF normalized frame: [-1, 1], origin center, y-up. from_pixel_bbox converts top-left pixel boxes.

Install

Requires Python 3.11+ and ffmpeg on PATH. Create a virtualenv, then install.

Linux / macOS

python -m venv venv && . venv/bin/activate
pip install -e ".[capture,detect,mqtt]"

Windows (PowerShell)

python -m venv .venv
.\.venv\Scripts\Activate.ps1
pip install -e ".[capture,mqtt]"
# .[detect] installs CPU-only torch; for an NVIDIA GPU install CUDA wheels instead:
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121

.[detect] pulls CPU-only torch from PyPI on every platform. For GPU acceleration, install the matching CUDA wheels as shown (cu121 for NVIDIA 40-series; see docs/development.md for other GPUs).

Run

# one RTSP URL at a time
python -m onvif_m rtsp://host/stream --detector torchvision --sink file --output-root ./out
python -m onvif_m rtsp://host/stream --detector mock --sink stdout          # dry-run wiring
python -m onvif_m rtsp://host/stream --sink mqtt --mqtt-host broker.local

--device {auto,cpu,cuda,mps} selects the accelerator. --health-port N serves /healthz. SIGINT/SIGTERM stops cleanly. Scaling to many streams is left to the user — run one process per stream.

Output

Two independent axes, each repeatable and also settable via environment variable:

Option	Values	Env
`--format`	`json`, `xml`	`ONVIF_M_FORMAT`
`--sink`	`file`, `stdout`, `mqtt`	`ONVIF_M_SINK`

Every selected sink writes every selected format. Examples:

python -m onvif_m rtsp://host/stream --format json --format xml --sink file --sink mqtt
ONVIF_M_FORMAT=json,xml ONVIF_M_SINK=stdout python -m onvif_m rtsp://host/stream

file: atomic sidecars under <output-root>/<name>/ (.meta.json / .meta.xml).
stdout: one line per payload per format.
mqtt: topic <prefix>/<onvif-mj|onvif-xml>/VideoAnalytics/<ProfileToken>[/<Module>].

Extending

Three things are pluggable: post-processors, publishers (sinks), and detectors. There are three ways to plug in, depending on how you install.

Post-processors

A post-processor runs after detection and before the metadata is built:

class PostProcessor:                                 # onvif_m.pipeline.PostProcessor
    def process(self, objects, frame): ...           # -> list[DetectedObject]

This is the hook for ReID / tracking (reassign a stable object_id, which flows to ONVIF @ObjectId), histogram or attribute tagging, face blurring, or filtering. From the CLI, point --processor at any importable module:factory — your own file on the path works from a PyPI install:

# my_hooks.py in the working directory (or any installed module)
python -m onvif_m rtsp://host/stream --processor my_hooks:HumansOnly

examples/processors.py is a copy-paste template (it ships in the source tree, not the wheel, so reference your own module when installed from PyPI).

Plugins from PyPI (entry points)

A separately-installed package can register publishers, detectors, or post-processors so they work from the CLI by name. Declare entry points in the plugin package's pyproject.toml:

[project.entry-points."onvif_m.publishers"]
s3 = "my_pkg:S3Publisher"          # -> --sink s3

[project.entry-points."onvif_m.detectors"]
yolo11 = "my_pkg:Yolo11Detector"   # -> --detector yolo11

[project.entry-points."onvif_m.processors"]
reid = "my_pkg:ReID"               # -> --processor reid

Each entry point is a zero-arg factory (a class works) returning a Publisher / Detector / PostProcessor. After pip install my-pkg, the names appear in the CLI:

pip install my-onvif-plugins
python -m onvif_m rtsp://host/stream --detector yolo11 --sink s3 --sink file

Library API (full control)

For custom wiring, drive the pipeline directly — this plugs in anything, including custom publishers/detectors without packaging them:

from onvif_m.capture import RtspCaptureSource
from onvif_m.pipeline import Camera, run_camera
from onvif_m.detect import create_detector
from onvif_m.publish import FilePublisher, MultiPublisher

run_camera(
    Camera("cam"),
    RtspCaptureSource("rtsp://host/stream"),
    create_detector(backend="torchvision"),
    MultiPublisher([FilePublisher("./out"), MyWebhookPublisher()]),
    processors=[MyReID()],
)

Note: arbitrary descriptors (e.g. ReID embeddings) have no ONVIF metadata field, so emitting those needs a schema extension.

Biometric suppression

suppress_biometrics (default on) is a detector-side flag: the detector loads no face/body submodels and emits no HumanFace/HumanBody metadata. It does not blur, mask, or alter the image — this tool never writes or modifies the source frame at all. Image redaction, if needed, is a downstream concern.

Example use: surveying a field for non-people objects (vehicles, animals, equipment) while ensuring no biometric data is computed or cascaded to downstream systems.

Design

docs/schema.md — output format and consumer guide.
ARCHITECTURE.md — pipeline, detector/publisher plugins, the post-processor hook, biometric suppression.
REQUIREMENTS.md; open backlog in TODO.md.
schema/onvif/README.md — vendored ONVIF XSDs.

Develop

pip install -e ".[dev]"
pytest -q          # incl. the ONVIF XSD compliance suite (self-skips offline)
ruff check . && mypy

Test matrix, scripted test servers (MQTT, RTSP), and benchmarking are in docs/development.md; see CONTRIBUTING.md and docs/releasing.md.

The default detector is torchvision (BSD-3). Ultralytics YOLOv8 is AGPL-3.0 and opt-in only.

Device & performance

The detector runs on CPU, Apple Silicon (MPS), or NVIDIA (CUDA) — --device {auto,cpu,cuda,mps} (auto = cuda > mps > cpu). Benchmark a combination:

python -m onvif_m.bench --model ssdlite320_mobilenet_v3_large --device auto

Per-frame latency at 640×480 (Apple figures measured on an Apple M2 Max):

Model	CPU (M2 Max)	MPS (M2 Max)	CPU (x86)	CUDA (RTX PRO 4500)
`ssdlite320_mobilenet_v3_large` (default, light)	48 ms	64 ms	28 ms	13 ms (77 fps)
`retinanet_resnet50_fpn` (heavy, accurate)	970 ms	95 ms	536 ms	17 ms (59 fps)

Measured on a Windows 11 reference box (Intel CPU + NVIDIA RTX 4070 Laptop, 8 GB, 50 W; torch 2.5.1+cu121), 640×480, 50–200 runs:

Model	CPU	CUDA (RTX 4070)
`ssdlite320_mobilenet_v3_large` (light)	83 ms (12 fps)	135 ms (7.4 fps)
`retinanet_resnet50_fpn` (heavy)	1584 ms (0.6 fps)	82 ms (12 fps)

Guidance: the light default runs in real time on CPU. For heavy/accurate models use a GPU — CUDA is ~19× faster than CPU on the RTX 4070 here, MPS ~10× on the M2 Max. A tiny model is launch/transfer-bound on a GPU (notably on a power-limited laptop GPU under Windows/WDDM, where the light default is actually faster on CPU), so pair the light default with CPU and reserve the GPU for heavy models.

Windows 11 + NVIDIA CUDA is a tested platform (pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121 for a CUDA build; ffmpeg must be on PATH).

License

Apache-2.0. Vendored ONVIF XSDs under schema/onvif/ are upstream ONVIF artifacts under ONVIF's terms.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

scottrfrancis

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Jun 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

onvif_m_producer-0.1.0.tar.gz (116.4 kB view details)

Uploaded Jun 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

onvif_m_producer-0.1.0-py3-none-any.whl (29.9 kB view details)

Uploaded Jun 24, 2026 Python 3

File details

Details for the file onvif_m_producer-0.1.0.tar.gz.

File metadata

Download URL: onvif_m_producer-0.1.0.tar.gz
Upload date: Jun 24, 2026
Size: 116.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for onvif_m_producer-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`3d5bdc501432e909e7fd502e9dc0496a17621fa3c7544991e0c0c38270b64e43`
MD5	`7599d0fc3b776cba2269832f93313a12`
BLAKE2b-256	`a62f9322ad64fd360e1d56e84ae60094c40e6c8f40c5190f7b466dd8defea890`

See more details on using hashes here.

Provenance

The following attestation bundles were made for onvif_m_producer-0.1.0.tar.gz:

Publisher: release.yml on scottrfrancis/rtsp-to-onvif-m-adapter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: onvif_m_producer-0.1.0.tar.gz
- Subject digest: 3d5bdc501432e909e7fd502e9dc0496a17621fa3c7544991e0c0c38270b64e43
- Sigstore transparency entry: 1943615713
- Sigstore integration time: Jun 24, 2026
Source repository:
- Permalink: scottrfrancis/rtsp-to-onvif-m-adapter@bac18b92556d6f8b5625602d0dba227509a4b0ef
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/scottrfrancis
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@bac18b92556d6f8b5625602d0dba227509a4b0ef
- Trigger Event: push

File details

Details for the file onvif_m_producer-0.1.0-py3-none-any.whl.

File metadata

Download URL: onvif_m_producer-0.1.0-py3-none-any.whl
Upload date: Jun 24, 2026
Size: 29.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for onvif_m_producer-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6fea289f7c26ecb78003b93a9976cb2ff6b17cf0ce49e9cf391ea6fbb4fb0a89`
MD5	`63d57716d7074c5cfb77a0aae9cc828f`
BLAKE2b-256	`7a5b8c78260894ee5b0e7149c50841525e5946305f1bc32a479c82e492e7cf6f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for onvif_m_producer-0.1.0-py3-none-any.whl:

Publisher: release.yml on scottrfrancis/rtsp-to-onvif-m-adapter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: onvif_m_producer-0.1.0-py3-none-any.whl
- Subject digest: 6fea289f7c26ecb78003b93a9976cb2ff6b17cf0ce49e9cf391ea6fbb4fb0a89
- Sigstore transparency entry: 1943615839
- Sigstore integration time: Jun 24, 2026
Source repository:
- Permalink: scottrfrancis/rtsp-to-onvif-m-adapter@bac18b92556d6f8b5625602d0dba227509a4b0ef
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/scottrfrancis
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@bac18b92556d6f8b5625602d0dba227509a4b0ef
- Trigger Event: push

onvif-m-producer 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

rtsp-to-onvif-m-adapter

Conformance

Install

Run

Output

Extending

Post-processors

Plugins from PyPI (entry points)

Library API (full control)

Biometric suppression

Design

Develop

Device & performance

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance