A permissive-license-aware framework for serving modern computer vision models locally and over Cloudflare Tunnel.

These details have not been verified by PyPI

Project links

Project description

VisionServeX

Local-first computer vision API gateway — secure, private, and honest.
Serve modern CV models on your machine. Local-only by default. No data retained.

Python 3.10+ v1.0.0rc3 ruff

What is VisionServeX?

VisionServeX is an open-source, permissive-license-aware Python framework for running modern computer vision models locally and exposing them through a stable HTTP API. It works as a local model gateway: start it once, call any supported model through one clean API.

Privacy-first design:

Binds to 127.0.0.1 by default — nothing leaves your machine.
Images are decoded in memory for inference and never written to disk by default.
No data is retained between requests by default.
Log redaction removes tokens, base64, and API keys from all output.

⚠️ No end-to-end encryption claimed. VisionServeX cannot provide E2E encryption in the cryptographic sense — the inference server must see plaintext image tensors to run models. We provide local-first processing, no-retention defaults, optional encryption-at-rest for job metadata, and auth for public mode. See docs/privacy.md.

Quickstart (CPU, 5 minutes)

pip install 'visionservex[server,hf,rfdetr]'

visionservex getting-started      # personalized guide
visionservex pull rfdetr-nano     # fast COCO detection, CPU-capable
visionservex serve                 # http://127.0.0.1:8080

curl -F "image=@image.jpg" -F "model_id=rfdetr-nano" \
     http://127.0.0.1:8080/detect | jq

Python Client

from visionservex import Client, VisionModel

# Direct inference (local, no server needed)
result = VisionModel("dfine-s").predict("image.jpg")

# Via local gateway
client = Client("http://127.0.0.1:8080")
result = client.detect("rfdetr-nano", "image.jpg")
result = client.grounded_segment("grounded-sam2", "image.jpg", prompt="car, person")
result = client.classify("swinv2-tiny", "image.jpg")

What works today

Family	Models	Task	Status	Install
Mock	`mock-*`	All	stable	base
RF-DETR	`rfdetr-nano/small/…`	detect	beta	`[rfdetr]`
RF-DETR-Seg	`rfdetr-seg-nano/small/…`	segment	beta	`[rfdetr]`
D-FINE	`dfine-n/s/m/l/x`	detect	beta	`[hf]`
Grounding DINO	`grounding-dino-tiny/…`	open-vocab detect	beta	`[hf]`
SwinV2	`swinv2-tiny/small/base/large`	classify	beta	`[hf]`
SAM v1	`sam-vit-base/large/huge`	foundation segment	beta	`[hf]`
SAM 2	`sam2-hiera-tiny/small/…`	foundation segment	beta	`[hf]`
Grounded SAM	`grounded-sam`	grounded segment	beta	`[hf]`
Grounded-SAM2	`grounded-sam2`	grounded segment	beta	`[hf]`
OneFormer	`oneformer-swin-large/…`	semantic/panoptic	beta	`[hf]`
RTMPose	`rtmpose-s/m/…`	pose	docker/manual	`openmmlab`
RTMDet-R/R2	`rtmdet-r/r2`	OBB	docker/manual	`openmmlab`
ONNX export	SwinV2	—	working	`[onnx]`
TensorRT	—	—	dry-run only	—

GPU: CUDA verified on RTX 5080 for 6+ model families. Run visionservex gpu smoke-test on your hardware.
MPS: Implemented, unverified (no Apple Silicon test hardware).
VRAM safety: Desktop GPU guard reserves buffer for GUI/system stability. GPU tests run serially by default.

"beta" means: CPU-verified, CUDA-verified when noted, CLI/Python/gateway tested. No known regressions. May have edge cases.

Security and Privacy

# Check your current security posture
visionservex security audit --json

# Switch to public-mode configuration
visionservex security mode cloudflare_private --apply

# Generate an API key for development
visionservex gateway token

# Verify log redaction works
visionservex security test-redaction

# See what temp files exist
visionservex privacy inspect-cache
visionservex privacy cleanup --dry-run

Security modes:

Mode	Binding	Auth	Notes
`local_private`	127.0.0.1	Optional	Default, safest
`lan_private`	LAN	Required	TLS recommended
`cloudflare_private`	127.0.0.1 + tunnel	Required	Cloudflare Access recommended
`production_multi_user`	127.0.0.1 + proxy	Required	Encrypted job store, audit logs

Safe Cloudflare Tunnel

export VISIONSERVEX_AUTH__ENABLED=true
export VISIONSERVEX_AUTH__API_KEY=$(visionservex gateway token 2>&1 | grep "API key:" | awk '{print $NF}')

visionservex tunnel config --domain api.yourdomain.com --out tunnel.yaml
visionservex serve &
visionservex tunnel run tunnel.yaml --i-understand-this-is-public

Installation

pip install visionservex                   # base (no heavy deps)
pip install 'visionservex[server]'         # + HTTP API server
pip install 'visionservex[hf]'             # + HF Transformers (D-FINE, GD, SwinV2, SAM, SAM2, OneFormer)
pip install 'visionservex[rfdetr]'         # + RF-DETR and RF-DETR-Seg
pip install 'visionservex[server,hf,rfdetr]'  # full recommended

OpenMMLab (RTMPose, RTMDet-R): Docker sidecar or pip install openmim && mim install mmengine mmcv mmpose. See docs/openmmlab_expert_models.md.

Syntax Contract

All 222 documented CLI/Python/API examples are covered and verified. No example is allowed to silently fail or return a raw traceback.

visionservex syntax audit             # verify 222 examples, failing must be 0
visionservex validation run release   # run full CI test suite

Documentation


Beginner quickstart	5-minute guide
Local gateway	Gateway commands and Python client
Security	Threat model, modes, configuration
Privacy	No E2E claim, retention policy, encryption
Threat model	What we protect and what we don't
Model zoo	All 68 models with status table
Model downloads	Download system, auto-pull
OpenMMLab expert	RTMPose, RTMDet-R, Co-DINO, InternImage
Cloudflare Tunnel	Public mode safely
GPU validation	CPU/CUDA/MPS status
Benchmarks	Latency numbers
Syntax contract	222 verified examples
Troubleshooting	Common errors
About	Author, citation

License and model licenses

Apache-2.0. See LICENSE and NOTICE.

Each integrated model retains its own upstream license. Review model, checkpoint, and dataset licenses before commercial use. See docs/model_licenses.md.

What remains before v1.0.0 final

OpenMMLab checkpoint auto-download (currently CHECKPOINT_REQUIRED structured error)
RTMPose / RTMDet-R2 real end-to-end inference via sidecar (checkpoints needed)
MPS verification on Apple Silicon hardware
TensorRT real engine build (currently dry-run)

Citation

@software{sajjadi2026visionservex,
  author = {Arash Sajjadi},
  title  = {{VisionServeX: A permissive-license-aware framework for local CV model serving}},
  year   = {2026},
  url    = {https://github.com/arashsajjadi/VisionServeX},
  note   = {Developed under the supervision of Prof. Mark Eramian, University of Saskatchewan.}
}

Author: Arash Sajjadi — PhD Candidate, Department of Computer Science, University of Saskatchewan
Supervision: Prof. Mark Eramian, Computer Vision Lab
(This project is not an official product of the University of Saskatchewan.)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.11.0

Jun 8, 2026

3.10.1

Jun 8, 2026

3.10.0

Jun 8, 2026

3.9.1

Jun 8, 2026

3.9.0

Jun 8, 2026

3.8.1

Jun 8, 2026

3.8.0

Jun 8, 2026

3.7.0

Jun 7, 2026

3.3.0

Jun 7, 2026

3.2.0

Jun 7, 2026

3.1.0

Jun 7, 2026

3.0.0

Jun 7, 2026

2.60.0

Jun 7, 2026

2.59.0

Jun 7, 2026

2.58.0

May 21, 2026

2.57.0

May 21, 2026

2.56.0

May 21, 2026

2.55.0

May 21, 2026

2.54.0

May 20, 2026

2.53.0

May 20, 2026

2.52.0

May 20, 2026

2.51.0

May 20, 2026

2.50.1

May 20, 2026

2.50.0

May 20, 2026

2.49.0

May 20, 2026

2.48.0

May 20, 2026

2.47.3

May 20, 2026

2.47.2

May 20, 2026

2.47.1

May 20, 2026

2.47.0

May 20, 2026

2.45.0

May 19, 2026

2.44.0

May 19, 2026

2.43.0

May 19, 2026

2.42.0

May 19, 2026

2.41.0

May 19, 2026

2.40.0

May 19, 2026

2.39.0

May 19, 2026

2.38.1

May 19, 2026

2.38.0

May 19, 2026

2.37.0

May 19, 2026

2.36.0

May 19, 2026

2.35.0

May 19, 2026

2.34.0

May 19, 2026

2.33.0

May 19, 2026

2.32.0

May 19, 2026

2.31.0

May 19, 2026

2.30.0

May 19, 2026

2.29.0

May 19, 2026

2.28.0

May 18, 2026

2.27.1

May 18, 2026

2.27.0

May 18, 2026

2.26.0

May 18, 2026

2.25.2

May 18, 2026

2.25.1

May 18, 2026

2.25.0

May 18, 2026

2.24.1

May 18, 2026

2.24.0

May 18, 2026

2.23.0

May 18, 2026

2.22.0

May 18, 2026

2.21.0

May 18, 2026

2.20.0

May 18, 2026

2.19.0

May 18, 2026

2.18.0

May 18, 2026

2.17.0

May 18, 2026

2.16.0

May 18, 2026

2.15.0

May 17, 2026

2.14.0

May 17, 2026

2.13.1

May 17, 2026

2.13.0

May 17, 2026

2.12.0

May 17, 2026

2.11.0

May 16, 2026

2.10.0

May 16, 2026

2.9.0

May 16, 2026

2.8.0

May 16, 2026

2.7.0

May 16, 2026

2.6.0

May 16, 2026

2.5.0

May 16, 2026

2.4.0

May 16, 2026

2.3.0

May 16, 2026

2.2.0

May 16, 2026

2.1.1

May 16, 2026

2.1.0

May 16, 2026

2.0.1

May 16, 2026

2.0.0

May 16, 2026

1.9.0

May 16, 2026

1.8.1

May 16, 2026

1.8.0

May 16, 2026

1.7.1

May 16, 2026

1.7.0

May 16, 2026

1.6.0

May 16, 2026

1.5.0

May 16, 2026

1.4.0

May 16, 2026

1.3.0

May 16, 2026

1.2.0

May 16, 2026

1.1.0

May 15, 2026

1.0.0

May 15, 2026

This version

1.0.0rc3 pre-release

May 15, 2026

1.0.0rc2 pre-release

May 15, 2026

1.0.0rc1 pre-release

May 15, 2026

0.9.0

May 15, 2026

0.8.0

May 15, 2026

0.7.0

May 15, 2026

0.5.0

May 15, 2026

0.4.0

May 15, 2026

0.3.0

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visionservex-1.0.0rc3.tar.gz (146.4 kB view details)

Uploaded May 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

visionservex-1.0.0rc3-py3-none-any.whl (180.7 kB view details)

Uploaded May 15, 2026 Python 3

File details

Details for the file visionservex-1.0.0rc3.tar.gz.

File metadata

Download URL: visionservex-1.0.0rc3.tar.gz
Upload date: May 15, 2026
Size: 146.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for visionservex-1.0.0rc3.tar.gz
Algorithm	Hash digest
SHA256	`54935e9660a22f1c15499cf17e019930c9313d91e137b28c420601958afceaa0`
MD5	`2ff9a819a39e111138307155b1b7522f`
BLAKE2b-256	`937fcb451feeeacaf058c8e9e94b71868c1539ce26ca9c22c01460390db20734`

See more details on using hashes here.

Provenance

The following attestation bundles were made for visionservex-1.0.0rc3.tar.gz:

Publisher: publish.yml on arashsajjadi/VisionServeX

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: visionservex-1.0.0rc3.tar.gz
- Subject digest: 54935e9660a22f1c15499cf17e019930c9313d91e137b28c420601958afceaa0
- Sigstore transparency entry: 1549942356
- Sigstore integration time: May 15, 2026
Source repository:
- Permalink: arashsajjadi/VisionServeX@e2f14e50351454bc7489f0d263d8116800cedcbc
- Branch / Tag: refs/tags/v1.0.0rc3
- Owner: https://github.com/arashsajjadi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e2f14e50351454bc7489f0d263d8116800cedcbc
- Trigger Event: push

File details

Details for the file visionservex-1.0.0rc3-py3-none-any.whl.

File metadata

Download URL: visionservex-1.0.0rc3-py3-none-any.whl
Upload date: May 15, 2026
Size: 180.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for visionservex-1.0.0rc3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`84be0d46a210f66a0b6f9b582d5109c3970de232565803d002b4ac08ee98de79`
MD5	`3bb9f5993c7a9c785c7294bb429c0b01`
BLAKE2b-256	`e182a09f7f16d100a6632b500d408d9fac77b12420f2f3d6891a1d54e9b30d4d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for visionservex-1.0.0rc3-py3-none-any.whl:

Publisher: publish.yml on arashsajjadi/VisionServeX

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: visionservex-1.0.0rc3-py3-none-any.whl
- Subject digest: 84be0d46a210f66a0b6f9b582d5109c3970de232565803d002b4ac08ee98de79
- Sigstore transparency entry: 1549942507
- Sigstore integration time: May 15, 2026
Source repository:
- Permalink: arashsajjadi/VisionServeX@e2f14e50351454bc7489f0d263d8116800cedcbc
- Branch / Tag: refs/tags/v1.0.0rc3
- Owner: https://github.com/arashsajjadi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e2f14e50351454bc7489f0d263d8116800cedcbc
- Trigger Event: push

visionservex 1.0.0rc3

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

VisionServeX

What is VisionServeX?

Quickstart (CPU, 5 minutes)

Python Client

What works today

Security and Privacy

Safe Cloudflare Tunnel

Installation

Syntax Contract

Documentation

License and model licenses

What remains before v1.0.0 final

Citation

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance