Model-agnostic generative vision abstractions (image/video) for the Abstract ecosystem

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

lpalbou

These details have not been verified by PyPI

Project description

AbstractVision

Model-agnostic generative vision API (images, optional video) for Python and the Abstract* ecosystem.

What you get

A small orchestration API: VisionManager
A packaged capability registry (“what models can do”): VisionModelCapabilitiesRegistry backed by vision_model_capabilities.json
Optional artifact-ref outputs (small JSON refs): LocalAssetStore and RuntimeArtifactStoreAdapter
Built-in backends (execution engines): src/abstractvision/backends/
- OpenAI-compatible HTTP: openai_compatible.py
- Local Diffusers: huggingface_diffusers.py
- Local stable-diffusion.cpp / GGUF: stable_diffusion_cpp.py
CLI/REPL for manual testing: abstractvision
Self-contained local Playground UI/API: playground/vision_playground.html (docs: playground/README.md)

How it fits together (diagram)

flowchart LR
  Caller[Python / CLI / AbstractCore] --> VM[VisionManager]
  VM --> BE[VisionBackend]
  BE --> VM
  VM -->|optional| Store[MediaStore]
  Store --> Ref[Artifact ref dict]
  VM -->|no store| Asset["GeneratedAsset (bytes + mime)"]

Status (current backend support)

Development status: Alpha (0.x). The public API is stable-by-design, but breaking changes may still happen and will be called out in CHANGELOG.md.
Built-in backends implement: text_to_image and image_to_image.
Video (text_to_video, image_to_video) is supported only via the OpenAI-compatible backend when endpoints are configured.
multi_view_image is part of the public API (VisionManager.generate_angles) but no built-in backend implements it yet.

Details: docs/reference/backends.md.

Installation

pip install abstractvision

The base install is lightweight. It includes the shared API, capability registry, artifact helpers, CLI, AbstractCore plugin entry point, and the stdlib OpenAI-compatible HTTP backend. Local inference runtimes are explicit extras.

Optional extras:

Extra	Use
`abstractvision[openai]`	Official OpenAI provider intent marker; no SDK dependency today.
`abstractvision[openai-compatible]`	Generic local/remote OpenAI-shaped endpoint intent marker; stdlib-only today.
`abstractvision[diffusers]`	Install Torch/Diffusers and related packages for local Diffusers generation.
`abstractvision[huggingface]`	Compatibility alias for callers that still request the historical Diffusers extra.
`abstractvision[sdcpp]`	Install `stable-diffusion-cpp-python` for the pip binding fallback.
`abstractvision[local]`	Convenience for both local backend dependency sets, including `diffusers` and `sdcpp`.
`abstractvision[all]`	All runtime backend dependencies, without contributor tooling.
`abstractvision[apple]` / `abstractvision[all-apple]`	Native macOS Python profile: Diffusers/Torch MPS plus stable-diffusion.cpp bindings.
`abstractvision[gpu]`	GPU Diffusers/Torch profile. Install a CUDA/ROCm-enabled PyTorch wheel when needed.
`abstractvision[all-gpu]`	Full GPU-relevant local vision profile: Diffusers plus stable-diffusion.cpp bindings.
`abstractvision[abstractcore]`	Compatibility marker only; AbstractCore is still supplied by the host application.

stable-diffusion-cpp-python is currently constrained below 0.4.6 because that release's source distribution is missing vendored CMake files required by native Linux builds.

Contributor-only extras:

Extra	Use
`abstractvision[diffusers-dev]` / `abstractvision[huggingface-dev]`	Looser dependency pins for newer/unreleased Diffusers pipelines; install Diffusers `main` separately if needed.
`abstractvision[test]`	Local test dependencies.
`abstractvision[docs]`	Documentation build tooling.
`abstractvision[dev]`	Full contributor workflow: tests, docs, build, lint, formatting, and pre-commit. Do not use this as an application runtime profile.

Note (CUDA): on Windows/Linux, pip install "abstractvision[diffusers]" may install a CPU-only PyTorch build. If you want to use an NVIDIA GPU, install a CUDA-enabled PyTorch build first (see https://pytorch.org/get-started/locally/) and verify torch.cuda.is_available() is True.

AbstractCore is not installed by AbstractVision. When an AbstractCore application has AbstractVision installed in the same environment, AbstractCore can discover the plugin entry point and use the integration modules lazily.

If you hit “missing pipeline class” errors for newer model families, see docs/getting-started.md. In that case you may need Diffusers from source (main):

pip install -U "abstractvision[diffusers-dev]"
pip install -U "git+https://github.com/huggingface/diffusers@main"

For local development from a repo checkout:

pip install -e ".[dev]"

Usage

Start here:

Getting started: docs/getting-started.md
FAQ: docs/faq.md
API reference: docs/api.md
Architecture: docs/architecture.md
Docs index: docs/README.md

First local model (Diffusers / cross-platform)

Install the local runtime extra, pre-download the model outside the REPL, then select the Diffusers backend explicitly:

pip install "abstractvision[diffusers]"
huggingface-cli download runwayml/stable-diffusion-v1-5
export ABSTRACTVISION_BACKEND=diffusers
export ABSTRACTVISION_MODEL_ID=runwayml/stable-diffusion-v1-5
export ABSTRACTVISION_DIFFUSERS_DEVICE=auto
abstractvision repl

For a fresh cache, you can also permit the REPL to download missing files:

ABSTRACTVISION_DIFFUSERS_ALLOW_DOWNLOAD=1 abstractvision repl

More recommendations by VRAM: docs/getting-started.md.

Capability-driven model selection

from abstractvision import VisionModelCapabilitiesRegistry

reg = VisionModelCapabilitiesRegistry()
assert reg.supports("runwayml/stable-diffusion-v1-5", "text_to_image")

print(reg.list_tasks())
print(reg.models_for_task("text_to_image"))

Backend wiring + generation (artifact outputs)

The base install is import-light and does not install Torch/Diffusers. Heavy local backend modules are imported lazily (see src/abstractvision/backends/__init__.py). Install abstractvision[diffusers] for local Diffusers, or abstractvision[sdcpp] for the optional stable-diffusion.cpp python binding fallback.

from abstractvision import LocalAssetStore, VisionManager, VisionModelCapabilitiesRegistry, is_artifact_ref
from abstractvision.backends import OpenAICompatibleBackendConfig, OpenAICompatibleVisionBackend

reg = VisionModelCapabilitiesRegistry()

backend = OpenAICompatibleVisionBackend(
    config=OpenAICompatibleBackendConfig(
        base_url="http://localhost:1234/v1",
        api_key="YOUR_KEY",      # optional for local servers
        model_id="REMOTE_MODEL", # optional (server-dependent)
    )
)

vm = VisionManager(
    backend=backend,
    store=LocalAssetStore(),         # enables artifact-ref outputs
    model_id="zai-org/GLM-Image",    # optional: capability gating
    registry=reg,                   # optional: reuse loaded registry
)

out = vm.generate_image("a cinematic photo of a red fox in snow")
assert is_artifact_ref(out)
print(out)  # {"$artifact": "...", "content_type": "...", ...}

png_bytes = vm.store.load_bytes(out["$artifact"])  # type: ignore[union-attr]

When installed next to AbstractCore, AbstractVision is also discovered as a llm.vision capability plugin. The plugin defaults to the official OpenAI image endpoint (https://api.openai.com/v1) and reads OPENAI_API_KEY (or ABSTRACTVISION_API_KEY). Set OPENAI_BASE_URL only when you need to override that OpenAI-compatible base for the official OpenAI profile. Set ABSTRACTVISION_BACKEND=openai-compatible plus ABSTRACTVISION_BASE_URL for a local or remote compatible /v1 server. Set ABSTRACTVISION_MODEL_ID, OPENAI_IMAGE_MODEL_ID, or OPENAI_IMAGE_MODEL when you need an explicit image model (static default OpenAI model: gpt-image-1). AbstractVision does not query provider /models catalogs to discover or select image models automatically, but you can inspect them explicitly with abstractvision provider-models, VisionManager.list_provider_models(...), or the AbstractCore plugin method llm.vision.list_provider_models(...). After inspection, set the model env var explicitly for newer provider models when available to your account. Set ABSTRACTVISION_BACKEND=diffusers or ABSTRACTVISION_BACKEND=sdcpp when you want AbstractCore to launch local AbstractVision generation directly.

Interactive testing (CLI / REPL)

abstractvision models
abstractvision provider-models --openai --task text_to_image
abstractvision provider-models --base-url http://localhost:1234/v1 --task text_to_image
abstractvision tasks
abstractvision show-model runwayml/stable-diffusion-v1-5

abstractvision repl

Inside the REPL:

/t2i "a watercolor painting of a lighthouse" --width 512 --height 512 --steps 10 --open

For a newer but still relatively small local model, try black-forest-labs/FLUX.2-klein-4B after installing Diffusers from source (see docs/getting-started.md):

/backend diffusers black-forest-labs/FLUX.2-klein-4B mps float16
/t2i "a product photo of a matte black espresso machine" --steps 4 --guidance-scale 1.0 --open

OpenAI-compatible server example:

/backend openai http://localhost:1234/v1
/t2i "a watercolor painting of a lighthouse" --width 512 --height 512 --steps 10 --open

The CLI/REPL can also be configured via ABSTRACTVISION_* env vars; see docs/reference/configuration.md.

Local web playground

The playground is owned by AbstractVision and runs without AbstractCore. It is a local/dev testing surface; use AbstractCore/Gateway for production routing, authentication, and browser-origin policy.

abstractvision playground --port 8091

Open http://127.0.0.1:8091/vision_playground.html, select a cached model, then load it. The page and the API are served by the same process.

One-shot commands (OpenAI-compatible HTTP backend only):

abstractvision t2i --base-url http://localhost:1234/v1 "a studio photo of an espresso machine"
abstractvision i2i --base-url http://localhost:1234/v1 --image ./input.png "make it watercolor"

Local GGUF via stable-diffusion.cpp

If you want to run GGUF diffusion models locally, use the stable-diffusion.cpp backend (sdcpp). Start with a single-file Stable Diffusion model when possible; Qwen Image and FLUX GGUF component sets are heavier.

Recommended:

macOS (Apple Silicon / Metal): install sd-cli (stable-diffusion.cpp executable) from releases and use CLI mode for Metal acceleration.
Otherwise (pip-only convenience): pip install "abstractvision[sdcpp]" installs the stable-diffusion.cpp python bindings (stable-diffusion-cpp-python>=0.4.2,<0.4.6), but this may run CPU-only depending on the wheel build.

Alternative (external executable):

Install sd-cli: https://github.com/leejet/stable-diffusion.cpp/releases

In the REPL:

/backend sdcpp /path/to/sd-v1-5.gguf /path/to/sd-cli
/t2i "a watercolor painting of a lighthouse" --width 512 --height 512 --steps 10 --open

FLUX.2-klein-4B GGUF component example:

/backend sdcpp /path/to/flux-2-klein-4b-Q8_0.gguf /path/to/flux2_ae.safetensors /path/to/Qwen3-4B-Q4_K_M.gguf /path/to/sd-cli
/t2i "a product photo of a matte black espresso machine" --steps 4 --guidance-scale 1.0 --sampling-method euler --diffusion-fa --offload-to-cpu --open

Extra flags are forwarded via request.extra. In CLI mode they are forwarded to sd-cli; in python bindings mode, keys are mapped to python binding kwargs when supported and unsupported keys are ignored.

AbstractCore tool integration (artifact refs)

If you’re using AbstractCore tool calling, AbstractVision can expose vision tasks as tools:

from abstractvision.integrations.abstractcore import make_vision_tools

tools = make_vision_tools(vision_manager=vm, model_id="zai-org/GLM-Image")

Install abstractcore in the host application environment when you use these helpers; it is not pulled in by AbstractVision.

AbstractFramework ecosystem

AbstractVision is part of the AbstractFramework ecosystem and is designed to compose with:

AbstractFramework (project hub): https://github.com/lpalbou/AbstractFramework
AbstractCore (orchestration + tool calling): https://github.com/lpalbou/abstractcore
AbstractRuntime (runtime services, including artifact storage): https://github.com/lpalbou/abstractruntime

In practice:

AbstractVision standardizes generative vision outputs (image/video) behind VisionManager.
AbstractCore can discover and use AbstractVision via the capability plugin (src/abstractvision/integrations/abstractcore_plugin.py) or you can expose vision tasks as tools (src/abstractvision/integrations/abstractcore.py).
Artifact refs returned by AbstractVision are designed to travel across processes; RuntimeArtifactStoreAdapter bridges to an AbstractRuntime-style artifact store (src/abstractvision/artifacts.py).

Project

Release notes: CHANGELOG.md
Contributing: CONTRIBUTING.md
Security: SECURITY.md
Acknowledgments: ACKNOWLEDGMENTS.md
Agent docs: llms.txt and llms-full.txt

Requirements

Python >= 3.9

License

MIT License - see LICENSE file for details.

Author

Laurent-Philippe Albou

Contact

contact@abstractcore.ai

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

lpalbou

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.18

May 31, 2026

0.3.17

May 29, 2026

0.3.16

May 26, 2026

0.3.15

May 26, 2026

0.3.14

May 26, 2026

0.3.13

May 23, 2026

0.3.12

May 22, 2026

0.3.11

May 22, 2026

0.3.10

May 22, 2026

0.3.9

May 21, 2026

0.3.8

May 20, 2026

0.3.7

May 19, 2026

0.3.6

May 17, 2026

This version

0.3.5

May 13, 2026

0.3.4

May 9, 2026

0.3.3

May 8, 2026

0.3.2

May 8, 2026

0.3.1

May 7, 2026

0.3.0

May 7, 2026

0.2.6

May 6, 2026

0.2.5

May 6, 2026

0.2.4

May 6, 2026

0.2.3

May 6, 2026

0.2.2

May 6, 2026

0.2.1

Feb 5, 2026

0.1.0

Jan 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abstractvision-0.3.5.tar.gz (204.1 kB view details)

Uploaded May 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

abstractvision-0.3.5-py3-none-any.whl (77.3 kB view details)

Uploaded May 13, 2026 Python 3

File details

Details for the file abstractvision-0.3.5.tar.gz.

File metadata

Download URL: abstractvision-0.3.5.tar.gz
Upload date: May 13, 2026
Size: 204.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for abstractvision-0.3.5.tar.gz
Algorithm	Hash digest
SHA256	`bc89c231e296c365f6eacdf93bb041eea3023df1d6bb54c318c4d5bb4ae44e90`
MD5	`b758f94bd1b2c4da7df1400e340424cb`
BLAKE2b-256	`297beb3659afa8a1e7e99465b5956a359c42955dd56ebbf5d48a1a6292fe31d6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for abstractvision-0.3.5.tar.gz:

Publisher: release.yml on lpalbou/AbstractVision

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: abstractvision-0.3.5.tar.gz
- Subject digest: bc89c231e296c365f6eacdf93bb041eea3023df1d6bb54c318c4d5bb4ae44e90
- Sigstore transparency entry: 1522951925
- Sigstore integration time: May 13, 2026
Source repository:
- Permalink: lpalbou/AbstractVision@d154ec883fd0f00ba9e1a0fbc3715828e1f78c21
- Branch / Tag: refs/heads/main
- Owner: https://github.com/lpalbou
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d154ec883fd0f00ba9e1a0fbc3715828e1f78c21
- Trigger Event: workflow_dispatch

File details

Details for the file abstractvision-0.3.5-py3-none-any.whl.

File metadata

Download URL: abstractvision-0.3.5-py3-none-any.whl
Upload date: May 13, 2026
Size: 77.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for abstractvision-0.3.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a972a79ddb32cfeeaeb62b2c25290cc719a5d4b1d42e303a137cca6f332a2b4b`
MD5	`c6dccfad5572ffaf1eb0f5649271590c`
BLAKE2b-256	`7ef92154495596f1b7b72089c3761d315840d3c54e3efdfe015949e4def7d48a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for abstractvision-0.3.5-py3-none-any.whl:

Publisher: release.yml on lpalbou/AbstractVision

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: abstractvision-0.3.5-py3-none-any.whl
- Subject digest: a972a79ddb32cfeeaeb62b2c25290cc719a5d4b1d42e303a137cca6f332a2b4b
- Sigstore transparency entry: 1522951940
- Sigstore integration time: May 13, 2026
Source repository:
- Permalink: lpalbou/AbstractVision@d154ec883fd0f00ba9e1a0fbc3715828e1f78c21
- Branch / Tag: refs/heads/main
- Owner: https://github.com/lpalbou
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d154ec883fd0f00ba9e1a0fbc3715828e1f78c21
- Trigger Event: workflow_dispatch

abstractvision 0.3.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

AbstractVision

What you get

How it fits together (diagram)

Status (current backend support)

Installation

Usage

First local model (Diffusers / cross-platform)

Capability-driven model selection

Backend wiring + generation (artifact outputs)

Interactive testing (CLI / REPL)

Local web playground

Local GGUF via stable-diffusion.cpp

AbstractCore tool integration (artifact refs)

AbstractFramework ecosystem

Project

Requirements

License

Author

Contact

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance