Z-Vision Generator — cross-platform AI image and video generator
Project description
Z-Vision Generator
Local AI image and video generation — hassle-free and fun. No tangled node graphs, no cloud dependencies, just prompts and results. Runs on macOS (Apple Silicon / MLX) and Windows (NVIDIA / CUDA), tuned for an M-series Mac with 32 GB unified memory and an NVIDIA RTX 3080.
Features
- Image generation — text-to-image with Z-Image and FLUX.2 Klein (4B/9B) model families
- Video generation — text-to-video and image-to-video with LTX-2.3 (macOS)
- Cross-platform — automatic backend selection: MLX on macOS, CUDA on Windows
- Prompt system — YAML prompt files with variables, structured prompts, snippets, and batch runs
- Model store — central
~/.ziv/directory with bare-name resolution and HuggingFace fallback - LoRA support — single or stacked, configurable weights, bare-name resolution
- Image upscale — generate small → Lanczos → img2img refine → CAS sharpen
- Video upscale — distilled-only two-stage 2× spatial upscaling
- Reference images — img2img steering from any starting image
- Quantization — 4-bit and 8-bit on both platforms
- Post-processing — contrast, saturation, and CAS sharpening (image only)
- Interactive controls — skip, quit, pause, and repeat during batch runs (image only)
Platform Support
| Platform | Image Generation | Video Generation |
|---|---|---|
| macOS (Apple Silicon) | ✅ Z-Image / FLUX via mflux/MLX | ✅ LTX-2.3 via MLX |
| Windows (NVIDIA GPU) | ✅ Z-Image / FLUX via diffusers/CUDA | ❌ Not supported |
Installation
Requires Python 3.14+ and uv.
uv is required. This package cannot be installed with pip — some dependencies require uv-specific resolution that pip does not support. All commands below use uv.
# Install globally from PyPI
uv tool install z-vision-generator
# Install globally from repository
uv tool install -e git+https://github.com/knuthelge/ZVisionGenerator.git
# Development setup
git clone https://github.com/knuthelge/ZVisionGenerator && cd ZVisionGenerator
uv sync
Video generation requires ffmpeg. On macOS,
ziv-videooffers to install it via Homebrew on first run.
Quick Start
# Generate an image (bare name from ~/.ziv/models/)
ziv-image -m my-model --prompt "a beautiful sunset"
# Generate from a HuggingFace model
ziv-image -m Tongyi-MAI/Z-Image-Turbo --prompt "a cat in a garden"
# Batch run from a prompts file
ziv-image -m my-model -p prompts.yaml -r 3
# Generate a video
ziv-video -m dgrauet/ltx-2.3-mlx-q4 --prompt "A cat walking through a garden"
# Image-to-video
ziv-video -m dgrauet/ltx-2.3-mlx-q4 --image photo.jpg --prompt "Camera zooms in slowly"
Tip:
ziv image,ziv video, andziv modelare also available as subcommands of the unifiedzivparent command.
Documentation
Full documentation is available at knuthelge.github.io/ZVisionGenerator.
- Getting Started — installation, model store, quick start
- Image Guide — aliases, sizes, reference images, LoRA, upscaling, quantization
- Video Guide — T2V, I2V, upscale, audio, LoRA, constraints
- Prompts Guide — prompt files, variables, structured prompts, snippets
- Model & LoRA Guide — checkpoint conversion, LoRA import, asset listing
- CLI Reference — full argument tables for all commands
- Development — setup, testing, architecture
Contributing
Contributions are welcome! See CONTRIBUTING.md for guidelines.
License
This project is licensed under the GNU Affero General Public License v3.0 or later.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file z_vision_generator-0.6.5.tar.gz.
File metadata
- Download URL: z_vision_generator-0.6.5.tar.gz
- Upload date:
- Size: 396.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d62de07791754e418032878d4a5f59fb5a406e980c1f66bed2566bbbbd315def
|
|
| MD5 |
ebc84d396c4fca76eddf10ffe5f0e755
|
|
| BLAKE2b-256 |
179b79402a0da2dacb90a052859e128715bd62ea0c826b59fb80f48142271a49
|
Provenance
The following attestation bundles were made for z_vision_generator-0.6.5.tar.gz:
Publisher:
release.yml on knuthelge/ZVisionGenerator
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
z_vision_generator-0.6.5.tar.gz -
Subject digest:
d62de07791754e418032878d4a5f59fb5a406e980c1f66bed2566bbbbd315def - Sigstore transparency entry: 1319705235
- Sigstore integration time:
-
Permalink:
knuthelge/ZVisionGenerator@70913e93b44ae3a6853d47bf0751bb1ae2dc2ec6 -
Branch / Tag:
refs/tags/v0.6.5 - Owner: https://github.com/knuthelge
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@70913e93b44ae3a6853d47bf0751bb1ae2dc2ec6 -
Trigger Event:
push
-
Statement type:
File details
Details for the file z_vision_generator-0.6.5-py3-none-any.whl.
File metadata
- Download URL: z_vision_generator-0.6.5-py3-none-any.whl
- Upload date:
- Size: 231.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
579f3768a69cc0fb6699779c745a79216a9eeccd3a23e2fa7792a21905c34e60
|
|
| MD5 |
2a02d5e22b94557b5fbd50c6208eda17
|
|
| BLAKE2b-256 |
d8343aa1b9cec2cfa730510337e3c6b83b748496a79234690d51db227b52812d
|
Provenance
The following attestation bundles were made for z_vision_generator-0.6.5-py3-none-any.whl:
Publisher:
release.yml on knuthelge/ZVisionGenerator
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
z_vision_generator-0.6.5-py3-none-any.whl -
Subject digest:
579f3768a69cc0fb6699779c745a79216a9eeccd3a23e2fa7792a21905c34e60 - Sigstore transparency entry: 1319705325
- Sigstore integration time:
-
Permalink:
knuthelge/ZVisionGenerator@70913e93b44ae3a6853d47bf0751bb1ae2dc2ec6 -
Branch / Tag:
refs/tags/v0.6.5 - Owner: https://github.com/knuthelge
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@70913e93b44ae3a6853d47bf0751bb1ae2dc2ec6 -
Trigger Event:
push
-
Statement type: