Skip to main content

Tool plugin for zk-rag to generate images with stable diffusion 3.5 medium

Project description

zk-rag-image-generator

A plugin for zk-rag that enables image generation using Stable Diffusion 3.5 Medium model.

Description

This plugin provides a tool to generate images from textual descriptions using the Stable Diffusion 3.5 Medium model. It can be used to enhance your RAG (Retrieval-Augmented Generation) applications with image generation capabilities.

Features

  • Generate images from textual descriptions
  • Save generated images as PNG files
  • Integration with mojentic LLM tools
  • Comprehensive error handling
  • Configurable through a gateway pattern

Installation

pip install zk-rag-image-generator

Usage

The plugin will be automatically discovered by zk-rag when installed. It provides the GenerateImage tool which can be used to create images from textual descriptions.

Example usage through zk-rag:

from zk_rag import get_tool

image_tool = get_tool("generate_image")
filename = image_tool.run(
    image_description="A serene mountain landscape at sunset with snow-capped peaks and a clear lake reflecting the orange sky",
    base_filename="mountain_sunset"
)
print(f"Image saved as: {filename}")  # Will print: Image saved as: mountain_sunset.png

Requirements

  • Python >= 3.11
  • mojentic (LLM tools framework)
  • torch, torchvision, torchaudio (PyTorch stack)
  • diffusers (Stable Diffusion implementation)
  • transformers (Hugging Face Transformers)
  • accelerate (Hardware acceleration support)
  • safetensors (Model weight handling)
  • Additional dependencies: scipy, protobuf, sentencepiece

For the complete list with specific versions, see requirements.txt.

Local Development

  1. Clone the repository
  2. Install dependencies:
    pip install -r requirements.txt
    pip install -r dev-requirements.txt
    
  3. Install pre-commit hooks (recommended):
    # Create a pre-commit hook that runs pytest
    cat > .git/hooks/pre-commit << 'EOL'
    #!/bin/sh
    
    # Run pytest
    echo "Running pytest..."
    python -m pytest
    
    # Store the exit code
    exit_code=$?
    
    # Exit with pytest's exit code
    exit $exit_code
    EOL
    
    # Make the hook executable
    chmod +x .git/hooks/pre-commit
    

Testing Guidelines

  • Tests are co-located with implementation files
  • Run tests: pytest
  • Linting: flake8 src
  • Code style:
    • Max line length: 127
    • Max complexity: 10
    • Follow numpy docstring style

Testing Best Practices

  • Use pytest for testing, with mocker if you require mocking
  • Do not use unittest or MagicMock directly
  • Use @fixture markers for pytest fixtures
  • Break up fixtures into smaller fixtures if they are too large
  • Separate test phases with a single blank line
  • Each test must fail for only one clear reason
  • Only write mocks for our own gateway classes

License

MIT License

Author

Stacey Vetzal (stacey@vetzal.com)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zk_rag_image_generator-1.0.2.tar.gz (3.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zk_rag_image_generator-1.0.2-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file zk_rag_image_generator-1.0.2.tar.gz.

File metadata

  • Download URL: zk_rag_image_generator-1.0.2.tar.gz
  • Upload date:
  • Size: 3.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for zk_rag_image_generator-1.0.2.tar.gz
Algorithm Hash digest
SHA256 4a51a04b956953016911c2b49e41cd5b2b1827339e3c06ddbc4990c522eb1d8a
MD5 20a9071cc1ebdf86ed55f188dc5cef09
BLAKE2b-256 1a4cd608b87692341cbba7bbf0799b7356241f66977e8c769ccc9b67cd51e04d

See more details on using hashes here.

Provenance

The following attestation bundles were made for zk_rag_image_generator-1.0.2.tar.gz:

Publisher: python-publish.yml on svetzal/zk-rag-image-generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zk_rag_image_generator-1.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for zk_rag_image_generator-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 1d5036ceb7fb86f8925b2c6a96efa01d0afff827f927644977ac6735e7aaaf12
MD5 2ed8f0a76fd5dfb802e530517b5afa3c
BLAKE2b-256 cb0f057f743c04616a4cf05b6f1e0ae005c4756a22cff4f16a22acdc6cceb14a

See more details on using hashes here.

Provenance

The following attestation bundles were made for zk_rag_image_generator-1.0.2-py3-none-any.whl:

Publisher: python-publish.yml on svetzal/zk-rag-image-generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page