Skip to main content

Tool plugin for zk-rag to generate images with stable diffusion 3.5 medium

Project description

zk-rag-image-generator

A plugin for zk-rag that enables image generation using Stable Diffusion 3.5 Medium model.

Description

This plugin provides a tool to generate images from textual descriptions using the Stable Diffusion 3.5 Medium model. It can be used to enhance your RAG (Retrieval-Augmented Generation) applications with image generation capabilities.

Features

  • Generate images from textual descriptions
  • Save generated images as PNG files
  • Integration with mojentic LLM tools
  • Comprehensive error handling
  • Configurable through a gateway pattern

Installation

pip install zk-rag-image-generator

Usage

The plugin will be automatically discovered by zk-rag when installed. It provides the GenerateImage tool which can be used to create images from textual descriptions.

Example usage through zk-rag:

from zk_rag import get_tool

image_tool = get_tool("generate_image")
filename = image_tool.run(
    image_description="A serene mountain landscape at sunset with snow-capped peaks and a clear lake reflecting the orange sky",
    base_filename="mountain_sunset"
)
print(f"Image saved as: {filename}")  # Will print: Image saved as: mountain_sunset.png

Requirements

  • Python >= 3.11
  • mojentic (LLM tools framework)
  • torch, torchvision, torchaudio (PyTorch stack)
  • diffusers (Stable Diffusion implementation)
  • transformers (Hugging Face Transformers)
  • accelerate (Hardware acceleration support)
  • safetensors (Model weight handling)
  • Additional dependencies: scipy, protobuf, sentencepiece

For the complete list with specific versions, see requirements.txt.

Local Development

  1. Clone the repository
  2. Install dependencies:
    pip install -r requirements.txt
    pip install -r dev-requirements.txt
    
  3. Install pre-commit hooks (recommended):
    # Create a pre-commit hook that runs pytest
    cat > .git/hooks/pre-commit << 'EOL'
    #!/bin/sh
    
    # Run pytest
    echo "Running pytest..."
    python -m pytest
    
    # Store the exit code
    exit_code=$?
    
    # Exit with pytest's exit code
    exit $exit_code
    EOL
    
    # Make the hook executable
    chmod +x .git/hooks/pre-commit
    

Testing Guidelines

  • Tests are co-located with implementation files
  • Run tests: pytest
  • Linting: flake8 src
  • Code style:
    • Max line length: 127
    • Max complexity: 10
    • Follow numpy docstring style

Testing Best Practices

  • Use pytest for testing, with mocker if you require mocking
  • Do not use unittest or MagicMock directly
  • Use @fixture markers for pytest fixtures
  • Break up fixtures into smaller fixtures if they are too large
  • Separate test phases with a single blank line
  • Each test must fail for only one clear reason
  • Only write mocks for our own gateway classes

License

MIT License

Author

Stacey Vetzal (stacey@vetzal.com)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zk_rag_image_generator-1.0.5.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zk_rag_image_generator-1.0.5-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file zk_rag_image_generator-1.0.5.tar.gz.

File metadata

  • Download URL: zk_rag_image_generator-1.0.5.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for zk_rag_image_generator-1.0.5.tar.gz
Algorithm Hash digest
SHA256 fe5a34c013c22a707c787fb40311efc1c824b6b5240caf31418eacebb846f725
MD5 b3e01c17f6033011508b30070e14d03d
BLAKE2b-256 840b49debb171753b3fd2a998bdd91e00070440e708a39c1059b28350daa7d21

See more details on using hashes here.

Provenance

The following attestation bundles were made for zk_rag_image_generator-1.0.5.tar.gz:

Publisher: python-publish.yml on svetzal/zk-rag-image-generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zk_rag_image_generator-1.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for zk_rag_image_generator-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 f97711a3290c51233eb52f446ff0b892561b5664ad870e37e55436541b9ff35e
MD5 982f051547ce61a31f5649cee696277b
BLAKE2b-256 b50c2de50e086b4fd769a3c106eb809dca3178065afd32a77beaec630e6f975f

See more details on using hashes here.

Provenance

The following attestation bundles were made for zk_rag_image_generator-1.0.5-py3-none-any.whl:

Publisher: python-publish.yml on svetzal/zk-rag-image-generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page