Skip to main content

Tool plugin for zk-rag to generate images with stable diffusion 3.5 medium

Project description

zk-rag-image-generator

A plugin for zk-rag that enables image generation using Stable Diffusion 3.5 Medium model.

Description

This plugin provides a tool to generate images from textual descriptions using the Stable Diffusion 3.5 Medium model. It can be used to enhance your RAG (Retrieval-Augmented Generation) applications with image generation capabilities.

Features

  • Generate images from textual descriptions
  • Save generated images as PNG files
  • Integration with mojentic LLM tools
  • Comprehensive error handling
  • Configurable through a gateway pattern

Installation

pip install zk-rag-image-generator

Usage

The plugin will be automatically discovered by zk-rag when installed. It provides the GenerateImage tool which can be used to create images from textual descriptions.

Example usage through zk-rag:

from zk_rag import get_tool

image_tool = get_tool("generate_image")
filename = image_tool.run(
    image_description="A serene mountain landscape at sunset with snow-capped peaks and a clear lake reflecting the orange sky",
    base_filename="mountain_sunset"
)
print(f"Image saved as: {filename}")  # Will print: Image saved as: mountain_sunset.png

Requirements

  • Python >= 3.11
  • mojentic (LLM tools framework)
  • torch, torchvision, torchaudio (PyTorch stack)
  • diffusers (Stable Diffusion implementation)
  • transformers (Hugging Face Transformers)
  • accelerate (Hardware acceleration support)
  • safetensors (Model weight handling)
  • Additional dependencies: scipy, protobuf, sentencepiece

For the complete list with specific versions, see requirements.txt.

Local Development

  1. Clone the repository
  2. Install dependencies:
    pip install -r requirements.txt
    pip install -r dev-requirements.txt
    
  3. Install pre-commit hooks (recommended):
    # Create a pre-commit hook that runs pytest
    cat > .git/hooks/pre-commit << 'EOL'
    #!/bin/sh
    
    # Run pytest
    echo "Running pytest..."
    python -m pytest
    
    # Store the exit code
    exit_code=$?
    
    # Exit with pytest's exit code
    exit $exit_code
    EOL
    
    # Make the hook executable
    chmod +x .git/hooks/pre-commit
    

Testing Guidelines

  • Tests are co-located with implementation files
  • Run tests: pytest
  • Linting: flake8 src
  • Code style:
    • Max line length: 127
    • Max complexity: 10
    • Follow numpy docstring style

Testing Best Practices

  • Use pytest for testing, with mocker if you require mocking
  • Do not use unittest or MagicMock directly
  • Use @fixture markers for pytest fixtures
  • Break up fixtures into smaller fixtures if they are too large
  • Separate test phases with a single blank line
  • Each test must fail for only one clear reason
  • Only write mocks for our own gateway classes

License

MIT License

Author

Stacey Vetzal (stacey@vetzal.com)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zk_rag_image_generator-1.1.0.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

zk_rag_image_generator-1.1.0-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file zk_rag_image_generator-1.1.0.tar.gz.

File metadata

  • Download URL: zk_rag_image_generator-1.1.0.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for zk_rag_image_generator-1.1.0.tar.gz
Algorithm Hash digest
SHA256 6f1f590062aa17a816ef744de96e6a2cfe4d04d00a37a2090e47ed8354f35fd1
MD5 f50dc23d37da237bd41ffed5fc013c55
BLAKE2b-256 88f1c522dbfe1573f072feeb7c0ecf6e4857ef34181ed3bb2972bf57a0494b71

See more details on using hashes here.

Provenance

The following attestation bundles were made for zk_rag_image_generator-1.1.0.tar.gz:

Publisher: python-publish.yml on svetzal/zk-rag-image-generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file zk_rag_image_generator-1.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for zk_rag_image_generator-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2d2236fcdca681fba71db51d8e51e25b4c7687edbe42813896134c02f68faf73
MD5 62ff6c2f6fd30df46adaa5ae2e608a5d
BLAKE2b-256 07c213bc3088524f2efda5624d4da5b54125277ee0445da83b9de0b6a196122f

See more details on using hashes here.

Provenance

The following attestation bundles were made for zk_rag_image_generator-1.1.0-py3-none-any.whl:

Publisher: python-publish.yml on svetzal/zk-rag-image-generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page