Composable helpers for OpenAI SDK agents, prompts, and storage

These details have not been verified by PyPI

Project description

openai-sdk-helpers

Shared primitives for composing OpenAI workflows: high-level agent abstractions (via openai-agents SDK) and low-level response handling (via openai SDK), plus structures, prompt rendering, and reusable utilities.

Installation • Quickstart • Features • Documentation • Contributing

Overview
Features
Requirements
Installation
Quickstart
- Text utilities
- Centralized OpenAI configuration
Advanced Usage
Development
Project Structure
Key modules
Contributing
License
Troubleshooting

Overview

openai-sdk-helpers packages the common building blocks required to assemble agent-driven applications. The library intentionally focuses on reusable primitives—data structures, configuration helpers, and orchestration utilities—while leaving application-specific prompts and tools to the consuming project.

Important: This library integrates with two distinct OpenAI SDKs:

openai-agents - Used by the agent module for high-level agent workflows with automatic tool handling
openai - Used by the response module for direct API interactions with fine-grained control over responses

The agent module provides a higher-level abstraction for building agents, while the response module offers lower-level control for custom response handling workflows.

Key Features

Agent Module (Built on `openai-agents` SDK)

Agent wrappers with synchronous and asynchronous entry points
Prompt rendering powered by Jinja2 for dynamic agent instructions
Vector and web search flows that coordinate planning, execution, and reporting with built-in concurrency control
Reusable text agents for common tasks:
- SummarizerAgent: Generate concise summaries from provided text
- TranslatorAgent: Translate text into target languages
- ValidatorAgent: Check inputs and outputs against safety guardrails
- TaxonomyClassifierAgent: Classify text into taxonomy-driven labels

Response Module (Built on `openai` SDK)

Response handling utilities for direct API control with fine-grained message management
Tool execution framework with custom handlers and structured outputs
Session persistence for saving and restoring conversation history

Infrastructure & Utilities

Centralized logger factory for consistent application logging
Retry patterns with exponential backoff and jitter
Output validation framework with JSON schema, semantic, and length validators
CLI tool for testing agents, validating templates, and inspecting registries
Deprecation utilities for managing API changes

Shared Components

Typed structures using Pydantic for prompts, responses, and search workflows to ensure predictable inputs and outputs
OpenAI configuration management with environment variable and .env file support
Vector storage abstraction for seamless integration with OpenAI vector stores
Type-safe interfaces with full type hints and py.typed marker for external projects
- ValidatorAgent: Check inputs and outputs against safety guardrails

Response Module (Built on `openai` SDK)

Response handling utilities for direct API control with fine-grained message management
Tool execution framework with custom handlers and structured outputs
Session persistence for saving and restoring conversation history

Shared Components

Typed structures using Pydantic for prompts, responses, and search workflows to ensure predictable inputs and outputs
OpenAI configuration management with environment variable and .env file support
Vector storage abstraction for seamless integration with OpenAI vector stores
Type-safe interfaces with full type hints and py.typed marker for external projects

Requirements

Python 3.10 or higher
OpenAI API key (set via OPENAI_API_KEY environment variable)

Note: This package depends on both:

openai - The standard OpenAI Python SDK
openai-agents - The OpenAI Agents SDK for high-level agent workflows

Both are automatically installed when you install openai-sdk-helpers.

Installation

Install the package directly from PyPI:

pip install openai-sdk-helpers

The package ships with type information via py.typed, enabling full type checking support in your IDE and with tools like mypy and pyright.

Development Installation

For local development with editable sources and optional dev dependencies:

# Clone the repository
git clone https://github.com/fatmambot33/openai-sdk-helpers.git
cd openai-sdk-helpers

# Install in editable mode with dev dependencies
pip install -e .[dev]

The dev dependencies include:

pydocstyle - Docstring style checking
pyright - Static type checking
black - Code formatting
pytest and pytest-cov - Testing and coverage

Quickstart

Basic Vector Search

Create a vector search workflow by wiring your own prompt templates and preferred model configuration. This example uses the agent module built on openai-agents SDK:

from pathlib import Path
from openai_sdk_helpers.agent.search.vector import VectorSearch

# Point to your custom prompt directory
prompts = Path("./prompts")

# Create and configure the vector search agent
vector_search = VectorSearch(
    prompt_dir=prompts,
    default_model="gpt-4o-mini"
)

# Run a synchronous search query
report = vector_search.run_agent_sync("Explain quantum entanglement for beginners")
print(report.report)

Note: The vector search workflow ships with default prompt templates (vector_planner.jinja, vector_search.jinja, and vector_writer.jinja). You only need to pass a prompt_dir when you want to override them; if the directory you supply is missing any of the templates, agent construction will fail with a FileNotFoundError.

Text utilities

The built-in text helpers provide lightweight single-step agents for common tasks. These use the agent module built on openai-agents SDK:

from openai_sdk_helpers.agent import (
    SummarizerAgent,
    TaxonomyClassifierAgent,
    TranslatorAgent,
    ValidatorAgent,
)
from openai_sdk_helpers.structure import TaxonomyNode

# Initialize agents with a default model
summarizer = SummarizerAgent(default_model="gpt-4o-mini")
classifier = TaxonomyClassifierAgent(
    model="gpt-4o-mini",
    taxonomy=[TaxonomyNode(label="Billing"), TaxonomyNode(label="Support")],
)
translator = TranslatorAgent(default_model="gpt-4o-mini")
validator = ValidatorAgent(default_model="gpt-4o-mini")

# Generate a summary
summary = summarizer.run_sync("Long-form content to condense...")
print(summary.text)

# Translate text
translation = translator.run_sync("Bonjour", target_language="English")
print(translation)

# Classify text against a taxonomy
classification = classifier.run_sync("I need help with my invoice")
print(classification.final_node)

# Equivalent Responses API classification (no app-side parsing glue)
from openai_sdk_helpers.response import classify_taxonomy_response

response_classification = classify_taxonomy_response(
    content="I need help with my invoice",
    taxonomy=[TaxonomyNode(label="Billing"), TaxonomyNode(label="Support")],
    model="gpt-4o-mini",
    return_summary=True,
)
print(response_classification.full_paths)

# Validate against guardrails
validation = validator.run_sync(
    "Share meeting notes with names removed",
    agent_output=summary.text
)
print(validation.is_safe)

Important: These text helpers ship with default prompt templates under src/openai_sdk_helpers/prompt, so you do not need to create placeholder files when installing from PyPI. Only pass a prompt_dir when you have custom templates you want to use instead.

Centralized OpenAI configuration

OpenAISettings provides a centralized way to manage OpenAI SDK configuration across your application:

from openai_sdk_helpers import OpenAISettings

# Load from environment variables or a local .env file
settings = OpenAISettings.from_env()

# Create an OpenAI client with loaded settings
client = settings.create_client()

# Reuse the default model across agents
from openai_sdk_helpers.agent.search.vector import VectorSearch

vector_search = VectorSearch(
    prompt_dir=prompts,
    default_model=settings.default_model or "gpt-4o-mini"
)

Supported Environment Variables:

OPENAI_API_KEY (required) - Your OpenAI API key
OPENAI_ORG_ID - Organization identifier
OPENAI_PROJECT_ID - Project identifier for billing
OPENAI_BASE_URL - Custom base URL for self-hosted deployments
OPENAI_MODEL - Default model name
OPENAI_TIMEOUT - Request timeout in seconds
OPENAI_MAX_RETRIES - Maximum retry attempts

Pass uncommon OpenAI client keyword arguments (such as default_headers, http_client, or custom base_url proxies) through extra_client_kwargs when instantiating OpenAISettings.

Direct Response Control (Response Module)

For more fine-grained control over API interactions, use the response module built on the standard openai SDK. This gives you direct access to message history, tool handlers, and custom response parsing:

from openai_sdk_helpers.response import ResponseBase
from openai_sdk_helpers import OpenAISettings

# Configure OpenAI settings
settings = OpenAISettings.from_env()

# Create a response handler with custom instructions
response = ResponseBase(
    instructions="You are a helpful code review assistant.",
    tools=None,  # Or provide custom tool definitions
    output_structure=None,  # Or a Pydantic model for structured output
    tool_handlers={},  # Map tool names to handler functions
    openai_settings=settings
)

# Execute and get a response
result = response.run_sync("Review this Python code for best practices...")
print(result)

# Clean up
response.close()

Key Differences:

Agent Module: Higher-level abstraction with automatic tool handling and agent-specific workflows
Response Module: Lower-level control with manual message management, custom tool handlers, and direct API access

ResponseConfiguration Tool Validation (Strict)

ResponseConfiguration now validates tool configuration eagerly at construction time:

tools must be a non-string sequence of mapping objects (for example [{"type": "web_search"}]).
String-like containers such as "abc", b"abc", and bytearray(b"abc") are invalid.
Each item in tools must be a mapping (dict-like). Invalid item types raise TypeError immediately.
When add_web_search_tool=True, gen_response() appends the OpenAI Responses API tool definition {"type": "web_search"}.

Now invalid (fails fast during configuration initialization):

from openai_sdk_helpers.response.configuration import ResponseConfiguration

# ❌ Invalid: string containers are not tool sequences
ResponseConfiguration(
    name="bad_tools",
    instructions="Use tools",
    tools="abc",
    input_structure=None,
    output_structure=None,
)

Advanced Usage

Image and File Analysis

The response module automatically detects file types and handles them appropriately:

from openai_sdk_helpers.response import ResponseBase
from openai_sdk_helpers import OpenAISettings

settings = OpenAISettings.from_env()

with ResponseBase(
    name="analyzer",
    instructions="You are a helpful assistant that can analyze files.",
    tools=None,
    output_structure=None,
    tool_handlers={},
    openai_settings=settings,
) as response:
    # Automatic type detection - single files parameter
    # Images are sent as base64-encoded images
    # PDF documents are sent as base64-encoded file data
    result = response.run_sync(
        "Analyze these files",
        files=["photo.jpg", "document.pdf"]
    )
    print(result)
    
    # Single file - automatically detected
    result = response.run_sync(
        "What's in this image?",
        files="photo.jpg"  # Automatically detected as image
    )
    print(result)
    
    # Use vector store for RAG (Retrieval-Augmented Generation)
    result = response.run_sync(
        "Search these documents",
        files=["doc1.pdf", "doc2.pdf"],
        use_vector_store=True  # Enable RAG with vector stores
    )
    print(result)

How It Works:

Images (jpg, png, gif, etc.) are automatically sent as base64-encoded images
Documents are sent as base64-encoded file data by default for PDFs only
Non-PDF documents should use use_vector_store=True (or be converted to PDF)
Vector Stores can optionally be used for documents when use_vector_store=True
Batch Processing is automatically used for multiple files (>3) for efficient encoding

Advanced File Processing:

from openai_sdk_helpers.response import process_files

# Process files directly with the dedicated module
vector_files, base64_files, images = process_files(
    response,
    files=["photo1.jpg", "photo2.jpg", "doc1.pdf", "doc2.pdf"],
    use_vector_store=False,
    batch_size=20,      # Files per batch
    max_workers=10,     # Concurrent workers
)

Base64 Encoding Utilities:

from openai_sdk_helpers.utils import (
    encode_image,
    encode_file,
    is_image_file,
    create_image_data_url,
    create_file_data_url,
)

# Check if a file is an image
is_image_file("photo.jpg")  # True
is_image_file("document.pdf")  # False

# Encode an image to base64
base64_image = encode_image("photo.jpg")

# Create a data URL for an image
image_url, detail = create_image_data_url("photo.jpg", detail="high")

# Encode a file to base64
base64_file = encode_file("document.pdf")

# Create a data URL for a file
file_data = create_file_data_url("document.pdf")

Custom Prompt Templates

Create custom Jinja2 templates for specialized agent behaviors:

from pathlib import Path
from openai_sdk_helpers.agent import SummarizerAgent

# Use custom prompt templates
custom_prompts = Path("./my_prompts")
summarizer = SummarizerAgent(
    prompt_dir=custom_prompts,
    default_model="gpt-4o"
)

Asynchronous Execution

All agents support both synchronous and asynchronous execution:

import asyncio
from openai_sdk_helpers.agent import SummarizerAgent

async def main():
    summarizer = SummarizerAgent(default_model="gpt-4o-mini")
    
    # Run asynchronously
    result = await summarizer.run_agent(
        text="Long document to summarize...",
        metadata={"source": "example.pdf"}
    )
    print(result.text)

asyncio.run(main())

Vector Storage Integration

Integrate with OpenAI vector stores for document search:

from openai_sdk_helpers.vector_storage import VectorStorage
from openai_sdk_helpers.agent.search.vector import VectorSearch

# Create or connect to a vector store
storage = VectorStorage(store_name="my_documents")

# Use it with vector search
vector_search = VectorSearch(
    default_model="gpt-4o-mini",
    vector_storage=storage
)

Development

The repository follows standard Python development practices with comprehensive quality checks.

Setting Up Your Development Environment

# Clone and install with dev dependencies
git clone https://github.com/fatmambot33/openai-sdk-helpers.git
cd openai-sdk-helpers
pip install -e .[dev]

Running Quality Checks

Before opening a pull request, ensure all quality checks pass locally:

# Style and docstring checks
pydocstyle src

# Code formatting check
black --check .

# Apply formatting (if needed)
black .

# Static type checking
pyright src

# Unit tests with coverage (minimum 70% required)
pytest -q --cov=src --cov-report=term-missing --cov-fail-under=70

All checks must pass for changes to be merged.

Project Structure

src/openai_sdk_helpers/
├── agent/              # Agent factories, orchestration, and search workflows
│   ├── base.py        # Base agent class with sync/async execution
│   ├── summarizer.py  # Text summarization agent
│   ├── translator.py  # Translation agent
│   ├── validation.py  # Input/output validation agent
│   ├── vector_search.py  # Multi-agent vector search workflow
│   └── coordinator_agent.py  # Coordinated multi-step workflows
├── prompt/             # Jinja2 template rendering utilities
├── response/           # Response parsing and transformation helpers
├── structure/          # Pydantic-based typed data structures
│   ├── base.py        # Base structure with schema generation
│   ├── plan/          # Task and plan structures
│   ├── summary.py     # Summary output structures
│   └── validation.py  # Validation result structures
├── vector_storage/     # Vector store abstraction layer
├── configuration.py          # OpenAI settings and configuration
└── utils/             # JSON serialization, logging, and helpers

tests/                  # Comprehensive unit test suite

Key Modules

The package is organized around cohesive, reusable building blocks:

Agent Modules (Built on `openai-agents` SDK)

These modules use the openai-agents SDK for high-level agent workflows with automatic tool handling and conversation management.

openai_sdk_helpers.agent.base.AgentBase
Base class for all agents with synchronous and asynchronous execution support. Handles prompt rendering, model configuration, and tool integration.
openai_sdk_helpers.agent.search.vector.VectorSearch
Complete vector search workflow that coordinates planning, searching, and reporting. Bundles VectorSearchPlanner, VectorSearchTool, and VectorSearchWriter into a single entry point.
openai_sdk_helpers.agent.coordinator_agent.ProjectManager
Orchestrates multi-step workflows with prompt creation, plan building, task execution, and summarization. Persists intermediate artifacts to disk.
openai_sdk_helpers.agent.summarizer.SummarizerAgent
Generates concise summaries from provided text using structured output.
openai_sdk_helpers.agent.translator.TranslatorAgent
Translates text into target languages with optional context.
openai_sdk_helpers.agent.validation.ValidatorAgent
Validates user inputs and agent outputs against safety guardrails.

Response Module (Built on `openai` SDK)

These modules use the standard openai SDK for direct API interactions with fine-grained control over request/response cycles.

openai_sdk_helpers.response.base.ResponseBase
Manages complete OpenAI API interaction lifecycle including input construction, tool execution, message history, and structured output parsing. Uses the client.responses.create() API (from the OpenAI Responses API, distinct from the Chat Completions API) for direct control over conversation flow.
openai_sdk_helpers.response.runner
Convenience functions for executing response workflows with automatic cleanup in both synchronous and asynchronous contexts.

Configuration and Data Structures (Shared)

openai_sdk_helpers.settings.OpenAISettings
Centralizes OpenAI API configuration with environment variable support. Creates configured OpenAI clients with consistent settings.
openai_sdk_helpers.structure.StructureBase
Pydantic-based foundation for all structured outputs. Provides JSON schema generation, validation, and serialization helpers.
openai_sdk_helpers.structure.plan
Task and plan structures for multi-step workflows with status tracking.

Utilities (Shared)

openai_sdk_helpers.prompt.PromptRenderer
Jinja2-based template rendering for dynamic prompt generation.
openai_sdk_helpers.vector_storage.VectorStorage
Minimal abstraction over OpenAI vector stores with search and file management.
openai_sdk_helpers.utils
JSON serialization helpers, logging utilities, and common validation functions.
openai_sdk_helpers.utils.langextract
Adapter helpers for running LangExtract-style extractors and validating the results into Pydantic models.

Related Projects

LangExtract
Google-maintained toolkit for extracting structured data from language model outputs, which can complement the validation and response utilities in openai-sdk-helpers.

Contributing

Contributions are welcome! We appreciate functional changes accompanied by relevant tests and documentation.

Guidelines

Fork and clone the repository
Create a feature branch from main
Make your changes following the project conventions
Add or update tests to cover your changes
Run all quality checks (see Development)
Submit a pull request with a clear description

Code Style

Follow PEP 8 for Python code formatting
Use NumPy-style docstrings as outlined in AGENTS.md
Write clear, descriptive commit messages in the imperative mood
Keep implementations Pythonic and maintainable

Documentation

Document all public classes, methods, and functions
Include a Methods section in class docstrings listing public methods
Add type hints to all function signatures
Provide examples in docstrings where helpful

Testing

Write unit tests for new functionality
Maintain minimum 70% code coverage
Ensure all tests pass before submitting

See AGENTS.md for detailed contributing guidelines and conventions.

License

This project is licensed under the MIT License - see the LICENSE file for details.

CLI Tool

The package includes a command-line tool for development and testing:

# List all registered response configurations
openai-helpers registry list

# Inspect a specific configuration
openai-helpers registry inspect my_config

# Validate Jinja2 templates
openai-helpers template validate ./templates

# Test an agent (coming soon)
openai-helpers agent test MyAgent --input "test input"

CLI Commands

registry list - Show all registered response configurations
registry inspect - Display details of a configuration
template validate - Check template syntax and structure
agent test - Test agents locally with sample inputs

Troubleshooting

Common Issues

"OPENAI_API_KEY is required" error

Ensure your OpenAI API key is set in the environment:

export OPENAI_API_KEY="your-api-key-here"

Or create a .env file in your project root:

OPENAI_API_KEY=your-api-key-here

"Prompt template not found" error

Vector search workflows require custom prompt templates. Either:

Create the required .jinja files in your prompt_dir
Omit the prompt_dir parameter to use built-in defaults (for text agents only)
Use the CLI to validate templates: openai-helpers template validate ./templates

Import errors after installation

Ensure you're using Python 3.10 or higher:

python --version

If using an older version, upgrade Python or create a virtual environment with Python 3.10+.

Type checking issues in your IDE

The package ships with py.typed for full type support. If your IDE isn't recognizing types:

Ensure your IDE's Python plugin is up to date
Restart your IDE after installing the package
Check that your IDE is using the correct Python interpreter

Getting Help

Check the Key Modules section for API documentation
Review examples in the Quickstart and Advanced Usage sections
Open an issue on GitHub for bugs or feature requests

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.4

Apr 14, 2026

This version

0.7.3

Feb 12, 2026

0.7.2

Feb 12, 2026

0.7.1

Feb 4, 2026

0.7.0

Feb 3, 2026

0.6.6

Feb 3, 2026

0.6.5

Feb 3, 2026

0.6.4

Feb 2, 2026

0.6.2

Jan 30, 2026

0.6.1

Jan 29, 2026

0.6.0

Jan 28, 2026

0.5.2

Jan 26, 2026

0.5.1

Jan 26, 2026

0.5.0

Jan 26, 2026

0.4.3

Jan 13, 2026

0.4.2

Jan 13, 2026

0.4.1

Jan 13, 2026

0.4.0

Jan 12, 2026

0.3.0

Jan 7, 2026

0.2.0

Jan 6, 2026

0.1.4

Jan 6, 2026

0.1.2

Jan 5, 2026

0.1.1

Jan 5, 2026

0.1.0

Jan 4, 2026

0.0.9

Jan 3, 2026

0.0.8

Dec 31, 2025

0.0.7

Dec 30, 2025

0.0.6

Dec 29, 2025

0.0.5

Dec 29, 2025

0.0.4

Dec 29, 2025

0.0.3

Dec 29, 2025

0.0.2

Dec 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openai_sdk_helpers-0.7.3.tar.gz (150.2 kB view details)

Uploaded Feb 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openai_sdk_helpers-0.7.3-py3-none-any.whl (198.0 kB view details)

Uploaded Feb 12, 2026 Python 3

File details

Details for the file openai_sdk_helpers-0.7.3.tar.gz.

File metadata

Download URL: openai_sdk_helpers-0.7.3.tar.gz
Upload date: Feb 12, 2026
Size: 150.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openai_sdk_helpers-0.7.3.tar.gz
Algorithm	Hash digest
SHA256	`3b72822dcb4e7d864497ee1b7aa999f6fdabfba80cb3e85e0cbac5e5b17148f7`
MD5	`05352f53d5572b0384429a311f1c4be9`
BLAKE2b-256	`1fd13e784979f75ab415520454849cc797c479b1a57c5790db82da18fa863969`

See more details on using hashes here.

File details

Details for the file openai_sdk_helpers-0.7.3-py3-none-any.whl.

File metadata

Download URL: openai_sdk_helpers-0.7.3-py3-none-any.whl
Upload date: Feb 12, 2026
Size: 198.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for openai_sdk_helpers-0.7.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0211028b955bd591601c3caba65c1967d9772b9adc0b38fcd8a83b8de983f162`
MD5	`0a9154af8f1d639ad337e46b0fceaa8d`
BLAKE2b-256	`cede19be8cc72192c8312c624a9339e7e18e56cef5d83bdf8ad62ae1980e5bba`

See more details on using hashes here.

openai-sdk-helpers 0.7.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

openai-sdk-helpers

Table of Contents

Overview

Key Features

Agent Module (Built on openai-agents SDK)

Response Module (Built on openai SDK)

Infrastructure & Utilities

Shared Components

Response Module (Built on openai SDK)

Shared Components

Requirements

Installation

Development Installation

Quickstart

Basic Vector Search

Text utilities

Centralized OpenAI configuration

Direct Response Control (Response Module)

ResponseConfiguration Tool Validation (Strict)

Advanced Usage

Image and File Analysis

Custom Prompt Templates

Asynchronous Execution

Vector Storage Integration

Development

Setting Up Your Development Environment

Running Quality Checks

Project Structure

Key Modules

Agent Modules (Built on openai-agents SDK)

Response Module (Built on openai SDK)

Configuration and Data Structures (Shared)

Utilities (Shared)

Related Projects

Contributing

Guidelines

Code Style

Documentation

Testing

License

CLI Tool

CLI Commands

Troubleshooting

Common Issues

Getting Help

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Agent Module (Built on `openai-agents` SDK)

Response Module (Built on `openai` SDK)

Response Module (Built on `openai` SDK)

Agent Modules (Built on `openai-agents` SDK)

Response Module (Built on `openai` SDK)