SDK for tracking and testing OpenAI API calls

These details have not been verified by PyPI

Project links

Project description

Playgent

Playgent is a Python SDK for tracking, testing, and evaluating OpenAI API calls. It provides seamless integration with OpenAI's Python client, allowing you to record API interactions, replay them for testing, and evaluate model outputs systematically.

Features

Session Tracking: Automatically track and record OpenAI API calls within sessions
Replay Testing: Replay recorded API calls to test different scenarios
Evaluation Framework: Evaluate model outputs against expected results
Drop-in Replacement: Works as a drop-in replacement for the OpenAI client
Async Support: Full support for both synchronous and asynchronous operations
Decorator Pattern: Simple @record decorator for automatic session management

Installation

Install Playgent using pip:

pip install playgent

Or using uv:

uv add playgent

Quick Start

Basic Usage

from playgent import init, create_session, record

# Initialize Playgent with your configuration
init(api_key="your-playgent-api-key", server_url="https://your-server.com")

# Create a session
session_id = create_session(
    person_id="user-123",
    endpoint="chat-endpoint",
    metadata={"purpose": "testing"}
)

# Your OpenAI code here - all calls will be tracked automatically

Using the Drop-in OpenAI Client

Replace your OpenAI import with Playgent's version:

# Instead of: from openai import OpenAI
from playgent.openai import OpenAI

client = OpenAI(api_key="your-openai-api-key")

# Use as normal - all calls are automatically tracked
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)

Using the @record Decorator

The simplest way to track OpenAI calls:

from playgent import record
from openai import OpenAI

@record(
    person_id="user-123",
    endpoint="my-endpoint"
)
def my_function():
    client = OpenAI()
    response = client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": "Hello!"}]
    )
    return response

# All OpenAI calls within this function are automatically tracked
result = my_function()

Async Support

from playgent import record
from openai import AsyncOpenAI
import asyncio

@record(
    person_id="user-123",
    endpoint="async-endpoint"
)
async def my_async_function():
    client = AsyncOpenAI()
    response = await client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": "Hello!"}]
    )
    return response

# Run async function
asyncio.run(my_async_function())

Replay Testing

from playgent import replay_test, get_session_events

# Get events from a session
events = get_session_events(session_id="your-session-id")

# Create a test case
test_case = {
    "name": "Test Chat Response",
    "session_id": "your-session-id",
    "events": events
}

# Replay the test
result = replay_test(test_case)
print(f"Test passed: {result['passed']}")

Evaluation

from playgent import evaluate

# Evaluate model outputs
evaluation = evaluate(
    session_id="your-session-id",
    criteria={
        "accuracy": lambda output: len(output) > 10,
        "relevance": lambda output: "expected" in output.lower()
    }
)

print(f"Evaluation results: {evaluation}")

Configuration

Playgent can be configured through environment variables or programmatically:

Environment Variables

export PLAYGENT_API_KEY="your-api-key"
export PLAYGENT_SERVER_URL="https://your-server.com"
export PLAYGENT_PERSON_ID="default-user"
export PLAYGENT_ENDPOINT="default-endpoint"

Programmatic Configuration

from playgent import init

init(
    api_key="your-api-key",
    server_url="https://your-server.com",
    batch_size=100,  # Number of events to batch before sending
)

Advanced Usage

Session Management

from playgent import create_session, get_session, session

# Create a session with metadata
session_id = create_session(
    person_id="user-123",
    endpoint="advanced-endpoint",
    metadata={
        "environment": "production",
        "version": "1.0.0",
        "feature_flags": ["new_model", "streaming"]
    }
)

# Get session details
session_info = get_session(session_id)
print(f"Session: {session_info}")

# Use session context manager
with session(person_id="user-123", endpoint="context-endpoint"):
    # All OpenAI calls in this block are tracked
    pass

Custom Event Tracking

from playgent import record
from playgent.types import EndpointEvent

# Track custom events alongside OpenAI calls
@record(person_id="user-123", endpoint="custom-endpoint")
def process_with_custom_events():
    # Your OpenAI calls here

    # Create custom event
    event = EndpointEvent(
        type="custom_metric",
        data={"metric": "response_time", "value": 0.125}
    )

    # Event will be included in session
    return result

API Reference

Core Functions

init(api_key, server_url, batch_size): Initialize Playgent
create_session(person_id, endpoint, metadata): Create a new tracking session
get_session(session_id): Get session details
get_session_events(session_id): Get all events from a session
replay_test(test_case): Replay a test case
evaluate(session_id, criteria): Evaluate session outputs
reset(): Reset Playgent state

Decorators

@record(person_id, endpoint, metadata): Decorator for automatic session management

Types

Session: Session data model
EndpointEvent: Event data model
TestCase: Test case definition
EvaluationResult: Evaluation result data

Development

Setting Up Development Environment

# Clone the repository
git clone https://github.com/yourusername/playgent-py.git
cd playgent-py

# Install development dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Format code
black src/
ruff check src/

# Type checking
mypy src/

Building and Publishing

# Build the package
python -m build

# Test on TestPyPI first
python -m twine upload --repository testpypi dist/*

# Publish to PyPI
python -m twine upload dist/*

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For support, please open an issue on the GitHub repository.

Acknowledgments

Built for seamless integration with OpenAI's Python client
Inspired by modern testing and observability practices
Designed for production-grade API tracking and testing

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.5

Nov 19, 2025

0.1.4

Nov 15, 2025

0.1.3

Nov 12, 2025

0.1.2

Nov 12, 2025

0.1.1

Oct 23, 2025

This version

0.1.0

Oct 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

playgent-0.1.0.tar.gz (20.0 kB view details)

Uploaded Oct 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

playgent-0.1.0-py3-none-any.whl (16.8 kB view details)

Uploaded Oct 23, 2025 Python 3

File details

Details for the file playgent-0.1.0.tar.gz.

File metadata

Download URL: playgent-0.1.0.tar.gz
Upload date: Oct 23, 2025
Size: 20.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for playgent-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`de4b61fe9dd1475809c4e355dfa3ddaec720bba04f9e1807edbf62db7d3ce761`
MD5	`765cd268ace2704c9ab4df59bc9dbe36`
BLAKE2b-256	`942141ce45114c246a0ab1a73baf20837af4d4e9180d534df13126979fd0db4f`

See more details on using hashes here.

File details

Details for the file playgent-0.1.0-py3-none-any.whl.

File metadata

Download URL: playgent-0.1.0-py3-none-any.whl
Upload date: Oct 23, 2025
Size: 16.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for playgent-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3d1039988d273f6aed42fc61d15e39ea1efdd6adc0eaa1d6ae407eee3ed663ce`
MD5	`08dea6fa55e9bc18c86d3399adbbb6d0`
BLAKE2b-256	`8af32263a35c3fa30e1219b11842556c6a17ee954bd31df5d0e20f31cbda4c9d`

See more details on using hashes here.

playgent 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Playgent

Features

Installation

Quick Start

Basic Usage

Using the Drop-in OpenAI Client

Using the @record Decorator

Async Support

Replay Testing

Evaluation

Configuration

Environment Variables

Programmatic Configuration

Advanced Usage

Session Management

Custom Event Tracking

API Reference

Core Functions

Decorators

Types

Development

Setting Up Development Environment

Building and Publishing

Contributing

License

Support

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes