Skip to main content

Python SDK for the evaluAte LLM evaluation framework

Project description

llmeval - Python SDK for evaluAte

A Python client library for the evaluAte LLM evaluation framework.

Installation

pip install -e .

For development with all extras:

pip install -e ".[dev]"

Quick Start

from llmeval import EvalClient

# Initialize the client
client = EvalClient(base_url="http://127.0.0.1:8080")

# Check server health
status = client.health_check()
print(status)

# Get available models
models = client.get_models()
print(f"Available models: {models}")

# Run a single evaluation
result = client.run_eval(
    model="anthropic:claude-sonnet-4",
    prompt="What is the capital of France?",
    expected="Paris",
    judge_model="gemini:gemini-1.5-pro"
)

print(f"Model output: {result.model_output}")
print(f"Judge verdict: {result.judge_verdict}")
print(f"Passed: {result.passed}")

Features

  • ✅ Simple, intuitive API
  • ✅ Type-safe with Pydantic models
  • ✅ Batch evaluation support
  • ✅ Real-time WebSocket streaming
  • ✅ Jupyter notebook integration
  • ✅ pandas DataFrame utilities
  • ✅ Comprehensive error handling
  • ✅ Context manager support

Documentation

See the examples/ directory for more usage examples:

  • basic_usage.py - Simple examples
  • advanced_usage.py - Advanced patterns
  • streaming_example.py - WebSocket streaming
  • jupyter_example.ipynb - Jupyter notebook

Requirements

  • Python 3.8+
  • requests
  • pydantic
  • websockets
  • pandas

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmeval-sdk-0.1.4.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llmeval_sdk-0.1.4-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file llmeval-sdk-0.1.4.tar.gz.

File metadata

  • Download URL: llmeval-sdk-0.1.4.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for llmeval-sdk-0.1.4.tar.gz
Algorithm Hash digest
SHA256 0d8f1d299181a9c3cb1e247f7ff537da4d027255bc7882fe61818004e06ba330
MD5 681a050bbb9cb5d390ddbc714b7ab9de
BLAKE2b-256 ad76808bde0a9213fe979c82126e51d880bedfbc7f749346e4c674c7bd4fbfb0

See more details on using hashes here.

File details

Details for the file llmeval_sdk-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: llmeval_sdk-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 7.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for llmeval_sdk-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 7dc3a1bf8797bc46d600eecb53e5abd2d15f472f528b06ac74cec8cec2ebb199
MD5 eca74994485229d25faf8557fac25b99
BLAKE2b-256 4f25c9fd80ee68662e343cc6972f71379ce77b917df92ff992f7eecd5f930fae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page