Python SDK for the evaluAte LLM evaluation framework
Project description
llmeval - Python SDK for evaluAte
A Python client library for the evaluAte LLM evaluation framework.
Installation
pip install -e .
For development with all extras:
pip install -e ".[dev]"
Quick Start
from llmeval import EvalClient
# Initialize the client
client = EvalClient(base_url="http://127.0.0.1:8080")
# Check server health
status = client.health_check()
print(status)
# Get available models
models = client.get_models()
print(f"Available models: {models}")
# Run a single evaluation
result = client.run_eval(
model="anthropic:claude-sonnet-4",
prompt="What is the capital of France?",
expected="Paris",
judge_model="gemini:gemini-1.5-pro"
)
print(f"Model output: {result.model_output}")
print(f"Judge verdict: {result.judge_verdict}")
print(f"Passed: {result.passed}")
Features
- ✅ Simple, intuitive API
- ✅ Type-safe with Pydantic models
- ✅ Batch evaluation support
- ✅ Real-time WebSocket streaming
- ✅ Jupyter notebook integration
- ✅ pandas DataFrame utilities
- ✅ Comprehensive error handling
- ✅ Context manager support
Documentation
See the examples/ directory for more usage examples:
basic_usage.py- Simple examplesadvanced_usage.py- Advanced patternsstreaming_example.py- WebSocket streamingjupyter_example.ipynb- Jupyter notebook
Requirements
- Python 3.8+
- requests
- pydantic
- websockets
- pandas
License
MIT License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
llmeval-sdk-0.1.4.tar.gz
(6.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llmeval-sdk-0.1.4.tar.gz.
File metadata
- Download URL: llmeval-sdk-0.1.4.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0d8f1d299181a9c3cb1e247f7ff537da4d027255bc7882fe61818004e06ba330
|
|
| MD5 |
681a050bbb9cb5d390ddbc714b7ab9de
|
|
| BLAKE2b-256 |
ad76808bde0a9213fe979c82126e51d880bedfbc7f749346e4c674c7bd4fbfb0
|
File details
Details for the file llmeval_sdk-0.1.4-py3-none-any.whl.
File metadata
- Download URL: llmeval_sdk-0.1.4-py3-none-any.whl
- Upload date:
- Size: 7.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7dc3a1bf8797bc46d600eecb53e5abd2d15f472f528b06ac74cec8cec2ebb199
|
|
| MD5 |
eca74994485229d25faf8557fac25b99
|
|
| BLAKE2b-256 |
4f25c9fd80ee68662e343cc6972f71379ce77b917df92ff992f7eecd5f930fae
|