Skip to main content

Python client library for the AgentLab evaluation platform using Connect RPC

Reason this release was yanked:

Deprecated field in test datasets that broke this package, use 0.6.1

Project description

AgentLab Python Client

PyPI version Python 3.10+ License: MIT

A Python client library for the AgentLab evaluation platform using Connect RPC. This library provides a simple and intuitive interface for running AI agent evaluations, managing evaluators, and accessing evaluation results.

🚀 Quick Start

pip install agentlab-py

Set your API token as an environment variable:

export AGENTLAB_API_TOKEN=your-api-token-here
from agentlab import AgentLabClient, CreateEvaluationOptions

client = AgentLabClient()

evaluation = client.run_evaluation(CreateEvaluationOptions(
    agent_name='my-agent',
    agent_version='1.0.0',
    evaluator_names=['correctness-v1'],
    user_question='What is the capital of France?',
    agent_answer='The capital of France is Paris.',
    ground_truth='Paris is the capital of France',
    metadata={'confidence': 0.95}
))

print(f"Evaluation completed: {evaluation.name}")

Retrieving Results

evaluation_run = client.get_evaluation_run('evaluation-run-id')

result_data = client.get_evaluation_result('evaluation-run-id')
print(result_data['results'])  # Parsed evaluator outputs

for evaluator_name, result in evaluation_run.evaluator_results.items():
    print(f"{evaluator_name}: {result.output}")

Listing Evaluation Runs

runs_response = client.list_evaluation_runs('project-123')
for run in runs_response.evaluation_runs:
    print(f"Run: {run.name} - Question: {run.user_question}")

Managing Agent Prompts

from agentlab import CreateAgentVersionOptions

# Publish agent version with prompts (idempotent)
result = client.publish_agent_version(CreateAgentVersionOptions(
    agent_name='my-assistant',
    version='1.0.0',
    prompts={
        'system': 'You are a helpful AI assistant...',
        'guidelines': 'Always be polite and professional.'
    }
))

print(f"Published version: {result.create_time}")
for name, content in result.prompts.items():
    print(f"  {name}: {content[:50]}...")

Analyzing Agent Performance

from agentlab import AnalysisParameters

# Create analysis for the last 30 days
params = AnalysisParameters(min_evaluation_runs=5, time_range_days=30)
session = client.analyze_agent('my-agent', '1.0.0', params)

# Get results
session = client.get_analysis_session(session.id)
if session.status.value == "ANALYSIS_STATUS_COMPLETED":
    stats = session.analysis_data.statistical_summary
    print(f"Success rate: {stats.success_rate:.1%}")
    print(f"Average score: {stats.average_score:.3f}")

Working with Test Datasets

# List all test datasets
datasets = client.list_test_datasets()
print(f"Found {len(datasets.test_datasets)} dataset(s)")

# Get tests from a dataset
tests = client.get_tests('dataset-uuid')
for test in tests.tests:
    print(f"Q: {test.user_question}")
    print(f"A: {test.expected_answer}")

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Issues and Feature Requests

🔗 Links

🏢 About VectorLabs

AgentLab is developed by VectorLabs, a company focused on advancing AI agent evaluation and development tools.


Made with ❤️ by the VectorLabs team

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentlab_py-0.6.0.tar.gz (86.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agentlab_py-0.6.0-py3-none-any.whl (76.8 kB view details)

Uploaded Python 3

File details

Details for the file agentlab_py-0.6.0.tar.gz.

File metadata

  • Download URL: agentlab_py-0.6.0.tar.gz
  • Upload date:
  • Size: 86.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for agentlab_py-0.6.0.tar.gz
Algorithm Hash digest
SHA256 3d163047fbb602ce66f7f954880e8097ed0ae3c4b9f83b46093d48c633121a49
MD5 dedf965381e1ea72204c0dda084804a1
BLAKE2b-256 16887614fe35735af56bdc4ae6e122dd55f3cb4608194602b9bd0a06969e4d25

See more details on using hashes here.

File details

Details for the file agentlab_py-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: agentlab_py-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 76.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for agentlab_py-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 68e3869c6a20c2a71a3e5cfa605aab3e1a76f0b967cd1d8b6f517dd2a34538b7
MD5 dc66e83a898667c709a267e6b4d430e0
BLAKE2b-256 7ac6830b83b2e7556cb0864f167e352af7617a1314a32bbbc90d34bfac089877

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page