Python client library for the AgentLab evaluation platform using Connect RPC

These details have not been verified by PyPI

Project links

Project description

AgentLab Python Client

A Python client library for the AgentLab evaluation platform using Connect RPC. This library provides a simple and intuitive interface for running AI agent evaluations, managing evaluators, and accessing evaluation results.

🚀 Quick Start

pip install agentlab-py

Set your API token as an environment variable:

export AGENTLAB_API_TOKEN=your-api-token-here

from agentlab import AgentLabClient, CreateEvaluationOptions

# Initialize the client (automatically loads AGENTLAB_API_TOKEN from environment)
client = AgentLabClient()

# Run an evaluation
evaluation = client.run_evaluation(CreateEvaluationOptions(
    agent_name='my-agent',
    agent_version='1.0.0',
    evaluator_names=['correctness-v1'],
    user_question='What is the capital of France?',
    agent_answer='The capital of France is Paris.',
    ground_truth='Paris is the capital of France',
    metadata={'confidence': 0.95}  # Optional metadata for tracking
))

print(f"Evaluation completed: {evaluation.name}")

Retrieving Results

# Get evaluation run details
evaluation_run = client.get_evaluation_run('evaluation-run-id')

# Get structured results with parsed JSON
result_data = client.get_evaluation_result('evaluation-run-id')
print(result_data['results'])  # Parsed evaluator outputs

# Access raw evaluator results
for evaluator_name, result in evaluation_run.evaluator_results.items():
    print(f"{evaluator_name}: {result.output}")

Listing Evaluation Runs

# List recent evaluation runs
runs_response = client.list_evaluation_runs('project-123')
for run in runs_response.evaluation_runs:
    print(f"Run: {run.name} - Question: {run.user_question}")

🔧 Development

Setting up the development environment

# Clone the repository
git clone https://github.com/VectorLabsCZ/agentlab-py.git
cd agentlab-py

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install development dependencies
pip install -r requirements-dev.txt

# Install the package in development mode
pip install -e .

Running Examples

# Basic usage example
python examples/basic_usage.py

# Async usage example
python examples/async_usage.py

Running Tests

# Run tests
pytest

# Run tests with coverage
pytest --cov=agentlab

# Run type checking
mypy agentlab/

📦 Building and Publishing

Building the package

# Install build tools
pip install build

# Build the package
python -m build

Publishing to PyPI

# Install twine
pip install twine

# Upload to PyPI
twine upload dist/*

🌟 Examples

Complete Evaluation Workflow

import json
from agentlab import AgentLabClient, CreateEvaluationOptions

def main():
    # Initialize client
    client = AgentLabClient()
        
    try:
        # 1. List available evaluators
        print("📋 Available evaluators:")
        evaluators = client.list_evaluators()
        for evaluator in evaluators.evaluators[:3]:  # Show first 3
            print(f"  - {evaluator.name}: {evaluator.display_name}")
        
        # 2. Run evaluation
        print("\n🚀 Running evaluation...")
        evaluation = client.run_evaluation(CreateEvaluationOptions(
            agent_name='demo-agent',
            agent_version='1.0.0',
            evaluator_names=['correctness-v1'],
            user_question='What is the square root of 16?',
            agent_answer='The square root of 16 is 4.',
            ground_truth='4',
            metadata={'confidence': 1.0}  # Additional context/scores
        ))
        
        # 3. Get results
        print(f"\n✅ Evaluation completed: {evaluation.name}")
        result_data = client.get_evaluation_result(evaluation.name)
        
        print("\n📊 Results:")
        print(json.dumps(result_data['results'], indent=2))
        
        # 4. List recent runs
        print("\n📈 Recent evaluation runs:")
        runs = client.list_evaluation_runs()
        for run in runs.evaluation_runs[:3]:  # Show first 3
            print(f"  - {run.name}: {run.user_question[:50]}...")
            
    except Exception as e:
        print(f"❌ Error: {e}")

if __name__ == '__main__':
    main()

Error Handling

from agentlab import AgentLabClient, AgentLabClientOptions, AgentLabError, AuthenticationError, APIError

try:
    client = AgentLabClient(AgentLabClientOptions(api_token='invalid-token'))
    evaluation = client.run_evaluation(options)
    
except AuthenticationError as e:
    print(f"Authentication failed: {e}")
    
except APIError as e:
    print(f"API error (status {e.status_code}): {e}")
    
except AgentLabError as e:
    print(f"AgentLab error: {e}")
    
except Exception as e:
    print(f"Unexpected error: {e}")

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Issues and Feature Requests

🐛 Report a bug
💡 Request a feature

🔗 Links

🏢 About VectorLabs

AgentLab is developed by VectorLabs, a company focused on advancing AI agent evaluation and development tools.

Made with ❤️ by the VectorLabs team

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.1

Oct 9, 2025

0.6.0 yanked

Oct 9, 2025

Reason this release was yanked:

Deprecated field in test datasets that broke this package, use 0.6.1

0.5.4

Oct 8, 2025

0.5.3

Oct 8, 2025

0.5.2 yanked

Oct 8, 2025

Reason this release was yanked:

Broken package

0.5.1

Oct 8, 2025

0.5.0

Oct 3, 2025

This version

0.4.0

Sep 20, 2025

0.3.1

Sep 18, 2025

0.3.0

Sep 17, 2025

0.2.4

Sep 15, 2025

0.2.3

Sep 15, 2025

0.2.2

Sep 15, 2025

0.2.1

Sep 15, 2025

0.2.0

Sep 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentlab_py-0.4.0.tar.gz (65.2 kB view details)

Uploaded Sep 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentlab_py-0.4.0-py3-none-any.whl (53.0 kB view details)

Uploaded Sep 20, 2025 Python 3

File details

Details for the file agentlab_py-0.4.0.tar.gz.

File metadata

Download URL: agentlab_py-0.4.0.tar.gz
Upload date: Sep 20, 2025
Size: 65.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for agentlab_py-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`9450ac9e16c2fab340774efc2c22c165884a7a1c6fe3a513db2d61c3f922d962`
MD5	`b5272da18c78b5003838eecea7fb12d1`
BLAKE2b-256	`f158019849e6c6d9f87f5787269b7bcb6e1663979c66472e7972d4308ab822b9`

See more details on using hashes here.

File details

Details for the file agentlab_py-0.4.0-py3-none-any.whl.

File metadata

Download URL: agentlab_py-0.4.0-py3-none-any.whl
Upload date: Sep 20, 2025
Size: 53.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for agentlab_py-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`29b03db11c3ad5f723a7fa75b54d0632ebcda48321125915e421786e0df44c36`
MD5	`93c125d4470fc1f2aee1ae3d2b6833c2`
BLAKE2b-256	`2e3189db13f536c7e8f43232641d6d7fac9a3cfee5c38b88b529a117c0cb3260`

See more details on using hashes here.

agentlab-py 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentLab Python Client

🚀 Quick Start

Retrieving Results

Listing Evaluation Runs

🔧 Development

Setting up the development environment

Running Examples

Running Tests

📦 Building and Publishing

Building the package

Publishing to PyPI

🌟 Examples

Complete Evaluation Workflow

Error Handling

🤝 Contributing

Issues and Feature Requests

🔗 Links

🏢 About VectorLabs

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes