Skip to main content

Python SDK for developing AI agent evals and observability

Project description

agentops ๐Ÿ•ต๏ธ

AI agents suck. Weโ€™re fixing that.

Build your next agent with evals, observability, and replay analytics. Agentops is the toolkit for evaluating and developing robust and reliable AI agents.

License: MIT

Latest Release ๐Ÿ“ฆ

version: 0.1 This is an alpha release for early testers.

Agentops is still in closed beta. You can sign up for an API key here.

Quick Start

pip install agentops

And...

import agentops as ao

Documentation: http://docs.agentops.ai

Why Agentops? ๐Ÿค”

Agent developers often work in the dark, with little to no visibility into agent testing performance. This means their agents never leave the lab. We're changing that. The agentops SDK is designed to become the gold standard for evaluating, grading, and testing agents. Our mission is to make sure your agents are ready for production.

Evaluations Roadmap ๐Ÿงญ

Platform Dashboard Evals
โœ… Python SDK โœ… Multi-session and Cross-session metrics ๐Ÿšง Evaluation playground + leaderboard
๐Ÿšง Evaluation builder API โœ… Custom event tag trackingย  ๐Ÿ”œ Agent scorecards
๐Ÿ”œ Javascript/Typescript SDK ๐Ÿšง Session replays ๐Ÿ”œ Custom eval metrics

Debugging Roadmap ๐Ÿงญ

Performance testing Environments LAA (LLM augmented agents) specific tests Reasoning and execution testing
๐Ÿ”œ Event latency analysis ๐Ÿ”œ Non-stationary environment testing ๐Ÿ”œ LLM non-deterministic function detection ๐Ÿ”œ Infinite loops and recursive thought detection
๐Ÿ”œ Regression testing ๐Ÿ”œ Multi-modal environments ๐Ÿ”œ Token limit overflow flags ๐Ÿ”œ Faulty reasoning detection
๐Ÿ”œ Success validators (external) ๐Ÿ”œ Execution containers ๐Ÿ”œ Context limit overflow flags ๐Ÿ”œ Generative code validators
๐Ÿ”œ Agent controllers/skill tests ๐Ÿ”œ Honeypot and prompt injection evaluation ๐Ÿ”œ API bill tracking ๐Ÿ”œ Error breakpoint analysis
๐Ÿ”œ Information context constraint testing ๐Ÿ”œ Anti-agent roadblocks (i.e. Captchas)
๐Ÿ”œ Agent workflow execution pricing

Agent Arena ๐ŸฅŠ

(coming soon!)

Installation & Usage ๐Ÿ“˜

To start using Agentops SDK, follow these steps:

  1. Clone the GitHub repo:
git clone https://github.com/AgentOps-AI/agentops.git
  1. Install the requirements:
pip install -e agentops/
  1. Integrate the SDK into your AI agent application. Refer to our API documentation for detailed instructions.

Join the Revolution ๐ŸŽ‰

Is there a feature you'd like to see agenotps cover? Just raise it in the issues tab, and we'll work on adding it the roadmap.

We're on a mission to improve AI agents, and we want you to be a part of it. Start building your next agent with agentops SDK today!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentops-0.0.1.tar.gz (11.5 kB view hashes)

Uploaded Source

Built Distribution

agentops-0.0.1-py3-none-any.whl (10.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page