Python SDK for developing AI agent evals and observability
Project description
agentops ๐ต๏ธ
AI agents suck. Weโre fixing that.
Build your next agent with evals, observability, and replay analytics. Agentops is the toolkit for evaluating and developing robust and reliable AI agents.
Latest Release ๐ฆ
version: 0.1
This is an alpha release for early testers.
Agentops is still in closed beta. You can sign up for an API key here.
Quick Start
pip install agentops
And...
import agentops as ao
Documentation: http://docs.agentops.ai
Why Agentops? ๐ค
Agent developers often work in the dark, with little to no visibility into agent testing performance. This means their agents never leave the lab. We're changing that. The agentops SDK is designed to become the gold standard for evaluating, grading, and testing agents. Our mission is to make sure your agents are ready for production.
Evaluations Roadmap ๐งญ
Platform | Dashboard | Evals |
---|---|---|
โ Python SDK | โ Multi-session and Cross-session metrics | ๐ง Evaluation playground + leaderboard |
๐ง Evaluation builder API | โ Custom event tag trackingย | ๐ Agent scorecards |
๐ Javascript/Typescript SDK | ๐ง Session replays | ๐ Custom eval metrics |
Debugging Roadmap ๐งญ
Performance testing | Environments | LAA (LLM augmented agents) specific tests | Reasoning and execution testing |
---|---|---|---|
๐ Event latency analysis | ๐ Non-stationary environment testing | ๐ LLM non-deterministic function detection | ๐ Infinite loops and recursive thought detection |
๐ Regression testing | ๐ Multi-modal environments | ๐ Token limit overflow flags | ๐ Faulty reasoning detection |
๐ Success validators (external) | ๐ Execution containers | ๐ Context limit overflow flags | ๐ Generative code validators |
๐ Agent controllers/skill tests | ๐ Honeypot and prompt injection evaluation | ๐ API bill tracking | ๐ Error breakpoint analysis |
๐ Information context constraint testing | ๐ Anti-agent roadblocks (i.e. Captchas) | ||
๐ Agent workflow execution pricing |
Agent Arena ๐ฅ
(coming soon!)
Installation & Usage ๐
To start using Agentops SDK, follow these steps:
- Clone the GitHub repo:
git clone https://github.com/AgentOps-AI/agentops.git
- Install the requirements:
pip install -e agentops/
- Integrate the SDK into your AI agent application. Refer to our API documentation for detailed instructions.
Join the Revolution ๐
Is there a feature you'd like to see agenotps cover? Just raise it in the issues tab, and we'll work on adding it the roadmap.
We're on a mission to improve AI agents, and we want you to be a part of it. Start building your next agent with agentops SDK today!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.