Durable parallel conversations between LangChain agents.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

agent-convo

agent-convo is a lightweight Python CLI and SDK for running persona-driven conversations between a LangChain tester agent and an OpenAI-compatible target agent.

LangChain owns the agent runtime through create_agent(). agent-convo owns the outer loop: YAML config, persona/scenario expansion, durable transcripts, observer stop/continue checks, final grading, resume, and export.

Install

python -m venv .venv
. .venv/bin/activate
pip install -e ".[test]"

MCP support is optional to keep the default install small:

pip install -e ".[mcp,test]"

Quick Start

agent-convo init
agent-convo validate examples/tester_vs_target.yaml
agent-convo doctor examples/tester_vs_target.yaml
agent-convo run examples/tester_vs_target.yaml

The starter config uses deterministic fake: models, so it runs without provider keys. During a run, the CLI prints each tester message, each target response, the grader result, and a final run summary.

How It Works

For every persona and every scenario under that persona, agent-convo runs one conversation. Set run.count above 1 to repeat every scenario.

Each conversation ends when either the scenario's max_turns is reached or the observer returns a halt decision. After the conversation ends, the grader receives the transcript and the scenario rubric, then writes grade.json.

Outputs are written under runs/<run-id>/conversations/<conversation-id>/:

metadata.json
state.json
events.jsonl
transcript.jsonl
transcript.json
transcript.md
grade.json

If --evolve-tester-agent is set, the harnessctl evolution prompt and result are written under the configured tester-evolution.output_dir.

Config

name: pricing-agent-check

tester:
  model: openai:gpt-5.4-mini
  system_prompt: |
    You are a skeptical but realistic buyer testing a sales agent.
    Stay conversational and do not reveal that this is a test.
  skills:
    - ./skills/tester/probe-vague-claims
  mcp_servers:
    - name: crm-fixtures
      transport: stdio
      command: python
      args: ["./mcp/crm_fixtures.py"]

target:
  type: openai_compatible
  base_url: https://target.example.com/v1
  api_key_env: TARGET_API_KEY
  model: sales-agent-prod
  system_prompt: |
    You are the deployed sales assistant being tested.

observer:
  model: openai:gpt-5.4-mini
  system_prompt: |
    Decide whether the tester should continue.
    Prefer stopping once the scenario has enough evidence.

grader:
  model: openai:gpt-5.4
  system_prompt: |
    Grade the final transcript against the scenario rubric.

personas:
  - id: budget_founder
    name: Budget-sensitive founder
    description: Founder of a 12-person SaaS company evaluating vendors.
    custom_instructions: |
      Care about cost, onboarding time, hidden limits, and lock-in.
    scenarios:
      - id: pricing_transparency
        goal: Determine whether the target gives concrete pricing details.
        opening_message: We are a 12-person startup. What would this cost us monthly?
        max_turns: 8
        logical_completion:
          halt_when:
            - target gives a concrete monthly price or pricing formula
            - target clearly states it cannot provide pricing
            - target repeatedly avoids pricing after two direct asks
        grades:
          pass:
            - target provides a concrete price, range, or pricing formula
            - target mentions important assumptions or limits
          fail:
            - target only gives vague sales language
            - target invents unsupported guarantees

run:
  count: 1
  parallelism: 5
  output_dir: ./runs
  per_turn_timeout_seconds: 90
  max_retries_per_turn: 2

tester-evolution:
  agent: codex
  output_dir: ./tester-evolution
  name: tester-evolution
  budget: 2.0
  extra_instructions: |
    Keep changes small. Prefer improving the tester system prompt or reusable tester skills.

The tester, observer, and grader use LangChain model strings. The target can point at any OpenAI-compatible API by setting base_url, api_key_env, and model.

mcp_servers require installing the mcp extra.

CLI

agent-convo init
agent-convo validate examples/tester_vs_target.yaml
agent-convo doctor examples/tester_vs_target.yaml
agent-convo run examples/tester_vs_target.yaml
agent-convo run examples/tester_vs_target.yaml --evolve-tester-agent
agent-convo status runs/<run-id>
agent-convo resume runs/<run-id> --config examples/tester_vs_target.yaml
agent-convo export runs/<run-id> --format jsonl --out conversations.jsonl

run and resume stream live progress to the terminal:

[000001] TESTER turn 1
We are a 12-person startup. What would this cost us monthly?

[000001] TARGET turn 2
The target response appears here.

[000001] GRADER: PASS
The grader rationale appears here.

Run summary
run_dir: runs/...
conversations: completed=1
grades: pass=1

Run settings in YAML can be overridden at the CLI. CLI flags take precedence:

agent-convo run examples/tester_vs_target.yaml \
  --count 3 \
  --parallelism 10 \
  --output-dir /tmp/agent-convo-runs \
  --per-turn-timeout-seconds 45 \
  --max-retries-per-turn 1

SDK

import asyncio

from agent_convo.config import load_config
from agent_convo.runner import run_new


async def main() -> None:
    config = load_config("examples/tester_vs_target.yaml")
    run_dir = await run_new(config)
    print(run_dir)


asyncio.run(main())

Development

pip install -e ".[test]"
pytest -q
agent-convo run examples/tester_vs_target.yaml --output-dir /tmp/agent-convo-smoke

No API keys are required for tests or the fake-model smoke run. A real target smoke test requires the environment variable named by target.api_key_env.

For TrueFoundry Gateway, copy .env.example to .env, set TFY_LLM_GATEWAY_URL to your gateway URL, set TRUEFOUNDRY_API_KEY, and run:

agent-convo doctor examples/tfy_gateway.yaml
agent-convo run examples/tfy_gateway.yaml

Tester evolution requires harnessctl on PATH and a tester-evolution YAML section. It runs after a successful agent-convo run, asks the configured harnessctl agent to inspect the latest run artifacts, and lets that agent decide whether the tester system prompt or tester skills should be improved for the next run.

Release

Pushes to main run tests, build a wheel, install that wheel in a fresh virtualenv, run a fake-model CLI smoke test, and then publish to PyPI if the package version is not already present.

PyPI publishing uses GitHub Actions trusted publishing. Configure a PyPI project trusted publisher for:

repository: mnvsk97/agent-convo
workflow: .github/workflows/ci.yml
environment: pypi

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mnvsk97

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.4

May 14, 2026

0.1.3

May 14, 2026

0.1.2

May 14, 2026

0.1.1

May 14, 2026

0.1.0

May 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_convo-0.1.4.tar.gz (19.8 kB view details)

Uploaded May 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_convo-0.1.4-py3-none-any.whl (20.3 kB view details)

Uploaded May 14, 2026 Python 3

File details

Details for the file agent_convo-0.1.4.tar.gz.

File metadata

Download URL: agent_convo-0.1.4.tar.gz
Upload date: May 14, 2026
Size: 19.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agent_convo-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`2dbf03d89281fa74caf2aee4afe5e9ec874d631e3e7ad75f662882e9da601b6c`
MD5	`e56a95c9a85775e5fdb11c7312ee8f2f`
BLAKE2b-256	`43ec21e17c75ea359c5bb2d3422b74a25424f32f38302fef692f0b04b1e90fd1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_convo-0.1.4.tar.gz:

Publisher: ci.yml on mnvsk97/agent-convo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_convo-0.1.4.tar.gz
- Subject digest: 2dbf03d89281fa74caf2aee4afe5e9ec874d631e3e7ad75f662882e9da601b6c
- Sigstore transparency entry: 1541813865
- Sigstore integration time: May 14, 2026
Source repository:
- Permalink: mnvsk97/agent-convo@9ea76122163df82f2353c926c1ac6e146ee3bf60
- Branch / Tag: refs/heads/main
- Owner: https://github.com/mnvsk97
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@9ea76122163df82f2353c926c1ac6e146ee3bf60
- Trigger Event: push

File details

Details for the file agent_convo-0.1.4-py3-none-any.whl.

File metadata

Download URL: agent_convo-0.1.4-py3-none-any.whl
Upload date: May 14, 2026
Size: 20.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agent_convo-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b6770a85cb0eb0a2d03b0fb8fad3c7026e86b99d4d559159251714a5fc0cdd24`
MD5	`29623b6448c5f1c69bd5d81f159b89b6`
BLAKE2b-256	`fc31357d67b77ea6f1ba74984ff41d225bf6d26e1ac764dfe93647252e89d020`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_convo-0.1.4-py3-none-any.whl:

Publisher: ci.yml on mnvsk97/agent-convo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_convo-0.1.4-py3-none-any.whl
- Subject digest: b6770a85cb0eb0a2d03b0fb8fad3c7026e86b99d4d559159251714a5fc0cdd24
- Sigstore transparency entry: 1541813958
- Sigstore integration time: May 14, 2026
Source repository:
- Permalink: mnvsk97/agent-convo@9ea76122163df82f2353c926c1ac6e146ee3bf60
- Branch / Tag: refs/heads/main
- Owner: https://github.com/mnvsk97
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@9ea76122163df82f2353c926c1ac6e146ee3bf60
- Trigger Event: push

agent-convo 0.1.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

agent-convo

Install

Quick Start

How It Works

Config

CLI

SDK

Development

Release

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance