AI Agent Security Competition SDK - Red teaming framework for tool-using AI agents

These details have not been verified by PyPI

Project links

Project description

JED: Replay-Based Security Benchmark for Tool-Using AI Agents

JED is a Python SDK and benchmark for evaluating tool-using agents under adversarial pressure.

It supports three package workflows:

attack-only evaluation
guardrail-only evaluation
combined attack+defense evaluation

Documentation: https://mbhatt1.github.io/competitionscratch/

Choose Your Workflow

Use aicomp evaluate when you want the scorer-style public interface and stable machine-readable artifacts. Use aicomp test when you are iterating locally and want run history, compare, and visualize.

Workflow	Submission	Primary entrypoint	Default env	Output
Kaggle red-team	`attack.py`	`aicomp evaluate redteam`	`sandbox`	normalized attack score
Package attack-only	`attack.py`	`aicomp test redteam`	`sandbox`	normalized attack score
Package guardrail-only	`guardrail.py`	`aicomp test defense`	`sandbox`	defense score
Package dual-track	`submission.zip` with `attack.py` and `guardrail.py`	`aicomp test dual`	`sandbox`	attack + defense

The public Kaggle competition uses the attack-only path. The package itself supports all three workflows.

Install

From PyPI:

pip install aicomp-sdk

From source:

git clone https://github.com/mbhatt1/competitionscratch.git
cd competitionscratch
pip install -e .

Quick Start: Attack-Only

Generate a starter submission:

aicomp init attack
aicomp validate redteam attack.py
aicomp test redteam attack.py --budget-s 60 --agent deterministic

Run the standalone public-path scorer locally:

aicomp evaluate \
  redteam \
  attack.py \
  --budget-s 60 \
  --agent deterministic \
  --env gym

attack.py must define AttackAlgorithm, inherit from AttackAlgorithmBase, and return list[AttackCandidate].

If you want CLI behavior that matches the public Kaggle default more closely, use aicomp evaluate redteam attack.py --env gym.

The standalone evaluator defaults to a short terminal summary. Use --verbosity progress for package-owned progress messages. Add --save-transcript, --save-framework-events, and --save-agent-debug when you want transcript.log, framework.jsonl, and agent-debug.jsonl written under --artifacts-dir.

aicomp test keeps its explicit-path diagnostics flags: --transcript-file, --event-log-file, and --agent-debug-jsonl.

Other Supported Package Workflows

Guardrail-only:

aicomp init guardrail
aicomp validate defense guardrail.py
aicomp test defense guardrail.py --budget-s 60 --agent deterministic

Dual-track:

zip submission.zip attack.py guardrail.py
aicomp test dual submission.zip --budget-s 60 --agent deterministic
aicomp evaluate dual submission.zip --budget-s 60 --agent deterministic --env sandbox

How Scoring Works

Attack scoring is replay-based. The evaluator replays each returned AttackCandidate and recomputes:

the trace
triggered predicates
the cell signature
the final score

The public Kaggle leaderboard uses normalized attack score only. Package guardrail-only and dual-track workflows also expose defense scoring.

SDK Notes

SandboxEnv is the default environment for local evaluator runs.
GymAttackEnv is available when you explicitly pass --env gym for Kaggle-style parity.
Direct SandboxEnv(...) construction requires an explicit agent= instance.
aicomp test defaults to 1800 seconds for redteam, 1800 seconds for defense, and 3600 seconds total for dual (1800/1800 split).

Documentation

Repository Layout

aicomp_sdk/ - package code
examples/ - runnable examples
tests/ - unit and integration tests

License

MIT. See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

3.1.2

Jun 19, 2026

3.1.1

Jun 15, 2026

3.1.0

May 4, 2026

3.0.0

Apr 7, 2026

2.2.0

Mar 31, 2026

2.1.0

Mar 23, 2026

2.0.1

Mar 18, 2026

2.0.0

Mar 16, 2026

1.0.6

Jan 4, 2026

1.0.5

Jan 4, 2026

1.0.4

Jan 4, 2026

1.0.1

Jan 4, 2026

1.0.0

Jan 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aicomp_sdk-3.1.2.tar.gz (521.3 kB view details)

Uploaded Jun 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aicomp_sdk-3.1.2-py3-none-any.whl (602.6 kB view details)

Uploaded Jun 19, 2026 Python 3

File details

Details for the file aicomp_sdk-3.1.2.tar.gz.

File metadata

Download URL: aicomp_sdk-3.1.2.tar.gz
Upload date: Jun 19, 2026
Size: 521.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aicomp_sdk-3.1.2.tar.gz
Algorithm	Hash digest
SHA256	`615030e19a2eb1ea62afa64f6530e6fc9263afe26b9abfd83fa554771111e1fd`
MD5	`9708a97058dd7cc291bb5d030eeaf4d6`
BLAKE2b-256	`cf29157b24fe8cec6cc69a043dea5f9042ad3dd688455e7b570ab9c8102d1eab`

See more details on using hashes here.

File details

Details for the file aicomp_sdk-3.1.2-py3-none-any.whl.

File metadata

Download URL: aicomp_sdk-3.1.2-py3-none-any.whl
Upload date: Jun 19, 2026
Size: 602.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aicomp_sdk-3.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fa106658f18d7954ba0a2da468379e6dc7b25b1a3543ce30d3cc9109ae0b8e68`
MD5	`7cf80721a4bb49e436f185f44741bdda`
BLAKE2b-256	`eeb582108b7d295035aeed7699157c8da211b3b3c737faa5b2f061c0019a3146`

See more details on using hashes here.

aicomp-sdk 3.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

JED: Replay-Based Security Benchmark for Tool-Using AI Agents

Choose Your Workflow

Install

Quick Start: Attack-Only

Other Supported Package Workflows

How Scoring Works

SDK Notes

Documentation

Repository Layout

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes