CLI for authoring and running SWEAP benchmark tasks
Project description
SWEAP CLI
Command-line tooling for authoring, validating, and evaluating SWEAP benchmark tasks. Each task is a self-contained bundle containing repository metadata, guardrail tests, and a golden patch that can be reproduced locally or inside Modal sandboxes.
- Documentation index: docs/README.md
- Latest workflow guides:
Quick Start
# optional: create a virtual environment
python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install sweap-cli
# scaffold a new task bundle
task init --repo https://github.com/example/project.git --commit deadbeef
# iterate locally until guardrails behave as expected
task validate
# run the modal evaluation pipeline (baseline + model + patched verification)
task run --model codex
Required Credentials
SWEAP_API_URLandSWEAP_API_TOKENfor remote submissions and runs (request an API token from the SWEAP team).OPENAI_API_KEYfor Codex access (optional for local runs; mandatory for remote runs processed by our hosted worker).modalCLI credentials (modal setup) if you plan to run Modal evaluations locally.
Add --runner node or --runner maven during task init to scaffold non-Python
bundles. Use task validate --modal to reproduce validation inside Modal and
task build to cache Modal environments for pytest bundles.
Core Commands
task init– scaffold manifests, guardrail directories, and dependency stubs.task validate– run baseline vs. patched guardrails locally or in Modal.task run– execute the full evaluation loop (baseline, model attempt, patched verification, optional full suite) locally or via the backend.task submit– register/update tasks with the backend and upload bundle archives.task build– prebuild Modal environments for pytest bundles.task info/task fetch-bundle/task runs-get– inspect remote metadata, download bundles, and retrieve run artifacts.
See the CLI reference for detailed options.
Need Help?
- Troubleshooting and common questions: docs/faq.md
- Manifest schema and runner expectations: docs/reference/manifest.md
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sweap_cli-0.1.0.tar.gz.
File metadata
- Download URL: sweap_cli-0.1.0.tar.gz
- Upload date:
- Size: 35.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8d388e4adaaa1a093d253a28699849f7119783d54839c82546584ea2b74a7b99
|
|
| MD5 |
24a96b77c69e175cb374725973c3e93c
|
|
| BLAKE2b-256 |
3eae826794765140f0b9c2708c0a55fb00db28173cc234d2e89e4cccef479671
|
File details
Details for the file sweap_cli-0.1.0-py3-none-any.whl.
File metadata
- Download URL: sweap_cli-0.1.0-py3-none-any.whl
- Upload date:
- Size: 38.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7a24ec7a60310e1fb17bafe79dfae3251ed53150d271ed89a982585878097565
|
|
| MD5 |
2d19de6f5ead775e985acc76d70f2b7b
|
|
| BLAKE2b-256 |
66570c205a5555338ad5123c26114f8433c7a9bd2772990353e6701f1c6169d1
|