Skip to main content

Human Time-to-Completion Evaluation CLI

Project description

hte-cli

Human Time-to-Completion Evaluation CLI - A tool for running assigned cybersecurity tasks with timing and result tracking.

Installation

# Recommended (pipx)
pipx install hte-cli

# Or with pip
pip install hte-cli

Quick Start

  1. Login (get credentials from your coordinator):

    hte-cli auth login
    
  2. View your assigned tasks:

    hte-cli tasks list
    
  3. Run a task:

    hte-cli tasks run
    

Commands

  • hte-cli auth login - Authenticate with the API
  • hte-cli auth status - Check authentication status
  • hte-cli tasks list - List your pending tasks
  • hte-cli tasks run [TASK_ID] - Run a task (defaults to highest priority)
  • hte-cli tasks pull-images - Pre-pull Docker images for upcoming tasks
  • hte-cli version - Show version info

System Requirements

All Platforms

  • Python 3.11+
  • Docker Engine >= 24.0.6
  • Docker Compose >= 2.21.0 (included in Docker Desktop 4.25+)

Windows

  • Docker Desktop with WSL2 backend enabled
  • WSL2 installed and configured (Microsoft docs)

macOS

  • Docker Desktop (Intel or Apple Silicon)
  • Note: Apple Silicon (M1/M2/M3) runs x86 containers via emulation - expect slightly slower performance

Linux

  • Docker Engine >= 24.0.6 with Compose plugin
  • User added to docker group: sudo usermod -aG docker $USER (log out and back in after)

Verify Docker Setup

# Check Docker Engine version (need >= 24.0.6)
docker version --format '{{.Client.Version}}'

# Check Docker Compose version (need >= 2.21.0)
docker compose version --short

# Test container can start
docker run --rm hello-world

If your Docker Compose version is too old, update Docker Desktop or install the latest compose plugin:

# Linux: Update compose plugin
sudo apt-get update && sudo apt-get install docker-compose-plugin

Configuration

Set HTE_API_URL environment variable to use a custom API endpoint:

export HTE_API_URL="http://your-server.com/api/v1/cli"

Support

For issues, contact your study coordinator or open an issue at: https://github.com/sean-peters-au/lyptus-mono


Developer Notes

This CLI is a thin wrapper with no consequential research decisions. It:

  • Wraps Inspect AI's human_cli agent for task execution
  • Syncs results to the backend API
  • Handles authentication via OAuth-style code exchange

The research-relevant code lives elsewhere:

  • Task sampling: scripts/sample_tasks_for_trials.py
  • Scoring criteria: src/human_ttc_eval/datasets/*/
  • Methodology: docs/methodology/human-expert-methodology-guide.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hte_cli-0.2.11.tar.gz (196.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hte_cli-0.2.11-py3-none-any.whl (29.9 kB view details)

Uploaded Python 3

File details

Details for the file hte_cli-0.2.11.tar.gz.

File metadata

  • Download URL: hte_cli-0.2.11.tar.gz
  • Upload date:
  • Size: 196.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.6

File hashes

Hashes for hte_cli-0.2.11.tar.gz
Algorithm Hash digest
SHA256 3d09a2c065a28cf040a9de6f420e2ed6b9824e64328dd8e0bd423b0f88012fbf
MD5 73e91d92c4f7fbf0dc7553e3d6de4307
BLAKE2b-256 f4546a0aa60a346abd1c6e6e3a7a9ccf64deea612180bfbfa0a1cd1bd1798c14

See more details on using hashes here.

File details

Details for the file hte_cli-0.2.11-py3-none-any.whl.

File metadata

  • Download URL: hte_cli-0.2.11-py3-none-any.whl
  • Upload date:
  • Size: 29.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.6

File hashes

Hashes for hte_cli-0.2.11-py3-none-any.whl
Algorithm Hash digest
SHA256 d67dc37927a38d49e254b5ff2d3f4f773b5d61613f61af36dbccbb4fdf75663d
MD5 d0546418e9360514a57068642c6b2825
BLAKE2b-256 e06be4ab891fd2a1967fad42df7c0f5743ee48fe8e4bbf4c77a50ebc274742f0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page