Skip to main content

HackAgent is an open-source security toolkit to detect vulnerabilities of your AI Agents.

Project description

HackAgent - AI Agent Security Testing Toolkit

AI Security Red-Team Toolkit


App -- Docs -- API


Python Version License uv Commitizen Ruff Test Coverage CI Status

What is HackAgent?

HackAgent is a comprehensive Python SDK and CLI designed to help security researchers, developers, and AI safety practitioners evaluate and strengthen the security of AI agents.

As AI agents become more powerful and autonomous, they face security challenges that traditional testing tools cannot address:

Threat Description
Prompt Injection Malicious inputs that hijack agent behavior
Jailbreaking Bypassing safety guardrails and content filters
Goal Hijacking Manipulating agents to pursue unintended objectives
Tool Misuse Exploiting agent capabilities for unauthorized actions

HackAgent automates testing for these vulnerabilities using research-backed attack techniques, helping you identify and fix security issues before they are exploited.

HackAgent CLI Demo

Interactive TUI with real-time attack progress and visual reporting.

Get Started Now

Quick Install

python3 -m venv .venv
source .venv/bin/activate
pip install hackagent

No API key required: HackAgent works locally out of the box.

Questions? Join community discussions or email ais@ai4i.it.

Architecture

HackAgent uses a modular pipeline to test agent robustness end-to-end.

Component Description
Attack Engine Orchestrates attacks using AdvPrefix, AutoDAN-Turbo, PAIR, TAP, FlipAttack, BoN, h4rm3l, CipherChat, PAP, and Baseline
Generator LLM role that creates adversarial prompts to test the target agent
Judge LLM role that evaluates whether attacks bypass safety measures
Target Agent Your AI agent under test across supported frameworks
Datasets Pre-built benchmark presets plus custom HuggingFace/file datasets

Supported Frameworks

Google ADK OpenAI SDK LiteLLM LangChain

Reporting

HackAgent supports both local and remote reporting.

  • Local mode stores test results in SQLite and includes a built-in dashboard.
  • Cloud mode syncs runs to the HackAgent remote platform when an API key is configured.
hackagent web

Access cloud reporting at https://app.hackagent.dev.

Responsible Use

HackAgent is designed for authorized security testing only. Always obtain explicit permission before testing any AI system.

Do

  • Test your own agents
  • Conduct authorized pentesting
  • Follow coordinated disclosure
  • Share security knowledge responsibly

Don't

  • Test systems without permission
  • Exploit vulnerabilities maliciously
  • Violate terms of service
  • Share harmful exploit instructions irresponsibly

Read the full guidelines: Responsible Disclosure

Contributing

Contributions are welcome. See CONTRIBUTING.md and CODE_OF_CONDUCT.md.

License

Licensed under Apache-2.0. See LICENSE.

Disclaimer

HackAgent is intended for security research and AI safety improvement. The authors are not responsible for misuse.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hackagent-0.8.0.tar.gz (511.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hackagent-0.8.0-py3-none-any.whl (702.8 kB view details)

Uploaded Python 3

File details

Details for the file hackagent-0.8.0.tar.gz.

File metadata

  • Download URL: hackagent-0.8.0.tar.gz
  • Upload date:
  • Size: 511.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for hackagent-0.8.0.tar.gz
Algorithm Hash digest
SHA256 d0e4457ce3fe7a290083a32bf0eca934a0f74226bd1feeeb7c3995da03daec25
MD5 2fec8009879110c2708090e6503ce2cb
BLAKE2b-256 dac06a5d0e17ae71a1b6317b4dea9e8886472c679745a2e0ae13bd7fb0d9b688

See more details on using hashes here.

File details

Details for the file hackagent-0.8.0-py3-none-any.whl.

File metadata

  • Download URL: hackagent-0.8.0-py3-none-any.whl
  • Upload date:
  • Size: 702.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for hackagent-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a254b14bb43e1faf3cad4c3fe37422de8f82d16557455b4b28fe413bf5c62c57
MD5 3d8b08ec8ce748d60574322095dd61bb
BLAKE2b-256 22a4b80cf343c65f821e7d4336636d08e78bdd12cc36385e703c9d599bd98249

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page