LLM Red Teaming Framework for defensive security research

These details have not been verified by PyPI

Project links

Project description

HiveTrace Red

HiveTrace Red: LLM Red Teaming Framework

A security framework for testing Large Language Model (LLM) vulnerabilities through systematic attack methodologies and evaluation pipelines.

HiveTrace Red can be used for:

Red teaming your LLM applications - Test safety guardrails before deployment
Research & benchmarking - Systematic evaluation of LLM robustness across attack vectors
Compliance testing - Validate AI safety requirements and regulatory standards
Attack technique research - Explore and compose novel jailbreak methodologies

HiveTrace Red combines static attack templates, dynamic prompt manipulation, and adaptive evaluation to systematically explore LLM failure modes. It's built for security researchers, AI safety teams, and anyone deploying LLMs who needs to ensure their systems are robust against adversarial attacks.

Features

80+ Attacks: Comprehensive library across 10 categories (roleplay, persuasion, token smuggling, etc.)
Multiple LLM Providers: OpenAI, GigaChat, YandexGPT, Google Gemini, and more
Advanced Evaluation: WildGuard evaluators and systematic response assessment
Async Pipeline: Efficient streaming architecture for large-scale testing
Multi-Language Support: Testing across multiple languages including Russian

Attack Categories

Category	Description
Roleplay	Persona-based jailbreaks using specific character roles
Persuasion	Social engineering techniques and psychological manipulation
Token Smuggling	Encoding and obfuscation methods to hide malicious intent
Context Switching	Conversation redirection to confuse safety filters
In-Context Learning	Few-shot examples to teach undesired behavior
Task Deflection	Reframing harmful requests as legitimate tasks
Text Structure Modification	Format manipulation to bypass detection
Output Formatting	Specific output format requests to bypass safety
Irrelevant Information	Content dilution to confuse safety filters
Simple Instructions	Direct instruction-based attacks

How It Works

Base Prompts → Apply Attacks → Modified Prompts → Target Model → Responses → Evaluator → Results

The framework provides a 3-stage pipeline:

Attack Generation: Apply various attack techniques to base prompts
Model Testing: Send modified prompts to target LLMs
Evaluation: Assess responses using WildGuard or custom evaluators

The hivetracered-report command generates comprehensive HTML reports with:

Executive summary with key metrics and OWASP LLM Top 10 mapping
Interactive charts showing attack success rates by type and name
Content analysis with response length distributions
Data explorer with filtering capabilities
Sample prompts and responses for detailed inspection

Results Example

The framework provides detailed attack analysis showing success rates across different attack types and individual attack techniques:

Attack Analysis Results

The analysis includes:

Success Rate by Attack Type: Comparative effectiveness of different attack categories (persuasion, roleplay, simple instructions, etc.)
Success Rate by Attack Name: Granular breakdown of individual attack technique performance

Installation

Install HiveTraceRed via pip:

pip install hivetracered

This will install the package and make the hivetracered and hivetracered-report CLI commands available.

Alternatively, install from source:

git clone https://github.com/HiveTrace/HiveTraceRed.git
cd HiveTraceRed
pip install -e .

Documentation

📖 Complete Documentation - Installation, tutorials, API reference, and attack guides

Requirements

Python 3.10 or higher
pip package manager
Virtual environment (recommended)

Responsible Use

⚠️ This tool is designed for defensive security research only.

HiveTrace Red should be used exclusively for:

Testing and improving your own LLM systems
Developing robust AI safety mechanisms
Conducting authorized security assessments
Academic research on LLM vulnerabilities

Do NOT use this tool for:

Attacking systems you don't own or have permission to test
Malicious purposes or causing harm
Bypassing safety measures in production systems without authorization

Users are responsible for ensuring their use complies with applicable laws and the terms of service of the LLM providers they test.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.16

May 21, 2026

1.0.15

Apr 21, 2026

1.0.14

Mar 13, 2026

1.0.13

Feb 20, 2026

1.0.12

Feb 16, 2026

1.0.11

Feb 16, 2026

1.0.10

Feb 13, 2026

1.0.9

Jan 29, 2026

1.0.8

Jan 21, 2026

1.0.7

Jan 12, 2026

1.0.6

Dec 29, 2025

1.0.5

Dec 17, 2025

1.0.4

Dec 9, 2025

1.0.3

Dec 3, 2025

1.0.2

Oct 17, 2025

This version

1.0.1

Oct 14, 2025

1.0.0

Oct 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hivetracered-1.0.1.tar.gz (163.0 kB view details)

Uploaded Oct 14, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hivetracered-1.0.1-py3-none-any.whl (267.9 kB view details)

Uploaded Oct 14, 2025 Python 3

File details

Details for the file hivetracered-1.0.1.tar.gz.

File metadata

Download URL: hivetracered-1.0.1.tar.gz
Upload date: Oct 14, 2025
Size: 163.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for hivetracered-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`e6e3960d88ea556df9dad0f3eff2571bfc97df972e86ddf8d5e6c6749ec8ba00`
MD5	`92511b9c0474e142b23707ed24669911`
BLAKE2b-256	`bf77726a4fbf31ee114dac7a51cafde737035ae569250d767d2ac67d80efe3f1`

See more details on using hashes here.

File details

Details for the file hivetracered-1.0.1-py3-none-any.whl.

File metadata

Download URL: hivetracered-1.0.1-py3-none-any.whl
Upload date: Oct 14, 2025
Size: 267.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for hivetracered-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c06ca8aee102c7cf2e6972bcc805517874a678b8339326287508c070d9cfe079`
MD5	`878674cd67860b9f6ae3a26c4c144c7c`
BLAKE2b-256	`c7ecb414d333067f09c47839a46fd572eca8a6d518b252ca274a72a83a017dad`

See more details on using hashes here.

hivetracered 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

HiveTrace Red: LLM Red Teaming Framework

Features

Attack Categories

How It Works

Results Example

Installation

Documentation

Requirements

Responsible Use

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes