Test ZenGuard AI against different datasets and benchmarks.
Project description
ZenGuard AI Benchmarks
This repository contains benchmarks for ZenGuard AI and information on how to run them.
There are two types of benchmarks that we run against ZenGuard AI:
- Hugging Face datasets based benchmarks
- ZenGuard AI Generated Benchmark - Zen Bench
Here you can find both benchmark results and how to run them yourself.
Public Datasets benchmarks
We are constantly monitoring Hugging Face for new datasets that relate to GenAI security. Then we run them against ZenGuard AI to find any potential security issues with our product.
ZenGuard AI Accuracy against Hugging Face datasets
# | Dataset | Accuracy | Date Added |
---|---|---|---|
1 | xTRam1/safe-guard-prompt-injection | 96% | 2024-07-01 |
2 | yanismiraoui/prompt_injections | 96.5% | 2024-08-01 |
3 | deepset/prompt-injections | 87% | 2024-05-15 |
4 | JasperLS/prompt-injections | 87% | 2024-05-15 |
4 | aporia-ai/prompt_injection | 87.68% | 2024-05-15 |
Check for yourself. Or run your own dataset.
We have developed the ZenGuard Benchmarks PyPi package to help test and benchmark ZenGuard AI better.
Here are the instructions on how to use the package.
Benchmarking Output
Here is an example of what the benchmarking output looks like:
Where:
Total Samples
: The total number of prompts processed.Correct
: The number of prompts that were classified correctly.False Positives
: The number of prompts incorrectly identified as attacks.False Negatives
: The number of actual prompt attacks that went undetected.Accuracy
: The ratio of correctly classified prompts to the total number of samples.
Zen Bench
More information
A much more detailed documentation is available at docs.zenguard.ai.
Test the capabilities of ZenGuard AI in our ZenGuard Playground. It's available to start for free to understand how our guardrails can enhance your GenAI applications.
Check out our Client library to get started with integrating ZenGuard AI into your project.
Support
Book a Demo or just shoot us an email to hello@zenguard.ai
Topics we care about - LLM Security, LLM Guardrails, Prompt Injections, GenAI Security.
Developed with ❤️ by ZenGuard AI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file zenguard_benchmarks-0.1.5.tar.gz
.
File metadata
- Download URL: zenguard_benchmarks-0.1.5.tar.gz
- Upload date:
- Size: 4.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.11.9 Darwin/23.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5aadb62748ac935137ebcfc651aa009856bd6027cccf4c392f8c255deea7fb25 |
|
MD5 | ff51b83a7699bbc940f838d624e70562 |
|
BLAKE2b-256 | ddd5a225a80df0ef2574d503d8b1ffa8fa35340b0d766565b2a8e0c6ffeab9ba |
File details
Details for the file zenguard_benchmarks-0.1.5-py3-none-any.whl
.
File metadata
- Download URL: zenguard_benchmarks-0.1.5-py3-none-any.whl
- Upload date:
- Size: 5.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.11.9 Darwin/23.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 75dc5aa753c24b02b1519b6fada17a0eb8cf6c99cab2ce57bd0cf954bd174deb |
|
MD5 | acd27d939543d0a011da019b6e20579e |
|
BLAKE2b-256 | 1668ce3afecaa6a3eb4d15c3ef4f8e040f5b7032d976167439aa8914eee9d3eb |