Skip to main content

Test ZenGuard AI against different datasets and benchmarks.

Project description

Documentation License: MIT PyPI version Open In Colab

ZenGuard AI Benchmarks

This repository contains benchmarks for ZenGuard AI and information on how to run them.

There are two types of benchmarks that we run against ZenGuard AI:

  1. Hugging Face datasets based benchmarks
  2. ZenGuard AI Generated Benchmark - Zen Bench

Here you can find both benchmark results and how to run them yourself.

Public Datasets benchmarks

We are constantly monitoring Hugging Face for new datasets that relate to GenAI security. Then we run them against ZenGuard AI to find any potential security issues with our product.

ZenGuard AI Accuracy against Hugging Face datasets

# Dataset Accuracy Date Added
1 xTRam1/safe-guard-prompt-injection 96% 2024-07-01
2 yanismiraoui/prompt_injections 96.5% 2024-08-01
3 deepset/prompt-injections 87% 2024-05-15
4 JasperLS/prompt-injections 87% 2024-05-15
4 aporia-ai/prompt_injection 87.68% 2024-05-15

Check for yourself. Or run your own dataset.

Open In Colab

We have developed the ZenGuard Benchmarks PyPi package to help test and benchmark ZenGuard AI better.

Here are the instructions on how to use the package.

Benchmarking Output

Here is an example of what the benchmarking output looks like:

alt text

Where:

  • Total Samples: The total number of prompts processed.
  • Correct: The number of prompts that were classified correctly.
  • False Positives: The number of prompts incorrectly identified as attacks.
  • False Negatives: The number of actual prompt attacks that went undetected.
  • Accuracy: The ratio of correctly classified prompts to the total number of samples.

Zen Bench

More information

A much more detailed documentation is available at docs.zenguard.ai.

Test the capabilities of ZenGuard AI in our ZenGuard Playground. It's available to start for free to understand how our guardrails can enhance your GenAI applications.

Check out our Client library to get started with integrating ZenGuard AI into your project.

Support

Book a Demo or just shoot us an email to hello@zenguard.ai

Topics we care about - LLM Security, LLM Guardrails, Prompt Injections, GenAI Security.


Developed with ❤️ by ZenGuard AI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zenguard_benchmarks-0.1.5.tar.gz (4.7 kB view details)

Uploaded Source

Built Distribution

zenguard_benchmarks-0.1.5-py3-none-any.whl (5.5 kB view details)

Uploaded Python 3

File details

Details for the file zenguard_benchmarks-0.1.5.tar.gz.

File metadata

  • Download URL: zenguard_benchmarks-0.1.5.tar.gz
  • Upload date:
  • Size: 4.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.9 Darwin/23.6.0

File hashes

Hashes for zenguard_benchmarks-0.1.5.tar.gz
Algorithm Hash digest
SHA256 5aadb62748ac935137ebcfc651aa009856bd6027cccf4c392f8c255deea7fb25
MD5 ff51b83a7699bbc940f838d624e70562
BLAKE2b-256 ddd5a225a80df0ef2574d503d8b1ffa8fa35340b0d766565b2a8e0c6ffeab9ba

See more details on using hashes here.

File details

Details for the file zenguard_benchmarks-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for zenguard_benchmarks-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 75dc5aa753c24b02b1519b6fada17a0eb8cf6c99cab2ce57bd0cf954bd174deb
MD5 acd27d939543d0a011da019b6e20579e
BLAKE2b-256 1668ce3afecaa6a3eb4d15c3ef4f8e040f5b7032d976167439aa8914eee9d3eb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page