Skip to main content

The Python Risk Identification Tool for LLMs (PyRIT) is a library used to assess the robustness of LLMs

Project description

Python Risk Identification Tool for generative AI (PyRIT)

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.

Introduction

PyRIT is a library developed by the AI Red Team for researchers and engineers to help them assess the robustness of their LLM endpoints against different harm categories such as fabrication/ungrounded content (e.g., hallucination), misuse (e.g., bias), and prohibited content (e.g., harassment).

PyRIT automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft).​

The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

Additionally, this tool allows researchers to iterate and improve their mitigations against different harms. For example, at Microsoft we are using this tool to iterate on different versions of a product (and its metaprompt) so that we can more effectively protect against prompt injection attacks.

PyRIT architecture

Where can I learn more?

Microsoft Learn has a dedicated page on AI Red Teaming.

Check out our docs for more information on how to install PyRIT, our How to Guide, and more, as well as our demos folder.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyrit-0.1.2.tar.gz (2.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyrit-0.1.2-py3-none-any.whl (115.7 kB view details)

Uploaded Python 3

File details

Details for the file pyrit-0.1.2.tar.gz.

File metadata

  • Download URL: pyrit-0.1.2.tar.gz
  • Upload date:
  • Size: 2.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for pyrit-0.1.2.tar.gz
Algorithm Hash digest
SHA256 7208fed7c69bc5d50fb12b92a12332b38fffb7588ade58b621cfa17f411d9370
MD5 8bb7e97c21ba60cc791af37541f0415a
BLAKE2b-256 e609023bf1c50e1f6605fcf267942ed9c77b4f7a52441cb2be35fefaed871aa5

See more details on using hashes here.

File details

Details for the file pyrit-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: pyrit-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 115.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for pyrit-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 73f0dce7258d246a5fbebe9cefda52fff4e2c80eedf3a1ddbdc1eb41ce13b56d
MD5 e4fd9563380f1b5fd78c050fbe64cc29
BLAKE2b-256 e8821e4b8ae159de3cf8053ce29c07d2a5114a5ce78d405c15d80000f49944c1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page