Skip to main content

The Python Risk Identification Tool for LLMs (PyRIT) is a library used to assess the robustness of LLMs

Project description

Python Risk Identification Tool for generative AI (PyRIT)

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.

Introduction

PyRIT is a library developed by the AI Red Team for researchers and engineers to help them assess the robustness of their LLM endpoints against different harm categories such as fabrication/ungrounded content (e.g., hallucination), misuse (e.g., bias), and prohibited content (e.g., harassment).

PyRIT automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft).​

The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

Additionally, this tool allows researchers to iterate and improve their mitigations against different harms. For example, at Microsoft we are using this tool to iterate on different versions of a product (and its metaprompt) so that we can more effectively protect against prompt injection attacks.

PyRIT architecture

Where can I learn more?

Microsoft Learn has a dedicated page on AI Red Teaming.

Check out our docs for more information on how to install PyRIT, our How to Guide, and more, as well as our demos.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyrit-0.4.0.tar.gz (254.3 kB view details)

Uploaded Source

Built Distribution

pyrit-0.4.0-py3-none-any.whl (402.7 kB view details)

Uploaded Python 3

File details

Details for the file pyrit-0.4.0.tar.gz.

File metadata

  • Download URL: pyrit-0.4.0.tar.gz
  • Upload date:
  • Size: 254.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for pyrit-0.4.0.tar.gz
Algorithm Hash digest
SHA256 0482c547c41125ed06799805ca658a6c75a5dd8d1f1f765ebad09825b9390793
MD5 07843aedf9b74948c5a64e9a8b453148
BLAKE2b-256 e20f79e007af5d6ff4215b4e8a56e2a41ba75eeb7f74e2fb7db9dba26da4b6db

See more details on using hashes here.

File details

Details for the file pyrit-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: pyrit-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 402.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for pyrit-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4faa5be12c8f5c599477324d10a48607b53e4f90e309f0a9c1b20ba90f1b1d78
MD5 0771ad3be6087a853f8d0fe99faf858f
BLAKE2b-256 1f0811e1037ab26a6d142dc39e2b5fd9d8d79c9808dbf81a9fcf465d83f30c38

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page