A brief description of your project

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation [NeurIPS 2024]

world

ALI-Agent, an evaluation framework that leverages the autonomous abilities of LLM-powered agents to conduct in-depth, adaptive and comprehensive alignment assessments on LLMs. ALI-Agent operates through two principal stages: Emulation and Refinement. During the Emulation stage, ALI-Agent automates the generation of realistic test scenarios. In the Refinement stage, it iteratively refines the scenarios to probe long-tail risks. Specifically, ALI-Agent incorporates a memory module to guide test scenario generation, a tool-using module to reduce human labor in tasks such as evaluating feedback from target LLMs, and an action module to refine tests.

📋 Catalogue

Catalogue
Preparations
Evaluation
- Quick Start
- See the result

⚙️ Preparations

Step 1. Install requirements.txt

Set up a virtualenv and install the pytorch manually.

Our experiments have been tested on Python 3.9.17 with PyTorch 2.0.1+cu117.

conda create --name myenv python=3.9.17
conda activate myenv

After that, install all the dependencies listed in the requirements.txt file by running the following command:

pip install -r requirements.txt

Step 2. Download checkpoints of evaluator

You can find checkpoints of evaluators in the link : (checkpoints)

Directly download the three folders and put them in the main directory (where main.py can be found).

⌛️ Evaluation

Make sure you are in the main directory (where main.py can be found).

Replace "OPENAI_API_KEY" in simulation/utils.py with your own OpenAI API key.

Quick Start

To run the agent on a specified dataset, run code as

python main.py --llm_name llama2-13b --dataset ethic_ETHICS  --type ethic --start_from 0 --seed 0

Supported names for llm_name, data_set, type can be found in parse.py

To run the agent with web browsing, replace "BING_API_KEY" in simulation/utils.py with your own BING API key.

python main.py --llm_name llama2-13b --web_browsing

See the Results

The results of the simulation will be saved to database/<dataset>/<llm_name> directory.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.4

Feb 3, 2025

0.2.3

Feb 3, 2025

This version

0.2.2

Feb 3, 2025

0.2.1

Feb 3, 2025

0.1.4

Feb 3, 2025

0.1.3

Feb 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ali_agent-0.2.2.tar.gz (5.5 MB view details)

Uploaded Feb 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ALI_Agent-0.2.2-py3-none-any.whl (5.5 MB view details)

Uploaded Feb 3, 2025 Python 3

File details

Details for the file ali_agent-0.2.2.tar.gz.

File metadata

Download URL: ali_agent-0.2.2.tar.gz
Upload date: Feb 3, 2025
Size: 5.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.17

File hashes

Hashes for ali_agent-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`0c92c18229189e02cb99d95dab6929bfee8708b777249bb8c417323201ee2449`
MD5	`04b28129287fdbb81ef39351df352809`
BLAKE2b-256	`3a3c207b99c7805c719ed2575c5578200629bc48dc73eaac986769076910ee94`

See more details on using hashes here.

File details

Details for the file ALI_Agent-0.2.2-py3-none-any.whl.

File metadata

Download URL: ALI_Agent-0.2.2-py3-none-any.whl
Upload date: Feb 3, 2025
Size: 5.5 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.17

File hashes

Hashes for ALI_Agent-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c7db251845cd8eed3e8f2d4c469ce88d9f470d51c294e673904cb014e019d8fc`
MD5	`8ace17333627c8411362a4787af9f311`
BLAKE2b-256	`24644949af8f9c429c4ce85814a9f31934ebbbdef0a016f93e311d17bb36e504`

See more details on using hashes here.

ALI-Agent 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation [NeurIPS 2024]

📋 Catalogue

⚙️ Preparations

Step 1. Install requirements.txt

Step 2. Download checkpoints of evaluator

⌛️ Evaluation

Quick Start

See the Results

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes