A brief description of your project
Project description
ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation [NeurIPS 2024]
ALI-Agent, an evaluation framework that leverages the autonomous abilities of LLM-powered agents to conduct in-depth, adaptive and comprehensive alignment assessments on LLMs. ALI-Agent operates through two principal stages: Emulation and Refinement. During the Emulation stage, ALI-Agent automates the generation of realistic test scenarios. In the Refinement stage, it iteratively refines the scenarios to probe long-tail risks. Specifically, ALI-Agent incorporates a memory module to guide test scenario generation, a tool-using module to reduce human labor in tasks such as evaluating feedback from target LLMs, and an action module to refine tests.
📋 Catalogue
⚙️ Preparations
Step 1. Install requirements.txt
Set up a virtualenv and install the pytorch manually.
Our experiments have been tested on Python 3.9.17 with PyTorch 2.0.1+cu117.
conda create --name myenv python=3.9.17
conda activate myenv
After that, install all the dependencies listed in the requirements.txt file by running the following command:
pip install -r requirements.txt
Step 2. Download checkpoints of evaluator
You can find checkpoints of evaluators in the link : (checkpoints)
Directly download the three folders and put them in the main directory (where main.py can be found).
⌛️ Evaluation
Make sure you are in the main directory (where main.py can be found).
Replace "OPENAI_API_KEY" in simulation/utils.py with your own OpenAI API key.
Quick Start
To run the agent on a specified dataset, run code as
python main.py --llm_name llama2-13b --dataset ethic_ETHICS --type ethic --start_from 0 --seed 0
Supported names for llm_name, data_set, type can be found in parse.py
To run the agent with web browsing, replace "BING_API_KEY" in simulation/utils.py with your own BING API key.
python main.py --llm_name llama2-13b --web_browsing
See the Results
The results of the simulation will be saved to database/<dataset>/<llm_name> directory.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ali_agent-0.1.3.tar.gz.
File metadata
- Download URL: ali_agent-0.1.3.tar.gz
- Upload date:
- Size: 5.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
72d3c50446bb22acfb6913c2b39bec2eb269bf0b4ad85855b9f3125b9d72bda5
|
|
| MD5 |
dbf65550041e20ea6859bec183b943a4
|
|
| BLAKE2b-256 |
be1babb959e32ee4935d998d09fb06e741327ebb23f022b222aa48ce9eda7f94
|
File details
Details for the file ALI_Agent-0.1.3-py3-none-any.whl.
File metadata
- Download URL: ALI_Agent-0.1.3-py3-none-any.whl
- Upload date:
- Size: 5.5 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e932928983745852779c0b359b05a02cea7d464c0eadb07dddb284556791bfa
|
|
| MD5 |
8ed59393e78b68c867824020e6ce6dca
|
|
| BLAKE2b-256 |
c3188dbd325e5164cac266da070ab3f43a1e96ed418a096f2429b5a30dea6bcc
|