Human Preference Aligned Video Generation Benchmark
Project description
HABench: Human Preference Aligned Video Generation Benchmark
HABench is a benchmark tool designed to systematically leverage MLLMs across all dimensions relevant to video generation assessment in generative models. By incorporating few-shot scoring and chain-of-query techniques, HA-Video-Bench provides a structured, scalable approach to generated video evaluation.
Evaluation
Multi-Dimensional Evaluation: Supports evaluation across several key dimensions of video generation:
| Dimension | Code Path |
|---|---|
| Image Quality | HAbench/staticquality.py |
| Aesthetic Quality | HAbench/staticquality.py |
| Temporal Consistency | HAbench/dynamicquality.py |
| Motion Effects | HAbench/dynamicquality.py |
| Object-Class Consistency | HAbench/VideoTextConsistency.py |
| Video-Text Consistency | HAbench/VideoTextConsistency.py |
| Color Consistency | HAbench/VideoTextConsistency.py |
| Action Consistency | HAbench/VideoTextConsistency.py |
| Scene Consistency | HAbench/VideoTextConsistency.py |
Support for Multiple Video Generation Models:
- Lavie
- Pika
- Show-1
- VideocrAfter2
- CogVideoX5B
- Kling
- Gen3
Installation Requirements
- Python >= 3.8
- OpenAI API access
Update your OpenAI API keys in
config.json:{ "GPT4o_API_KEY": "your-api-key", "GPT4o_BASE_URL": "your-base-url", "GPT4o_mini_API_KEY": "your-mini-api-key", "GPT4o_mini_BASE_URL": "your-mini-base-url" }
Installation
git clone https://github.com/yourusername/HABench.git
cd HABench
conda env create -f environment.yml
conda activate HABench
Data Preparation
Please organize your data according to the following structure:
/HABench/data/
├── color/ # 'color' dimension videos
│ ├── cogvideox5b/
│ │ ├── A red bird_0.mp4
│ │ ├── A red bird_1.mp4
│ │ └── ...
│ ├── lavie/
│ │ ├── A red bird_0.mp4
│ │ ├── A red bird_1.mp4
│ │ └── ...
│ ├── pika/
│ │ └── ...
│ └── ...
│
├── object_class/ # 'object_class' dimension videos
│ ├── cogvideox5b/
│ │ ├── A train_0.mp4
│ │ ├── A train_1.mp4
│ │ └── ...
│ ├── lavie/
│ │ └── ...
│ └── ...
│
├── scene/ # 'scene' dimension videos
│ ├── cogvideox5b/
│ │ ├── Botanical garden_0.mp4
│ │ ├── Botanical garden_1.mp4
│ │ └── ...
│ └── ...
│
├── action/ # 'action' 'temporal_consistency' 'motion_effects' dimension videos
│ ├── cogvideox5b/
│ │ ├── A person is marching_0.mp4
│ │ ├── A person is marching_1.mp4
│ │ └── ...
│ └── ...
│
└── overall_consistency/ # 'overall consistency' 'imaging_quality' 'aesthetic_quality' dimension videos
├── cogvideox5b/
│ ├── Close up of grapes on a rotating table._0.mp4
│ └── ...
├── lavie/
│ └── ...
├── pika/
│ └── ...
└── ...
Usage
Run the following command to evaluate the dimension you want to evaluate:
python evaluate.py \
--dimension $DIMENSION \
--videos_path ./data/{dimension} \
--config_path ./config.json/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file habench-0.1.0.tar.gz.
File metadata
- Download URL: habench-0.1.0.tar.gz
- Upload date:
- Size: 11.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d4788379dae1e6f1c54a0f616567a666c39363726c98627107a6b47fde02cca9
|
|
| MD5 |
14dd9a978b250cc5c1ab3f213d137db1
|
|
| BLAKE2b-256 |
4c78fc8322fb6f11ff28d0670c5a64dde102ed969b4a5fd9f33019837ad9d247
|
File details
Details for the file HAbench-0.1.0-py3-none-any.whl.
File metadata
- Download URL: HAbench-0.1.0-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ce1a28b84f3954f4ce3718dce32eb0c0d9d004a35bfe3ea9452cdb060c7b2ecd
|
|
| MD5 |
1f81f304a52fa7824e097f7389313633
|
|
| BLAKE2b-256 |
a6d3c10f22587d28afcf9a2572e9f7297aeb342d89c32605c67023c0365c3a63
|