VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.
Project description
VideoGenHub
VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.
- We define 2 prominent generation tasks (Text-to-Video and Image-to-Video).
- We built a unified inference pipeline to ensure fair comparison. We currently support around 10 models.
📰 News
- 2024 Jun 10: GenAI-Arena Paper is out. It is featured on Huggingface Daily Papers.
📄 Table of Contents
🛠️ Installation 🔝
To install from pypi:
pip install videogen-hub
To install from github:
git clone https://github.com/TIGER-AI-Lab/VideoGenHub.git
cd VideoGenHub
cd env_cfg
pip install -r requirements.txt
cd ..
pip install -e .
The requirement of opensora is in env_cfg/opensora.txt
For some models like show one, you need to login through huggingface-cli
.
huggingface-cli login
👨🏫 Get Started 🔝
Benchmarking
To reproduce our experiment using benchmark.
For text-to-video generation:
./t2v_inference.sh --<model_name> --<device>
Infering one model
import videogen_hub
model = videogen_hub.load('VideoCrafter2')
video = model.infer_one_video(prompt="A child excitedly swings on a rusty swing set, laughter filling the air.")
# Here video is a torch tensor of shape torch.Size([16, 3, 320, 512])
See Google Colab here: https://colab.research.google.com/drive/145UMsBOe5JLqZ2m0LKqvvqsyRJA1IeaE?usp=sharing
🧠 Philosophy 🔝
By streamlining research and collaboration, VideoGenHub plays a pivotal role in propelling the field of Video Generation.
- Purity of Evaluation: We ensure a fair and consistent evaluation for all models, eliminating biases.
- Research Roadmap: By defining tasks and curating datasets, we provide clear direction for researchers.
- Open Collaboration: Our platform fosters the exchange and cooperation of related technologies, bringing together minds and innovations.
Implemented Models
We included more than 10 Models in video generation.
Method | Venue | Type |
---|---|---|
LaVie | - | Text-To-Video Generation |
VideoCrafter2 | - | Text-To-Video Generation |
ModelScope | - | Text-To-Video Generation |
StreamingT2V | - | Text-To-Video Generation |
Show 1 | - | Text-To-Video Generation |
OpenSora | - | Text-To-Video Generation |
OpenSora-Plan | - | Text-To-Video Generation |
T2V-Turbo | - | Text-To-Video Generation |
DynamiCrafter2 | - | Image-To-Video Generation |
SEINE | ICLR'24 | Image-To-Video Generation |
Consisti2v | - | Image-To_Video Generation |
I2VGenXL | - | Image-To_Video Generation |
🎫 License 🔝
This project is released under the License.
🖊️ Citation 🔝
This work is a part of GenAI-Arena work.
Please kindly cite our paper if you use our code, data, models or results:
@misc{jiang2024genai,
title={GenAI Arena: An Open Evaluation Platform for Generative Models},
author={Dongfu Jiang and Max Ku and Tianle Li and Yuansheng Ni and Shizhuo Sun and Rongqi Fan and Wenhu Chen},
year={2024},
eprint={2406.04485},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file videogen_hub-0.1.2.tar.gz
.
File metadata
- Download URL: videogen_hub-0.1.2.tar.gz
- Upload date:
- Size: 420.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 38cefaeaca47289cb1f860d75a9e9980b78217991e532209cc4c61c9f670de4b |
|
MD5 | a3685b6b13dcf2f573ed4c4030fe011e |
|
BLAKE2b-256 | df954c43d963ee8824f679b3eff2ab28e6711605bdc2ac3aeb7247023a5b0032 |
File details
Details for the file videogen_hub-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: videogen_hub-0.1.2-py3-none-any.whl
- Upload date:
- Size: 555.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3ec54d59700abe897ba93c164acad1aa5db24608fdff6384a74381d1f90d8920 |
|
MD5 | 87141830aef70b93f6dbdca1d0526639 |
|
BLAKE2b-256 | 0d52378589fc1bc945574830b3b59e258a9d83f9e04db6c0bb333b67162e7ab9 |