Skip to main content

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

Project description

VideoGenHub

contributors license GitHub Hits

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

  • We define 2 prominent generation tasks (Text-to-Video and Image-to-Video).
  • We built a unified inference pipeline to ensure fair comparison. We currently support around 10 models.

📰 News

📄 Table of Contents

🛠️ Installation 🔝

To install from pypi:

pip install videogen-hub

To install from github:

git clone https://github.com/TIGER-AI-Lab/VideoGenHub.git
cd VideoGenHub
cd env_cfg
pip install -r requirements.txt
cd ..
pip install -e .

The requirement of opensora is in env_cfg/opensora.txt

For some models like show one, you need to login through huggingface-cli.

huggingface-cli login

👨‍🏫 Get Started 🔝

Benchmarking

To reproduce our experiment using benchmark.

For text-to-video generation:

./t2v_inference.sh --<model_name> --<device>

Infering one model

import videogen_hub

model = videogen_hub.load('VideoCrafter2')
video = model.infer_one_video(prompt="A child excitedly swings on a rusty swing set, laughter filling the air.")

# Here video is a torch tensor of shape torch.Size([16, 3, 320, 512])

See Google Colab here: https://colab.research.google.com/drive/145UMsBOe5JLqZ2m0LKqvvqsyRJA1IeaE?usp=sharing

🧠 Philosophy 🔝

By streamlining research and collaboration, VideoGenHub plays a pivotal role in propelling the field of Video Generation.

  • Purity of Evaluation: We ensure a fair and consistent evaluation for all models, eliminating biases.
  • Research Roadmap: By defining tasks and curating datasets, we provide clear direction for researchers.
  • Open Collaboration: Our platform fosters the exchange and cooperation of related technologies, bringing together minds and innovations.

Implemented Models

We included more than 10 Models in video generation.

Method Venue Type
LaVie - Text-To-Video Generation
VideoCrafter2 - Text-To-Video Generation
ModelScope - Text-To-Video Generation
StreamingT2V - Text-To-Video Generation
Show 1 - Text-To-Video Generation
OpenSora - Text-To-Video Generation
OpenSora-Plan - Text-To-Video Generation
T2V-Turbo - Text-To-Video Generation
DynamiCrafter2 - Image-To-Video Generation
SEINE ICLR'24 Image-To-Video Generation
Consisti2v - Image-To_Video Generation
I2VGenXL - Image-To_Video Generation

🎫 License 🔝

This project is released under the License.

🖊️ Citation 🔝

This work is a part of GenAI-Arena work.

Please kindly cite our paper if you use our code, data, models or results:

@misc{jiang2024genai,
      title={GenAI Arena: An Open Evaluation Platform for Generative Models}, 
      author={Dongfu Jiang and Max Ku and Tianle Li and Yuansheng Ni and Shizhuo Sun and Rongqi Fan and Wenhu Chen},
      year={2024},
      eprint={2406.04485},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videogen_hub-0.1.2.tar.gz (420.2 kB view details)

Uploaded Source

Built Distribution

videogen_hub-0.1.2-py3-none-any.whl (555.4 kB view details)

Uploaded Python 3

File details

Details for the file videogen_hub-0.1.2.tar.gz.

File metadata

  • Download URL: videogen_hub-0.1.2.tar.gz
  • Upload date:
  • Size: 420.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for videogen_hub-0.1.2.tar.gz
Algorithm Hash digest
SHA256 38cefaeaca47289cb1f860d75a9e9980b78217991e532209cc4c61c9f670de4b
MD5 a3685b6b13dcf2f573ed4c4030fe011e
BLAKE2b-256 df954c43d963ee8824f679b3eff2ab28e6711605bdc2ac3aeb7247023a5b0032

See more details on using hashes here.

File details

Details for the file videogen_hub-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: videogen_hub-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 555.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for videogen_hub-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3ec54d59700abe897ba93c164acad1aa5db24608fdff6384a74381d1f90d8920
MD5 87141830aef70b93f6dbdca1d0526639
BLAKE2b-256 0d52378589fc1bc945574830b3b59e258a9d83f9e04db6c0bb333b67162e7ab9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page