Skip to main content

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

Project description

VideoGenHub

contributors license GitHub Hits

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

  • We define 2 prominent generation tasks (Text-to-Video and Image-to-Video).
  • We built a unified inference pipeline to ensure fair comparison. We currently support around 10 models.

📰 News

📄 Table of Contents

🛠️ Installation 🔝

To install from pypi:

pip install videogen-hub

To install from github:

git clone https://github.com/TIGER-AI-Lab/VideoGenHub.git
cd VideoGenHub
cd env_cfg
pip install -r requirements.txt
cd ..
pip install -e .

The requirement of opensora is in env_cfg/opensora.txt

For some models like show one, you need to login through huggingface-cli.

huggingface-cli login

👨‍🏫 Get Started 🔝

Benchmarking

To reproduce our experiment using benchmark.

For text-to-video generation:

./t2v_inference.sh --<model_name> --<device>

Infering one model

import videogen_hub

model = videogen_hub.load('VideoCrafter2')
video = model.infer_one_video(prompt="A child excitedly swings on a rusty swing set, laughter filling the air.")

# Here video is a torch tensor of shape torch.Size([16, 3, 320, 512])

See Google Colab here: https://colab.research.google.com/drive/145UMsBOe5JLqZ2m0LKqvvqsyRJA1IeaE?usp=sharing

🧠 Philosophy 🔝

By streamlining research and collaboration, VideoGenHub plays a pivotal role in propelling the field of Video Generation.

  • Purity of Evaluation: We ensure a fair and consistent evaluation for all models, eliminating biases.
  • Research Roadmap: By defining tasks and curating datasets, we provide clear direction for researchers.
  • Open Collaboration: Our platform fosters the exchange and cooperation of related technologies, bringing together minds and innovations.

Implemented Models

We included more than 10 Models in video generation.

Method Venue Type
LaVie - Text-To-Video Generation
VideoCrafter2 - Text-To-Video Generation
ModelScope - Text-To-Video Generation
StreamingT2V - Text-To-Video Generation
Show 1 - Text-To-Video Generation
OpenSora - Text-To-Video Generation
OpenSora-Plan - Text-To-Video Generation
T2V-Turbo - Text-To-Video Generation
DynamiCrafter2 - Image-To-Video Generation
SEINE ICLR'24 Image-To-Video Generation
Consisti2v - Image-To_Video Generation
I2VGenXL - Image-To_Video Generation

🎫 License 🔝

This project is released under the License.

🖊️ Citation 🔝

This work is a part of GenAI-Arena work.

Please kindly cite our paper if you use our code, data, models or results:

@misc{jiang2024genai,
      title={GenAI Arena: An Open Evaluation Platform for Generative Models}, 
      author={Dongfu Jiang and Max Ku and Tianle Li and Yuansheng Ni and Shizhuo Sun and Rongqi Fan and Wenhu Chen},
      year={2024},
      eprint={2406.04485},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videogen_hub-0.1.1.tar.gz (387.1 kB view details)

Uploaded Source

Built Distribution

videogen_hub-0.1.1-py3-none-any.whl (508.5 kB view details)

Uploaded Python 3

File details

Details for the file videogen_hub-0.1.1.tar.gz.

File metadata

  • Download URL: videogen_hub-0.1.1.tar.gz
  • Upload date:
  • Size: 387.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for videogen_hub-0.1.1.tar.gz
Algorithm Hash digest
SHA256 a1d43844be3fc4e2dec63c7d591c6556e737d4c3fb70d7bf962d4567a12fa0b3
MD5 725938d8489df2c9ce4b1837632c052b
BLAKE2b-256 c553f757fccf91f9dad9274e11acf5eab322a5d4ce66b6f67f8922d7025802ce

See more details on using hashes here.

File details

Details for the file videogen_hub-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: videogen_hub-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 508.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for videogen_hub-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b0ec023254587aacd385a829cd1db39f91fb5d03cf0dff5bb3cd7b041457878e
MD5 9d288f005558a5c3f826bd2187475d25
BLAKE2b-256 e8bae2def7fb9f59c6bb4e89b82e2088e5f71425e740ef3548707edef6930ace

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page