Skip to main content

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

Project description

VideoGenHub

contributors license GitHub Hits

VideoGenHub is a one-stop library to standardize the inference and evaluation of all the conditional video generation models.

  • We define 2 prominent generation tasks (Text-to-Video and Image-to-Video).
  • We built a unified inference pipeline to ensure fair comparison. We currently support around 10 models.

📰 News

📄 Table of Contents

🛠️ Installation 🔝

To install from pypi:

pip install videogen-hub

To install from github:

git clone https://github.com/TIGER-AI-Lab/VideoGenHub.git
cd VideoGenHub
cd env_cfg
pip install -r requirements.txt
cd ..
pip install -e .

The requirement of opensora is in env_cfg/opensora.txt

For some models like show one, you need to login through huggingface-cli.

huggingface-cli login

👨‍🏫 Get Started 🔝

Benchmarking

To reproduce our experiment using benchmark.

For text-to-video generation:

./t2v_inference.sh --<model_name> --<device>

Infering one model

import videogen_hub

model = videogen_hub.load('VideoCrafter2')
video = model.infer_one_video(prompt="A child excitedly swings on a rusty swing set, laughter filling the air.")

# Here video is a torch tensor of shape torch.Size([16, 3, 320, 512])

See Google Colab here: https://colab.research.google.com/drive/145UMsBOe5JLqZ2m0LKqvvqsyRJA1IeaE?usp=sharing

🧠 Philosophy 🔝

By streamlining research and collaboration, VideoGenHub plays a pivotal role in propelling the field of Video Generation.

  • Purity of Evaluation: We ensure a fair and consistent evaluation for all models, eliminating biases.
  • Research Roadmap: By defining tasks and curating datasets, we provide clear direction for researchers.
  • Open Collaboration: Our platform fosters the exchange and cooperation of related technologies, bringing together minds and innovations.

Implemented Models

We included more than 10 Models in video generation.

Method Venue Type
LaVie - Text-To-Video Generation
VideoCrafter2 - Text-To-Video Generation
ModelScope - Text-To-Video Generation
StreamingT2V - Text-To-Video Generation
Show 1 - Text-To-Video Generation
OpenSora - Text-To-Video Generation
OpenSora-Plan - Text-To-Video Generation
T2V-Turbo - Text-To-Video Generation
DynamiCrafter2 - Image-To-Video Generation
SEINE ICLR'24 Image-To-Video Generation
Consisti2v - Image-To_Video Generation
I2VGenXL - Image-To_Video Generation

TODO

  • Add ComfyUI Support
  • Add Metrics Support
  • Add Visualization Support (Similar to ImagenHub)
  • Add Video Editing Task

🎫 License 🔝

This project is released under the License.

🖊️ Citation 🔝

This work is a part of GenAI-Arena work.

Please kindly cite our paper if you use our code, data, models or results:

@misc{jiang2024genai,
      title={GenAI Arena: An Open Evaluation Platform for Generative Models}, 
      author={Dongfu Jiang and Max Ku and Tianle Li and Yuansheng Ni and Shizhuo Sun and Rongqi Fan and Wenhu Chen},
      year={2024},
      eprint={2406.04485},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videogen_hub-0.1.4a0.tar.gz (958.8 kB view details)

Uploaded Source

Built Distribution

videogen_hub-0.1.4a0-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file videogen_hub-0.1.4a0.tar.gz.

File metadata

  • Download URL: videogen_hub-0.1.4a0.tar.gz
  • Upload date:
  • Size: 958.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for videogen_hub-0.1.4a0.tar.gz
Algorithm Hash digest
SHA256 5d9d0ef011a30f26ee4a1553a2d59392aa3b0923b7c1bba68e453c35d3d87d49
MD5 85f925367971b07fd154623742e48719
BLAKE2b-256 4625543fffa0b68ae5dd491a3c2a24032daecbfb2d73569d21c186492eaf3f8d

See more details on using hashes here.

File details

Details for the file videogen_hub-0.1.4a0-py3-none-any.whl.

File metadata

File hashes

Hashes for videogen_hub-0.1.4a0-py3-none-any.whl
Algorithm Hash digest
SHA256 0cf9dc51e4f23f008138e208e8851cfb22bfbe5a7f574100ef192017b1d1b137
MD5 673e7885dca92ec410a1590af5fe0993
BLAKE2b-256 5c7a28de547c93982a5466033235cef098e9f9bc4c93a3c69427966f6728daef

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page