Skip to main content

Text to Video synthesis

Project description

Multi-Modality

Gen1

My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML

The flow:

image => midas => clip => spacetime unet => diffusion

Install

pip3 install gen1

Usage

import torch
from gen1.model import Gen1

model = Gen1()

images = torch.randn(1, 3, 128, 128)
video = torch.randn(1, 3, 16, 128, 128)

run_out = model.forward(images, video)

Datasets

Here is a summary table of the datasets used in the Structure and Content-Guided Video Synthesis with Diffusion Models paper:

Dataset Type Size Domain Description Source
Internal dataset Images 240M General Uncaptioned images Private
Custom video dataset Videos 6.4M clips General Uncaptioned short video clips Private
DAVIS Videos - General Video object segmentation Link
Stock footage Videos - General Diverse video clips -

Citation

@misc{2302.03011,
Author = {Patrick Esser and Johnathan Chiu and Parmida Atighehchian and Jonathan Granskog and Anastasis Germanidis},
Title = {Structure and Content-Guided Video Synthesis with Diffusion Models},
Year = {2023},
Eprint = {arXiv:2302.03011},

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gen1-0.0.3.tar.gz (20.5 kB view details)

Uploaded Source

Built Distribution

gen1-0.0.3-py3-none-any.whl (21.4 kB view details)

Uploaded Python 3

File details

Details for the file gen1-0.0.3.tar.gz.

File metadata

  • Download URL: gen1-0.0.3.tar.gz
  • Upload date:
  • Size: 20.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for gen1-0.0.3.tar.gz
Algorithm Hash digest
SHA256 a2e4d66f5656c3b3c237bd57c9e69aed03824d87503a50e8045cfbda60d01fbc
MD5 77c77609e1abac9513f3359c0d46ce2f
BLAKE2b-256 b386c2b477576b1e36680f1209cd8ad8bd8e61a10ee8fa52026e9db1dbae1b58

See more details on using hashes here.

File details

Details for the file gen1-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: gen1-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 21.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0

File hashes

Hashes for gen1-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 37c385c94b7e8b499d940d382093f7cb4c519454856cf9fa6ae984b694e7369e
MD5 8b098d0bb1ac5c350d39d412f55bf053
BLAKE2b-256 72c5694b742b8d02a074007beac9518519c88f08b52914f255b5f75715a4bd04

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page