Text to Video synthesis
Project description
Gen1
My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML
The flow:
image => midas => clip => spacetime unet => diffusion
Install
pip3 install gen1
Usage
import torch
from gen1.model import Gen1
model = Gen1()
images = torch.randn(1, 3, 128, 128)
video = torch.randn(1, 3, 16, 128, 128)
run_out = model.forward(images, video)
Datasets
Here is a summary table of the datasets used in the Structure and Content-Guided Video Synthesis with Diffusion Models paper:
Dataset | Type | Size | Domain | Description | Source |
---|---|---|---|---|---|
Internal dataset | Images | 240M | General | Uncaptioned images | Private |
Custom video dataset | Videos | 6.4M clips | General | Uncaptioned short video clips | Private |
DAVIS | Videos | - | General | Video object segmentation | Link |
Stock footage | Videos | - | General | Diverse video clips | - |
Citation
@misc{2302.03011,
Author = {Patrick Esser and Johnathan Chiu and Parmida Atighehchian and Jonathan Granskog and Anastasis Germanidis},
Title = {Structure and Content-Guided Video Synthesis with Diffusion Models},
Year = {2023},
Eprint = {arXiv:2302.03011},
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gen1-0.0.3.tar.gz
(20.5 kB
view details)
Built Distribution
gen1-0.0.3-py3-none-any.whl
(21.4 kB
view details)
File details
Details for the file gen1-0.0.3.tar.gz
.
File metadata
- Download URL: gen1-0.0.3.tar.gz
- Upload date:
- Size: 20.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a2e4d66f5656c3b3c237bd57c9e69aed03824d87503a50e8045cfbda60d01fbc |
|
MD5 | 77c77609e1abac9513f3359c0d46ce2f |
|
BLAKE2b-256 | b386c2b477576b1e36680f1209cd8ad8bd8e61a10ee8fa52026e9db1dbae1b58 |
File details
Details for the file gen1-0.0.3-py3-none-any.whl
.
File metadata
- Download URL: gen1-0.0.3-py3-none-any.whl
- Upload date:
- Size: 21.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.11.0 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 37c385c94b7e8b499d940d382093f7cb4c519454856cf9fa6ae984b694e7369e |
|
MD5 | 8b098d0bb1ac5c350d39d412f55bf053 |
|
BLAKE2b-256 | 72c5694b742b8d02a074007beac9518519c88f08b52914f255b5f75715a4bd04 |