Skip to main content

A Framework for Training and Evaluating Video Prediction Models

Project description

📚 Link to Documentation 📚

Introduction

Video prediction ('VP') is the task of predicting future frames given some context frames.

Like with most Computer Vision sub-domains, scientific contributions in this field exhibit a high variance in the following aspects:

  • Training protocol (dataset usage, when to backprop, value ranges etc.)
  • Technical details of model implementation (deep learning framework, package dependencies etc.)
  • Benchmark selection and execution (this includes the choice of dataset, number of context/predicted frames, skipping frames in the observed sequences etc.)
  • Evaluation protocol (metrics chosen, variations in implementation/reduction modes, different ways of creating visualizations etc.)

Furthermore, while many contributors nowadays do share their code, seemingly minor missing parts such as dataloaders etc. make it much harder to assess, compare and improve existing models.

This repo aims at providing a suite that facilitates scientific work in the subfield, providing standardized yet customizable solutions for the aspects mentioned above. This way, validating existing VP models and creating new ones hopefully becomes much less tedious.

Installation

Requirements:

  • python >= 3.6 (code is tested with version 3.8)
  • pip

From PyPi:

pip install vp-suite

From source:

pip install git+https://github.com/Flunzmas/vp-suite.git

If you want to contribute:

git clone https://github.com/Flunzmas/vp-suite.git
cd vp-suite
pip install -e .[dev,doc]

Usage

When using this package, a folder vp-suite is created in your current working directory/current path that will contain all downloaded data as well as run logs, outputs and trained models.

Training models

  1. Set up the trainer:
  2. Load one of the provided datasets (will be downloaded automatically) create your own:
  3. Create video prediction model (either from scratch or from a pretrained checkpoint, can be one of the provided models or your own):
  4. Run the training loop, optionally providing custom configuration
from vp_suite import VPSuite

suite = VPSuite()
suite.load_dataset("MM")  # load moving MNIST dataset from default location

model_checkpoint = ""  # Set to valid model path to load a checkpoint
if model_checkpoint != "":
   suite.load_model(model_checkpoint)
else:
   suite.create_model('lstm')  # create a ConvLSTM-Based Prediction Model
   
suite.train(lr=2e-4, epochs=100)

This will train the model, log training progress to the console (and optionally to Weights & Biases), save model checkpoints on improvement and, optionally, generate and save prediction visualizations.

Evaluating models

  1. Set up the tester
  2. Load one of the provided datasets or (will be downloaded automatically) create your own
  3. Load the models you'd like to test (by default, a CopyLastFrame baseline is already loaded)
  4. Run the testing on all models, optionally providing custom configuration of the evaluation protocol:
from vp_suite import VPSuite

suite = VPSuite()
suite.load_dataset("MM")  # load moving MNIST dataset from default location

# get the filepaths to the models you'd like to test
model_dirs = ["out/model_foo/", "out/model_bar/"]
for model_dir in model_dirs:
    suite.load_model(model_dir, ckpt_name="best_model.pth")
suite.test(context_frames=5, pred_frames=10)

This code will evaluate the loaded models on the loaded dataset (its test portion, if avaliable), creating detailed summaries of prediction performance across a customizable set of metrics. Optionally, the results as well as prediction visualizations can be saved and logged to Weights & Biases.

Note: if the specified evaluation protocol or the loaded dataset is incompatible with one of the models, this will raise an error with an explanation.

Hyperparameter Optimization

This package uses optuna to provide hyperparameter optimization functionalities. The following snippet provides a full example:

import json
from vp_suite import VPSuite
from vp_suite.constants import PKG_RESOURCES

suite = VPSuite()
suite.load_dataset(dataset="KTH")  # select dataset of choice
suite.create_model(model_type="lstm")  # select model of choice
with open(str((PKG_RESOURCES / "optuna_example_config.json").resolve()), 'r') as cfg_file:
    optuna_cfg = json.load(cfg_file)
# optuna_cfg specifies the parameters' search intervals and scales; modify as you wish.
suite.hyperopt(optuna_cfg, n_trials=30, epochs=10)

This code e.g. will run 30 training loops (called trials by optuna), producing a trained model for each hyperparameter configuration and writing the hyperparameter configuration of the best performing run to the console.

Note 1: For hyperopt, visualization, logging and model checkpointing is minimized to reduce IO strain.

Note 2: Despite optuna's trial pruning capabilities, running a high number of trials might still take a lot of time. In that case, consider e.g. reducing the number of training epochs.

Customization

While this package comes with a few pre-defined models/datasets/metrics etc. for your convenience, it was designed with quick extensibility in mind. See the sections below for how to add new models, datasets or metrics.

Creating new VP models or integrating existing external models

  1. Create a file model_<your name>.py in the folder vp_suite/models.
  2. Create a class that derives from vp_suite.models.base_model.VideoPredictionModel and override the things you need.
  3. Write your model code or import existing code so that the superclass interface is still served. If desired, you can implement a custom training loop iteration train_iter(self, config, loader, optimizer, loss_provider, epoch) that gets called instead of the default training loop iteration.
  4. Check training performance on different datasets, fix things and contribute to the project 😊

Training on custom datasets

  1. Create a file dataset_<your name>.py in the folder vp_suite/dataset.
  2. Create a class that derives from vp_suite.dataset.base_dataset.BaseVPDataset and override the things you need.
  3. Write your dataset code or import existing code so that the superclass interface is served and the dataset initialization with vp_suite/dataset/factory.py still works.
  4. Register it in the DATASET_CLASSES dict of vp_suite/dataset/__init__.py.
  5. Run pytest, check training performance with different models, fix things and contribute to the project 😊

Custom losses, metrics and optimization

  1. Create a new file in vp_suite/measure, containing your loss or metric.
  2. Make vp_suite.measure.base_measure.BaseMeasure its superclass and provide all needed implementations and attributes.
  3. Register the measure in the METRIC_CLASSES dict of vp_suite/measure/__init__.py and, if it can also be used as a loss, in the LOSS_CLASSES dict.
  4. Run pytest, check training/evaluation performance with different models and datasets, fix things and contribute to the project 😊

Contributing

This project is always open to extension! It grows especially powerful with more models and datasets, so if you've made your code work on custom models/datasets/metrics/etc., feel free to submit a merge request!

Other kinds of contributions are also very welcome - just check the open issues on the tracker or open up a new issue there.

When submitting a merge request, please make sure all tests run through (execute from root folder):

python -m pytest --runslow --cov=vp_suite

Note: this is the easiest way to run all tests without import hassles. Omit the runslow argument to speed up testing by removing the tests for the complete training/testing procedure.

API Documentation

Updating the API documentation can be done by executing build_docs.sh from the docs/ folder.

Acknowledgements

All other sources are acknowledged in the documentation of the respective point of usage (to the best of our knowledge).

License stuffs

This project comes with an MIT License, except for the following components:

  • Module vp_suite.measure.fvd.pytorch_i3d (Apache 2.0 License, taken and modified from here)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vp-suite-0.0.7.tar.gz (66.4 kB view details)

Uploaded Source

Built Distribution

vp_suite-0.0.7-py3-none-any.whl (84.8 kB view details)

Uploaded Python 3

File details

Details for the file vp-suite-0.0.7.tar.gz.

File metadata

  • Download URL: vp-suite-0.0.7.tar.gz
  • Upload date:
  • Size: 66.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.12

File hashes

Hashes for vp-suite-0.0.7.tar.gz
Algorithm Hash digest
SHA256 6ea3e98fd9973c8d7dcd5f0b1d61d44d58578f29b9139a1ccdf0df843457f4d9
MD5 278e7d7076a75ed73cfbb7ba1e228283
BLAKE2b-256 3a8199754d7e1bbb22e22e70c9125adbe2715b0544bd95390a4867ae59ad4d7e

See more details on using hashes here.

File details

Details for the file vp_suite-0.0.7-py3-none-any.whl.

File metadata

  • Download URL: vp_suite-0.0.7-py3-none-any.whl
  • Upload date:
  • Size: 84.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.12

File hashes

Hashes for vp_suite-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 fab609be511714071cce91120e958cc018f77eea9973a343f14e673738095b8a
MD5 d8cf7bf56e2b2ff5a9b392fb0ebdf56b
BLAKE2b-256 3fe5a8d461d59e7d93bb80c8dcc86ba62a4427f55614e701593f8570ed167784

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page