Skip to main content

mmf: a modular framework for vision and language multimodal research.

Project description

MMF

NOTE: MMF is still in beta mode and will replace Pythia framework. To get the latest Pythia code which doesn't contain MMF changes, please use the following command:

git clone --branch v0.3 https://github.com/facebookresearch/mmf pythia

MMF is a modular framework for vision and language multimodal research. Built on top of PyTorch, it features:

  • Model Zoo: Reference implementations for state-of-the-art vision and language model including LoRRA (SoTA on VQA and TextVQA), Pythia model (VQA 2018 challenge winner), BAN and BUTD.
  • Multi-Tasking: Support for multi-tasking which allows training on multiple dataset together.
  • Datasets: Includes support for various datasets built-in including VQA, VizWiz, TextVQA, VisualDialog and COCO Captioning.
  • Modules: Provides implementations for many commonly used layers in vision and language domain
  • Distributed: Support for distributed training based on DataParallel as well as DistributedDataParallel.
  • Unopinionated: Unopinionated about the dataset and model implementations built on top of it.
  • Customization: Custom losses, metrics, scheduling, optimizers, tensorboard; suits all your custom needs.

You can use MMF to bootstrap for your next vision and language multimodal research project.

MMF can also act as starter codebase for challenges around vision and language datasets (TextVQA challenge, VQA challenge). MMF was formerly known as Pythia.

MMF Examples

Installation

Follow installation instructions in the documentation.

Documentation

Learn more about MMF here.

Citation

If you use MMF in your work, please cite:

@inproceedings{singh2018pythia,
  title={Pythia-a platform for vision \& language research},
  author={Singh, Amanpreet and Goswami, Vedanuj and Natarajan, Vivek and Jiang, Yu and Chen, Xinlei and Shah, Meet and Rohrbach, Marcus and Batra, Dhruv and Parikh, Devi},
  booktitle={SysML Workshop, NeurIPS},
  volume={2018},
  year={2018}
}

License

MMF is licensed under BSD license available in LICENSE file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mmf-1.0.0rc7.tar.gz (228.4 kB view details)

Uploaded Source

Built Distributions

mmf-1.0.0rc7-cp38-cp38-manylinux1_x86_64.whl (399.7 kB view details)

Uploaded CPython 3.8

mmf-1.0.0rc7-cp37-cp37m-manylinux1_x86_64.whl (399.5 kB view details)

Uploaded CPython 3.7m

mmf-1.0.0rc7-cp36-cp36m-manylinux1_x86_64.whl (399.4 kB view details)

Uploaded CPython 3.6m

File details

Details for the file mmf-1.0.0rc7.tar.gz.

File metadata

  • Download URL: mmf-1.0.0rc7.tar.gz
  • Upload date:
  • Size: 228.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for mmf-1.0.0rc7.tar.gz
Algorithm Hash digest
SHA256 a8c87894e0c3a40b900aa298c86fda54b505b3c385b9efbead7d5e3970d8070e
MD5 adff889a54e74329a8e6caec4eb1fc8c
BLAKE2b-256 2e1616eff3f02e94ec2e343978845b94141b36792ae9979fec043186748fbd57

See more details on using hashes here.

File details

Details for the file mmf-1.0.0rc7-cp38-cp38-manylinux1_x86_64.whl.

File metadata

  • Download URL: mmf-1.0.0rc7-cp38-cp38-manylinux1_x86_64.whl
  • Upload date:
  • Size: 399.7 kB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for mmf-1.0.0rc7-cp38-cp38-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 ce706eb10e732039edd5355d8ad2f9af556cdb2c2a3049b336d0f4e2394069ee
MD5 77706392c34997a51eb907b9381b2f2f
BLAKE2b-256 4159519bb5bae917a962936462161f68356ffe94bc1146f6e0f14fe840bce753

See more details on using hashes here.

File details

Details for the file mmf-1.0.0rc7-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: mmf-1.0.0rc7-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 399.5 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for mmf-1.0.0rc7-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 32bb20dfccae8c50479daeaaff5d65fb0467541abe84d22adfbce374cd5681cc
MD5 a21096d30dbbf907535a292a90b7edbc
BLAKE2b-256 10210aa2b00c0d34cc83dfa5d18bed02249f75fc6ddc2c8d517f168229aca5ec

See more details on using hashes here.

File details

Details for the file mmf-1.0.0rc7-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: mmf-1.0.0rc7-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 399.4 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for mmf-1.0.0rc7-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 b851b01f5d1a78e468bd6634576a0e3f23a77b2b5d9178981d7b7ac19545e98f
MD5 6a35d12c189657087df326623574ca39
BLAKE2b-256 5a42f1b36ccabc8f5c0beeef701008ab6fd4712e1ec67771dc41603122199345

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page