Skip to main content

mmf: a modular framework for vision and language multimodal research.

Project description

MMF is a modular framework for vision and language multimodal research from Facebook AI Research. MMF contains reference implementations of state-of-the-art vision and language models and has powered multiple research projects at Facebook AI Research. See full list of project inside or built on MMF here.

MMF is powered by PyTorch, allows distributed training and is un-opinionated, scalable and fast. Use MMF to bootstrap for your next vision and language multimodal research project by following the installation instructions. Take a look at list of MMF features here.

MMF also acts as starter codebase for challenges around vision and language datasets (The Hateful Memes, TextVQA, TextCaps and VQA challenges). MMF was formerly known as Pythia. The next video shows an overview of how datasets and models work inside MMF. Checkout MMF's video overview.


Follow installation instructions in the documentation.


Learn more about MMF here.


If you use MMF in your work or use any models published in MMF, please cite:

  author =       {Singh, Amanpreet and Goswami, Vedanuj and Natarajan, Vivek and Jiang, Yu and Chen, Xinlei and Shah, Meet and
                 Rohrbach, Marcus and Batra, Dhruv and Parikh, Devi},
  title =        {MMF: A multimodal framework for vision and language research},
  howpublished = {\url{}},
  year =         {2020}


MMF is licensed under BSD license available in LICENSE file

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mmf-1.0.0rc10.tar.gz (160.5 kB view hashes)

Uploaded Source

Built Distributions

mmf-1.0.0rc10-cp38-cp38-manylinux1_x86_64.whl (393.1 kB view hashes)

Uploaded CPython 3.8

mmf-1.0.0rc10-cp37-cp37m-manylinux1_x86_64.whl (404.9 kB view hashes)

Uploaded CPython 3.7m

mmf-1.0.0rc10-cp36-cp36m-manylinux1_x86_64.whl (393.1 kB view hashes)

Uploaded CPython 3.6m

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page