Skip to main content

MHC Binding Predictor

Project description

Build Status Coverage Status Open In Colab

mhcflurry

MHC I ligand prediction package with competitive accuracy and a fast and documented implementation.

[!IMPORTANT] Version 2.2.0 is the first release to use PyTorch as its neural network backend, replacing TensorFlow/Keras used in previous versions. It loads the same published weights and produces equivalent predictions, so existing workflows should continue to work with no changes.

Key changes in 2.2.0:

  • Backend: TensorFlow/Keras replaced by PyTorch (>= 2.0)
  • Python: Requires Python 3.10+ (previously 3.9+)
  • Dependencies: pandas >= 2.0 is now required; tensorflow and keras are no longer needed
  • Hardware: Automatic GPU detection; Apple Silicon (MPS) is now supported

If you are upgrading from 2.1.x, simply pip install --upgrade mhcflurry. The published pre-trained models are unchanged and will be loaded and converted automatically.

MHCflurry implements class I peptide/MHC binding affinity prediction. The current version provides pan-MHC I predictors supporting any MHC allele of known sequence. MHCflurry runs on Python 3.10+ using the PyTorch neural network library. It exposes command-line and Python library interfaces.

MHCflurry also includes two experimental predictors, an "antigen processing" predictor that attempts to model MHC allele-independent effects such as proteosomal cleavage and a "presentation" predictor that integrates processing predictions with binding affinity predictions to give a composite "presentation score." Both models are trained on mass spec-identified MHC ligands.

If you find MHCflurry useful in your research please cite:

T. O'Donnell, A. Rubinsteyn, U. Laserson. "MHCflurry 2.0: Improved pan-allele prediction of MHC I-presented peptides by incorporating antigen processing," Cell Systems, 2020. https://doi.org/10.1016/j.cels.2020.06.010

T. O'Donnell, A. Rubinsteyn, M. Bonsack, A. B. Riemer, U. Laserson, and J. Hammerbacher, "MHCflurry: Open-Source Class I MHC Binding Affinity Prediction," Cell Systems, 2018. https://doi.org/10.1016/j.cels.2018.05.014

Please file an issue if you have questions or encounter problems.

Have a bugfix or other contribution? We would love your help. See our contributing guidelines.

Try it now

You can generate MHCflurry predictions without any setup by running our Google colaboratory notebook.

Installation (pip)

Install the package:

$ pip install mhcflurry

Download our datasets and trained models:

$ mhcflurry-downloads fetch

You can now generate predictions:

$ mhcflurry-predict \
       --alleles HLA-A0201 HLA-A0301 \
       --peptides SIINFEKL SIINFEKD SIINFEKQ \
       --out /tmp/predictions.csv

Wrote: /tmp/predictions.csv

Or scan protein sequences for potential epitopes:

$ mhcflurry-predict-scan \
        --sequences MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHS \
        --alleles HLA-A*02:01 \
        --out /tmp/predictions.csv

Wrote: /tmp/predictions.csv

See the documentation for more details.

Docker

You can also try the latest (GitHub master) version of MHCflurry using the Docker image hosted on Dockerhub by running:

$ docker run -p 9999:9999 --rm openvax/mhcflurry:latest

This will start a jupyter notebook server in an environment that has MHCflurry installed. Go to http://localhost:9999 in a browser to use it.

To build the Docker image yourself, from a checkout run:

$ docker build -t mhcflurry:latest .
$ docker run -p 9999:9999 --rm mhcflurry:latest

Predicted sequence motifs

Sequence logos for the binding motifs learned by MHCflurry BA are available here.

Common issues and fixes

Problems downloading data and models

Some users have reported HTTP connection issues when using mhcflurry-downloads fetch. As a workaround, you can download the data manually (e.g. using wget) and then use mhcflurry-downloads just to copy the data to the right place.

To do this, first get the URL(s) of the downloads you need using mhcflurry-downloads url:

$ mhcflurry-downloads url models_class1_presentation
https://github.com/openvax/mhcflurry/releases/download/1.6.0/models_class1_presentation.20200205.tar.bz2```

Then make a directory and download the needed files to this directory:

$ mkdir downloads
$ wget  --directory-prefix downloads https://github.com/openvax/mhcflurry/releases/download/1.6.0/models_class1_presentation.20200205.tar.bz2```

HTTP request sent, awaiting response... 200 OK
Length: 72616448 (69M) [application/octet-stream]
Saving to: 'downloads/models_class1_presentation.20200205.tar.bz2'

Now call mhcflurry-downloads fetch with the --already-downloaded-dir option to indicate that the downloads should be retrived from the specified directory:

$ mhcflurry-downloads fetch models_class1_presentation --already-downloaded-dir downloads

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mhcflurry-2.2.1.tar.gz (178.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mhcflurry-2.2.1-py3-none-any.whl (157.0 kB view details)

Uploaded Python 3

File details

Details for the file mhcflurry-2.2.1.tar.gz.

File metadata

  • Download URL: mhcflurry-2.2.1.tar.gz
  • Upload date:
  • Size: 178.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mhcflurry-2.2.1.tar.gz
Algorithm Hash digest
SHA256 cda123729b649e94e1da9af40cdedf2d6823f39ebfd87dd36e47eb994bb3c4a0
MD5 144498d8f9ab43bde815cb6baa116f72
BLAKE2b-256 dd7f9001cdc0a1dabfa9dac11a53520eb84f1f6b1167c49390bf332864367dcf

See more details on using hashes here.

Provenance

The following attestation bundles were made for mhcflurry-2.2.1.tar.gz:

Publisher: release.yml on openvax/mhcflurry

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mhcflurry-2.2.1-py3-none-any.whl.

File metadata

  • Download URL: mhcflurry-2.2.1-py3-none-any.whl
  • Upload date:
  • Size: 157.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mhcflurry-2.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cd934e9b092de76e477442c6f84aab1265def91acfa5ae129308116dbfa66ac7
MD5 5f3b515a8055a6ab017ea3faa1e984df
BLAKE2b-256 9ef7a465e7fef6c06fb649f5334550a73b4db5f387e7dbd3517a778d1c3fe923

See more details on using hashes here.

Provenance

The following attestation bundles were made for mhcflurry-2.2.1-py3-none-any.whl:

Publisher: release.yml on openvax/mhcflurry

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page