Python interface to MHC binding, presentation, immunogenicity, and antigen processing predictors

These details have been verified by PyPI

Maintainers

hammerlab iskander openvax tavinathanson timodonnell

These details have not been verified by PyPI

Project links

Homepage

Project description

mhctools

Python interface to MHC binding, presentation, immunogenicity, and antigen processing predictors.

Installation

pip install mhctools

For MHCflurry support, also run:

mhcflurry-downloads fetch

Data model

Every predictor returns results as two nested dataclasses:

PeptideResult — all predictions for one peptide (across alleles and prediction kinds). This is what you get back per peptide from predict().
Pred — a single prediction: one peptide, one allele, one measurement kind (e.g. affinity, presentation, immunogenicity). Frozen and self-contained.

predict(["SIINFEKL", "GILGFVFTL"])
  → [PeptideResult, PeptideResult]
       └── .preds = (Pred(allele=A0201, kind=affinity, ...),
                     Pred(allele=A0201, kind=presentation, ...),
                     Pred(allele=B0702, kind=affinity, ...),
                     ...)

Both convert to DataFrames and have consistent column names for easy downstream analysis.

Quick start

from mhctools import NetMHCpan41

predictor = NetMHCpan41(alleles=["HLA-A*02:01", "HLA-B*07:02"])

# predict() returns a list of PeptideResult — one per peptide
results = predictor.predict(["SIINFEKL", "GILGFVFTL"])

for result in results:
    best = result.best_affinity
    if best:
        print(f"{best.peptide} -> {best.allele} IC50={best.value:.1f}nM")

Python API

Predicting peptides

from mhctools import NetMHCpan41

predictor = NetMHCpan41(alleles=["HLA-A*02:01", "HLA-B*07:02"])
results = predictor.predict(["SIINFEKL", "GILGFVFTL"])

result = results[0]                   # PeptideResult for "SIINFEKL"
result.preds                          # tuple of Pred objects
result.best_affinity                  # Pred with highest affinity score
result.best_affinity.allele           # "HLA-A*02:01"
result.best_affinity.value            # IC50 in nM
result.best_affinity.score            # higher = better (~0-1)
result.best_affinity.percentile_rank  # lower = better (0-100)

result.best_affinity_by_rank          # Pred with lowest percentile rank
result.best_presentation              # best EL/presentation score
result.best_presentation_by_rank      # best EL percentile rank
result.best_stability                 # best pMHC stability (if available)
result.best_stability_by_rank

# filter by kind or allele
result.filter(kind=Kind.pMHC_affinity)
result.filter(allele="HLA-A*02:01")

NetMHCpan 4.1 automatically emits both pMHC_affinity and pMHC_presentation predictions per peptide-allele pair.

Scanning proteins

predict_proteins() takes a dictionary of protein sequences and returns {sequence_name: list[PeptideResult]}:

proteins = predictor.predict_proteins(
    {"TP53": "MEEPQSDPSVEPPLSQETFS...", "KRAS": "MTEYKLVVVGAGGVGKS..."},
    peptide_lengths=[9, 10],
)

for pp in proteins["TP53"]:
    best = pp.best_affinity
    if best and best.value < 500:
        print(f"  offset={best.offset} {best.peptide} IC50={best.value:.0f}")

DataFrames

Every level has a _dataframe variant that flattens to a pandas DataFrame with consistent columns:

df = predictor.predict_dataframe(["SIINFEKL"], sample_name="pat001")
df = predictor.predict_proteins_dataframe({"TP53": "MEEPQ..."}, sample_name="pat001")

Columns: sample_name, peptide, n_flank, c_flank, source_sequence_name, offset, predictor_name, predictor_version, allele, kind, score, value, percentile_rank.

Multi-sample predictions

MultiSample runs a predictor across multiple samples, each with its own HLA genotype:

from mhctools import MultiSample, NetMHCpan41

ms = MultiSample(
    samples={
        "pat001": ["HLA-A*02:01", "HLA-B*07:02"],
        "pat002": ["HLA-A*01:01", "HLA-B*08:01"],
    },
    predictor_class=NetMHCpan41,
)

# {sample_name: list[PeptideResult]}
results = ms.predict(["SIINFEKL", "GILGFVFTL"])

# {sample_name: {seq_name: list[PeptideResult]}}
protein_results = ms.predict_proteins({"TP53": "MEEPQ..."})

# flat DataFrames with sample_name column
df = ms.predict_dataframe(["SIINFEKL"])
df = ms.predict_proteins_dataframe({"TP53": "MEEPQ..."})

Measurement kinds

The Kind enum describes what biological quantity a Pred measures:

Kind	Meaning
`pMHC_affinity`	Peptide-MHC binding affinity
`pMHC_presentation`	Likelihood of surface presentation (EL/processing)
`pMHC_stability`	Peptide-MHC complex stability
`immunogenicity`	T-cell immunogenicity
`antigen_processing`	Combined processing score
`proteasome_cleavage`	Proteasomal cleavage score
`tap_transport`	TAP transport score (reserved, not yet used)
`erap_trimming`	ERAP trimming score (reserved, not yet used)

The Pred object

Every prediction is a frozen, self-contained Pred dataclass:

from mhctools import Pred, Kind

pred = Pred(
    kind=Kind.pMHC_affinity,
    score=0.85,           # ~0-1, higher = better
    peptide="SIINFEKL",
    allele="HLA-A*02:01",
    value=120.5,          # IC50 in nM
    percentile_rank=0.8,
    source_sequence_name="TP53",
    offset=42,
    predictor_name="netMHCpan",
    predictor_version="4.1",
)

score is always higher-is-better. value is in native units (nM for affinity, hours for stability). percentile_rank is always optional, 0-100, lower = stronger.

Supported predictors

MHC binding & presentation

Predictor	Kinds produced	Requires
`NetMHCpan` / `NetMHCpan41` / `NetMHCpan42`	affinity + presentation	NetMHCpan
`NetMHCpan4`	affinity or presentation	NetMHCpan 4.0
`NetMHCpan3` / `NetMHCpan28`	affinity	older NetMHCpan
`NetMHC` / `NetMHC3` / `NetMHC4`	affinity	NetMHC
`NetMHCIIpan` / `NetMHCIIpan43`	affinity or presentation	NetMHCIIpan
`NetMHCcons`	affinity	NetMHCcons
`NetMHCstabpan`	stability	NetMHCstabpan
`MHCflurry`	affinity + presentation	`pip install mhcflurry` + `mhcflurry-downloads fetch`
`BigMHC`	presentation or immunogenicity	BigMHC clone (set `BIGMHC_DIR`)
`MixMHCpred`	presentation	MixMHCpred
`IedbNetMHCpan` / `IedbSMM` / `IedbNetMHCIIpan`	affinity	IEDB web API
`RandomBindingPredictor`	affinity	(built-in)

Antigen processing

Predictor	Kinds produced	Requires
`Pepsickle`	proteasome cleavage	`pip install mhctools[pepsickle]`
`NetChop`	proteasome cleavage	NetChop

Processing predictors use configurable scoring to aggregate per-position cleavage probabilities into peptide-level scores. See ProcessingPredictor and ProteasomePredictor for details.

Commandline examples

Prediction for user-supplied peptide sequences

mhctools --sequence SIINFEKL SIINFEKLQ --mhc-predictor netmhc --mhc-alleles A0201

Automatically extract peptides as subsequences of specified length

mhctools --sequence AAAQQQSIINFEKL --extract-subsequences --mhc-peptide-lengths 8-10 --mhc-predictor mhcflurry --mhc-alleles A0201

Legacy API

The old predict_peptides() and predict_subsequences() methods still work and return BindingPredictionCollection objects:

predictor = NetMHCpan(alleles=["A*02:01"])
collection = predictor.predict_subsequences(
    {"1L2Y": "NLYIQWLKDGGPSSGRPPPS"},
    peptide_lengths=[9],
)
df = collection.to_dataframe()

for bp in collection:
    if bp.affinity < 100:
        print("Strong binder: %s" % bp)

To convert legacy results to the new types:

preds = collection.to_preds()           # list of Pred
pp_list = collection.to_peptide_preds() # list of PeptideResult

Project details

These details have been verified by PyPI

Maintainers

hammerlab iskander openvax tavinathanson timodonnell

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

3.14.1

May 8, 2026

3.13.7

May 7, 2026

3.13.6

May 6, 2026

3.13.5

May 5, 2026

3.13.4

May 5, 2026

3.13.3

May 4, 2026

3.13.2

Apr 18, 2026

3.13.1

Apr 15, 2026

3.13.0

Apr 15, 2026

3.12.3

Apr 15, 2026

3.12.2

Apr 15, 2026

3.12.1

Apr 14, 2026

3.12.0

Apr 12, 2026

3.11.0

Apr 11, 2026

3.10.1

Apr 11, 2026

3.10.0

Apr 11, 2026

3.9.0

Apr 11, 2026

3.8.2

Apr 11, 2026

3.8.1

Apr 11, 2026

3.8.0

Apr 11, 2026

3.7.1

Apr 10, 2026

3.7.0

Apr 10, 2026

3.6.0

Apr 10, 2026

3.5.1

Apr 10, 2026

3.5.0

Apr 10, 2026

3.4.0

Apr 10, 2026

This version

3.3.0

Apr 9, 2026

3.2.0

Apr 9, 2026

3.1.1

Apr 9, 2026

3.1.0

Apr 9, 2026

3.0.2

Apr 9, 2026

3.0.1

Apr 8, 2026

3.0.0

Apr 7, 2026

2.2.0

Apr 6, 2026

2.1.0

Mar 17, 2026

2.0.0

Mar 17, 2026

1.9.0

Feb 12, 2024

1.8.1

Oct 9, 2020

1.8.0

Sep 11, 2020

1.7.1

May 1, 2020

1.7.0

Nov 18, 2019

1.6.23

Oct 16, 2019

1.6.22

Oct 3, 2019

1.6.21

Oct 2, 2019

1.6.20

May 16, 2019

1.6.19

May 16, 2019

1.6.18

Jan 9, 2019

1.6.17

Feb 26, 2018

1.6.16

Feb 26, 2018

1.6.15

Feb 24, 2018

1.6.13

Feb 21, 2018

1.6.10

Feb 20, 2018

1.6.8

Dec 4, 2017

1.6.6

Oct 4, 2017

1.6.5

Aug 8, 2017

1.6.4

Jul 27, 2017

1.6.3

Jul 27, 2017

1.6.2

Jul 21, 2017

1.6.1

Jul 20, 2017

1.6.0

Jul 18, 2017

1.5.0

Jun 21, 2017

1.4.0

Jun 1, 2017

1.3.0

May 3, 2017

1.2.0

Apr 25, 2017

1.1.0

Apr 13, 2017

1.0.2

Mar 7, 2017

1.0.1

Mar 2, 2017

1.0.0

Mar 2, 2017

0.5.0

Feb 17, 2017

0.4.1

Dec 14, 2016

0.4.0

Dec 3, 2016

0.3.1

Oct 13, 2016

0.3.0

Aug 6, 2016

0.2.3

May 10, 2016

0.2.2

Feb 24, 2016

0.2.1

Feb 21, 2016

0.2.0

Feb 20, 2016

0.1.8

Sep 3, 2015

0.1.7

Sep 1, 2015

0.1.6

Aug 26, 2015

0.1.5

Aug 26, 2015

0.1.4

Aug 25, 2015

0.1.3

Aug 24, 2015

0.1.2

Aug 24, 2015

0.1.1

Aug 24, 2015

0.1.0

Aug 24, 2015

0.0.11

Aug 4, 2015

0.0.6

May 1, 2015

0.0.5

Apr 30, 2015

0.0.4

Apr 30, 2015

0.0.0

Apr 23, 2015

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mhctools-3.3.0.tar.gz (80.5 kB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mhctools-3.3.0-py3-none-any.whl (80.1 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file mhctools-3.3.0.tar.gz.

File metadata

Download URL: mhctools-3.3.0.tar.gz
Upload date: Apr 9, 2026
Size: 80.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for mhctools-3.3.0.tar.gz
Algorithm	Hash digest
SHA256	`1ea4b731995fcd0220a5b680a1e107920d095a1510a4908c594a2e9d1bef7f3b`
MD5	`ac2f434e04daebef244bdacc61bf51ed`
BLAKE2b-256	`d5c33406504044235ceb60434d3e419ab19a61c8fa506bd4c71a9b67897ef236`

See more details on using hashes here.

File details

Details for the file mhctools-3.3.0-py3-none-any.whl.

File metadata

Download URL: mhctools-3.3.0-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 80.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for mhctools-3.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`10be33fecd58783b97a6c5f38f486a6ff1fc547b64a65ad327bb5565c4d47e1f`
MD5	`3fbd28da9ba5820aa66f17f4fcbdb805`
BLAKE2b-256	`a8c50c6b0368d9e20f7d662464a7712a9ce5ee9cf9e26d07b90396aac58b44ca`

See more details on using hashes here.

mhctools 3.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

mhctools

Installation

Data model

Quick start

Python API

Predicting peptides

Scanning proteins

DataFrames

Multi-sample predictions

Measurement kinds

The Pred object

Supported predictors

MHC binding & presentation

Antigen processing

Commandline examples

Prediction for user-supplied peptide sequences

Automatically extract peptides as subsequences of specified length

Legacy API

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes