Skip to main content

A Python package for processing, manipulating and making inferences about antibody sequence data

Project description

AntPack

AntPack is a toolkit for antibody numbering, data processing, statistical inference and machine learning for antibody sequences. It is currently in active development -- more updates soon! For installation and how to use, see the docs.

What's new in version 0.2.6

Starting in version 0.2.0, there are only two dependencies, numpy and pybind. AntPack can now also indicate to you in its output which numbered positions are framework and CDR and can sort a list of position codes (which makes it easy to merge numbering for many different numbered sequences to generate a fixed length array or MSA file). Also there are some minor updates to the VJ gene tool; previously it just returned the V and J gene assignments, now it returns the percent identity to the assigned V and J genes as well.

Antibody numbering

Numbering antibody sequences is an important precursor for many statistical inference / machine learning applications. AntPack is orders of magnitude faster for numbering antibody sequences than existing tools in the literature (e.g. ANARCI, AbRSA), while providing >= reliability. AntPack also provides tools for merging lists of position codes and for easy extraction of specific CDRs and framework regions.

V / J genes

Identifying the most similar human V / J gene sequences is useful for a variety of purposes. AntPack provides tools for determining which human V and J gene sequences are most similar to the variable region chain provided as input.

Humanness and developability

Minimizing the risk of immunogenicity is important for selecting clinical candidates. In AntPack v0.1.0, we introduce a simple, fully interpretable generative model for human heavy and light chains that outperforms all comparators in the literature on a large held-out test set for distinguishing human sequences from those of other species. This scoring tool can be used to score sequences for humanness, suggest modifications to make them more human, identify liabilities, and generate highly human sequences that contain selected motifs.

Finding developability liabilities

Some sequence motifs are known to be associated with developability issues -- certain motifs are known, for example, to be prone to N-glycosylation or deamidation. AntPack provides a tool for finding these "liability" motifs in an input sequence. Note that that identifying liabilities through finding motifs in this way is known to be prone to false positives (an N-glycosylation motif, for example, will not always be glycosylated). Still, these kinds of alerts can be useful for making yourself aware of potential developability issues.

Citing this work

If using AntPack in research intended for publication, please cite either the preprint:

Jonathan Parkinson and Wei Wang. 2024. For antibody sequence generative modeling, mixture models may be all you need. bioRxiv: https://doi.org/10.1101/2024.01.27.577555

or the final paper in Bioinformatics.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

antpack-0.2.6.tar.gz (13.7 MB view details)

Uploaded Source

File details

Details for the file antpack-0.2.6.tar.gz.

File metadata

  • Download URL: antpack-0.2.6.tar.gz
  • Upload date:
  • Size: 13.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.12

File hashes

Hashes for antpack-0.2.6.tar.gz
Algorithm Hash digest
SHA256 fa07824b9fdda395dd608438af9c6d4ebee4f6f25f9b387c0fe5e50b66d214c4
MD5 b6e4d55269a1856600ced232e2920632
BLAKE2b-256 e09352cd3f5e769306b85b49f034c030ae996a1cdbc77e2de3e91ff47e154570

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page