Structure-based Antibody Renumbering
Project description
Structure-based Antibody Renumbering
This repo is currently in development. If you encounter any bugs, please report the issue here.
SAbR (Structure-based Antibody Renumbering) renumbers antibody PDB files using the 3D coordinate of backbone atoms. It uses custom forked versions of SoftAlign and ANARCI to align structures to SAbDaB-derived consensus embeddings and renumber to various antibody schemes, respectively.
Installation and use
- SAbR can be installed into a virtual environment via pip:
# Latest release
pip install sabr-kit
# Most recent version from Github
git clone --recursive https://github.com/delalamo/SAbR.git
cd SAbR/
pip install -e .
It can then be run using the sabr command (see below).
Alternatively, SAbR can be directly run with the latest docker container:
❌ This doesn't currently work. Please check back soon! ❌
docker run --rm ghcr.io/delalamo/sabr:latest -i input.pdb -o output.pdb -c CHAIN_ID
Running SAbR
Practical considerations:
- Heavy and light chain structures are similar enough that chain type should be manually declared with
--chain-typeif possible (leave blank if uncertain). - It is recommended for now to truncate the query structure to contain only the Fv when running SAbR, as it will sometimes align variable region beta-strands to those in the constant region.
- When running scFvs, it is recommended to run each variable domain independently.
If running on a Mac with apple silicon, set the environmental variable JAX_PLATFORMS to cpu.
Usage: sabr [OPTIONS]
Structure-based Antibody Renumbering (SAbR) renumbers antibody PDB files
using the 3D coordinates of backbone atoms.
Options:
-i, --input-pdb FILE Input PDB file. [required]
-c, --input-chain TEXT Chain identifier to renumber.
-o, --output FILE Destination structure file. Use .pdb
extension for PDB format or .cif extension
for mmCIF format. mmCIF is required when
using --extended-insertions. [required]
-n, --numbering-scheme [imgt|chothia|kabat|martin|aho|wolfguy]
Numbering scheme. [default: (IMGT)]
--overwrite Overwrite the output PDB if it already
exists.
-v, --verbose Enable verbose logging.
--max-residues INTEGER Maximum number of residues to process from
the chain. If 0 (default), process all
residues.
-t, --chain-type [heavy|light|auto]
Restrict alignment to specific chain type
embeddings. 'heavy' searches only heavy
chain (H) embeddings, 'light' searches only
light chain (K and L) embeddings, 'auto'
searches all embeddings and picks the best
match. [default: auto]
--extended-insertions Enable extended insertion codes (AA, AB,
..., ZZ, AAA, etc.) for antibodies with very
long CDR loops. Requires mmCIF output format
(.cif extension). Standard PDB format only
supports single-character insertion codes
(A-Z, max 26 insertions per position)
-v, --verbose Verbose output
Known issues
- SAbR currently struggles with scFvs for two reasons. First, it is unclear how to assign canonical numbering to multiple domains within a single chain, unless we accept a spacer (e.g., starting chain #2 at 201 instead of 1). Second, it will sometimes align across both chains, introducing a massive insertion in between. It is unclear how to prevent this; please see issue #2 for details.
- SAbR sometimes mistakenly includes sheets from the Fab in the VH.
- The algorithm for renumbering CDRs, which is the same as the one for IMGT, does not account for unassigned residues. So if a residue is missing due to heterogeneity, the CDR numbering algorithm will misnumber other residues in the CDR.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sabr_kit-0.2.2.tar.gz.
File metadata
- Download URL: sabr_kit-0.2.2.tar.gz
- Upload date:
- Size: 2.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
391f7ba8ad287ddcc72d2e64c7af69996564c69e546a095a23e8a5f2f83ee641
|
|
| MD5 |
cce64784d32af2ad2a765ca82636079d
|
|
| BLAKE2b-256 |
2ce8acd9690b5bae35f965d5a0ad6917e2bb162e0fb1995f8ca8c0bbe6bd28a2
|
File details
Details for the file sabr_kit-0.2.2-py3-none-any.whl.
File metadata
- Download URL: sabr_kit-0.2.2-py3-none-any.whl
- Upload date:
- Size: 2.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7d2839e2f9bccc919e353472f9b3ba9b194c29012d71fe8eefc751a9c16b4944
|
|
| MD5 |
2aaf4db306311b4a641398925499655f
|
|
| BLAKE2b-256 |
b8fc6d07e705831728b5efbb53f3070b02536597be80d9959541a9c4c59b0885
|