Skip to main content

Tiny protein-function classifier distilled from ProteinMPNN, for edge deployment.

Project description

capiti

Tiny protein-function classifier for edge deployment. Given a nucleotide sequence encoding a protein, capiti flags whether the encoded protein is expected to retain the enzymatic function of one of a small reference set.

Weighs ~1 MB on disk, runs inference in tens of milliseconds on a Raspberry Pi. Trained by distilling ProteinMPNN's function-preserving design prior into a small 1D CNN.

Overview (each ResidualDilatedBlock collapsed to one box):

CapitiCNN overview

Inside one ResidualDilatedBlock:

ResidualDilatedBlock detail

See docs/capiti.summary.txt for the full per-layer size / FLOP table.

Install

pip install capiti

Use

capiti ATGCGTAAAGTGGCC...           # prints TRUE or FALSE (default set ab9)
capiti ATGCGT...  --cutoff 0.8 -v   # TRUE  p_inset=0.995
capiti --fasta seqs.fa              # batch over a FASTA
echo ATGCGT... | capiti --stdin

Reference sets

capiti ships multiple bundled reference sets, selectable at invocation time. Use --set NAME (or the CAPITI_SET env var).

set status description
ab9 available 9 soluble enzymes (several beta-lactamases relevant to antibiotic resistance, plus other enzymes). Default.
C planned TBA.
E planned TBA.
capiti ATGCGT... --set ab9
capiti --fasta seqs.fa --set C
CAPITI_SET=E capiti --stdin

Exit code is 0 on TRUE, 1 on FALSE, suitable for shell pipelines:

capiti ATGCGT... && echo "in set" || echo "not in set"

Status

The 0.0.x line is research-grade. The CLI surface (flags, stdin/FASTA behaviour, exit codes) is stable; bundled models will be retrained and updated between 0.0.x releases, and should not be used for anything operational.

License

MIT.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

capiti-0.0.3.tar.gz (857.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

capiti-0.0.3-py3-none-any.whl (856.6 kB view details)

Uploaded Python 3

File details

Details for the file capiti-0.0.3.tar.gz.

File metadata

  • Download URL: capiti-0.0.3.tar.gz
  • Upload date:
  • Size: 857.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for capiti-0.0.3.tar.gz
Algorithm Hash digest
SHA256 993ab8aee81c144bca3f53a5660c3772508fc8cd0eef4dc295938492bb2de036
MD5 ada43aeb8057ee59b45af2a0c9730bc7
BLAKE2b-256 223cc215d673acf47dda12e41a16fbd4e6389b671980b177196fca03d932ccee

See more details on using hashes here.

File details

Details for the file capiti-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: capiti-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 856.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for capiti-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 27693e64e8368ea13a2361db91070588419b9548a87619ce889a6f5c55ead758
MD5 378fa2ada8e9dc92f9e304649c13cd3b
BLAKE2b-256 3524a91ec083651dc62287df662258e92259c71a95882f3cda763ad3cbfc9a44

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page