Skip to main content

A modular Python toolkit for peptide modelling, rescoring, and structural bioinformatics

Project description

🧬 PepKit

A Python Toolkit for Peptide Modeling, Analysis, and Benchmarking

PepKit Logo

PepKit is a modular, peptide-centric Python toolkit designed to support end-to-end computational peptide workflows, from sequence processing and descriptor calculation to structural modeling, confidence assessment, and docking-oriented analysis.

The package is built with reproducibility, scalability, and interoperability in mind, making it suitable for:

  • Peptide–protein interaction studies
  • Machine-learning–ready dataset construction
  • Structural bioinformatics and docking benchmarks
  • Large-scale peptide screening and analysis pipelines

✨ Key Features

1️⃣ Sequence I/O & Standardization

  • Convert between FASTA and SMILES formats (fasta_to_smiles, smiles_to_fasta).
  • Validate and standardize peptide sequences (canonical residues, charge models).
  • Batch processing for lists and pandas DataFrames.

2️⃣ Physicochemical Descriptors & Clustering

  • Compute peptide-level descriptors (molecular weight, charge, hydrophobicity, pI).
  • Generate descriptor tables for ML pipelines.
  • Cluster peptide libraries based on sequence or chemical similarity.

3️⃣ Structural Modeling & Confidence Metrics

  • Post-process AlphaFold / AlphaFold-Multimer outputs.
  • Compute confidence metrics:
    • pLDDT (global, peptide, interface)
    • PAE (interface-aware)
    • pTM / ipTM / composite pTM
    • pDockQ, pDockQ2, MPDockQ
  • Optional DockQ evaluation against experimental structures.

4️⃣ Peptide–Protein Dataset Construction

  • Automated querying of RCSB PDB for peptide–protein complexes.
  • Heuristic peptide-chain detection and interface extraction.
  • CSV + FASTA export for benchmarking and ML.

5️⃣ Docking & Benchmark Pipelines

  • Prepare inputs for docking and scoring workflows.
  • Integrate predicted and experimental metrics in unified tables.
  • Designed to interoperate with Rosetta, AlphaFold, and downstream scoring tools.

📦 Installation

Install from PyPI

pip install pepkit

Development installation

git clone https://github.com/Vivi-tran/PepKit.git
cd PepKit
pip install -e .

🚀 Quickstart

This section shows how to get started with PepKit in just a few lines of code, covering the most common peptide-centric tasks.


🔹 Sequence Conversion

Convert between peptide FASTA and SMILES representations:

from pepkit.conversion import fasta_to_smiles, smiles_to_fasta


# FASTA → SMILES
seq = "ACDEFGHIK"
smiles = fasta_to_smiles(seq)
print("SMILES:", smiles)

# SMILES → FASTA
back_seq = smiles_to_fasta(smiles)
print("FASTA:", back_seq)

📚 Documentation

Full documentation is available at:

👉 https://pepkit.readthedocs.io/en/latest/

License

This project is licensed under MIT License - see the License file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pepkit-0.0.5.tar.gz (49.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pepkit-0.0.5-py3-none-any.whl (380.0 kB view details)

Uploaded Python 3

File details

Details for the file pepkit-0.0.5.tar.gz.

File metadata

  • Download URL: pepkit-0.0.5.tar.gz
  • Upload date:
  • Size: 49.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pepkit-0.0.5.tar.gz
Algorithm Hash digest
SHA256 1e436d42bc857215c83267eb276f5f60283586f958980390d6e8b19f6d47eadc
MD5 bbf127fb2d2942ae50863f53c4ebc607
BLAKE2b-256 11697e65bdfe3c266d8f0182df9020bfcde2e2c72c0b8b0787d02b573417e465

See more details on using hashes here.

Provenance

The following attestation bundles were made for pepkit-0.0.5.tar.gz:

Publisher: publishing-package.yml on Vivi-tran/PepKit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pepkit-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: pepkit-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 380.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pepkit-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 19b1cdbf2b5baa9f5d37e9fd3e497f0827804babf1ac42185ca5ae7af852de59
MD5 c21a0470e5acf3d375a73d03beaf1311
BLAKE2b-256 e53c26353ba287e6619a3499d862d69aac51d714532932e7e81e45db3eb583bf

See more details on using hashes here.

Provenance

The following attestation bundles were made for pepkit-0.0.5-py3-none-any.whl:

Publisher: publishing-package.yml on Vivi-tran/PepKit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page