A modular Python toolkit for peptide modelling, rescoring, and structural bioinformatics
Project description
🧬 PepKit
A Python Toolkit for Peptide Modeling, Analysis, and Benchmarking
PepKit is a modular, peptide-centric Python toolkit designed to support end-to-end computational peptide workflows, from sequence processing and descriptor calculation to structural modeling, confidence assessment, and docking-oriented analysis.
The package is built with reproducibility, scalability, and interoperability in mind, making it suitable for:
- Peptide–protein interaction studies
- Machine-learning–ready dataset construction
- Structural bioinformatics and docking benchmarks
- Large-scale peptide screening and analysis pipelines
✨ Key Features
1️⃣ Sequence I/O & Standardization
- Convert between FASTA and SMILES formats (
fasta_to_smiles,smiles_to_fasta). - Validate and standardize peptide sequences (canonical residues, charge models).
- Batch processing for lists and pandas DataFrames.
2️⃣ Physicochemical Descriptors & Clustering
- Compute peptide-level descriptors (molecular weight, charge, hydrophobicity, pI).
- Generate descriptor tables for ML pipelines.
- Cluster peptide libraries based on sequence or chemical similarity.
3️⃣ Structural Modeling & Confidence Metrics
- Post-process AlphaFold / AlphaFold-Multimer outputs.
- Compute confidence metrics:
- pLDDT (global, peptide, interface)
- PAE (interface-aware)
- pTM / ipTM / composite pTM
- pDockQ, pDockQ2, MPDockQ
- Optional DockQ evaluation against experimental structures.
4️⃣ Peptide–Protein Dataset Construction
- Automated querying of RCSB PDB for peptide–protein complexes.
- Heuristic peptide-chain detection and interface extraction.
- CSV + FASTA export for benchmarking and ML.
5️⃣ Docking & Benchmark Pipelines
- Prepare inputs for docking and scoring workflows.
- Integrate predicted and experimental metrics in unified tables.
- Designed to interoperate with Rosetta, AlphaFold, and downstream scoring tools.
📦 Installation
Install from PyPI
pip install pepkit
Development installation
git clone https://github.com/Vivi-tran/PepKit.git
cd PepKit
pip install -e .
🚀 Quickstart
This section shows how to get started with PepKit in just a few lines of code, covering the most common peptide-centric tasks.
🔹 Sequence Conversion
Convert between peptide FASTA and SMILES representations:
from pepkit.conversion import fasta_to_smiles, smiles_to_fasta
# FASTA → SMILES
seq = "ACDEFGHIK"
smiles = fasta_to_smiles(seq)
print("SMILES:", smiles)
# SMILES → FASTA
back_seq = smiles_to_fasta(smiles)
print("FASTA:", back_seq)
📚 Documentation
Full documentation is available at:
👉 https://pepkit.readthedocs.io/en/latest/
License
This project is licensed under MIT License - see the License file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pepkit-0.0.5.tar.gz.
File metadata
- Download URL: pepkit-0.0.5.tar.gz
- Upload date:
- Size: 49.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1e436d42bc857215c83267eb276f5f60283586f958980390d6e8b19f6d47eadc
|
|
| MD5 |
bbf127fb2d2942ae50863f53c4ebc607
|
|
| BLAKE2b-256 |
11697e65bdfe3c266d8f0182df9020bfcde2e2c72c0b8b0787d02b573417e465
|
Provenance
The following attestation bundles were made for pepkit-0.0.5.tar.gz:
Publisher:
publishing-package.yml on Vivi-tran/PepKit
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pepkit-0.0.5.tar.gz -
Subject digest:
1e436d42bc857215c83267eb276f5f60283586f958980390d6e8b19f6d47eadc - Sigstore transparency entry: 836860107
- Sigstore integration time:
-
Permalink:
Vivi-tran/PepKit@7d2abe4dff59dbd19e8a8a91318d4c9986594b9d -
Branch / Tag:
refs/tags/v0.0.5 - Owner: https://github.com/Vivi-tran
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publishing-package.yml@7d2abe4dff59dbd19e8a8a91318d4c9986594b9d -
Trigger Event:
release
-
Statement type:
File details
Details for the file pepkit-0.0.5-py3-none-any.whl.
File metadata
- Download URL: pepkit-0.0.5-py3-none-any.whl
- Upload date:
- Size: 380.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
19b1cdbf2b5baa9f5d37e9fd3e497f0827804babf1ac42185ca5ae7af852de59
|
|
| MD5 |
c21a0470e5acf3d375a73d03beaf1311
|
|
| BLAKE2b-256 |
e53c26353ba287e6619a3499d862d69aac51d714532932e7e81e45db3eb583bf
|
Provenance
The following attestation bundles were made for pepkit-0.0.5-py3-none-any.whl:
Publisher:
publishing-package.yml on Vivi-tran/PepKit
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pepkit-0.0.5-py3-none-any.whl -
Subject digest:
19b1cdbf2b5baa9f5d37e9fd3e497f0827804babf1ac42185ca5ae7af852de59 - Sigstore transparency entry: 836860222
- Sigstore integration time:
-
Permalink:
Vivi-tran/PepKit@7d2abe4dff59dbd19e8a8a91318d4c9986594b9d -
Branch / Tag:
refs/tags/v0.0.5 - Owner: https://github.com/Vivi-tran
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publishing-package.yml@7d2abe4dff59dbd19e8a8a91318d4c9986594b9d -
Trigger Event:
release
-
Statement type: