Skip to main content

Mapping peptides to source protein for immunopeptidiomics data analysis

Project description

ProtPeptigram

CI/CD Cross-Platform PyPI version Python 3.10+ License: MIT

Visualization of Immunopeptides Mapped to Source Proteins Across Multiple Samples

ProtPeptigram provides a comprehensive visualization platform for mapping immunopeptides to their source proteins across different biological samples. This tool can enables to identify peptide coverage patterns, analyze density distributions, and compare peptide presentations between experimental conditions.

ProtPeptigram Visualization Example

Features

  • Intuitive Peptide Visualization: Map peptides to their source proteins with detailed positional information
  • Multi-Sample Support: Compare peptide presentation across different experimental conditions
  • Intensity-Based Coloring: Visualize peptide abundance with customizable color schemes
  • Automatic Highlighting: Identify regions of interest with dense peptide coverage
  • Publication-Quality Outputs: Generate high-resolution figures suitable for scientific publications
  • Customizable Visualizations: Adjust color schemes, highlighting, and display options to suit your needs

Installation

From PyPI (Recommended)

pip install protpeptigram

From TestPyPI

pip install -i https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ protpeptigram

From Source

git clone https://github.com/Sanpme66/ProtPeptigram.git
cd ProtPeptigram
pip install -e .

Requirements

  • Python ≥ 3.8
  • pandas
  • matplotlib
  • numpy
  • Biopython
  • rich

Quick Start

Command Line Usage

# Basic usage with minimal options
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_directory

# Specify top 10 proteins by peptide count
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_dir -tp 10

# Visualize specific proteins
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_dir -pl protein_list.txt

# Apply intensity threshold
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_dir -th 1000

Python API Usage

from ProtPeptigram.DataProcessor import PeptideDataProcessor
from ProtPeptigram.viz import ImmunoViz

# Initialize data processor
processor = PeptideDataProcessor()

# Load data
processor.load_peaks_data("data/peptides.csv")
processor.load_protein_sequences("data/proteome.fasta")

# Process data
formatted_data = processor.filter_and_format_data(
    filter_contaminants=True,
    intensity_threshold=1000,
    min_samples=2
)

# Create visualizations
viz = ImmunoViz(formatted_data)
fig, _ = viz.plot_peptigram(
    protein_ids=["P20152", "P32261"],
    group_by="Sample",
    color_by="protein",
    title="HLA Peptide Visualization"
)

# Save visualization
fig.savefig("protein_visualization.png", dpi=300, bbox_inches="tight")

Running on Google Colab

You can quickly try out ProtPeptigram on Google Colab without installing anything locally. Click the link below to open the example notebook:

Open in Colab

Documentation

For detailed documentation including API reference, tutorials, and examples:

Input Data Format

ProtPeptigram accepts peptide data in CSV format from PEAKS software with the following columns:

  • Peptide sequence
  • Protein accession
  • Intensity values for each sample

For protein sequences, standard FASTA format files are supported.

Citation

If you use ProtPeptigram in your research, please cite:

Krishna S, Li C, et al. (2024). ProtPeptigram: Visualization tool for mapping peptides to source proteins.
bioRxiv. https://www.monash.edu/research/compomics/

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Developed at Li Lab/Purcell Lab, Monash University, Australia
  • Inspired by the need for better visualization tools in immunopeptidomics research

Contact

Sanjay Krishna - GitHub

Project Link: https://github.com/Sanpme66/ProtPeptigram

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protpeptigram-1.2.0.dev0.tar.gz (28.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

protpeptigram-1.2.0.dev0-py3-none-any.whl (26.2 kB view details)

Uploaded Python 3

File details

Details for the file protpeptigram-1.2.0.dev0.tar.gz.

File metadata

  • Download URL: protpeptigram-1.2.0.dev0.tar.gz
  • Upload date:
  • Size: 28.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for protpeptigram-1.2.0.dev0.tar.gz
Algorithm Hash digest
SHA256 f67102cc5c7f6b20e21fbfdb87ef89846cd56ab0f1ea41b67fa4743fdb356982
MD5 b44ce2bb3d34261cb103bd02e5455790
BLAKE2b-256 7bd27e444ee546fa809b1794962fb14f2096e60b89431d6bf17201a11b631f08

See more details on using hashes here.

File details

Details for the file protpeptigram-1.2.0.dev0-py3-none-any.whl.

File metadata

File hashes

Hashes for protpeptigram-1.2.0.dev0-py3-none-any.whl
Algorithm Hash digest
SHA256 0620c3332531b6d0df1d01475a850cf7669cee416ba74d08f26663a3f691ab95
MD5 9f9e47fb70dbe5f1ab356d8357b5faf0
BLAKE2b-256 6859c92b54ba3e2c3c06254c2ab501ad0eef98ee5746cabda4f9f0856abb6964

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page