Skip to main content

Mapping peptides to source protein for immunopeptidiomics data analysis

Project description

ProtPeptigram

CI/CD Cross-Platform PyPI version Python 3.10+ License: MIT

Visualization of Immunopeptides Mapped to Source Proteins Across Multiple Samples

ProtPeptigram provides a comprehensive visualization platform for mapping immunopeptides to their source proteins across different biological samples. This tool can enables to identify peptide coverage patterns, analyze density distributions, and compare peptide presentations between experimental conditions.

ProtPeptigram Visualization Example

Features

  • Intuitive Peptide Visualization: Map peptides to their source proteins with detailed positional information
  • Multi-Sample Support: Compare peptide presentation across different experimental conditions
  • Intensity-Based Coloring: Visualize peptide abundance with customizable color schemes
  • Automatic Highlighting: Identify regions of interest with dense peptide coverage
  • Publication-Quality Outputs: Generate high-resolution figures suitable for scientific publications
  • Customizable Visualizations: Adjust color schemes, highlighting, and display options to suit your needs

Installation

From PyPI (Recommended)

pip install protpeptigram

From TestPyPI

pip install -i https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple/ protpeptigram

From Source

git clone https://github.com/Sanpme66/ProtPeptigram.git
cd ProtPeptigram
pip install -e .

Requirements

  • Python ≥ 3.8
  • pandas
  • matplotlib
  • numpy
  • Biopython
  • rich

Quick Start

Command Line Usage

# Basic usage with minimal options
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_directory

# Specify top 10 proteins by peptide count
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_dir -tp 10

# Visualize specific proteins
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_dir -pl protein_list.txt

# Apply intensity threshold
protpeptigram -i data/peptides.csv -f data/proteome.fasta -o output_dir -th 1000

Python API Usage

from ProtPeptigram.DataProcessor import PeptideDataProcessor
from ProtPeptigram.viz import ImmunoViz

# Initialize data processor
processor = PeptideDataProcessor()

# Load data
processor.load_peaks_data("data/peptides.csv")
processor.load_protein_sequences("data/proteome.fasta")

# Process data
formatted_data = processor.filter_and_format_data(
    filter_contaminants=True,
    intensity_threshold=1000,
    min_samples=2
)

# Create visualizations
viz = ImmunoViz(formatted_data)
fig, _ = viz.plot_peptigram(
    protein_ids=["P20152", "P32261"],
    group_by="Sample",
    color_by="protein",
    title="HLA Peptide Visualization"
)

# Save visualization
fig.savefig("protein_visualization.png", dpi=300, bbox_inches="tight")

Running on Google Colab

You can quickly try out ProtPeptigram on Google Colab without installing anything locally. Click the link below to open the example notebook:

Open in Colab

Documentation

For detailed documentation including API reference, tutorials, and examples:

Input Data Format

ProtPeptigram accepts peptide data in CSV format from PEAKS software with the following columns:

  • Peptide sequence
  • Protein accession
  • Intensity values for each sample

For protein sequences, standard FASTA format files are supported.

Citation

If you use ProtPeptigram in your research, please cite:

Krishna S, Li C, et al. (2024). ProtPeptigram: Visualization tool for mapping peptides to source proteins.
bioRxiv. https://www.monash.edu/research/compomics/

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Developed at Li Lab/Purcell Lab, Monash University, Australia
  • Inspired by the need for better visualization tools in immunopeptidomics research

Contact

Sanjay Krishna - GitHub

Project Link: https://github.com/Sanpme66/ProtPeptigram

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protpeptigram-1.0.0.tar.gz (28.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

protpeptigram-1.0.0-py3-none-any.whl (26.6 kB view details)

Uploaded Python 3

File details

Details for the file protpeptigram-1.0.0.tar.gz.

File metadata

  • Download URL: protpeptigram-1.0.0.tar.gz
  • Upload date:
  • Size: 28.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for protpeptigram-1.0.0.tar.gz
Algorithm Hash digest
SHA256 fdaca526062ec0a8bd275f9582961979ec7d96ed47e0b0c6016864a91c9d95f4
MD5 8ca40b6b78b63449a0b6fc04c9398586
BLAKE2b-256 0689fa98559f332701ad7cb03bdc207b8718f235b63db6a34287b61fc88c0573

See more details on using hashes here.

File details

Details for the file protpeptigram-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: protpeptigram-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 26.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for protpeptigram-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3c6d47c8f8a0676c6d92f4a070de416347e1548dae4fd380f479cb10dd8461e3
MD5 a7834fa47917be29e59236486fa4b46a
BLAKE2b-256 e3ad55c3a5a300686d4fa4fe47d585fccff38ea955d51e85ac0ac160fc5ec279

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page