IPTK is a library specialized in the analysis of HLA-peptidomes identified through an immunopeptidomics pipeline.
Project description
The immunopeptidomic toolkit library, IPTK
Introduction and Project Aim
IPTK is a Python library specialized in the analysis of HLA-peptidomes identified through an immunopeptidomic(IP) pipeline. The library provides a high level API for analyzing and visualizing the identified peptides, integrating transcritomics and protein structure information for a rich analysis of the identified immunopeptidomes. It also provides a toolbox for integrating and comparing different experiments and/or different runs.
Release 0.6 notice
Version 0.6 brings major upgrades to the library and introduce a wide array of function and classes for automating and accelerating IPTK performance
1- IPTK can now parse and work with mzIdentML files using the function parse_mzIdentML_to_identification_table define in the IO module of the library
2- IPTK can now process and read mzML files directly using PyOpenMS
3- IPTK has an improved function executional speed thanks to the AcceleratedFunctions module in the Analysis module which provides an acceleration using Numba
4- Current release also introduce, the Wrappers module which provide a simple abstraction for creating Experiment and ExperimentSet
5- Introducing ReplicatedExperiments which provides a simple API for creating experiments obtained from replicates
6- IPTK, current support concurrent execution, the wrapper submodules, now utilizes multiprocessing for parsing and reading multiple datasets on-parallel
7- Introducing, chordDiagram for showing overlap among experiments and Proband of experiments
Release 0.6.6 notice
1- Introducing GOEngine class which provides an easy-to-use wrapper around goatools for performing GOEA on the identified proteins.
2- Current release supports Jaccard index as a metric of similarity among experiments
3- Introducing support for visualizing GOEA results
4- Correction of minor bugs and documentation typos in previous releases
The road to version 1.0
The major plan is to, first, increase and enhance IPTK scale and execution speed by offloading computational intensive tasks to RUST. Second, increase automation by providing custom analysis recipes for performing commonly used routines. Third, provide an API for integrating other omics layers, namely metabolomics and proteomics. Finally, adding support to PTM modified HLA peptides and proteins
Planned features for 0.7.* Release
1. Release 0.7.1 will aim at supporting the integration of Proteomic data with the library
2. Release 0.7.2 will aim at supporting the integration of Metabolomics data with the library
3. Release 0.7.3 will aim at standardizing all omics API and provide a high-level abstraction for working with them
4. Release 0.7.4-0.7.7 will aim at re-implement all the class in Rust and provide a python wrapper around these classes, Thus ensuring fast and concurrent execution
Planned features for 0.8.*
1. Release 0.8.1-0.8.4 will aim at re-implement all IPTK parsers in Rust and provide a python binder to it
2. Release 0.8.5-0.8.8 will aim at re-implementing all analysis function using Rust
Planned features for 0.9.*
Different minor releases will introduce different analysis Recipes to automate analysis tasks
Planning for version 1.0.0
IPTK version 1.0 is release on PyPi and on BioConda
Previous versions Release notice
Release 0.5 notice:
1- Adding a class to query AFND database for allele frequency world-wide.
2- Adding function for plotting a choropleth for allele frequencies.
3- Adding classes for working directly with mzML files using pyopenMS framework
4- An experimental class that act as database interface and provide method for storing and querying immunopeptidomic data
Release 0.4.11 notice:
Adding more control to the function plot_MDS_from_ic_coverage to fine-tune its behavior, for example, by controlling the random seed.
Release 0.4.10 notice:
Corrected a bug in the Experiment class to correctly compute the length of peptides containing parentheses. This bug caused the len function to return the number of characters in the sequence instead of the number of amino acids.
Release 0.4.8 notice:
Corrected a bug in the Peptide class to manage peptides containing parentheses in the sequence. This bug caused the len function to return the number of characters in the sequence instead of the number of amino acids.
Release 0.4.7 notice:
Minor corrections in the visualization module
Release 0.4.6 notice:
Minor corrections in the documentation and the default values for some parameters in the visualization functions
Release 0.4.0 notice:
1- Adding function to compute immunopeptiomic coverage matrix
2- Introducing MDS plots for comparing the similarities between runs based on immunopeptidomic coverage
Tutorials
The library have three notebooks that provide a step-by-step guidance to use the library and to utilize its major APIs for interacting with an IPs data. These tutorial can be found at the Tutorial directory at the project's Github page.
IPTK has been documented using Sphinx, the manual of the library can be found at docs directory and online at readthedocs
Installation
The library can be installed using pip as follows:
pip install iptkl --user
Funding
The project was funded by the German Research Foundation (DFG) (Research Training Group 1743, ‘Genes, Environment and Inflammation’).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for iptkl-0.6.10.macosx-10.7-x86_64.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | d8e96a2e5a92ab73f0d3299ae638629e8f42ba0a2ad9a24edd46740883dca4c3 |
|
MD5 | 2b7288a9a30128d8dfca717687c1634b |
|
BLAKE2b-256 | b8ed9043503469489fcbb992ce528af0d314c688338fc8992dd0f5f72b20721c |
Hashes for iptkl-0.6.10-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1767bb0a90baa3904a8c75145f1a1ba21892178eb2f77177e6da745410b93743 |
|
MD5 | b849a906abfecfd4e07b67a39b36b869 |
|
BLAKE2b-256 | b8b33e699d288bf6773f8c9f628e0804a5d16da1c69d6d688d2399c81350e8a3 |