Skip to main content

Python Package for running custom protein inference algorithms on tab-formatted tandem MS/MS search results.

Project description

Py Protein Inference

Requirements

  1. Python 3.7 or greater. This package was created using Python 3.7
  2. Python Packages: numpy, pyteomics, pulp, PyYAML, matplotlib. These should be installed automatically during installation.

Quick Start Guide

  1. Install the package using pip:
pip install pyproteininference
  1. Run the Heuristic method with tab delimited results directly from percolator to generate results for peptide centric, parsimony, inclusion, and exclusion:
protein_inference_heuristic_cli.py \
-t /path/to/target/file1.txt \
-d /path/to/decoy/file1.txt \
-db /path/to/database/file.fasta 
  1. Run the standard commandline tool with tab delimited results directly from percolator to run a particular inference method. By default, peptide centric inference is selected if a parameter file is not specified:
protein_inference_cli.py \
-t /path/to/target/file.txt \
-d /path/to/decoy/file.txt \
-db /path/to/database/file.fasta 
  1. Specifying Parameters. The two most common parameters to change are the inference type, and the decoy symbol (for identifying decoy proteins vs target proteins). The parameters can be quickly altered by creating a file called params.yaml as follows:
parameters:
  inference:
    inference_type: parsimony
  identifiers:
    decoy_symbol: "decoy_"

The inference type can be one of: parsimony, peptide_centric, inclusion, exclusion, or first_protein. All parameters are optional, so you only need to define the ones you want to alter. Parameters that are not defined are set to default values. See the package documentation for the default parameters.

  1. Run the standard commandline tool again, this time specifying the parameters as above:
protein_inference_cli.py \
-t /path/to/target/file.txt \
-d /path/to/decoy/file.txt \
-db /path/to/database/file.fasta \
-y /path/to/params.yaml
  1. Running with docker
    • Either Pull the image from docker hub:
      • docker pull hinklet/pyproteininference:latest
    • Or Build the image with the following command (After having cloned the repository):
      • git clone REPOSITORY_URL
      • cd pyproteininference
      • docker build -t pyproteininference:latest .
    • Run the tool, making sure to volume mount in the directory with your input data and parameters. In the case below, that local directory would be /path/to/local/directory and the path in the container is /data
        	docker run -v /path/to/local/directory/:/data \
        	-it hinklet/pyproteininference:latest \
        	python /usr/local/bin/protein_inference_cli.py \
        	-f /data/input_file.txt \
        	-db /data/database.fasta \
        	-y /data/parameters.yaml \
        	-o /data/
      

Documentation

For more information please see the full package documentation (https://thinkle12.github.io/pyproteininference/).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyproteininference-1.0.1.tar.gz (638.8 kB view details)

Uploaded Source

Built Distribution

pyproteininference-1.0.1-py3-none-any.whl (67.4 kB view details)

Uploaded Python 3

File details

Details for the file pyproteininference-1.0.1.tar.gz.

File metadata

  • Download URL: pyproteininference-1.0.1.tar.gz
  • Upload date:
  • Size: 638.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for pyproteininference-1.0.1.tar.gz
Algorithm Hash digest
SHA256 420b03f89f9b1669747d5281ee77216dd1be90c3dc4b2b1dbd45877c68448071
MD5 ed21c4a8a71d64be69ae10da602a7209
BLAKE2b-256 fda2d71789bbebfe63a8d97943e89eaf6a28e4e76bf03b72f597633e4d7c5abe

See more details on using hashes here.

File details

Details for the file pyproteininference-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for pyproteininference-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bd6714931f645d0bea2ed546af48d114226a5780662a797c8d096541947c60cd
MD5 98868d9f0e7e04a049876f3188d76774
BLAKE2b-256 e1cf0d8a96aa5d031fb17895a98baadae9c93519d67948b2508d39fba5745423

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page