Skip to main content

Map protein sites to human reference sequence.

Project description

Protmapper

The Protmapper maps references to protein sites to the human reference sequence based on UniProt, PhosphoSitePlus, and manual curation.

Installation

Python package

The Protmapper is a Python package that is available on PyPI and can be installed as:

pip install protmapper

Docker container

Alternatively, the Protmapper Docker container can be run to expose it as a REST API as:

docker run -d -p 8008:8008 labsyspharm/protmapper:latest

Command line interface

In addition to supporting usage via a Python API and a REST service, Protmapper also provides a command line interface that can be used as follows.

Run Protmapper on a list of proteins with residues and sites provided in a
text file.

positional arguments:
  input                 Path to an input file. The input file is a text file
                        in which each row consists of four comma separated
                        values, with the first element being a protein ID, the
                        second, the namespace in which that ID is valid
                        (uniprot or hgnc),third, an amino acid represented as
                        a single capital letter, and fourth, a site position
                        on the protein.
  output                Path to the output file to be generated. Each line of
                        the output file corresponds to a line in the input
                        file. Each linerepresents a mapped site produced by
                        Protmapper.

optional arguments:
  -h, --help            show this help message and exit
  --peptide             If given, the third element of each row of the input
                        file is a peptide (amino acid sequence) rather than a
                        single amino acid residue. In this case, peptide-
                        oriented mappings are applied. In this mode the
                        following boolean arguments are ignored.
  --no_methionine_offset
                        If given, will not check for off-by-one errors in site
                        position (possibly) attributable to site numbering
                        from mature proteins after cleavage of the initial
                        methionine.
  --no_orthology_mapping
                        If given, will not check sequence positions for known
                        modification sites in mouse or rat sequences (based on
                        PhosphoSitePlus data).
  --no_isoform_mapping  If given, will not check sequence positions for known
                        modifications in other human isoforms of the protein
                        (based on PhosphoSitePlus data).

Documentation

For a detailed documentation of the Protmapper, visit http://protmapper.readthedocs.io

Funding

The development of protmapper is funded under the DARPA Automated Scientific Discovery Framework project (ARO grant W911NF018-1-0124).

Citation

@article{bachman2019protmapper,
  author = {Bachman, John A and Gyori, Benjamin M and Sorger, Peter K},
  doi = {10.1101/822668},
  journal = {bioRxiv},
  publisher = {Cold Spring Harbor Laboratory},
  title = {{Assembling a phosphoproteomic knowledge base using ProtMapper to normalize phosphosite information from databases and text mining}},
  url = {https://www.biorxiv.org/content/early/2019/11/06/822668.1},
  year = {2019}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protmapper-0.0.29.tar.gz (34.4 kB view details)

Uploaded Source

Built Distribution

protmapper-0.0.29-py3-none-any.whl (35.9 kB view details)

Uploaded Python 3

File details

Details for the file protmapper-0.0.29.tar.gz.

File metadata

  • Download URL: protmapper-0.0.29.tar.gz
  • Upload date:
  • Size: 34.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for protmapper-0.0.29.tar.gz
Algorithm Hash digest
SHA256 d0d10198fb000401d03ba6c271869970caf8e1c8b46db686d2eed153fa8c2257
MD5 6556c798765a1494f775a910cf31c947
BLAKE2b-256 1db8767041ed4ba4fa07ab812b4641ce4a88aa36b5ed3a4ae150089a3e2c7619

See more details on using hashes here.

File details

Details for the file protmapper-0.0.29-py3-none-any.whl.

File metadata

  • Download URL: protmapper-0.0.29-py3-none-any.whl
  • Upload date:
  • Size: 35.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for protmapper-0.0.29-py3-none-any.whl
Algorithm Hash digest
SHA256 411597039c0ea07e2c43973409ca4c5cb3da16f5eea48e1b9e6c54b0f00a9552
MD5 8af5abd8a04ac144455cbc213d8ad7ac
BLAKE2b-256 3cbccdf85d1a6e4fc5287eb4e492e50f1c8a3a7a5a9b6660e364f894f64d1b0b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page