Skip to main content

Compute protein descriptors

Project description

PyPI version Python Support Documentation Status Build Status Coverage Status

propy3

propy3 is a drop-in replacement for propy. The original project was developed by Dongsheng Cao and Yizeng Liang from 2010-2012. See the commit history for all changes made afterwards.

The reason for creating this fork of propy is to add Python 3 support.

The only point where you have to enter propy3 is at installation. Afterwards, you simply import propy.

Introduction

Sequence-derived structural and physicochemical features are highly useful for representing and distinguishing proteins or peptides of different structural, functional and interaction properties, and have been extensively used in developing methods and software for predicting protein structural and functional classes, protein-protein interactions, drug-target interactions, protein substrates, molecular binding sites on proteins, subcellular locations, protein crystallization propensity and peptides of specific properties. In order to conveniently apply these structural features from a protein sequence for researchers, we developed a propy package using pure python language, which could calculate a large number of protein descriptors from a protein sequence.

Features

The propy package has the following significant features:

  1. It is written by the pure python language. It only needs the support of some built-in modules in the python software.
  2. For academic users, it is free of charge. They can freely use and distribute it. For commercial purpose, they must contact the author.
  3. It can calculate a large number of protein descriptors including: amino acid composition descriptors, dipeptide composition descriptors, tri-peptide composition descriptors, Normalized Moreau-Broto autocorrelation descriptors, Moran autocorrelation descriptors, Geary autocorrelation descriptors, Composition, Transition, Distribution descriptors (CTD), sequence order coupling numbers, quasi-sequence order descriptors, pseudo amino acid composition descriptors, amphiphilic pseudo amino acid composition descriptors.
  4. The users could specify the needed properties of 20 amino acids to calculate the corresponding protein descriptors.
  5. The package includes the module which could directly download the protein sequence form uniprot website by uniprot id.
  6. The package includes the module which could automatrically download the property from the AAindex database. Thus, the user could calcualte thousands of protein features.

The protein descriptors calculated by propy

  1. AAC: amino acid composition descriptors (20)
  2. DPC: dipeptide composition descriptors (400)
  3. TPC: tri-peptide composition descriptors (8000)
  4. MBauto: Normalized Moreau-Broto autocorrelation descriptors (depend on the given properties, the default is 240)
  5. Moranauto: Moran autocorrelation descriptors(depend on the given properties, the default is 240)
  6. Gearyauto: Geary autocorrelation descriptors(depend on the given properties, the default is 240)
  7. CTD: Composition, Transition, Distribution descriptors (CTD) (21+21+105=147)
  8. SOCN: sequence order coupling numbers (depend on the choice of maxlag, the default is 60)
  9. QSO: quasi-sequence order descriptors (depend on the choice of maxlag, the default is 100)
  10. PAAC: pseudo amino acid composition descriptors (depend on the choice of lamda, the default is 50)
  11. APAAC: amphiphilic pseudo amino acid composition descriptors(depend on the choice of lamda, the default is 50)

Install

Pip

pip install propy3

BioConda

conda install -c bioconda propy3

Usage Example

For more examples, please see the user guide.

from propy import PyPro
from propy.GetProteinFromUniprot import GetProteinSequence

# download the protein sequence by uniprot id
proteinsequence = GetProteinSequence("P48039")

DesObject = PyPro.GetProDes(proteinsequence)  # construct a GetProDes object
print(DesObject.GetCTD())  # calculate 147 CTD descriptors
print(DesObject.GetAAComp())  # calculate 20 amino acid composition descriptors

# calculate 30 pseudo amino acid composition descriptors
paac = DesObject.GetPAAC(lamda=10, weight=0.05)

for i in paac:
    print(i, paaci)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

propy3-1.1.1.tar.gz (222.6 kB view details)

Uploaded Source

Built Distribution

propy3-1.1.1-py3-none-any.whl (290.3 kB view details)

Uploaded Python 3

File details

Details for the file propy3-1.1.1.tar.gz.

File metadata

  • Download URL: propy3-1.1.1.tar.gz
  • Upload date:
  • Size: 222.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2

File hashes

Hashes for propy3-1.1.1.tar.gz
Algorithm Hash digest
SHA256 ab67d6b469c4338a9f9994915065b48f84f131e60a54467e5633834cef08597a
MD5 680415d8ce39007b5833aa76826ef5b2
BLAKE2b-256 7119c36482a78c13b7a8494457e0e10351b51cb4391ba79c8aa4eacc9ae08b16

See more details on using hashes here.

File details

Details for the file propy3-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: propy3-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 290.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2

File hashes

Hashes for propy3-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0e8aa7ff37a1d6f0266ded966c89d1b76c08d7edf5f31a2f4fdbc07339a820af
MD5 480a920da20fe2eaab979b3715051416
BLAKE2b-256 4e89ae0b38c3e2c5c2fe3c23593b5aadea134ce268afc32d563db6176ff67fda

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page