Skip to main content

protclus: Library for protein complex discovery on PPI networks

Project description

protclus - Minimal Python 3 library for Protein Complex Discovery in PPI Networks

This is a Python 3 library containing clustering algorithms chiefly used for protein complex discovery in protein-protein interaction (PPI) networks.

It is inspired by the collection of Python 2 scripts in https://github.com/trueprice/python-graph-clustering (which is unfortunately due to deprecate in 2021) --- but updated for Python 3 and rewritten to as a set of classes and methods. As a result it is significantly easier to import and integrate into projects. Additionally it benefits from some additional flexibility in the output and some improvements on the speed of the algorithms.

It benefits from extremely simple usage that is shared between all of the methods

# Example using MCODE
filename = "data/unweighted_example_network.txt"
c = MCODE(filename)
c.cluster()

print(c.clusters())
c.save_clusters("mcode_example_clusters.txt")

Methods

Currently Included

  • MCODE by Gary D. Bader and Christopher W. V. Hogue "An automated method for finding molecular complexes in large protein interaction networks."
  • DPCLUS by Md Altaf-Ul-Amin et al. "Development and implementation of an algorithm for detection of protein complexes in large interaction networks."
  • IPCA by Min Li, Jian-er Chen, Jian-xin Wang, Bin Hu, and Gang Chen. Modifying the dpclus algorithm for identifying protein complexes based on new topological structures.
  • COACH by by Min Wu, Xiaoli Li, Chee-Keong Kwoh, and See-Kiong Ng. "A core-attachment based method to detect protein complexes in ppi networks."

Coming Soon

  • Graph Entropy by E. C. Kenley and Y. Cho "Entropy-Based Graph Clustering: Application to Biological and Social Networks"

Installation

Installation from PyPI

pip install protclus

Installation from source from the project root folder where setup.py can be done via

pip install -e .

Testing

python setup.py test

Citation

If this work was of use to you please cite the original authors of each of the algorithms and the previous Python2 script authors.

Additionally please consider citing the following paper, as this library was developed as part of

@misc{protclus,
      title={Incorporating network based protein complex discovery into automated model construction}, 
      author={Paul Scherer and Maja Trȩbacz and Nikola Simidjievski and Zohreh Shams and Helena Andres Terre and Pietro Liò and Mateja Jamnik},
      year={2020},
      eprint={2010.00387},
      archivePrefix={arXiv},
      primaryClass={q-bio.MN}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protclus-0.9.1.tar.gz (11.2 kB view details)

Uploaded Source

Built Distribution

protclus-0.9.1-py3-none-any.whl (20.0 kB view details)

Uploaded Python 3

File details

Details for the file protclus-0.9.1.tar.gz.

File metadata

  • Download URL: protclus-0.9.1.tar.gz
  • Upload date:
  • Size: 11.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9

File hashes

Hashes for protclus-0.9.1.tar.gz
Algorithm Hash digest
SHA256 b861e9bdf42de7b0e945391aff907aa485206742fdca3cca5b5c090ccd9ce97c
MD5 06e699ae5fd5e4c55d5bc3be05bbc643
BLAKE2b-256 c0b1548ecdf718d219f9286bbb2df5dca49426647ad81bd221045a0429f00358

See more details on using hashes here.

File details

Details for the file protclus-0.9.1-py3-none-any.whl.

File metadata

  • Download URL: protclus-0.9.1-py3-none-any.whl
  • Upload date:
  • Size: 20.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9

File hashes

Hashes for protclus-0.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 08c6c59d06b273a24d39e10bdef27f717b1e0ea970c7bfa374c912b7662fbf6b
MD5 026e3877c829c49645eb1f3c9adcadfd
BLAKE2b-256 d13e02665c104a61768c5b9fff92c8b435b1514e80ca649572c362098310645c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page