protclus: Library for protein complex discovery on PPI networks
Project description
protclus - Minimal Python 3 library for Protein Complex Discovery in PPI Networks
This is a Python 3 library containing clustering algorithms chiefly used for protein complex discovery in protein-protein interaction (PPI) networks.
It is inspired by the collection of Python 2 scripts in https://github.com/trueprice/python-graph-clustering (which is unfortunately due to deprecate in 2021) --- but updated for Python 3 and rewritten to as a set of classes and methods. As a result it is significantly easier to import and integrate into projects. Additionally it benefits from some additional flexibility in the output and some improvements on the speed of the algorithms.
It benefits from extremely simple usage that is shared between all of the methods
# Example using MCODE
filename = "data/unweighted_example_network.txt"
c = MCODE(filename)
c.cluster()
print(c.clusters())
c.save_clusters("mcode_example_clusters.txt")
Methods
Currently Included
- MCODE by Gary D. Bader and Christopher W. V. Hogue "An automated method for finding molecular complexes in large protein interaction networks."
- DPCLUS by Md Altaf-Ul-Amin et al. "Development and implementation of an algorithm for detection of protein complexes in large interaction networks."
- IPCA by Min Li, Jian-er Chen, Jian-xin Wang, Bin Hu, and Gang Chen. Modifying the dpclus algorithm for identifying protein complexes based on new topological structures.
- COACH by by Min Wu, Xiaoli Li, Chee-Keong Kwoh, and See-Kiong Ng. "A core-attachment based method to detect protein complexes in ppi networks."
Coming Soon
- Graph Entropy by E. C. Kenley and Y. Cho "Entropy-Based Graph Clustering: Application to Biological and Social Networks"
Installation
Installation from PyPI
pip install protclus
Installation from source from the project root folder where setup.py
can be done via
pip install -e .
Testing
python setup.py test
Citation
If this work was of use to you please cite the original authors of each of the algorithms and the previous Python2 script authors.
Additionally please consider citing the following paper, as this library was developed as part of
@misc{protclus,
title={Incorporating network based protein complex discovery into automated model construction},
author={Paul Scherer and Maja Trȩbacz and Nikola Simidjievski and Zohreh Shams and Helena Andres Terre and Pietro Liò and Mateja Jamnik},
year={2020},
eprint={2010.00387},
archivePrefix={arXiv},
primaryClass={q-bio.MN}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file protclus-0.9.1.tar.gz
.
File metadata
- Download URL: protclus-0.9.1.tar.gz
- Upload date:
- Size: 11.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b861e9bdf42de7b0e945391aff907aa485206742fdca3cca5b5c090ccd9ce97c |
|
MD5 | 06e699ae5fd5e4c55d5bc3be05bbc643 |
|
BLAKE2b-256 | c0b1548ecdf718d219f9286bbb2df5dca49426647ad81bd221045a0429f00358 |
File details
Details for the file protclus-0.9.1-py3-none-any.whl
.
File metadata
- Download URL: protclus-0.9.1-py3-none-any.whl
- Upload date:
- Size: 20.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.6.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 08c6c59d06b273a24d39e10bdef27f717b1e0ea970c7bfa374c912b7662fbf6b |
|
MD5 | 026e3877c829c49645eb1f3c9adcadfd |
|
BLAKE2b-256 | d13e02665c104a61768c5b9fff92c8b435b1514e80ca649572c362098310645c |