Skip to main content

Large scale Markov clustering (MCL) via subgraph extraction

Project description

bigmcl

Large scale Markov clustering (MCL) via subgraph extraction

bigmcl will isolate disconnected subgraphs from a large graph file and execute MCL on the subgraphs. bigmcl enables MCL on large, highly disconnected graphs, such as those used in orthogroup inference. Not recommended for graphs that are manageable with typical MCL.

Important to note that the inflation parameter is affected by this approach - I have noted clusters are more granular if anything. In the future, I plan on implementing a systematic approach option for identifying ideal inflations for each subgraph.

Please cite this git repository and MCL when this software contributes to your analysis.

DISCLAIMER

bigmcl is currently in a beta state, and while I appreciate bringing issues to my attention, I am currently focused on getting things working well for my own research, so I cannot guarantee timely issue resolution. My hope is bigmcl will be in a longterm stable state by publication 2022.


INSTALL

pip install bigmcl

Clone mcl from here, compile, and add to your path.


USE

Input and go:

bigmcl -i <GRAPH.imx> -I 1.5

More elaborate options:

usage: bigmcl.py [-h] -i INPUT -I INFLATION [-s] [-r ROW_FILE] [-m] [-o OUTPUT] [-c CORES]
                 [-v]

Isolates disconnected graphs and runs MCL on the subgraphs. Input data must be numerical.

optional arguments:
  -h, --help            show this help message and exit
  -i INPUT, --input INPUT
                        MCL graph file in imx format
  -I INFLATION, --inflation INFLATION
  -s, --symmetric       Matrix is symmetric (throughput increase)
  -r ROW_FILE, --row_file ROW_FILE
                        Continue from finished row.txt
  -m, --mcl_format      Output clusters in MCL format
  -o OUTPUT, --output OUTPUT
                        Alternative output directory
  -c CORES, --cores CORES
  -v, --verbose

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigmcl-0.2b2.tar.gz (13.3 kB view details)

Uploaded Source

Built Distribution

bigmcl-0.2b2-py3-none-any.whl (21.5 kB view details)

Uploaded Python 3

File details

Details for the file bigmcl-0.2b2.tar.gz.

File metadata

  • Download URL: bigmcl-0.2b2.tar.gz
  • Upload date:
  • Size: 13.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.6.0.post20210108 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.5

File hashes

Hashes for bigmcl-0.2b2.tar.gz
Algorithm Hash digest
SHA256 00597064f12f7282b6f7eef932229bce7ec9d86dd6da3b53fb6069db469ba909
MD5 17ef30e38a473f4e433af60167780e48
BLAKE2b-256 745b17291f8a5b15f9d0b4b531174adee9a2d379a8ec7d23e8ec4f35327c8f1c

See more details on using hashes here.

File details

Details for the file bigmcl-0.2b2-py3-none-any.whl.

File metadata

  • Download URL: bigmcl-0.2b2-py3-none-any.whl
  • Upload date:
  • Size: 21.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.6.0.post20210108 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.5

File hashes

Hashes for bigmcl-0.2b2-py3-none-any.whl
Algorithm Hash digest
SHA256 b5212b80a8fa6f00f0371b045208a24c55d72f54a547e704af2ad60bee11f447
MD5 6bcff18d1f304d48f640d7bd8382aa62
BLAKE2b-256 9b67154df6e7e66afe5a2431292d484b98cb15d6c525103c55db2b9a45bf421f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page