Skip to main content

ccHBGF - consensus clustering using Hybrid Bipartite Graph Formulation (HBGF)

Project description

ccHBGF: Graph-based Consensus Clustering

Overview of Consensus Clustering Workflow

A python-based consensus clustering function utilising Hybrid Bipartite Graph Formulation (HBGF).

The ccHBGF function performs consensus clustering by following these steps:

  1. Definition of a bipartite graph adjaceny matrix A
  2. Decomposition of A into a spectral embedding UVt
  3. Clustering of UVt into a consensus labels

Installation

pip install ccHBGF
pip install 'ccHBGF[tutorial]' # When running example notebooks

Hybrid Bipartite Graph Formulation (HBGF)

Overview of Consensus Clustering Workflow

HBGF is a graph-based consensus ensemble clustering technique. This method constructs a bipartite graph with two types of vertices: observations and clusters from different clusteirng solutions. An edge exists only between an observation vertex and a cluster vertex, indicating the object's membership in that cluster. The graph is then partitioned using spectral partitioning to derive consensus labels for all observations.

Example Usage

from ccHBGF import ccHBGF

consensus_labels = ccHBGF(solutions_matrix, init='orthogonal', tol=0.1, verbose=True, random_state=0)

Where the solutions_matrix is of shape (m,n):

  • m = the number of observations
  • n = the number of different clustering solutions.

Please refer to notebooks/ for more detailed examples.

References

[1] Hu, Tianming, et al. "A comparison of three graph partitioning based methods for consensus clustering." Rough Sets and Knowledge Technology: First International Conference, RSKT 2006, Chongquing, China, July 24-26, 2006. Proceedings 1. Springer Berlin Heidelberg, 2006.

[2] Fern, Xiaoli Zhang, and Carla E. Brodley. "Solving cluster ensemble problems by bipartite graph partitioning." Proceedings of the twenty-first international conference on Machine learning. 2004.

[3] Ng, Andrew, Michael Jordan, and Yair Weiss. "On spectral clustering: Analysis and an algorithm." Advances in neural information processing systems 14 (2001).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cchbgf-0.1.0.tar.gz (3.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cchbgf-0.1.0-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file cchbgf-0.1.0.tar.gz.

File metadata

  • Download URL: cchbgf-0.1.0.tar.gz
  • Upload date:
  • Size: 3.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.29

File hashes

Hashes for cchbgf-0.1.0.tar.gz
Algorithm Hash digest
SHA256 d37981dad6d326b8a71eb10e3a029d7722cb958302d01951b860619379073b4e
MD5 dd6c46bf1055ace36d8487a5a846172d
BLAKE2b-256 578cdc695e63524078333690d68ebff566b892c64433cf07c536cc496e8d497d

See more details on using hashes here.

File details

Details for the file cchbgf-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: cchbgf-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.29

File hashes

Hashes for cchbgf-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5cefd02cd104d9566548a0844b03c28176659fe5b938f38b5a93a32c08e59a04
MD5 d93ebf12892b44cbd505edd49ac85ff0
BLAKE2b-256 012e2a4d36ab230921b94e3b24d083ccaeea036068412b55c7c7eb5772bb75eb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page