Skip to main content

A package for automatic clustering hyperparameter optmization

Project description

Hypercluster

A package for clustering optimization with sklearn.

Requirements:

pandas
numpy
scipy
matplotlib
seaborn
scikit-learn
hdbscan

Optional: snakemake

Install

pip install hypercluster

or

conda install -c bioconda hypercluster

Right now there are issue with the bioconda install on linux. Try the pip, if you are having problems.

Docs

https://hypercluster.readthedocs.io/en/latest/index.html

Examples

https://github.com/liliblu/hypercluster/tree/dev/examples

Quickstart example

import pandas as pd
from sklearn.datasets import make_blobs
import hypercluster

data, labels = make_blobs()
data = pd.DataFrame(data)
labels = pd.Series(labels, index=data.index, name='labels')

# With a single clustering algorithm
clusterer = hypercluster.utilities.AutoClusterer()
clusterer.fit(data).evaluate(
  methods = hypercluster.constants.need_ground_truth+hypercluster.constants.inherent_metrics, 
  gold_standard = labels
  )

hypercluster.visualize.visualize_evaluations(clusterer.evaluation_, multiple_clusterers=False)

# With a range of algorithms

evals, labels_df, labels_dict = optimize_clustering(data)

hypercluster.visualize.visualize_evaluations(evals)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hypercluster-0.1.2.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hypercluster-0.1.2-py3-none-any.whl (22.1 kB view details)

Uploaded Python 3

File details

Details for the file hypercluster-0.1.2.tar.gz.

File metadata

  • Download URL: hypercluster-0.1.2.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191201 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.0

File hashes

Hashes for hypercluster-0.1.2.tar.gz
Algorithm Hash digest
SHA256 04e3e64011a941d08c72c16eb24a496a7960fcecbac0a9f36e2c61b630a79dd2
MD5 ab8ca535d75b3690455afe4b8a25c95b
BLAKE2b-256 b29604c821ff62a568ce6a3078da66311038e615793e1367ae4231433f7d942d

See more details on using hashes here.

File details

Details for the file hypercluster-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: hypercluster-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 22.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191201 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.0

File hashes

Hashes for hypercluster-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5421aee1d2b95fdf93f34543c9b41293206cd0ef7f81e2b54a065f6f1ad6561c
MD5 d5dd5c19779b50f58c3646bf22fba309
BLAKE2b-256 da446489ed062fd885b163d342961510f85d273409d3ee2825bb79f54ca80061

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page