Skip to main content

Automatic Clustering selection with Ray Tune

Project description

Tests Codecov PythonVersion PyPi Docs

auto-clustering

Automatic Clustering selection with Ray Tune

Important Note:

This tool may optimize metrics such as the validity index, silhouette or the davies-bouldin score, but as this is based on unsupervised learning, these metrics may not always reflect the true usefulness of the resulting clusters.

Example: Clustering Selection

from autoclustering import AutoClustering
from sklearn.datasets import load_digits


data, _ = load_digits(return_X_y=True)

clustering = AutoClustering(num_samples=50,
                            metric='validity_index',
                            n_jobs=-1,
                            verbose=0)

clustering.fit(data)

clustering.best_params_
clustering.best_score_
clustering.n_clusters_
clustering.best_estimator_

clustering.predict(data)

Changelog

See the changelog for notes on the changes of auto-clustering

Source code

You can check the latest development version with the command:

git clone https://github.com/rodrigo-arenas/auto-clustering.git

Install the development dependencies:

pip install -r requirements.txt

Check the latest in-development documentation: https://auto-clustering.readthedocs.io/en/latest/

Contributing

Contributions are more than welcome!

There are several opportunities on the ongoing project, so please get in touch if you would like to help out. Make sure to check the current issues and also the Contribution guide.

Big thanks to the people who are helping with this project!

Contributors

Testing

After installation, you can launch the test suite from outside the source directory:

pytest autoclustering/

Disclaimer

The library is still experimental and under heavy development

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

auto-clustering-0.1.0.dev0.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

auto_clustering-0.1.0.dev0-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file auto-clustering-0.1.0.dev0.tar.gz.

File metadata

  • Download URL: auto-clustering-0.1.0.dev0.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.9.6 requests/2.28.2 setuptools/65.6.3 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.10.9

File hashes

Hashes for auto-clustering-0.1.0.dev0.tar.gz
Algorithm Hash digest
SHA256 83156be4b903063aaec869067175c142e5f24d9e5740ac2c9cfd7424fe109598
MD5 729fb3152a4c53c5d91cfc341d40071b
BLAKE2b-256 21c0f676118d290e3134ee2f3338fa80696dfca47ed93af44acd834cdacb1078

See more details on using hashes here.

File details

Details for the file auto_clustering-0.1.0.dev0-py3-none-any.whl.

File metadata

  • Download URL: auto_clustering-0.1.0.dev0-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.9.6 requests/2.28.2 setuptools/65.6.3 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.10.9

File hashes

Hashes for auto_clustering-0.1.0.dev0-py3-none-any.whl
Algorithm Hash digest
SHA256 8fe5933df15f33815442c37f864a6fb56e4fd96e77e3b4206cfeb69d0f4aaec4
MD5 ae8c77a6394a84c0996a63c5cd4b79ee
BLAKE2b-256 9ee75ade0d2427b46132c53924380a783a72aba618247dad03db557c038343aa

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page