Skip to main content

Automatic Clustering selection with Ray Tune

Project description

Tests Codecov PythonVersion PyPi Docs

auto-clustering

Automatic Clustering selection with Ray Tune

Important Note:

This tool may optimize metrics such as the validity index, silhouette or the davies-bouldin score, but as this is based on unsupervised learning, these metrics may not always reflect the true usefulness of the resulting clusters.

Example: Clustering Selection

from autoclustering import AutoClustering
from sklearn.datasets import load_digits


data, _ = load_digits(return_X_y=True)

clustering = AutoClustering(num_samples=50,
                            metric='validity_index',
                            n_jobs=-1,
                            verbose=0)

clustering.fit(data)

clustering.best_params_
clustering.best_score_
clustering.n_clusters_
clustering.best_estimator_

clustering.predict(data)

Changelog

See the changelog for notes on the changes of auto-clustering

Source code

You can check the latest development version with the command:

git clone https://github.com/rodrigo-arenas/auto-clustering.git

Install the development dependencies:

pip install -r requirements.txt

Check the latest in-development documentation: https://auto-clustering.readthedocs.io/en/latest/

Contributing

Contributions are more than welcome!

There are several opportunities on the ongoing project, so please get in touch if you would like to help out. Make sure to check the current issues and also the Contribution guide.

Big thanks to the people who are helping with this project!

Contributors

Testing

After installation, you can launch the test suite from outside the source directory:

pytest autoclustering/

Disclaimer

The library is still experimental and under heavy development

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

auto-clustering-0.1.0.dev0.tar.gz (7.9 kB view hashes)

Uploaded Source

Built Distribution

auto_clustering-0.1.0.dev0-py3-none-any.whl (9.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page