Automatic Clustering selection with Ray Tune
Project description
auto-clustering
Automatic Clustering selection with Ray Tune
Important Note:
This tool may optimize metrics such as the validity index, silhouette or the davies-bouldin score, but as this is based on unsupervised learning, these metrics may not always reflect the true usefulness of the resulting clusters.
Example: Clustering Selection
from autoclustering import AutoClustering
from sklearn.datasets import load_digits
data, _ = load_digits(return_X_y=True)
clustering = AutoClustering(num_samples=50,
metric='validity_index',
n_jobs=-1,
verbose=0)
clustering.fit(data)
clustering.best_params_
clustering.best_score_
clustering.n_clusters_
clustering.best_estimator_
clustering.predict(data)
Changelog
See the changelog for notes on the changes of auto-clustering
Important links
Official source code repo: https://github.com/rodrigo-arenas/auto-clustering/
Download releases: https://pypi.org/project/auto-clustering/
Issue tracker: https://github.com/rodrigo-arenas/auto-clustering/issues
Stable documentation: https://auto-clustering.readthedocs.io/en/stable/
Source code
You can check the latest development version with the command:
git clone https://github.com/rodrigo-arenas/auto-clustering.git
Install the development dependencies:
pip install -r requirements.txt
Check the latest in-development documentation: https://auto-clustering.readthedocs.io/en/latest/
Contributing
Contributions are more than welcome!
There are several opportunities on the ongoing project, so please get in touch if you would like to help out. Make sure to check the current issues and also the Contribution guide.
Big thanks to the people who are helping with this project!
Testing
After installation, you can launch the test suite from outside the source directory:
pytest autoclustering/
Disclaimer
The library is still experimental and under heavy development
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file auto-clustering-0.1.0.dev0.tar.gz
.
File metadata
- Download URL: auto-clustering-0.1.0.dev0.tar.gz
- Upload date:
- Size: 7.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.9.6 requests/2.28.2 setuptools/65.6.3 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 83156be4b903063aaec869067175c142e5f24d9e5740ac2c9cfd7424fe109598 |
|
MD5 | 729fb3152a4c53c5d91cfc341d40071b |
|
BLAKE2b-256 | 21c0f676118d290e3134ee2f3338fa80696dfca47ed93af44acd834cdacb1078 |
File details
Details for the file auto_clustering-0.1.0.dev0-py3-none-any.whl
.
File metadata
- Download URL: auto_clustering-0.1.0.dev0-py3-none-any.whl
- Upload date:
- Size: 9.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.9.6 requests/2.28.2 setuptools/65.6.3 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8fe5933df15f33815442c37f864a6fb56e4fd96e77e3b4206cfeb69d0f4aaec4 |
|
MD5 | ae8c77a6394a84c0996a63c5cd4b79ee |
|
BLAKE2b-256 | 9ee75ade0d2427b46132c53924380a783a72aba618247dad03db557c038343aa |