A Kmeans implementation using only NumPy
Project description
K means clustering is often used as an unsupervised data-analytics algorithm meant to find the ideal number of possible classes in a given dataset.
This project implements a k-means clustering algorithm pipeline that takes in dataset file(s) such as the one found in the dataset folder and computes the best K for each dataset and outputs into another text file the file name followed by the estimated K for each one.
Allowed only to use numpy package, all other packages are prohibited. Each line in the dataset file represent 1, n dimensional datapoint.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for kMeansBCMAssessment-cwildenb-0.3.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2cd522ecf3a1b19514c4cfb05b1c4d75d177b58f3f1c808d058f935b170502b0 |
|
MD5 | 8b31c365d96c25baf70e7169ef4fc8ed |
|
BLAKE2b-256 | 0b3e936f57a9bc82d0cfb158e0fa878820bf6dccb55d1d8eb13388b0fcef83d4 |
Close
Hashes for kMeansBCMAssessment_cwildenb-0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8f8f46cede5122fbe4c321f7c58724f4bb8dffc04ac18a9455e91d2383755af3 |
|
MD5 | cfd9c88549f93b7d1b0a02cc5528623c |
|
BLAKE2b-256 | 191b9fce3d2fdfd4a7a4eb19e8f9daf244ca60c4885a4e108c4bb519275ca181 |