A Kmeans implementation using only NumPy
Project description
K means clustering is often used as an unsupervised data-analytics algorithm meant to find the ideal number of possible classes in a given dataset.
This project implements a k-means clustering algorithm pipeline that takes in dataset file(s) such as the one found in the dataset folder and computes the best K for each dataset and outputs into another text file the file name followed by the estimated K for each one.
Allowed only to use numpy package, all other packages are prohibited. Each line in the dataset file represent 1, n dimensional datapoint.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for kMeansBCMAssessment-cwildenb-0.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 55ef852e4f65ef9ee86260ff9bfc326920e4004b28f47af75a0be18782bd2b34 |
|
MD5 | 09cfce13a3f0f839fc33fc64be3e8d14 |
|
BLAKE2b-256 | 2a89fc602f3eee0f47ba369afc687d19d9a352e232933675f398b4d96c5e4fe8 |
Close
Hashes for kMeansBCMAssessment_cwildenb-0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3bb7a02d806791cc18621f1310aafd9db11d86dbbf88adb0e0226948a421b608 |
|
MD5 | 16b5160408b4fca31acae2c5540c2bb2 |
|
BLAKE2b-256 | 2835a1a62afced07117172fe57389ff0a07e7187c4aaa36362c4c5263a000235 |