Skip to main content

A Kmeans implementation using only NumPy

Project description

K means clustering is often used as an unsupervised data-analytics algorithm meant to find the ideal number of possible classes in a given dataset.

This project implements a k-means clustering algorithm pipeline that takes in dataset file(s) such as the one found in the dataset folder and computes the best K for each dataset and outputs into another text file the file name followed by the estimated K for each one.

Allowed only to use numpy package, all other packages are prohibited. Each line in the dataset file represent 1, n dimensional datapoint.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for kMeansBCMAssessment-cwildenb, version 0.3
Filename, size File type Python version Upload date Hashes
Filename, size kMeansBCMAssessment_cwildenb-0.3-py3-none-any.whl (4.5 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size kMeansBCMAssessment-cwildenb-0.3.tar.gz (3.1 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page