Skip to main content

No project description provided

Project description

DESCRIPTION

Documentation Status

python-cluster is a “simple” package that allows to create several groups (clusters) of objects from a list. It’s meant to be flexible and able to cluster any object. To ensure this kind of flexibility, you need not only to supply the list of objects, but also a function that calculates the similarity between two of those objects. For simple datatypes, like integers, this can be as simple as a subtraction, but more complex calculations are possible. Right now, it is possible to generate the clusters using a hierarchical clustering and the popular K-Means algorithm. For the hierarchical algorithm there are different “linkage” (single, complete, average and uclus) methods available.

Algorithms are based on the document found at http://www.elet.polimi.it/upload/matteucc/Clustering/tutorial_html/

USAGE

A simple python program could look like this:

>>> from cluster import HierarchicalClustering
>>> data = [12,34,23,32,46,96,13]
>>> cl = HierarchicalClustering(data, lambda x,y: abs(x-y))
>>> cl.getlevel(10)     # get clusters of items closer than 10
[96, 46, [12, 13, 23, 34, 32]]
>>> cl.getlevel(5)      # get clusters of items closer than 5
[96, 46, [12, 13], 23, [34, 32]]

Note, that when you retrieve a set of clusters, it immediately starts the clustering process, which is quite complex. If you intend to create clusters from a large dataset, consider doing that in a separate thread.

For K-Means clustering it would look like this:

>>> from cluster import KMeansClustering
>>> cl = KMeansClustering([(1,1), (2,1), (5,3), ...])
>>> clusters = cl.getclusters(2)

The parameter passed to getclusters is the count of clusters generated.

Documentation Status

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cluster-1.3.1.tar.gz (40.5 kB view details)

Uploaded Source

Built Distribution

cluster-1.3.1-py2.py3-none-any.whl (20.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file cluster-1.3.1.tar.gz.

File metadata

  • Download URL: cluster-1.3.1.tar.gz
  • Upload date:
  • Size: 40.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for cluster-1.3.1.tar.gz
Algorithm Hash digest
SHA256 8ed5822f1b61b49257927fc6e74d4bad0b76c9a5491f2515f622a9c0a2f88167
MD5 7a904b5b4b6c34cdb949baec358f2dfd
BLAKE2b-256 3bd20067825452d6a49a5f163d1047ba5497f76b6d80b4a95cdd800c6f9021a6

See more details on using hashes here.

File details

Details for the file cluster-1.3.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for cluster-1.3.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 665f337ddd7157ce4af208818f66d8405b976468dd902dd6b29057819c834c19
MD5 2d01804cb6ef6e5c4c1d278bcc3fc649
BLAKE2b-256 c63ff7787f73464aed0febcf0a3d2ab920fd9397731a9e333bdaec47cadf1904

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page