Skip to main content

parameter-free clustering algorithm

Project description

TX-Means

TX-Means is a parameter-free clustering algorithm able to efficiently partitioning transactional data in a completely automatic way. TX-Means is designed for the case where clustering must be applied on a massive number of different datasets, for instance when a large set of users need to be analyzed individually and each of them has generated a long history of transactions.

In this repository we provide the source code of TX-Means, the clustering algorithm competitors and the dataset used in

Riccardo Guidotti, Anna Monreale, Mirco Nanni, Fosca Giannotti, Dino Pedreschi "Clustering Individual Transactional Data for Masses of Users", KDD 2017, 2017, Halifax, NS, Canada

Please cite the paper above if you use our code or dataets.

Where to get it

The source code is currently hosted on GitHub at: https://github.com/riccotti/TX-Means

How to install

pip install TXMeans

How to import (some examples)

from TXMeans.txmeans import TXmeans
from TXMeans.util import count_items, remap_items, sample_size (Util functions)
from TXMeans.util import basket_list_to_bitarray, basket_bitarray_to_list (Converting(Reverting) to(from) bitarray)
from TXMeans.datamanager import read_uci_data (Convert the data in nice basket format)
from TXMeans.validation_measures import delta_k, purity, normalized_mutual_info_score (Measure of Validation)
from TXMeans.util import jaccard_bitarray
Requirements:
  • python >= 3
  • numpy >= 1.10.1
  • pandas >= 0.18.1
  • scipy >= 0.17.1
  • bitarray >= 0.8.1
  • Java >= 8.1

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TXMeans-0.1.1.tar.gz (35.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

TXMeans-0.1.1-py3-none-any.whl (39.3 kB view details)

Uploaded Python 3

File details

Details for the file TXMeans-0.1.1.tar.gz.

File metadata

  • Download URL: TXMeans-0.1.1.tar.gz
  • Upload date:
  • Size: 35.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.2

File hashes

Hashes for TXMeans-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c717de6e1e0d19214f435844b4ca6f4d709e7d388ed5de5b12ec07adda7a2055
MD5 1a68d028fab1be2769180b21e01ef9d2
BLAKE2b-256 256d6e24c1f3bc376783c318abb9cfbe1b1ff1cf70625936ebac07606fcfe8aa

See more details on using hashes here.

File details

Details for the file TXMeans-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: TXMeans-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 39.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.2

File hashes

Hashes for TXMeans-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8d3c190af7a12a0e6fb7bb3e3695699c7d6c9673e3209b29a70dd666f8d1b8ff
MD5 893500dadce2fe8692593bc7925e2005
BLAKE2b-256 b4fd7c9a21d8e164275794392d6881418f68cebfd691e8c35231a4e50c81a2f5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page