Skip to main content

A package to create a deterministic classifier based on Zipf law

Project description

travis coveralls sonar_quality sonar_maintainability Maintainability pip

Introduction

ZipfClassifier is a classifier that, even though in principle usable on any distribution, leverages the assumption that some kind of datasets su as:

follow the Zipf law.

Dependecies

ZipfClassifier uses zipf, another package o’ mine. I also suggest to use dictances for the metrics used in classification.

Installation

pip install zipf_classifier

Working examples and explanation

A jupyter notebook is available with a full explanation, three working examples and respective link to datasets.

License

This package is licensed under MIT license.

FAQs

Frequenctly asked questions down below.

Generally, which metric do you suggest?

Experimental analysis suggests that, in particular when the learning set distributions contain a significant greater number of events than the distribution from the document you are trying to classify, the intersection_squared_hellinger seemed to work best.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zipf_classifier-1.2.0.tar.gz (5.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page