Skip to main content

Python package to mine association rules in datasets

Project description

ruleminer

Documentation image image License: MIT Ruff

Python package to discover association rules in Pandas DataFrames.

This package implements the code of the paper Discovering and ranking validation rules in supervisory data.

Features

Here is what the package does:

  • Generate human-readable validation rules using rule templates containing regular expressions and a Pandas DataFrame dataset

    • available functions: min, max, abs, quantile, sum, substr, split, count, sumif and countif
    • including parameters for metric filters and rule precisions (including XBRL tolerances)
  • Evaluate rules and calculate association rules metrics

    • available metrics: abs support, abs exceptions, confidence, support, added value, casual confidence, casual support, conviction, lift and rule power factor

Here are some examples of rule templates with regexes with which you can generate validation rules:

  • if ({"Type"} == ".") then ({"."} > 0)

  • if ({"."} > 0) then (({"."} == 0) & ({"."} > 0))*

  • (({"."} + {"."} + {"."}) == {"."})

  • ({"Own funds"} <= quantile({"Own funds"}, 0.95))

  • (substr({"Type"}, 0, 1) in ["a", "b"])

The first template generates (with the dataset described in the Usage section) rules like

  • if ({"Type"} == "non-life_insurer") then ({"TP-nonlife"} > 0)
  • if ({"Type"} == "life_insurer") then ({"TP-life"} > 0)

These generated validation rules can then be used to validate new datasets.

Contributors

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ruleminer-1.0.1.tar.gz (36.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ruleminer-1.0.1-py3-none-any.whl (39.2 kB view details)

Uploaded Python 3

File details

Details for the file ruleminer-1.0.1.tar.gz.

File metadata

  • Download URL: ruleminer-1.0.1.tar.gz
  • Upload date:
  • Size: 36.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ruleminer-1.0.1.tar.gz
Algorithm Hash digest
SHA256 44ea0495dd77a65da6c958a0635de7a2c01bb288687269a8981435b9bc883446
MD5 ad72d00f23c172a46bae293dec5cd69c
BLAKE2b-256 55b9f9cc716f53a03278c1f834387b09112c5d73cf3b47923c1ea691f989ae8e

See more details on using hashes here.

File details

Details for the file ruleminer-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: ruleminer-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 39.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ruleminer-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7c9234c7636e33f504a5b89c50a2b74c4aa5712076bb82e7e448ab46b938fb2e
MD5 c62abc0e3b84f683c8048b9959b63d1e
BLAKE2b-256 5bc55c2cced52a2a746dcc9ddef90731699bc2fe33946cb45de2372b94c78f24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page