Skip to main content

Python package to mine association rules in datasets

Project description

ruleminer

Documentation image image License: MIT Ruff

Python package to discover association rules in Pandas DataFrames.

This package implements the code of the paper Discovering and ranking validation rules in supervisory data.

Features

Here is what the package does:

  • Generate human-readable validation rules using rule templates containing regular expressions and a Pandas DataFrame dataset

    • available functions: min, max, abs, quantile, sum, substr, split, count, sumif and countif
    • including parameters for metric filters and rule precisions (including XBRL tolerances)
  • Evaluate rules and calculate association rules metrics

    • available metrics: abs support, abs exceptions, confidence, support, added value, casual confidence, casual support, conviction, lift and rule power factor

Here are some examples of rule templates with regexes with which you can generate validation rules:

  • if ({"Type"} == ".") then ({"."} > 0)

  • if ({"."} > 0) then (({"."} == 0) & ({"."} > 0))*

  • (({"."} + {"."} + {"."}) == {"."})

  • ({"Own funds"} <= quantile({"Own funds"}, 0.95))

  • (substr({"Type"}, 0, 1) in ["a", "b"])

The first template generates (with the dataset described in the Usage section) rules like

  • if ({"Type"} == "non-life_insurer") then ({"TP-nonlife"} > 0)
  • if ({"Type"} == "life_insurer") then ({"TP-life"} > 0)

These generated validation rules can then be used to validate new datasets.

Contributors

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ruleminer-1.0.2.tar.gz (36.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ruleminer-1.0.2-py3-none-any.whl (39.4 kB view details)

Uploaded Python 3

File details

Details for the file ruleminer-1.0.2.tar.gz.

File metadata

  • Download URL: ruleminer-1.0.2.tar.gz
  • Upload date:
  • Size: 36.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ruleminer-1.0.2.tar.gz
Algorithm Hash digest
SHA256 ea63bcaec331674f7de7c73e8a88b6835ee9eca279c7452a2034e0562c3d166c
MD5 d1412367aad09dfe8a792eb0b6107604
BLAKE2b-256 5217477ff50f1ccc7f59f1c4ddad60b6ce9f6b406995706d0d0093d61b5b52c2

See more details on using hashes here.

File details

Details for the file ruleminer-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: ruleminer-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 39.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ruleminer-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8852b44fa1efd209b653e5345577209e21ef616cd5dc5a04fe066bdabd163f26
MD5 fe7c09733822251703da1abdda8527e2
BLAKE2b-256 3c8729ec1e278b2d64ac9cea2b67bfaa3a31b5a6c434bbfb7f5112f05b2cbeeb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page