Skip to main content

Python package to mine association rules in datasets

Project description

ruleminer

Documentation image image License: MIT Ruff

Python package to discover association rules in Pandas DataFrames.

This package implements the code of the paper Discovering and ranking validation rules in supervisory data.

Features

Here is what the package does:

  • Generate human-readable validation rules using rule templates containing regular expressions and a Pandas DataFrame dataset

    • available functions: min, max, abs, quantile, sum, substr, split, count, sumif and countif
    • including parameters for metric filters and rule precisions (including XBRL tolerances)
  • Evaluate rules and calculate association rules metrics

    • available metrics: abs support, abs exceptions, confidence, support, added value, casual confidence, casual support, conviction, lift and rule power factor

Here are some examples of rule templates with regexes with which you can generate validation rules:

  • if ({"Type"} == ".") then ({"."} > 0)

  • if ({"."} > 0) then (({"."} == 0) & ({"."} > 0))*

  • (({"."} + {"."} + {"."}) == {"."})

  • ({"Own funds"} <= quantile({"Own funds"}, 0.95))

  • (substr({"Type"}, 0, 1) in ["a", "b"])

The first template generates (with the dataset described in the Usage section) rules like

  • if ({"Type"} == "non-life_insurer") then ({"TP-nonlife"} > 0)
  • if ({"Type"} == "life_insurer") then ({"TP-life"} > 0)

These generated validation rules can then be used to validate new datasets.

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ruleminer-0.2.6.tar.gz (23.9 kB view details)

Uploaded Source

Built Distribution

ruleminer-0.2.6-py3-none-any.whl (25.5 kB view details)

Uploaded Python 3

File details

Details for the file ruleminer-0.2.6.tar.gz.

File metadata

  • Download URL: ruleminer-0.2.6.tar.gz
  • Upload date:
  • Size: 23.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for ruleminer-0.2.6.tar.gz
Algorithm Hash digest
SHA256 9d3a1ebd89c16414df97bb2e6c723fcbc27a3c578a0e82fb14a2e10311a2ebdc
MD5 d86f9d08e91f08bab378e2cd3bda73eb
BLAKE2b-256 d5b7deb866b07d43fb28637deaf00da0b450b3a92207bc88211adaa52e9927ee

See more details on using hashes here.

File details

Details for the file ruleminer-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: ruleminer-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 25.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for ruleminer-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 238b9c42aa905d0aa9e5f11cbce39d5550c0a3d082c6ef80798af6aaeb1808f1
MD5 1206cc85ee6c362bd00ae6453a9736e0
BLAKE2b-256 d315518504ab575da2c6d382e98bfa29e061c607b9d6e4d9535ee3ea0510cf66

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page