Skip to main content

Python package to mine association rules in datasets

Project description

ruleminer

Documentation image image License: MIT Ruff

Python package to discover association rules in Pandas DataFrames.

This package implements the code of the paper Discovering and ranking validation rules in supervisory data.

Features

Here is what the package does:

  • Generate human-readable validation rules using rule templates containing regular expressions and a Pandas DataFrame dataset

    • available functions: min, max, abs, quantile, sum, substr, split, count, sumif and countif
    • including parameters for metric filters and rule precisions (including XBRL tolerances)
  • Evaluate rules and calculate association rules metrics

    • available metrics: abs support, abs exceptions, confidence, support, added value, casual confidence, casual support, conviction, lift and rule power factor

Here are some examples of rule templates with regexes with which you can generate validation rules:

  • if ({"Type"} == ".") then ({"."} > 0)

  • if ({"."} > 0) then (({"."} == 0) & ({"."} > 0))*

  • (({"."} + {"."} + {"."}) == {"."})

  • ({"Own funds"} <= quantile({"Own funds"}, 0.95))

  • (substr({"Type"}, 0, 1) in ["a", "b"])

The first template generates (with the dataset described in the Usage section) rules like

  • if ({"Type"} == "non-life_insurer") then ({"TP-nonlife"} > 0)
  • if ({"Type"} == "life_insurer") then ({"TP-life"} > 0)

These generated validation rules can then be used to validate new datasets.

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ruleminer-0.2.8.tar.gz (24.8 kB view details)

Uploaded Source

Built Distribution

ruleminer-0.2.8-py3-none-any.whl (26.5 kB view details)

Uploaded Python 3

File details

Details for the file ruleminer-0.2.8.tar.gz.

File metadata

  • Download URL: ruleminer-0.2.8.tar.gz
  • Upload date:
  • Size: 24.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for ruleminer-0.2.8.tar.gz
Algorithm Hash digest
SHA256 fb88a2a4d3b79a6780ad3dc6016ae1eeef24b4df20e87b057f9a2eda55851474
MD5 4d3aedad122a6a0d9412d1b7082c0d20
BLAKE2b-256 8059faafc85f634dbc0f044cbbfba4de7405d1ee490556ad188eed068f176a6a

See more details on using hashes here.

File details

Details for the file ruleminer-0.2.8-py3-none-any.whl.

File metadata

  • Download URL: ruleminer-0.2.8-py3-none-any.whl
  • Upload date:
  • Size: 26.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for ruleminer-0.2.8-py3-none-any.whl
Algorithm Hash digest
SHA256 5362e4a9a3b0a9a469475ae64964972aed5688d32f447eb04aa8e81dff349869
MD5 25426268854296b7598ea00e26952b7f
BLAKE2b-256 fabed7ec4df0ee0fc34d143a248d6c796e7adba54b35a36b1b33f843ee210f53

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page