Skip to main content

Create explanation to dataframe

Project description

PD-EXPLAIN

PD-EXPLAIN is a Python library that wraps Pandas, allowing users to obtain multiple type of query explanations over Pandas DataFrames. PD-EXPLAIN is under active development, currently featuring interestingness based explanations, those are deviation-based explanations (for filter, join, and set operations) and explanations for high-variance group-by-and-aggregate operations. Both explainers utilizes the FEDEX system.

The system also supports aggregate outlier explanations, based on the SCORPION systems , and will soon fully support Boolean-query explanations based on this paper.

PD-EXPLAIN was demonstrated at VLDB '24.

Installation

Install pd-explain with pip or by git ssh

  pip install pd-explain
  
  pip install git+ssh://git@github.com/analysis-bots/pd-explain.git

For cloning this project use

git clone git@github.com:analysis-bots/pd-explain.git

cd pd_explain

pip install -r requirements.txt

Demo

Demo Spotify example

Demo Spotify example notebook - click to view

Documentation

Documentation

Citation Information

TBD

Authors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pd_explain-1.1.0.tar.gz (128.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pd_explain-1.1.0-py3-none-any.whl (133.6 kB view details)

Uploaded Python 3

File details

Details for the file pd_explain-1.1.0.tar.gz.

File metadata

  • Download URL: pd_explain-1.1.0.tar.gz
  • Upload date:
  • Size: 128.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pd_explain-1.1.0.tar.gz
Algorithm Hash digest
SHA256 7eae81ac4bce7642dbd48596df54263cd39ca38c16a994dc460c5a18b298b21f
MD5 2c070da05f2b0ce520852017fbecb6fc
BLAKE2b-256 757c41f285f3c2754c759de1e428fae30ebe2b80b287c1459ac837b9e5e835a4

See more details on using hashes here.

Provenance

The following attestation bundles were made for pd_explain-1.1.0.tar.gz:

Publisher: python-publish.yml on analysis-bots/pd-explain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pd_explain-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: pd_explain-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 133.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pd_explain-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b7fd6d13d8b8255c96bb094ef756ac20ba5a7c0918139e2a9569b6bb6f25695d
MD5 7c626800750edb12da71d92461f4c85e
BLAKE2b-256 490352fb16d5a4a02bbcdfbafec9681da28357cce412305e5ae280f7202f0b46

See more details on using hashes here.

Provenance

The following attestation bundles were made for pd_explain-1.1.0-py3-none-any.whl:

Publisher: python-publish.yml on analysis-bots/pd-explain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page