Skip to main content

Create explanation to dataframe

Project description

PD-EXPLAIN

PD-EXPLAIN is a Python library that wraps Pandas, allowing users to obtain multiple type of query explanations over Pandas DataFrames. PD-EXPLAIN is under active development, currently featuring interestingness based explanations, those are deviation-based explanations (for filter, join, and set operations) and explanations for high-variance group-by-and-aggregate operations. Both explainers utilizes the FEDEX system.

The system also supports aggregate outlier explanations, based on the SCORPION systems , and will soon fully support Boolean-query explanations based on this paper.

PD-EXPLAIN was demonstrated at VLDB '24.

Installation

Install pd-explain with pip or by git ssh

  pip install pd-explain
  
  pip install git+ssh://git@github.com/analysis-bots/pd-explain.git

For cloning this project use

git clone git@github.com:analysis-bots/pd-explain.git

cd pd_explain

pip install -r requirements.txt

Demo

Demo Spotify example

Demo Spotify example notebook - click to view

Documentation

Documentation

Citation Information

TBD

Authors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pd_explain-1.0.2.tar.gz (50.8 kB view details)

Uploaded Source

Built Distribution

pd_explain-1.0.2-py3-none-any.whl (45.1 kB view details)

Uploaded Python 3

File details

Details for the file pd_explain-1.0.2.tar.gz.

File metadata

  • Download URL: pd_explain-1.0.2.tar.gz
  • Upload date:
  • Size: 50.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pd_explain-1.0.2.tar.gz
Algorithm Hash digest
SHA256 61befd87c1acfaf6a690731463791efe98899f13404878b2e9551a37cd43d2ce
MD5 367c4ebb4184698c3c0d6bcd10569139
BLAKE2b-256 75ee4e78948e383a83fc68480a9b4971f54220c711901b9aa7189c9a6fe8dc06

See more details on using hashes here.

Provenance

The following attestation bundles were made for pd_explain-1.0.2.tar.gz:

Publisher: python-publish.yml on analysis-bots/pd-explain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pd_explain-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: pd_explain-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 45.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pd_explain-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 05808d9119e4386ab68dff047cbec62e8db68fdee365f3144220693c48a3e643
MD5 775a39ce102f00095679c5d510d443ba
BLAKE2b-256 9166e1846d5016300fb77e74360e7566d32c1a4dce9a09c20c66f29610c8c867

See more details on using hashes here.

Provenance

The following attestation bundles were made for pd_explain-1.0.2-py3-none-any.whl:

Publisher: python-publish.yml on analysis-bots/pd-explain

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page