No project description provided
Project description
Fedex Genarator
Introduction
FEDEx Genarator is a system that assists in the process of EDA (Exploratory Data Analysis) sessions. Based on FEDEx work (https://github.com/TAU-DB/FEDEx), it gives the user the option to generate NL explanations + Visualizations to their queries (Filter/GroupBy/Join) results.
How it works
FEDEx generator is forked from on FEDEx system, and offer new process to get explanation:
- The user users query (filter/groupby/join), and pass to FEDEx the input dataframe, output dataframe and the query parameters.
- FEDEx calculates an Interestingness Measure (that works well with the specific operation, for example Exceptionality measure for Filter and Join operations) for every column in the output dataframe (the query result)
- FEDEx finds the most interesting columns and partition them to set of rows.
- Then it finds the set-of-rows that affects the Interesingness measure result the most (from [2]).
- Now FEDEx takes the top columns and set-of-rows and generates meaningful explanations
For the full details, you can either view the code or read the FEDEx article which will be referenced here really soon:)
Example
In FEDEx example they used the spotify dataset from Kaggle.
The first operation of our user was SELECT * FROM Spotify WHERE popularity > 65;
The raw output (Snip) -
The generated explanation -
The second operation of the user was SELECT AVG(dancability), AVG(loudness) FROM [SELECT * FROM Spotify WHERE year >= 1990] GROUPBY year;
The raw output (Snip) -
The generated explanation -
Usage
Notice - This project was tested on python version 3.6-3.8.
First, you have to install the requirements - py -3 -m pip install -r requirements.txt
Secondly, you should install latex on your system (the explanations inside the graphs require that). Things will still work even without latex but the experince might be a bit inferior.
This fork created to work on some adjusments for a API that will allow users to use pandas and generate explanations without effort and without using additional dedicated API.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file fedex_generator-1.0.6.0.tar.gz.
File metadata
- Download URL: fedex_generator-1.0.6.0.tar.gz
- Upload date:
- Size: 52.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
562c649ae0fcad847528cf4f6a6a9df142c26a85f92afdd100c1b7d9d4a9afab
|
|
| MD5 |
ebad487f169c8cf78aca692cf3ebc879
|
|
| BLAKE2b-256 |
d8f916ae0e45b29b12775ea537104433322efd648c88d1dbe91b8fdd2ca09302
|
Provenance
The following attestation bundles were made for fedex_generator-1.0.6.0.tar.gz:
Publisher:
python-publish.yml on analysis-bots/FEDEx_Generator
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
fedex_generator-1.0.6.0.tar.gz -
Subject digest:
562c649ae0fcad847528cf4f6a6a9df142c26a85f92afdd100c1b7d9d4a9afab - Sigstore transparency entry: 347736201
- Sigstore integration time:
-
Permalink:
analysis-bots/FEDEx_Generator@c4a2a9fa7011e2fad8724fee4713cfc3d1697787 -
Branch / Tag:
refs/tags/v1.0.6 - Owner: https://github.com/analysis-bots
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@c4a2a9fa7011e2fad8724fee4713cfc3d1697787 -
Trigger Event:
release
-
Statement type:
File details
Details for the file fedex_generator-1.0.6.0-py3-none-any.whl.
File metadata
- Download URL: fedex_generator-1.0.6.0-py3-none-any.whl
- Upload date:
- Size: 59.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c8b7b7bc2f13b81dbbc5fbb3bb4080be17d4908de377ccc7daaff1a916693c40
|
|
| MD5 |
571b534c4d4c16b1aacfa6264d56cfd1
|
|
| BLAKE2b-256 |
ac25b288a001f0890a63cfe1c444f4873af550b4bc29aec96420c99aa5c8bfd6
|
Provenance
The following attestation bundles were made for fedex_generator-1.0.6.0-py3-none-any.whl:
Publisher:
python-publish.yml on analysis-bots/FEDEx_Generator
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
fedex_generator-1.0.6.0-py3-none-any.whl -
Subject digest:
c8b7b7bc2f13b81dbbc5fbb3bb4080be17d4908de377ccc7daaff1a916693c40 - Sigstore transparency entry: 347736203
- Sigstore integration time:
-
Permalink:
analysis-bots/FEDEx_Generator@c4a2a9fa7011e2fad8724fee4713cfc3d1697787 -
Branch / Tag:
refs/tags/v1.0.6 - Owner: https://github.com/analysis-bots
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@c4a2a9fa7011e2fad8724fee4713cfc3d1697787 -
Trigger Event:
release
-
Statement type: