Skip to main content

No project description provided

Project description

Fedex Genarator

Introduction

FEDEx Genarator is a system that assists in the process of EDA (Exploratory Data Analysis) sessions. Based on FEDEx work (https://github.com/TAU-DB/FEDEx), it gives the user the option to generate NL explanations + Visualizations to their queries (Filter/GroupBy/Join) results.

How it works

FEDEx generator is forked from on FEDEx system, and offer new process to get explanation:

  1. The user users query (filter/groupby/join), and pass to FEDEx the input dataframe, output dataframe and the query parameters.
  2. FEDEx calculates an Interestingness Measure (that works well with the specific operation, for example Exceptionality measure for Filter and Join operations) for every column in the output dataframe (the query result)
  3. FEDEx finds the most interesting columns and partition them to set of rows.
  4. Then it finds the set-of-rows that affects the Interesingness measure result the most (from [2]).
  5. Now FEDEx takes the top columns and set-of-rows and generates meaningful explanations

For the full details, you can either view the code or read the FEDEx article which will be referenced here really soon:)

Example

In FEDEx example they used the spotify dataset from Kaggle. The first operation of our user was SELECT * FROM Spotify WHERE popularity > 65;

The raw output (Snip) -

Filter output

The generated explanation -

Filter explanation

The second operation of the user was SELECT AVG(dancability), AVG(loudness) FROM [SELECT * FROM Spotify WHERE year >= 1990] GROUPBY year;

The raw output (Snip) -

GroupBy output

The generated explanation -

GroupBy explanation

Usage

Notice - This project was tested on python version 3.6-3.8.

First, you have to install the requirements - py -3 -m pip install -r requirements.txt

Secondly, you should install latex on your system (the explanations inside the graphs require that). Things will still work even without latex but the experince might be a bit inferior.

This fork created to work on some adjusments for a API that will allow users to use pandas and generate explanations without effort and without using additional dedicated API.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fedex_generator-1.0.6.0.tar.gz (52.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fedex_generator-1.0.6.0-py3-none-any.whl (59.0 kB view details)

Uploaded Python 3

File details

Details for the file fedex_generator-1.0.6.0.tar.gz.

File metadata

  • Download URL: fedex_generator-1.0.6.0.tar.gz
  • Upload date:
  • Size: 52.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for fedex_generator-1.0.6.0.tar.gz
Algorithm Hash digest
SHA256 562c649ae0fcad847528cf4f6a6a9df142c26a85f92afdd100c1b7d9d4a9afab
MD5 ebad487f169c8cf78aca692cf3ebc879
BLAKE2b-256 d8f916ae0e45b29b12775ea537104433322efd648c88d1dbe91b8fdd2ca09302

See more details on using hashes here.

Provenance

The following attestation bundles were made for fedex_generator-1.0.6.0.tar.gz:

Publisher: python-publish.yml on analysis-bots/FEDEx_Generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file fedex_generator-1.0.6.0-py3-none-any.whl.

File metadata

File hashes

Hashes for fedex_generator-1.0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c8b7b7bc2f13b81dbbc5fbb3bb4080be17d4908de377ccc7daaff1a916693c40
MD5 571b534c4d4c16b1aacfa6264d56cfd1
BLAKE2b-256 ac25b288a001f0890a63cfe1c444f4873af550b4bc29aec96420c99aa5c8bfd6

See more details on using hashes here.

Provenance

The following attestation bundles were made for fedex_generator-1.0.6.0-py3-none-any.whl:

Publisher: python-publish.yml on analysis-bots/FEDEx_Generator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page