Skip to main content

Exploratory Data Analytics tool for SQL

Project description



SQL Bridge Tool to Exploratory Data Analysis

edaSQL is a library to link SQL to Exploratory Data Analysis and further more in the Data Engineering. This will solve many limitations in the SQL studios available in the market. Use the SQL Query language to get your Table Results.

Installation

Install dependency Packages before installing edaSQL

pip install pyodbc
pip install ipython

Optional dependency for better visualization - Jupyter Notebook

pip install notebook

Now Install using pip . Offical Python Package Here!!

pip install edaSQL

(OR)

Clone this Repository. Run this from the root directory to install

python setup.py install

Documentation

Read the detailed documentation in readthedocs.io

edaSQL Jupyter NoteBook Tutorial

Import Packages

import edaSQL
import pandas as pd

1. Connect to the DataBase

edasql = edaSQL.SQL()
edasql.connectToDataBase(server='your server name', 
                         database='your database', 
                         user='username', 
                         password='password',
                         sqlDriver='ODBC Driver 17 for SQL Server')

2. Query Data

sampleQuery = "select  * from INX"
data = pd.read_sql(sampleQuery, edasql.dbConnection)

3. Data Overview

insights =  edaSQL.EDA(dataFrame=data,HTMLDisplay=True)
dataInsights =insights.dataInsights()
deepInsights = insights.deepInsights()

4. Correlation

eda = edaSQL.EDA(dataFrame=data)
eda.pearsonCorrelation()
eda.spearmanCorrelation()
eda.kendallCorrelation()

5. Missing Values

eda.missingValuesPlot(plot ='matrix')
eda.missingValuesPlot(plot ='bar')
eda.missingValuesPlot(plot ='heatmap')
eda.missingValuesPlot(plot ='dendrogram')

6. Outliers

eda.outliersVisualization(plot = 'box')
eda.outliersVisualization(plot = 'scatter')
outliers = eda.getOutliers()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edaSQL-0.0.1.5.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

edaSQL-0.0.1.5-py3-none-any.whl (5.9 kB view details)

Uploaded Python 3

File details

Details for the file edaSQL-0.0.1.5.tar.gz.

File metadata

  • Download URL: edaSQL-0.0.1.5.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.0

File hashes

Hashes for edaSQL-0.0.1.5.tar.gz
Algorithm Hash digest
SHA256 db955cec6eeff7a6fc892b29b8ae8a4dbf96d084a1e1c211890f8b67b247e251
MD5 0289dc22c8b3f42f5a583cf9a777a212
BLAKE2b-256 628d8df8941f4b3215fc8d919b0121743d1bae8be3270dbccfdfd82f0608f806

See more details on using hashes here.

File details

Details for the file edaSQL-0.0.1.5-py3-none-any.whl.

File metadata

  • Download URL: edaSQL-0.0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 5.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.0

File hashes

Hashes for edaSQL-0.0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 ac62a762a217a3e56f66f1ccea029d4f38908e47cf32f0be4b93ca2c8ed887a0
MD5 1631eb7f5322439f022177d9b1c99bc3
BLAKE2b-256 861db0ba54d1726f1f9b1818fe4fd00f0bea74c0c43f0a1d329fee4b2f5e1708

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page