Exploratory Data Analytics tool for SQL
Project description
SQL Bridge Tool to Exploratory Data Analysis
edaSQL is a library to link SQL to Exploratory Data Analysis and further more in the Data Engineering. This will solve many limitations in the SQL studios available in the market. Use the SQL Query language to get your Table Results.
Installation
Install dependency Packages before installing edaSQL
pip install pyodbc
pip install ipython
Optional dependency for better visualization - Jupyter Notebook
pip install notebook
Now Install using pip . Offical Python Package Here!!
pip install edaSQL
(OR)
Clone this Repository. Run this from the root directory to install
python setup.py install
Documentation
Read the detailed documentation in readthedocs.io
edaSQL Jupyter NoteBook Tutorial
Import Packages
import edaSQL
import pandas as pd
1. Connect to the DataBase
edasql = edaSQL.SQL()
edasql.connectToDataBase(server='your server name',
database='your database',
user='username',
password='password',
sqlDriver='ODBC Driver 17 for SQL Server')
2. Query Data
sampleQuery = "select * from INX"
data = pd.read_sql(sampleQuery, edasql.dbConnection)
3. Data Overview
insights = edaSQL.EDA(dataFrame=data,HTMLDisplay=True)
dataInsights =insights.dataInsights()
deepInsights = insights.deepInsights()
4. Correlation
eda = edaSQL.EDA(dataFrame=data)
eda.pearsonCorrelation()
eda.spearmanCorrelation()
eda.kendallCorrelation()
5. Missing Values
eda.missingValuesPlot(plot ='matrix')
eda.missingValuesPlot(plot ='bar')
eda.missingValuesPlot(plot ='heatmap')
eda.missingValuesPlot(plot ='dendrogram')
6. Outliers
eda.outliersVisualization(plot = 'box')
eda.outliersVisualization(plot = 'scatter')
outliers = eda.getOutliers()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file edaSQL-0.0.1.5.tar.gz
.
File metadata
- Download URL: edaSQL-0.0.1.5.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | db955cec6eeff7a6fc892b29b8ae8a4dbf96d084a1e1c211890f8b67b247e251 |
|
MD5 | 0289dc22c8b3f42f5a583cf9a777a212 |
|
BLAKE2b-256 | 628d8df8941f4b3215fc8d919b0121743d1bae8be3270dbccfdfd82f0608f806 |
File details
Details for the file edaSQL-0.0.1.5-py3-none-any.whl
.
File metadata
- Download URL: edaSQL-0.0.1.5-py3-none-any.whl
- Upload date:
- Size: 5.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ac62a762a217a3e56f66f1ccea029d4f38908e47cf32f0be4b93ca2c8ed887a0 |
|
MD5 | 1631eb7f5322439f022177d9b1c99bc3 |
|
BLAKE2b-256 | 861db0ba54d1726f1f9b1818fe4fd00f0bea74c0c43f0a1d329fee4b2f5e1708 |