An all-encompassing, automated toolkit for deep Exploratory Data Analysis and reporting.
Project description
Decyphr
An all-encompassing, automated toolkit for generating deep, professional, and interactive Deep Data Analysis reports with a single line of code. It's not just a normal EDA.
Decyphr is designed to accelerate the data science workflow by automating the tedious and time-consuming process of initial data exploration. It goes beyond basic profiling to provide deep statistical insights, advanced machine learning-driven analysis, and a stunning, presentation-ready report that is as beautiful as it is informative.
Key Features
Decyphr provides a comprehensive suite of analyses, intelligently triggered based on your data's characteristics:
Complete Overview: Instant summary of dataset shape, memory usage, variable types, and data quality warnings.
Deep Univariate Analysis: Detailed statistical profiles, histograms, and frequency charts for every variable.
Multivariate Analysis: Stunning, interactive heatmaps for both linear (Pearson) and non-linear (Phik) correlations.
Advanced Data Quality: Automatically detects constant columns, whitespace issues, and potential outliers using multiple methods (IQR, Isolation Forest).
Statistical Inference: Performs automated Hypothesis Testing (T-Tests, ANOVA, Chi-Squared) to uncover statistically significant relationships.
Machine Learning Insights: PCA: Analyzes dimensionality reduction possibilities. Clustering: Automatically finds hidden segments in your data using K-Means. Feature Importance: Trains a baseline model to identify the most predictive features when a target is provided. Explainable AI (XAI): Generates SHAP summary plots to explain how your features impact model predictions.
Specialized Analysis: Includes dedicated modules for Deep Text Analysis (Sentiment, NER, Topics), Time-Series Decomposition, and Geospatial Mapping.
Data Drift Detection: Compare two datasets to quantify changes in data distribution over time.
High-End Interactive Report: A beautiful, modern dashboard with a toggleable light/dark theme, responsive charts, and a professional UI/UX.
Quick Start
- Installation
!pip install decyphr
- Importing
import decyphr
- Running Analysis
decyphr.analyze(filepath="data/your_data.csv")
- Generating Report (Jupyter Notebook)
import os, webbrowser path = os.path.abspath('Reports/your_generated_report_name.html') webbrowser.open(f'file://{path}')
- Generating Report (Google Colab)
from google.colab import files files.download("reports/your_report_name.html")
Generated Your First Report
Create a Python script add the above code. Just point it to your dataset and let Decyphr do the rest.
Your professional, interactive HTML report will be automatically saved in a new decyphr_reports/ directory.
Project Structure
This project uses a highly modular "plugin" architecture to ensure it is robust, maintainable, and easy to extend. All analysis and visualization logic is separated into self-contained modules located in the src/decyphr/analysis_plugins/ directory.
Capabilities
While there is no end of this vast ocean but as of now Decyphr can process more than 1 lakh rows with over 1 lakh cells in less than 3 mins. Isn't that amazing !
Contributing
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated. Please feel free to fork the repo and create a pull request, or open an issue with suggestions.
License
Distributed under the MIT License. See LICENSE file for more information. Designed and Created by - Ayush Singh
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file decyphr-1.1.2.tar.gz.
File metadata
- Download URL: decyphr-1.1.2.tar.gz
- Upload date:
- Size: 85.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7054e9aebb49f663ca432dd4d88b43a561425cdf04beffaeafd41ccad55f387f
|
|
| MD5 |
acca820105850396da570333313b1626
|
|
| BLAKE2b-256 |
8477eac169721a67f7de24ef7b0de269983c90c65e86f34ccb2d2db865781700
|
File details
Details for the file decyphr-1.1.2-py3-none-any.whl.
File metadata
- Download URL: decyphr-1.1.2-py3-none-any.whl
- Upload date:
- Size: 129.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9a5d03813e041145e74b0f3dc8b6c244111df4f086117123e91942f5748cc81a
|
|
| MD5 |
b1c874ca2c45e2e9c3e83d3932fda106
|
|
| BLAKE2b-256 |
8a0450d42c589981780d403e0c6671e066e1b07dfe0d104869aa66b9b86009b3
|