Skip to main content

A production-ready Python library for one-line data cleaning, EDA, and beautiful HTML reporting.

Project description

SagarAnalysis

A production-ready, beginner-friendly Python package designed for one-line data cleaning, one-line Exploratory Data Analysis (EDA), and stunning, glassmorphic HTML reporting.

Features

  • One-line Cleaning (clean(df)):
    • Automatically standardizes column names to snake_case.
    • Removes duplicate rows.
    • Detects and drops constant/highly missing columns.
    • Imputes missing values intelligently (median/mean/mode/unknown).
    • Handles numerical outliers via clipping.
    • Coerces data types (string to numeric, string to datetime, Yes/No to boolean).
  • One-line EDA & Report (analysis(df)):
    • Generates a comprehensive overview of the dataset.
    • Performs descriptive statistics and missingness analysis.
    • Discovers high correlations and skewness.
    • Auto-detects the target column and runs target profiling.
  • Stunning HTML Reports (report.save()):
    • Interactive, modern Glassmorphism design.
    • Supports light/dark theme toggles.
    • Fully responsive layout with embedded Base64 charts.
    • Searchable variables, collapsible details, and clear data insights.

Installation

pip install sagaranalysis

Quick Start

import pandas as pd
import sagaranalysis as sa

# Load your dirty dataset
df = pd.read_csv("dirty_data.csv")

# Clean your data in one line
cleaned_df = sa.clean(df)

# Perform automatic EDA and generate insights
report = sa.analysis(cleaned_df)

# View or save the report
report.show()                  # Opens in local browser or renders in Jupyter Notebook
report.save("report.html")     # Saves self-contained report to disk

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sagaranalysis-0.2.0.tar.gz (33.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sagaranalysis-0.2.0-py3-none-any.whl (33.7 kB view details)

Uploaded Python 3

File details

Details for the file sagaranalysis-0.2.0.tar.gz.

File metadata

  • Download URL: sagaranalysis-0.2.0.tar.gz
  • Upload date:
  • Size: 33.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for sagaranalysis-0.2.0.tar.gz
Algorithm Hash digest
SHA256 7ae851b1cdbd3f257229e88bd42dd65f07a20893fa3ac454b2dddd708c7568eb
MD5 a7149c3dba977e236187ee4b419302fc
BLAKE2b-256 72e5a331b5d5e91b1974d19938995261ac3de986b80693793a136c57b7e0658b

See more details on using hashes here.

File details

Details for the file sagaranalysis-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: sagaranalysis-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 33.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for sagaranalysis-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b917fbc87f70fafa6d9845f0a8713b04f6b9c17b057b35d73df3bd9b44b1a26e
MD5 4ea6e8b026ae1a8206c8dc7038527be9
BLAKE2b-256 c8a8432b1c7bf1841c227df64998d774cb0412fb908c1433792245f28dfd7488

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page