Skip to main content

Ultra-lightweight micro EDA (exploratory data analysis) tool for small datasets

Project description

microeda

microeda is an ultra-lightweight Python library for Exploratory Data Analysis (EDA) on small datasets. It provides quick insights into your data with minimal setup—detecting column types, summarizing distributions, spotting missing values, outliers, and exploring pairwise relationships.


✨ Features

  • Detect column types: numeric, categorical, boolean, datetime, text, or IDs
  • Summarize numeric columns: mean, std, quartiles, missing values, outliers
  • Summarize categorical columns: top values, unique counts
  • Summarize datetime columns: min, max, missing values
  • Quick text analysis: token counts, most frequent words
  • Missing value patterns and pairwise missing correlations
  • Outlier detection: IQR and Z-score methods
  • Pairwise hints for correlations and associations (Pearson, Cramer's V, Mutual Information)
  • Command-line interface (CLI) for generating reports in Markdown or HTML

📦 Installation

Install via PyPI:

pip install microeda

Or install from source:

git clone https://github.com/SaptarshiMondal123/microeda.git
cd microeda
pip install .

Usage

Python API

import pandas as pd
from microeda import analyze

df = pd.read_csv("your_data.csv")
report = analyze(df, name="My Dataset")

# View summaries
print(report["summaries"])
# View column types
print(report["column_types"])
# Missing values and pairwise hints
print(report["missingness"])
print(report["pairwise_hints"])

CLI

Generate a Markdown report directly from the terminal:

microeda path/to/data.csv --style md --out report.md

Options:

--style: md (Markdown) or html (HTML)

--out: output file path

Contributing

Contributions are welcome! Feel free to submit issues or pull requests.

  • Fork the repo

  • Create a new branch (git checkout -b feature-name)

  • Make your changes

  • Run tests (pytest)

  • Submit a pull request

License

MIT License © 2025 Saptarshi Mondal

Links

GitHub: https://github.com/SaptarshiMondal123/microeda

PyPI: https://pypi.org/project/microeda/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

microeda-0.2.0.tar.gz (10.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

microeda-0.2.0-py3-none-any.whl (10.6 kB view details)

Uploaded Python 3

File details

Details for the file microeda-0.2.0.tar.gz.

File metadata

  • Download URL: microeda-0.2.0.tar.gz
  • Upload date:
  • Size: 10.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for microeda-0.2.0.tar.gz
Algorithm Hash digest
SHA256 06b5767ce9db44adff6e2e6c521b228d3124cfd7394f91c900ba124782977724
MD5 4ea92bb61cd9de911e0c7f1d86bec184
BLAKE2b-256 3b89ede3933d325ee6f99199de8b7ac4a3dfdc1be0e61e4c6896ed7abc6e0c2b

See more details on using hashes here.

File details

Details for the file microeda-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: microeda-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 10.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for microeda-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a7d1853d7a77ef07db031e35507e3a83f1f6594998dac3cf4681046c4f59871f
MD5 87c3cc72491f4cb81a45297ef23ab615
BLAKE2b-256 5e65bc1d656be18102e97205e8bdde4399110e1f40c33c8ecc2998ec017a2e01

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page