Skip to main content

A beginner-friendly Python library that simplifies Exploratory Data Analysis (EDA) with AI-powered guide, and provides an interactive cheat-sheet for quick reference and tools for data visualization, cleaning and feature engineering.

Project description

๐Ÿง  pyedahelper - Simplify Your Exploratory Data Analysis (EDA)

pyedahelper is an educational and practical Python library designed to make Exploratory Data Analysis (EDA) simple, guided, and fast, especially for data analysts, students, and early-career data scientists who want to spend more time analyzing data and less time remembering syntax.

It's a lightweight, educational, and intelligent Python library that helps you perform Exploratory Data Analysis (EDA) faster โ€” with guided suggestions, ready-to-use utilities, and clean visualizations.

๐ŸŒŸ Key Features:

  • โšก A smart EDA cheat sheet (interactive and collapsible),
  • ๐Ÿ’ฌ AI-guided EDA assistant โ€” suggests the next logical step (e.g., โ€œView top rows with df.head()โ€).
  • ๐Ÿงฉ A suite of data tools for real-world EDA tasks (loading, cleaning, feature engineering, visualization, and summaries),
  • ๐Ÿ’ฌ Handy code hints and examples you can copy directly into your notebook.

๐ŸŒ Why pyedahelper?

Performing EDA often involves the use of numerous syntaxes to understand the dataset, it forces the narrative that good data professionals are those who know all the Python syntaxes by heart rather than those who can interprete accurately, the output of each of the EDA steps. And more importantly, Data Analysts spend more than 80% of their analytics time on iterative EDA, some of these hours spent checking documentary and Googling stuffs.

pyedahelper solves this by combining ready-to-use functions for your data workflow, AI-powered guide with inline learning โ€” you can see, learn, and apply the same steps.


โš™๏ธ Installation

pip install pyedahelper==1.0.2

Upgrade

pip install --upgrade pyedahelper

๐Ÿš€ Quick Start

import edahelper as eda
import pandas as pd

# Load your dataset
df = pd.read_csv("data.csv")

# ๐Ÿ“š Display the interactive EDA cheat-sheet
eda.show() -- for experienced analysts or
eda.core.show() -- for total newbies

# ๐Ÿ” Start guided suggestion
eda.next("read_csv")   # Suggests: "View first rows with df.head()"

# ๐Ÿ’ก View an example command with short explanation
eda.core.example("describe")

From there, the assistant automatically continues:

df.head() โ†’ df.columns โ†’ df.shape โ†’ df.info() โ†’ df.describe() โ†’ ...

If you want to skip a suggestion, simply type "Next".

๐Ÿ” Modules Overview

1๏ธโƒฃ EDA Guidance (AI Suggestion System)

The AI-powered step recommender helps complete beginners know what to do next.

Example flow:

eda.next("read_csv")   # Suggests df.head()
eda.next("head")       # Suggests df.columns
eda.next("columns")    # Suggests df.shape

It covers:

. Dataset overview (head, columns, shape, info, describe)

. Missing values (isnull, fillna, dropna)

. Data cleaning (duplicated, astype, replace)

. Visualization (plot_distribution, scatterplot, plot_correlation)

. Feature prep and modeling steps (label_encode, split, fit_model, predict)

5๏ธโƒฃ Visualization Module

Functions for exploring and visualizing data quickly.

from edahelper import visualization as vis

vis.plot_correlation(df)
vis.plot_distribution(df, "Age")
vis.scatter(df, "Age", "Income", hue="Gender")

๐ŸŽจ Uses matplotlib and seaborn under the hood for fast, clean plots.

๐Ÿ“˜ The Interactive Cheat-Sheet

When you forget a syntax, simply call:

eda.core.show()

โœจ Displays a colorful grouped guide of:

Data Loading Overview Missing Values Indexing & Grouping Visualization Feature Engineering NumPy & sklearn tips

๐Ÿง‘๐Ÿฝโ€๐Ÿ’ป Example Workflow

import pyedahelper as eda
import pandas as pd

# Load data
df = pd.read_csv("sales.csv")

# Start guided mode
eda.next("read_csv")    # Suggests df.head()
eda.next('head')        # Suggests df.info()

๐Ÿ“ฆ Project Structure

pyedahelper/
โ”‚
โ”œโ”€โ”€ __init__.py              # Main entrypoint
โ”œโ”€โ”€ core.py                  # Cheat-sheet + examples
โ”œโ”€โ”€ show.py                  # Display logic
โ”œโ”€โ”€ stats_summary.py         # Dataset summary helpers
โ”œโ”€โ”€ visualization.py         # Quick plots (hist, scatter, heatmap)
โ”œโ”€โ”€ nextstep.py              # AI-guided EDA assistant (eda.next)
โ””โ”€โ”€ __init__.py              # Exports unified functions

๐Ÿ›  Requirements

Python 3.8+ pandas numpy seaborn scikit-learn matplotlib rich (for colored terminal output)

๐Ÿงพ License

MIT License ยฉ 2025 Chidiebere Christopher Feel free to fork, contribute, or use it in your analytics workflow!

๐ŸŒŸ Contributing

We welcome contributions โ€” bug fixes, new EDA tools, or notebook examples.

  1. Fork the repo
  2. Create your feature branch (git checkout -b feature-name)
  3. Commit your changes
  4. Push and open a Pull Request ๐ŸŽ‰

๐Ÿ”— Links

๐Ÿ“ฆ PyPI: https://pypi.org/project/pyedahelper/ ๐Ÿ’ป GitHub: https://github.com/93Chidiebere/pyedahelper-Python-EDA-Helper โœ‰๏ธ Author: Chidiebere V. Christopher

๐Ÿš€ Learn. Explore. Analyze. Faster. pyedahelper โ€” your friendly companion for every EDA project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyedahelper-1.0.3.tar.gz (5.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyedahelper-1.0.3-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file pyedahelper-1.0.3.tar.gz.

File metadata

  • Download URL: pyedahelper-1.0.3.tar.gz
  • Upload date:
  • Size: 5.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for pyedahelper-1.0.3.tar.gz
Algorithm Hash digest
SHA256 dc682e552db3536a1c495eba166d16206280d575bdadebcec890a97f5fd83278
MD5 127b5867df6c89a50e3af89f79068a46
BLAKE2b-256 5c8ee37ae71d761a49497a4e161403569824c8b56ad03b24170f360eaf3a93d4

See more details on using hashes here.

File details

Details for the file pyedahelper-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: pyedahelper-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 4.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for pyedahelper-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 0cb6b86e211302be15646e0dc81e77eea35dfcda8ae7e894c6d8306d75d18567
MD5 8d347d623013166b102a5a8e35f368ac
BLAKE2b-256 4fc51a729fc23d0cc8552ec5761d9edebe44a130f1d9c7aef7f4c6e906e76df2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page