No project description provided
Project description
TidyPeek
TidyPeek is a lightweight CLI tool for quick CSV sanity checks.
It helps you inspect a dataset's structure, missing values, duplicates, likely identifier columns, and basic actionable insights - all from the terminal.
Features
- View total rows and columns
- Inspect column names and data types
- Check missing values per column
- Detect exact duplicate rows
- Detect likely identifier columns
- Check duplicate values in likely ID Columns
- Check duplicate email values in email-like columns
- Generate simple insights based on data quality issues
Installation
pip install tidypeek
🚀 Usage
Run with FilePath
tidypeek path/to/file.csv
Example:
tidypeek "/home/user/Datasets/UFC Fighters/UFCFighters.csv"
Run without a file path:
tidypeek
⚠️ Note
If your filepath consists of spaces, wrap it up in quotes:
tidypeek "/path/to/My Dataset/data.csv"
🧠 What it does
TidyPeek performs a quick analysis of your dataset and highlights:
- Missing values
- Duplicate rows
- Potential identifier columns
- Data inconsistencies
It then generates simple insights to help guide cleaning and analysis.
📊 Example Output
=== GENERAL CHECKS ===
Total Columns: 18
Total Rows: 4111
=== IDENTIFIER CHECKS ===
Likely ID columns:
- name (Uniqueness: 99.85%, Missing: 0.00%)
Duplicate ID counts:
- name: 6 duplicates
=== INSIGHTS ===
1. 4 columns have high missing values
2. 12 columns have low uniqueness
3. Column 'name' likely acts as an identifier but has duplicates
🎯 Use Cases
- Quick dataset inspection
- Data cleaning workflows
- Learning data analysis basics
- Pre-analysis sanity checks
📌 Current Scope
tidypeek focuses on lightweight inspection, not full data cleaning or transformation.
🤝 Contributing
Feel free to fork the repo and open a pull request.
📄 License MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tidypeek-0.1.1.tar.gz.
File metadata
- Download URL: tidypeek-0.1.1.tar.gz
- Upload date:
- Size: 4.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a247867063bd34df7d0d080fc2f038779e021c5364fb9ac4d47a8f2b113c5fd7
|
|
| MD5 |
e6159b1fbb7da42f5dfdf43a51d4ae48
|
|
| BLAKE2b-256 |
6ae9cbce3bb0a8cdfcf9055f8758ea2ff3ddac592475d965eadc5fbf6d39c61b
|
File details
Details for the file tidypeek-0.1.1-py3-none-any.whl.
File metadata
- Download URL: tidypeek-0.1.1-py3-none-any.whl
- Upload date:
- Size: 4.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3c0cf4a5cc8e7d0e80161ae577d92bf4faf5544de262296608f5db3e2e7fc2ae
|
|
| MD5 |
cf8f92b24ecd87c5ed64fbd6f00291f2
|
|
| BLAKE2b-256 |
ff8c1161f4afae778b8a2e7f066151b5107df97a7829cca7b69882ef6e276573
|