Skip to main content

Automated Data Cleaning for Faster Analytics

Project description

cleanframe-data 🪄

License: MIT Python Version PyPI version

Automated Data Cleaning for Faster Analytics

cleanframe-data is a lightweight, fast, and intuitive Python library designed to automate dataset diagnostics and cleaning. It helps data analysts, scientists, and beginners clean messy datasets, handle missing values, drop low-quality columns, and cap outliers in just one line of code.


🚀 Features

  • One-Line Auto-Clean: Drop duplicates, remove low-quality columns, impute missing values, and handle outliers instantly using cf.auto_clean(df).
  • Advanced Outlier Handling: Automatically detects and caps extreme numerical outliers using the Interquartile Range (IQR) method.
  • Smart Column Dropping: Drops columns automatically if their missing data percentage crosses your defined threshold.
  • Dataset Diagnostics: Get a quick, comprehensive report of data types, missing values, and percentages.
  • Modern Pandas Ready: Built from the ground up to support modern Pandas (2.0+) Copy-on-Write behaviors without annoying warnings.

🛠️ Installation

You can install the official stable release directly from PyPI:

pip install cleanframe-data

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanframe_data-0.0.3.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanframe_data-0.0.3-py3-none-any.whl (5.6 kB view details)

Uploaded Python 3

File details

Details for the file cleanframe_data-0.0.3.tar.gz.

File metadata

  • Download URL: cleanframe_data-0.0.3.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for cleanframe_data-0.0.3.tar.gz
Algorithm Hash digest
SHA256 35cc084cd0d0db47d5c2d33eac57d3fc3266f1438cb87cd1e957776e51dda7f8
MD5 c3f0a01c6c803c3e797533071ffd724c
BLAKE2b-256 a71d2d6f5a5e5592f4a6686fd2e0a78fd72c010217bcf56d009d87f7154b4e83

See more details on using hashes here.

File details

Details for the file cleanframe_data-0.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for cleanframe_data-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 54d70bd702baa2f0d48d459a7da7029443e8e4c762356031ba28376b1ff5bbcb
MD5 64233d21d95cf618a80e65bca695691a
BLAKE2b-256 63d4b8edc7be1a9147cd4d016e12070421b74c26a471d05169e14c0cee9bfbd7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page