Skip to main content

Automated Data Cleaning for Faster Analytics

Project description

cleanframe-data 🪄

License: MIT Python Version PyPI version

Automated Data Cleaning for Faster Analytics

cleanframe-data is a lightweight, fast, and intuitive Python library designed to automate dataset diagnostics and cleaning. It helps data analysts, scientists, and beginners clean messy datasets, handle missing values, drop low-quality columns, and cap outliers in just one line of code.


🚀 Features

  • One-Line Auto-Clean: Drop duplicates, remove low-quality columns, impute missing values, and handle outliers instantly using cf.auto_clean(df).
  • Advanced Outlier Handling: Automatically detects and caps extreme numerical outliers using the Interquartile Range (IQR) method.
  • Smart Column Dropping: Drops columns automatically if their missing data percentage crosses your defined threshold.
  • Dataset Diagnostics: Get a quick, comprehensive report of data types, missing values, and percentages.
  • Modern Pandas Ready: Built from the ground up to support modern Pandas (2.0+) Copy-on-Write behaviors without annoying warnings.

🛠️ Installation

You can install the official stable release directly from PyPI:

pip install cleanframe-data

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanframe_data-0.0.5.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanframe_data-0.0.5-py3-none-any.whl (5.6 kB view details)

Uploaded Python 3

File details

Details for the file cleanframe_data-0.0.5.tar.gz.

File metadata

  • Download URL: cleanframe_data-0.0.5.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for cleanframe_data-0.0.5.tar.gz
Algorithm Hash digest
SHA256 a92ff11569971ef204b9bada6e85161f3b42e74cdc8f571128a81aab757382cc
MD5 20ac4ff16d8156f3bf355a8496fe9b7f
BLAKE2b-256 c45455984495d068865477b0825576afc7f6766c0e2bf10a567670a7ec728f43

See more details on using hashes here.

File details

Details for the file cleanframe_data-0.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for cleanframe_data-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 62fbde2e91972391a4d646c54f222ea935e857f004c9e2215641a03a17e7e47f
MD5 6f3615c89cb5f791a44259bea04f453b
BLAKE2b-256 6adae55128f1199151898cc5b5167d8457f8ab122c0edb32e8c871c32212bf12

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page