Skip to main content

Automated Data Cleaning for Faster Analytics

Project description

cleanframe-data 🪄

License: MIT Python Version PyPI version

Automated Data Cleaning for Faster Analytics

cleanframe-data is a lightweight, fast, and intuitive Python library designed to automate dataset diagnostics and cleaning. It helps data analysts, scientists, and beginners clean messy datasets, handle missing values, drop low-quality columns, and cap outliers in just one line of code.


🚀 Features

  • One-Line Auto-Clean: Drop duplicates, remove low-quality columns, impute missing values, and handle outliers instantly using cf.auto_clean(df).
  • Advanced Outlier Handling: Automatically detects and caps extreme numerical outliers using the Interquartile Range (IQR) method.
  • Smart Column Dropping: Drops columns automatically if their missing data percentage crosses your defined threshold.
  • Dataset Diagnostics: Get a quick, comprehensive report of data types, missing values, and percentages.
  • Modern Pandas Ready: Built from the ground up to support modern Pandas (2.0+) Copy-on-Write behaviors without annoying warnings.

🛠️ Installation

You can install the official stable release directly from PyPI:

pip install cleanframe-data

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanframe_data-0.0.4.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanframe_data-0.0.4-py3-none-any.whl (5.6 kB view details)

Uploaded Python 3

File details

Details for the file cleanframe_data-0.0.4.tar.gz.

File metadata

  • Download URL: cleanframe_data-0.0.4.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for cleanframe_data-0.0.4.tar.gz
Algorithm Hash digest
SHA256 29ba685349ef4e2554367a013aca66675afde433fb35ba7b7e9db4fdd6ba370c
MD5 b0ff0da77136af52f9ebc9274f0f8421
BLAKE2b-256 2b274537e89b843754a987fd5be528063ee6b9b9cadb7988457986d51d638ad0

See more details on using hashes here.

File details

Details for the file cleanframe_data-0.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for cleanframe_data-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c7a5addef1758eeae51762a4f1a2f49d2b913a39952b4c4858801b4f656a9e40
MD5 b07aa052b9e0431aeaf36af0351ebe52
BLAKE2b-256 9d91f65e6d2fee3287164b9408f4cdae0a417ecc6e48d5f3ab64d87de537f100

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page