Skip to main content

Automated Data Cleaning for Faster Analytics

Project description

cleanframe 🪄

License: MIT Python Version

Automated Data Cleaning for Faster Analytics

cleanframe is a lightweight, fast, and intuitive Python library designed to automate dataset diagnostics and cleaning. It helps data analysts, scientists, and beginners clean messy datasets, handle missing values, drop low-quality columns, and cap outliers in just one line of code.


🚀 Features

  • One-Line Auto-Clean: Drop duplicates, remove low-quality columns, impute missing values, and handle outliers instantly using cf.auto_clean(df).
  • Advanced Outlier Handling: Automatically detects and caps extreme numerical outliers using the Interquartile Range (IQR) method.
  • Smart Column Dropping: Drops columns automatically if their missing data percentage crosses your defined threshold.
  • Dataset Diagnostics: Get a quick, comprehensive report of data types, missing values, and percentages.
  • Modern Pandas Ready: Built from the ground up to support modern Pandas (2.0+) Copy-on-Write behaviors without annoying warnings.

🛠️ Installation

(Once published to PyPI, you can install it via pip)

pip install cleanframe

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanframe_data-0.0.2.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanframe_data-0.0.2-py3-none-any.whl (5.6 kB view details)

Uploaded Python 3

File details

Details for the file cleanframe_data-0.0.2.tar.gz.

File metadata

  • Download URL: cleanframe_data-0.0.2.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for cleanframe_data-0.0.2.tar.gz
Algorithm Hash digest
SHA256 70e5d65125c1947a532884c9e561cfe57c3c35e67c4d3e4cc59b0b1b323a4012
MD5 fd5e33c71312802921023b20cd47a87d
BLAKE2b-256 fcabc0abaf8c2c12c17c12d6339b40b6426b3ccb4d028208594220104758fab4

See more details on using hashes here.

File details

Details for the file cleanframe_data-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for cleanframe_data-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 9004b9c0f0056918f8851a17319379e5da9326389e49a71bf430b3af2891bdd3
MD5 f1bab536502840dca8014169964290aa
BLAKE2b-256 e810b0be60d78290f3c2981c9ce206da315d48609e80f2ee923c4b02b048b1b2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page