Skip to main content

Intelligent, automatic missing value imputation library for pandas datasets with context-aware text handling.

Project description

AM_filler

Automatic Missing Value Filler - Fill missing values in datasets intelligently with one line of code.

Python 3.8+ License: MIT

🚀 Overview

AM_filler is a Python library that automatically detects column types and fills missing values using the best strategy — without any user configuration. It saves time and prepares datasets for ML/DL tasks in one line of code.

Distribution Logic

Why AM_filler?

Feature sklearn/PyCaret AM_filler
Choose imputation strategy Manual Automatic
Handle text columns Limited Built-in
Configuration required Yes None
One-line usage No Yes

📦 Installation

# Clone the repository
git clone https://github.com/yourusername/AM_filler.git
cd AM_filler

# Install in development mode
pip install -e .

⚡ Quick Start

from am_filler import AMFiller
import pandas as pd

# Your DataFrame with missing values
df = pd.read_csv("your_data.csv")

# One line to fill all missing values!
df_clean = AMFiller().fit_transform(df)

That's it! AM_filler automatically:

  • Detects column types (numeric, categorical, text)
  • Chooses the best imputation strategy
  • Fills all missing values
  • Logs what was done

🧠 How It Works - The Logic

AM_Filler uses intelligent algorithms to determine how to fill missing data:

1. Numeric Data (Intelligence)

It checks the distribution of data before deciding:

  • Normal Distribution: Uses MEAN.
  • Skewed / Outliers: Uses MEDIAN.

(See the graph above for visual representation)

2. Context-Aware Text Filling

Unlike other libraries that drop text or fill with "Missing", AM_Filler uses context-aware templates:

  • Description: "Information not available."
  • Feedback: "No review provided."
  • etc.

3. Categorical Data

  • Uses MODE (Most frequent value).

📁 Project Structure

AM_filler/
├── am_filler/
│   ├── __init__.py      # Public API exports
│   ├── core.py          # Main AMFiller class
│   ├── numeric.py       # Numeric imputation (smart mean/median)
│   ├── categorical.py   # Categorical imputation (mode)
│   ├── text.py          # Text imputation (sentences)
│   └── utils.py         # Helper functions
├── examples/
│   └── test_notebook.ipynb # Interactive Demo!
├── docs/
│   └── images/          # Documentation assets
├── README.md
├── pyproject.toml
└── setup.py

🧪 Running Tests

# Install test dependencies
pip install pytest

# Run all tests
pytest tests/ -v

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

am_filler-1.0.0.tar.gz (16.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

am_filler-1.0.0-py3-none-any.whl (11.5 kB view details)

Uploaded Python 3

File details

Details for the file am_filler-1.0.0.tar.gz.

File metadata

  • Download URL: am_filler-1.0.0.tar.gz
  • Upload date:
  • Size: 16.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.11

File hashes

Hashes for am_filler-1.0.0.tar.gz
Algorithm Hash digest
SHA256 0680f2770b12a3110fff4a70afa947d5ca6cc9608723e0e9bc509902ff4c6223
MD5 bb025100f14e602030c41b8627f1a3b6
BLAKE2b-256 a1f6d8584f531cc529411b18f9dbd5c4bc2aea69924187e749beb494a6520e85

See more details on using hashes here.

File details

Details for the file am_filler-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: am_filler-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 11.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.11

File hashes

Hashes for am_filler-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5e24c3ff8be67e76f493163025659ce7543b8f1f30abe0d4d652d19ba8471ccb
MD5 adb9ac09ba04a0f89c2a39332062f061
BLAKE2b-256 3d3ba5ce4ccf941c889b00dd962f3641fb3acc0ca63da757a7155441774b67b9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page