Intelligent, automatic missing value imputation library for pandas datasets with context-aware text handling.
Project description
AM_filler
Automatic Missing Value Filler - Fill missing values in datasets intelligently with one line of code.
🚀 Overview
AM_filler is a Python library that automatically detects column types and fills missing values using the best strategy — without any user configuration. It saves time and prepares datasets for ML/DL tasks in one line of code.
Why AM_filler?
| Feature | sklearn/PyCaret | AM_filler |
|---|---|---|
| Choose imputation strategy | Manual | Automatic |
| Handle text columns | Limited | Built-in |
| Configuration required | Yes | None |
| One-line usage | No | Yes |
📦 Installation
# Clone the repository
git clone https://github.com/yourusername/AM_filler.git
cd AM_filler
# Install in development mode
pip install -e .
⚡ Quick Start
from am_filler import AMFiller
import pandas as pd
# Your DataFrame with missing values
df = pd.read_csv("your_data.csv")
# One line to fill all missing values!
df_clean = AMFiller().fit_transform(df)
That's it! AM_filler automatically:
- Detects column types (numeric, categorical, text)
- Chooses the best imputation strategy
- Fills all missing values
- Logs what was done
🧠 How It Works - The Logic
AM_Filler uses intelligent algorithms to determine how to fill missing data:
1. Numeric Data (Intelligence)
It checks the distribution of data before deciding:
- Normal Distribution: Uses MEAN.
- Skewed / Outliers: Uses MEDIAN.
(See the graph above for visual representation)
2. Context-Aware Text Filling
Unlike other libraries that drop text or fill with "Missing", AM_Filler uses context-aware templates:
- Description: "Information not available."
- Feedback: "No review provided."
- etc.
3. Categorical Data
- Uses MODE (Most frequent value).
📁 Project Structure
AM_filler/
├── am_filler/
│ ├── __init__.py # Public API exports
│ ├── core.py # Main AMFiller class
│ ├── numeric.py # Numeric imputation (smart mean/median)
│ ├── categorical.py # Categorical imputation (mode)
│ ├── text.py # Text imputation (sentences)
│ └── utils.py # Helper functions
├── examples/
│ └── test_notebook.ipynb # Interactive Demo!
├── docs/
│ └── images/ # Documentation assets
├── README.md
├── pyproject.toml
└── setup.py
🧪 Running Tests
# Install test dependencies
pip install pytest
# Run all tests
pytest tests/ -v
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file am_filler-1.0.0.tar.gz.
File metadata
- Download URL: am_filler-1.0.0.tar.gz
- Upload date:
- Size: 16.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0680f2770b12a3110fff4a70afa947d5ca6cc9608723e0e9bc509902ff4c6223
|
|
| MD5 |
bb025100f14e602030c41b8627f1a3b6
|
|
| BLAKE2b-256 |
a1f6d8584f531cc529411b18f9dbd5c4bc2aea69924187e749beb494a6520e85
|
File details
Details for the file am_filler-1.0.0-py3-none-any.whl.
File metadata
- Download URL: am_filler-1.0.0-py3-none-any.whl
- Upload date:
- Size: 11.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5e24c3ff8be67e76f493163025659ce7543b8f1f30abe0d4d652d19ba8471ccb
|
|
| MD5 |
adb9ac09ba04a0f89c2a39332062f061
|
|
| BLAKE2b-256 |
3d3ba5ce4ccf941c889b00dd962f3641fb3acc0ca63da757a7155441774b67b9
|