Impute missing values using faiss
Project description
FaissImputer
Impute missing values using faiss - A Python library for missing data imputation with k nearest neighbors.
Installation
You can install faiss-imputer
using pip
:
pip install faiss-imputer
Usage
import pandas as pd
from faiss_imputer import FaissImputer
# Create your DataFrame and introduce missing values
# ...
# Create an instance of FaissImputer
imputer = FaissImputer(n_neighbors=3)
# Fit the imputer on the data frame with missing values
imputer.fit(df_missing)
# Transform the data frame with missing values
df_imputed = imputer.transform(df_missing)
Parameters
n_neighbors: Number of nearest neighbors to consider for imputation. metric: Distance metric to use for nearest neighbor search ('l2' or 'ip'). strategy: Imputation strategy ('mean' or 'median'). index_factory: Faiss index type ('Flat' or others).
Example
For a detailed example, refer to the example.py file.
Contributing
Contributions are welcome! If you find a bug or have an enhancement suggestion, please open an issue or create a pull request.
License
This project is licensed under the MIT License.
Third-Party Licenses
This project utilizes code from Meta's Faiss library, which is distributed under the Apache License 2.0.
Please note that while this project includes code from the Faiss library, it is not officially associated with or endorsed by the Faiss maintainers or Meta.
For detailed licensing information of the Faiss library, please refer to the Faiss repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for faiss_imputer-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1bcf730e192c77cfaf157fe9c07a221a4c2e90708f38a689afd5e3ecb802bc3d |
|
MD5 | 3b439807524304312a42dbd91995a4e8 |
|
BLAKE2b-256 | b9f5903803213b073e4b5253ce59d5355522bf73aaf2186ece4e161e8f1a99b2 |