Skip to main content

Impute missing values using faiss

Project description

FaissImputer

PyPI Version License

Impute missing values using faiss - A Python library for missing data imputation with k nearest neighbors.

Installation

You can install faiss-imputer using pip:

pip install faiss-imputer

Usage

import pandas as pd
from faiss_imputer import FaissImputer

# Create your DataFrame and introduce missing values
# ...

# Create an instance of FaissImputer
imputer = FaissImputer(n_neighbors=3)

# Fit the imputer on the data frame with missing values
imputer.fit(df_missing)

# Transform the data frame with missing values
df_imputed = imputer.transform(df_missing)

Parameters

n_neighbors: Number of nearest neighbors to consider for imputation. metric: Distance metric to use for nearest neighbor search ('l2' or 'ip'). strategy: Imputation strategy ('mean' or 'median'). index_factory: Faiss index type ('Flat' or others).

Example

For a detailed example, refer to the example.py file.

Contributing

Contributions are welcome! If you find a bug or have an enhancement suggestion, please open an issue or create a pull request.

License

This project is licensed under the MIT License.

Third-Party Licenses

This project utilizes code from Meta's Faiss library, which is distributed under the Apache License 2.0.

Please note that while this project includes code from the Faiss library, it is not officially associated with or endorsed by the Faiss maintainers or Meta.

For detailed licensing information of the Faiss library, please refer to the Faiss repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

faiss-imputer-0.1.1.tar.gz (4.4 kB view hashes)

Uploaded Source

Built Distribution

faiss_imputer-0.1.1-py3-none-any.whl (4.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page