LLM-powered feature engineering with scikit-learn API compatibility
Project description
SKFeatureLLM
SKFeatureLLM is a Python library that brings the power of Large Language Models (LLMs) to feature engineering for tabular data, wrapped in a familiar scikit-learn–style API. The library aims to leverage LLMs' capabilities to automatically generate and implement meaningful features for your machine learning tasks.
📑 Table of Contents
🌟 Key Features
- 🤖 LLM-powered feature engineering
- 🔌 Model-agnostic: works with any LLM provider (OpenAI, Anthropic, etc.)
- 🛠 Scikit-learn compatible API
- 📊 Comprehensive feature evaluation and reporting
- 🎯 Support for both supervised and unsupervised feature engineering
📦 Installation
To install the latest release of SKFeatureLLM from PyPI:
pip install skfeaturellm
This will install the library and its core dependencies for end users.
✅ Running Tests
To run the test suite, ensure pytest is installed and execute:
poetry run pytest
Tests are located in the tests/ directory and cover the core functionality of SKFeatureLLM.
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🤝 Contributing
We welcome contributions! Here's how you can help:
- Report Bugs: If you find a bug, please open an issue with a detailed description.
- Suggest Features: Have an idea for a new feature? Open an issue to discuss it.
- Submit Pull Requests: We love PRs! Here's how to submit one:
- Fork the repository
- Create a new branch for your feature
- Make your changes
- Submit a pull request
Development Setup
- Clone the repository:
git clone https://github.com/yourusername/skfeaturellm.git
cd skfeaturellm
- Install development dependencies:
pip install -e ".[dev]"
- Run tests:
pytest
- Format code:
black .
isort .
Code Style
We use:
- Black for code formatting
- isort for import sorting
- pylint for linting
- mypy for type checking
Please ensure your code passes all checks before submitting a PR.
👤 Author
📚 Documentation
Our documentation is hosted on Read the Docs and includes:
- 📖 User Guide - Learn how to use SKFeatureLLM effectively
- 🔍 API Reference - Detailed documentation of all classes and methods
- 🎯 Examples - Practical examples for different use cases
- 📊 Feature Engineering Guide - Best practices for feature engineering with LLMs
For any questions or issues, please open an issue on our GitHub repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file skfeaturellm-0.0.8.tar.gz.
File metadata
- Download URL: skfeaturellm-0.0.8.tar.gz
- Upload date:
- Size: 22.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d2ca4305fd9217fcd77f1c749beda84cd55230dac9abae40c07fc485bd422551
|
|
| MD5 |
ed5351806c53699801715103198b7dff
|
|
| BLAKE2b-256 |
b91e63dee522d7e261c038bc28c876e161647e2fa2dcea14e95d84485ebb6d59
|
Provenance
The following attestation bundles were made for skfeaturellm-0.0.8.tar.gz:
Publisher:
ci-cd.yml on RobertoCorti/skfeaturellm
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
skfeaturellm-0.0.8.tar.gz -
Subject digest:
d2ca4305fd9217fcd77f1c749beda84cd55230dac9abae40c07fc485bd422551 - Sigstore transparency entry: 975963258
- Sigstore integration time:
-
Permalink:
RobertoCorti/skfeaturellm@58ab2adaea96a7ab2fe83c96673af41957ef5cf0 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/RobertoCorti
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci-cd.yml@58ab2adaea96a7ab2fe83c96673af41957ef5cf0 -
Trigger Event:
push
-
Statement type:
File details
Details for the file skfeaturellm-0.0.8-py3-none-any.whl.
File metadata
- Download URL: skfeaturellm-0.0.8-py3-none-any.whl
- Upload date:
- Size: 26.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
baf1ab5baaacd94a1e9c107d17f9db0c268db0d1ee5a3d93a2015ca5c5ea75ce
|
|
| MD5 |
9f94e75202bcd22196fc89d4e14ef34d
|
|
| BLAKE2b-256 |
b10bec8e11b74039471777e656b160094c9bcf12062972988df87bc94b8f909b
|
Provenance
The following attestation bundles were made for skfeaturellm-0.0.8-py3-none-any.whl:
Publisher:
ci-cd.yml on RobertoCorti/skfeaturellm
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
skfeaturellm-0.0.8-py3-none-any.whl -
Subject digest:
baf1ab5baaacd94a1e9c107d17f9db0c268db0d1ee5a3d93a2015ca5c5ea75ce - Sigstore transparency entry: 975963261
- Sigstore integration time:
-
Permalink:
RobertoCorti/skfeaturellm@58ab2adaea96a7ab2fe83c96673af41957ef5cf0 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/RobertoCorti
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci-cd.yml@58ab2adaea96a7ab2fe83c96673af41957ef5cf0 -
Trigger Event:
push
-
Statement type: