Skip to main content

Automatic Discretization of Features with Optimal Target Association

Project description

AutoCarver Logo

PyPI PyPI - Python Version License

ReadTheDocs

Check out the package documentation on ReadTheDocs!

Install

AutoCarver can be installed from PyPI:

pip install autocarver

Why AutoCarver?

AutoCarver is a powerful Python package designed to address the fundamental question of What's the best processing for my model's features?

It offers an automated and optimized approach to processing and engineering your data, resulting in improved model performance, enhanced explainability, and reduced feature dimensionality. As of today, this set of tools is available for binary classification and regression problems only.

Key Features:

  1. Data Processing and Engineering: AutoCarver performs automatic bucketization and carving of a DataFrame's columns to maximize their correlation with a target variable. By leveraging advanced techniques, it optimizes the preprocessing steps for your data, leading to enhanced predictive accuracy.

  2. Improved Model Explainability: AutoCarver aids in understanding the relationship between the processed features and the target variable. By uncovering meaningful patterns and interactions, it provides valuable insights into the underlying data dynamics, enhancing the interpretability of your models.

  3. Reduced Feature Dimensionality: AutoCarver excels at reducing feature dimensionality, especially in scenarios involving one-hot encoding. It identifies and preserves only the most statistically relevant modalities, ensuring that your models focus on the most informative aspects of the data while eliminating noise and redundancy.

  4. Statistical Accuracy and Relevance: AutoCarver incorporates statistical techniques to ensure that the selected modalities have a sufficient number of observations, minimizing the risk of drawing conclusions based on insufficient data. This helps maintain the reliability and validity of your models.

  5. Robustness Testing: AutoCarver goes beyond feature processing by assessing the robustness of the selected modalities. It performs tests to evaluate the stability and consistency of the chosen features across different datasets or subsets, ensuring their reliability in various scenarios.

AutoCarver is a valuable tool for data scientists and practitioners involved in binary classification or regression problems, such as credit scoring, fraud detection, and risk assessment. By leveraging its automated feature processing capabilities, you can unlock the full potential of your data, leading to more accurate predictions, improved model explainability, and better decision-making in your classification tasks.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autocarver-7.0.10.tar.gz (64.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autocarver-7.0.10-py3-none-any.whl (101.0 kB view details)

Uploaded Python 3

File details

Details for the file autocarver-7.0.10.tar.gz.

File metadata

  • Download URL: autocarver-7.0.10.tar.gz
  • Upload date:
  • Size: 64.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for autocarver-7.0.10.tar.gz
Algorithm Hash digest
SHA256 8adb4971bb1132958cd4cf47cc0e6e23a71f4cb00a297db07d84f22c127e5aef
MD5 ed91a239d9bf4f6c809e3e2e0227dacb
BLAKE2b-256 ca84fbeb423bbdbe47f89bfd64fc9409c2155c07d4645be4276e44b9e8255b18

See more details on using hashes here.

File details

Details for the file autocarver-7.0.10-py3-none-any.whl.

File metadata

  • Download URL: autocarver-7.0.10-py3-none-any.whl
  • Upload date:
  • Size: 101.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for autocarver-7.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 8c5c1e249396617d39fc661a7fbc0d378c3baebdbf98d2c8c5621c56f64a63ff
MD5 e012e156a191c095f725562d1c564d7a
BLAKE2b-256 6dd770fba47c7971dd0c2de402853ef05a046f0de654024f21f08e0389ef8317

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page