Skip to main content

Statistical IV: Statistical Hypothesis Testing for the Information Value (IV). Evaluation of the predictive power of features using the IV with specific thresholds for each dataset.

Project description

Statistical IV

Our J-Divergence test is under the next null hypothesis

H0: The predictive power of the variable is not significant.

The null hypothesis is tested using a two-tailed distribution, and this should be taken into consideration when interpreting the p-value.

Explanation

Optimize your machine learning models with 'Statistical-IV'. Perform automated feature selection based on statistics and customize error control.

  1. Import package

    from statistical_iv import api
    
  2. Provide a DataFrame as Input:

    • Supply a DataFrame df containing your data for IV calculation.
  3. Specify Predictor Variables:

    • Prived a list of predictor variable names (variables_names) to analyze.
  4. Define the Target Variable:

    • Specify the name of the target variable (var_y) in your DataFrame.
  5. Indicate Variable Types:

    • Define the type of your predictor variables as 'categorical' or 'numerical' using the type_vars parameter.
  6. Optional: Set Maximum Bins:

    • Adjust the maximum number of bins for discretization (optional) using the max_bins parameter.
  7. Call the statistical_iv Function:

    • Calculate Statistical IV information by calling the statistical_iv function from api with the specified parameters (That is used for OptimalBinning package).
    result_df = api.statistical_iv(df, variables_names, var_y, type_vars, max_bins)
    

Example Result:

Output Example

Full Paper:

For a comprehensive exploration of the topic, we recommend perusing the contents of the article available at this link.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

statistical_iv-0.3.1.tar.gz (5.4 kB view hashes)

Uploaded Source

Built Distribution

statistical_iv-0.3.1-py3-none-any.whl (5.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page