category-encoders

A collection sklearn transformers to encode categorical variables as numeric

These details have not been verified by PyPI

Project links

Project description

A set of example problems examining different encoding methods for categorical variables for the purpose of classification. Optionally, install the library of encoders as a package and use them in your projects directly. They are all available as methods or as scikit-learn compatible transformers.

Docs [here](http://wdm0006.github.io/categorical_encoding/)

Encoding Methods

Ordinal

One-Hot

Binary

Helmert Contrast

Sum Contrast

Polynomial Contrast

Backward Difference Contrast

Simple Hashing

Usage

Either run the examples in encoding_examples.py, or install as:

pip install category_encoders

To use:

import category_encoders as ce

encoder = ce.BackwardDifferenceEncoder(cols=[…]) encoder = ce.BinaryEncoder(cols=[…]) encoder = ce.HashingEncoder(cols=[…]) encoder = ce.HelmertEncoder(cols=[…]) encoder = ce.OneHotEncoder(cols=[…]) encoder = ce.OrdinalEncoder(cols=[…]) encoder = ce.SumEncoder(cols=[…]) encoder = ce.PolynomialEncoder(cols=[…])

All of these are fully compatible sklearn transformers, so they can be used in pipelines or in your existing scripts. If the cols parameter isn’t passed, every column will be encoded, so be careful with that.

Datasets

The datasets used in the examples are car, mushroom, and splice datasets from the UCI dataset repository, found here:

[datasets](https://archive.ics.uci.edu/ml/datasets)

License

BSD

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.9.0

Nov 2, 2025

2.8.1

Mar 15, 2025

2.8.0

Jan 19, 2025

2.7.0

Jan 7, 2025

2.6.4

Oct 1, 2024

2.6.3

Oct 29, 2023

2.6.2

Aug 15, 2023

2.6.1

May 15, 2023

2.6.0

Jan 14, 2023

2.5.1.post0

Oct 6, 2022

2.5.1

Oct 5, 2022

2.5.0

Jun 2, 2022

2.4.1

May 10, 2022

2.4.0

Mar 9, 2022

2.3.0

Oct 13, 2021

2.2.2

Apr 29, 2020

2.1.0

Oct 1, 2019

2.0.0

Apr 28, 2019

1.3.0

Oct 14, 2018

1.2.8

Jun 6, 2018

1.2.7

Jun 4, 2018

1.2.6

Jan 21, 2018

1.2.5

Nov 3, 2017

1.2.4

Jul 12, 2017

1.2.3

Jul 28, 2016

1.2.2

Jul 26, 2016

1.2.1

Jun 20, 2016

1.2.0

Jun 17, 2016

1.1.2

May 31, 2016

1.1.1

May 31, 2016

1.1.0

May 31, 2016

1.0.5

May 18, 2016

This version

1.0.4

Apr 6, 2016

1.0.3

Mar 9, 2016

1.0.2

Mar 2, 2016

1.0.1

Mar 2, 2016

1.0.0

Feb 24, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

category_encoders-1.0.4.tar.gz (5.4 kB view details)

Uploaded Apr 6, 2016 Source

File details

Details for the file category_encoders-1.0.4.tar.gz.

File metadata

Download URL: category_encoders-1.0.4.tar.gz
Upload date: Apr 6, 2016
Size: 5.4 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for category_encoders-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`de3a13422562eef3151497e11306db7c376521e8d65a366fe046fff85c2ab05d`
MD5	`3d106275940ab9455e0efa075ed6f030`
BLAKE2b-256	`ff9cf24e95db6b7d9d9503ce61f6daa980d69cd70a953b7e04b6d19b6e673234`

See more details on using hashes here.

category-encoders 1.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Encoding Methods

Usage

Datasets

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes