Python library for converting Scikit-Learn pipelines to PMML

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
- Science/Research
Operating System
- OS Independent
Programming Language
- Python
Topic
- Scientific/Engineering
- Software Development

Project description

SkLearn2PMML

Python package for converting Scikit-Learn pipelines to PMML.

Features

This package is a thin Python wrapper around the JPMML-SkLearn library.

News and Updates

The current version is 0.132.0 (30 June, 2026):

pip install sklearn2pmml==0.132.0

See the NEWS.md file.

Prerequisites

Java 11 or newer. The Java executable must be available on system path.
Python 3.8 or newer.

Installation

Installing a release version from PyPI:

pip install sklearn2pmml

Alternatively, installing the latest snapshot version from GitHub:

pip install --upgrade git+https://github.com/jpmml/sklearn2pmml.git

Usage

Native Scikit-Learn

SkLearn2PMML can convert a wide variety of Scikit-Learn and Scikit-Learn adjacent estimators as-is.

The list of supported transformer, selector and predictor (aka model) classes is given in the features.md file of the JPMML-SkLearn project.

Keep SkLearn2PMML maximally up-to-date. One and the same package version -- preferably the latest and greatest -- is able to work with all Scikit-Learn 0.17 (ca 2015) and newer versions.

Library

Use the sklearn2pmml.sklearn2pmml(estimator, pmml_path) utility function to convert a fitted estimator object to PMML:

from sklearn2pmml import sklearn2pmml

estimator = ...
estimator.fit(X, y)

# Convert a live estimator object
sklearn2pmml(estimator, "Estimator.pmml")

The estimator argument may also be a path-like object to an estimator pickle file in local filesystem:

from sklearn2pmml import sklearn2pmml

import joblib

joblib.dump(estimator, "Estimator.pkl")

sklearn2pmml("Estimator.pkl", "Estimator.pmml")

SkLearn2PMML uses a custom Java component (rather than the built-in Python unpickler component) for reading pickle files. As such, it is safe to use with unvetted pickle files.

Command-line application

The sklearn2pmml module is executable.

The main application simply calls the sklearn2pmml.sklearn2pmml() utility function. At minimum, it is necessary to provide the input pickle file (-i or --input; supports joblib, pickle or dill variants) and output PMML file paths (-o or --output):

python -m sklearn2pmml --input Estimator.pkl --output Estimator.pmml

To see all supported command-line options, pass --help:

python -m sklearn2pmml --help

On some platforms, the Pip package installer additionally makes the main application available as a top-level command:

sklearn2pmml --input pipeline.pkl --output pipeline.pmml

PMML-enhanced Scikit-Learn

Native Scikit-Learn estimators have rather limited portability between environments, because they lack adequate metadata. For example, they did not collect and store even the most crucial metadata about the feature matrix (ie. the feature_names_in_ attribute) prior to Scikit-Learn 1.0 (ca 2021).

SkLearn2PMML provides the sklearn2pmml.pipeline.PMMLPipeline meta-estimator class, which extends the sklearn.pipeline.Pipeline class with the following functionality:

Collect feature and label metadata using the fit(X, y) method:
- The column names of the X dataset become input field names. Otherwise, they default to x1, x2, ..., x{n_features_in_}.
- The column names of the y dataset become target field name(s). Otherwise, they default to y (single-output case) or y1, y2, ..., y{n_outputs_} (multi-output case).
Perform prediction post-processing using predict_transform(X), predict_proba_transform(X) and apply_transform(X) methods (operating on predict_transformer, predict_proba_transformer and apply_transformer attributes, respectively).
Embed model verification data using the verify(X) method.
Configure the representation of final estimator step using the configure(**pmml_options) method.
Perform extra edits (ie. insert, update or delete PMML XML fragments) on the PMML document using the customize(command, xpath_expr, pmml_element) method.

PMML-enhanced workflow:

#from sklearn.pipeline import Pipeline
from sklearn2pmml import sklearn2pmml
from sklearn2pmml.pipeline import PMMLPipeline

#pipeline = Pipeline(...)
# Activate prediction post-processing
pipeline = PMMLPipeline(..., predict_transformer = ...)
pipeline.fit(X, y)

# Embed small but representative sample for self-check purposes during deployment
pipeline.verify(X.sample(n = 10))

# Default prediction
yt = pipeline.predict(X)
# Default prediction, together with its transformation results
yt_transformed = pipeline.predict_transform(X)

# Default PMML representation
sklearn2pmml(pipeline, "Pipeline.pmml")

pipeline.configure(...)
#pipeline.customize(...)

# Customized PMML representation
sklearn2pmml(pipeline, "Pipeline-customized.pmml")

Additionally, SkLearn2PMML provides a number of PMML-oriented transformer, selector and predictor classes:

sklearn2pmml.decoration. Capture or declare the domain of individual features by their operational type using ContinuousDomain, CategoricalDomain or OrdinalDomain meta-transformers. Give transformed features meaningful names using Alias and MultiAlias meta-transformers.
sklearn2pmml.preprocessing. Transform features using ExpressionTransformer (any to any), CutTransformer (continuous to discrete), LookupTransformer (discrete to discrete), and many other transformers.
sklearn2pmml.cross_reference. Cross-reference features and transformed features at subsequent transformer steps using Memorizer and Recaller meta-transformers.
sklearn2pmml.ensemble. Estimate conditionally using the SelectFirstTransformer meta-transformer, plus SelectFirstClassifier and SelectFirstRegressor meta-predictors. Combine predictors using GBDTLRClassifier and GBDTLMRegressor meta-predictors.
sklearn2pmml.postprocessing. Transform predictions using the BusinessDecisionTransformer transformer.

For example, mapping and pre-processing the Audit dataset:

from sklearn.compose import ColumnTransformer
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import OneHotEncoder
from sklearn2pmml.decoration import Alias, CategoricalDomain, ContinuousDomain
from sklearn2pmml.preprocessing import ExpressionTransformer

import pandas

df = pandas.read_csv("Audit.csv")

# Group features by type (operational type plus data type)
cat_cols = ["Education", "Employment", "Marital", "Occupation", "Gender"]
cont_int_cols = ["Age", "Hours"]
cont_float_cols = ["Income"]

transformer = ColumnTransformer([
	# Features
	("cat", make_pipeline(CategoricalDomain(), OneHotEncoder()), cat_cols),
	("cont_int", ContinuousDomain(), cont_int_cols),
	("cont_float", ContinuousDomain(), cont_float_cols),
	# Transformed features
	("hourly_income", Alias(ExpressionTransformer("X['Income'] / (X['Hours'] * 52)"), name = "Hourly_Income"), ["Income", "Hours"])
], remainder = "drop")
transformer.fit(df)

Xt = transformer.transform(df)

Documentation

Integrations:

Extensions:

Miscellaneous:

Archived:

Converting Scikit-Learn to PMML

License

SkLearn2PMML is licensed under the terms and conditions of the GNU Affero General Public License, Version 3.0.

If you would like to use SkLearn2PMML in a proprietary software project, then it is possible to enter into a licensing agreement which makes SkLearn2PMML available under the terms and conditions of the BSD 3-Clause License instead.

Additional information

SkLearn2PMML is developed and maintained by Openscoring Ltd, Estonia.

Interested in using Java PMML API software in your company? Please contact info@openscoring.io

Project details

These details have not been verified by PyPI

Project links

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
- Science/Research
Operating System
- OS Independent
Programming Language
- Python
Topic
- Scientific/Engineering
- Software Development

Release history Release notifications | RSS feed

This version

0.132.0

Jun 30, 2026

0.131.0

Jun 16, 2026

0.130.1

Jun 2, 2026

0.130.0

Apr 4, 2026

0.129.2

Mar 13, 2026

0.129.1

Mar 11, 2026

0.129.0

Mar 9, 2026

0.128.1

Feb 27, 2026

0.128.0

Feb 18, 2026

0.127.2

Feb 5, 2026

0.127.1

Jan 27, 2026

0.127.0

Jan 23, 2026

0.126.1

Jan 20, 2026

0.126.0

Jan 15, 2026

0.125.2

Jan 13, 2026

0.125.1

Dec 31, 2025

0.125.0

Dec 26, 2025

0.124.0

Dec 6, 2025

0.123.1

Oct 24, 2025

0.123.0

Oct 5, 2025

0.122.2

Sep 19, 2025

0.122.1

Sep 15, 2025

0.122.0

Sep 2, 2025

0.121.1

Aug 4, 2025

0.121.0

Jun 30, 2025

0.120.0

Jun 26, 2025

0.119.1

Jun 20, 2025

0.119.0

Jun 15, 2025

0.118.0

Jun 9, 2025

0.117.0

May 21, 2025

0.116.4

Apr 13, 2025

0.116.3

Apr 7, 2025

0.116.2

Apr 2, 2025

0.116.1

Mar 30, 2025

0.116.0

Mar 18, 2025

0.115.0

Mar 13, 2025

0.114.0

Mar 9, 2025

0.113.0

Feb 1, 2025

0.112.1.post1

Dec 15, 2024

0.112.1

Dec 14, 2024

0.112.0

Dec 8, 2024

0.111.2

Dec 4, 2024

0.111.1

Oct 28, 2024

0.111.0

Oct 21, 2024

0.110.0

Aug 5, 2024

0.109.0

Jun 19, 2024

0.108.0

May 20, 2024

0.107.1

May 9, 2024

0.107.0

Apr 25, 2024

0.106.0

Apr 22, 2024

0.105.2

Apr 2, 2024

0.105.1

Mar 29, 2024

0.105.0

Mar 21, 2024

0.104.1

Mar 14, 2024

0.104.0

Mar 10, 2024

0.103.3

Mar 3, 2024

0.103.2

Feb 23, 2024

0.103.1

Feb 13, 2024

0.103.0

Feb 11, 2024

0.102.0

Jan 28, 2024

0.101.0

Jan 7, 2024

0.100.2

Dec 18, 2023

0.100.1

Dec 16, 2023

0.100.0

Dec 8, 2023

0.99.3

Dec 4, 2023

0.99.2

Nov 1, 2023

0.99.1

Oct 23, 2023

0.99.0

Sep 24, 2023

0.98.1

Sep 17, 2023

0.98.0

Aug 28, 2023

0.97.3

Aug 20, 2023

0.97.2

Aug 12, 2023

0.97.1

Aug 5, 2023

0.97.0

Aug 3, 2023

0.96.0

Jul 30, 2023

0.95.1

Jul 18, 2023

0.95.0

Jul 14, 2023

0.94.1

Jul 8, 2023

0.94.0

Jun 19, 2023

0.93.0

Jun 6, 2023

0.92.2

May 17, 2023

0.92.1

May 1, 2023

0.92.0

Apr 2, 2023

0.91.1

Mar 12, 2023

0.91.0

Feb 26, 2023

0.90.4

Jan 30, 2023

0.90.3

Jan 21, 2023

0.90.2

Jan 9, 2023

0.90.1

Jan 3, 2023

0.90.0

Dec 26, 2022

0.89.1

Dec 18, 2022

0.89.0

Dec 13, 2022

0.88.1

Dec 10, 2022

0.88.0

Nov 29, 2022

0.87.1

Nov 15, 2022

0.87.0

Oct 29, 2022

0.86.3

Sep 19, 2022

0.86.2

Sep 11, 2022

0.86.1

Sep 10, 2022

0.86.0

Sep 5, 2022

0.85.0

Jul 16, 2022

0.84.2

Jun 28, 2022

0.84.1

Jun 12, 2022

0.84.0

Jun 8, 2022

0.83.0

May 13, 2022

0.82.0

May 9, 2022

0.81.0

Apr 23, 2022

0.80.0

Apr 16, 2022

0.79.0

Apr 7, 2022

0.78.1

Apr 1, 2022

0.78.0

Mar 17, 2022

0.77.2

Jan 17, 2022

0.77.1

Dec 23, 2021

0.77.0

Dec 2, 2021

0.76.1

Nov 3, 2021

0.76.0

Oct 17, 2021

0.75.0

Oct 4, 2021

0.74.4

Sep 21, 2021

0.74.3

Sep 19, 2021

0.74.2

Sep 5, 2021

0.74.1

Aug 27, 2021

0.74.0

Aug 15, 2021

0.73.5

Jul 13, 2021

0.73.4

Jul 8, 2021

0.73.3

Jul 7, 2021

0.73.2

Jul 4, 2021

0.73.1

Jun 27, 2021

0.73.0

Jun 22, 2021

0.72.0

Jun 19, 2021

0.71.1

Apr 25, 2021

0.71.0

Apr 11, 2021

0.70.0

Apr 5, 2021

0.69.0

Mar 14, 2021

0.68.0

Mar 7, 2021

0.67.0

Mar 4, 2021

0.66.1

Jan 27, 2021

0.66.0

Jan 12, 2021

0.65.0

Dec 28, 2020

0.64.1

Dec 25, 2020

0.64.0

Nov 22, 2020

0.63.1

Oct 19, 2020

0.63.0

Oct 18, 2020

0.62.0

Oct 13, 2020

0.61.0

Jul 30, 2020

0.60.0

Jul 5, 2020

0.59.0

Jun 23, 2020

0.58.0

May 29, 2020

0.57.0

May 24, 2020

0.56.2

May 21, 2020

0.56.1

May 18, 2020

0.56.0

May 17, 2020

0.55.4

Mar 25, 2020

0.55.3

Mar 20, 2020

0.55.2

Mar 18, 2020

0.55.1

Mar 5, 2020

0.55.0

Mar 4, 2020

0.54.0

Feb 29, 2020

0.53.0

Jan 14, 2020

0.52.1

Jan 5, 2020

0.52.0

Dec 29, 2019

0.51.2

Dec 23, 2019

0.51.1.post1

Jun 2, 2023

0.51.1

Dec 22, 2019

0.51.0

Dec 8, 2019

0.50.1

Nov 28, 2019

0.50.0

Nov 25, 2019

0.49.3

Nov 19, 2019

0.49.2

Nov 18, 2019

0.49.1

Oct 22, 2019

0.49.0

Aug 31, 2019

0.48.0

Jul 22, 2019

0.47.2

Jul 7, 2019

0.47.1

Jun 27, 2019

0.47.0

Jun 20, 2019

0.46.0

Jun 9, 2019

0.45.0

May 21, 2019

0.44.0

Mar 26, 2019

0.43.0

Feb 24, 2019

0.42.0

Feb 9, 2019

0.41.0

Jan 20, 2019

0.40.0

Dec 19, 2018

0.39.0

Oct 6, 2018

0.38.2

Sep 17, 2018

0.38.1

Sep 16, 2018

0.38.0

Sep 4, 2018

0.37.0

Aug 31, 2018

0.36.1

Jun 24, 2018

0.36.0

May 23, 2018

0.35.2

May 20, 2018

0.35.1

May 5, 2018

0.35.0

Apr 8, 2018

0.34.0

Mar 29, 2018

0.33.0

Mar 20, 2018

0.32.0

Mar 12, 2018

0.31.1

Mar 7, 2018

0.31.0

Mar 4, 2018

0.30.0

Feb 19, 2018

0.29.0

Jan 7, 2018

0.28.2

Jan 1, 2018

0.28.1

Dec 27, 2017

0.28.0

Dec 25, 2017

0.27.0

Dec 11, 2017

0.26.0

Oct 18, 2017

0.25.0

Oct 15, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearn2pmml-0.132.0.tar.gz (7.7 MB view details)

Uploaded Jun 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sklearn2pmml-0.132.0-py3-none-any.whl (7.7 MB view details)

Uploaded Jun 30, 2026 Python 3

File details

Details for the file sklearn2pmml-0.132.0.tar.gz.

File metadata

Download URL: sklearn2pmml-0.132.0.tar.gz
Upload date: Jun 30, 2026
Size: 7.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for sklearn2pmml-0.132.0.tar.gz
Algorithm	Hash digest
SHA256	`eb349f9bdeb23ec08edb5291d25775ccabdccef1990454c7a92f7e4f270da05a`
MD5	`42780334ca5ae10fef2e497be9da7a14`
BLAKE2b-256	`fa2e65305d321940ad2ee239ab0f9d9acac5b7fc25e22c060c5bfaedf4216645`

See more details on using hashes here.

File details

Details for the file sklearn2pmml-0.132.0-py3-none-any.whl.

File metadata

Download URL: sklearn2pmml-0.132.0-py3-none-any.whl
Upload date: Jun 30, 2026
Size: 7.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for sklearn2pmml-0.132.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`db96beda256c98d19b4e2cc0b52ba034e3d96c713b5083a025231b6b52ac2ce5`
MD5	`77fb4d0cbff42cbd199560fdf5909c95`
BLAKE2b-256	`32dfc80523563635f5a39bdd149fefa430bdb63127b7bdc07131c2e1eae7e51d`

See more details on using hashes here.

sklearn2pmml 0.132.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SkLearn2PMML

Features

News and Updates

Prerequisites

Installation

Usage

Native Scikit-Learn

Library

Command-line application

PMML-enhanced Scikit-Learn

Documentation

License

Additional information

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes