Designed Experiments; Latent Variables (PCA, PLS, multivariate methods with missing data); Process Monitoring; Batch data analysis.

These details have not been verified by PyPI

Project links

Project description

Process Improvement using Data

A Python package for multivariate data analysis, designed experiments, and process monitoring. Companion to the online textbook Process Improvement using Data. This package also powers the statistical engine behind factori.al.

Installation

pip install process-improve

Quick Start

PCA — Principal Component Analysis

import pandas as pd
from process_improve.multivariate.methods import PCA, MCUVScaler

# Load and scale your data
X = pd.read_csv("your_data.csv", index_col=0)
scaler = MCUVScaler().fit(X)
X_scaled = scaler.transform(X)

# Fit a PCA model
pca = PCA(n_components=3).fit(X_scaled)

# Inspect results
print(pca.scores_)  # Score matrix (N x A)
print(pca.loadings_)  # Loading matrix (K x A)
print(pca.r2_cumulative_)  # Cumulative R² per component

# Detect outliers
outliers = pca.detect_outliers(conf_level=0.95)

# Contribution analysis
contrib = pca.score_contributions(pca.scores_.iloc[0].values)

# Select number of components via cross-validation
result = PCA.select_n_components(X_scaled, max_components=10)
print(result.n_components)

# Built-in plots
pca.score_plot()
pca.spe_plot()
pca.t2_plot()
pca.loading_plot()

PLS — Projection to Latent Structures

from process_improve.multivariate.methods import PLS, MCUVScaler

# Scale X and Y separately
scaler_x = MCUVScaler().fit(X)
scaler_y = MCUVScaler().fit(Y)

# Fit a PLS model
pls = PLS(n_components=3).fit(scaler_x.transform(X), scaler_y.transform(Y))

# Inspect results
print(pls.scores_)  # X scores (N x A)
print(pls.beta_coefficients_)  # Regression coefficients (K x M)
print(pls.r2_cumulative_)  # Cumulative R² for Y

# Predict new observations
result = pls.predict(scaler_x.transform(X_new))
print(result.y_hat)  # Predicted Y values
print(result.spe)  # SPE for new data
print(result.hotellings_t2)  # Hotelling's T² for new data

# Detect outliers and analyze contributions
outliers = pls.detect_outliers(conf_level=0.95)
contrib = pls.score_contributions(pls.scores_.iloc[0].values)

DOE — Experimental Strategy Recommendation

Plan a complete multi-stage experimental program before running any experiments:

from process_improve.experiments.factor import Factor, Response
from process_improve.experiments.strategy import recommend_strategy

# Define factors for a fermentation optimization
factors = [
    Factor(name="Temperature", low=25, high=40, units="degC"),
    Factor(name="pH", low=5.0, high=7.5),
    Factor(name="Glucose", low=10, high=50, units="g/L"),
    Factor(name="Yeast extract", low=1, high=10, units="g/L"),
    Factor(name="Agitation", low=100, high=400, units="rpm"),
    Factor(name="Aeration", low=0.5, high=2.0, units="vvm"),
    Factor(name="Inoculum", low=2, high=10, units="%v/v"),
]
responses = [Response(name="Yield", goal="maximize", units="g/L")]

# Get a complete experimental plan
strategy = recommend_strategy(
    factors=factors,
    responses=responses,
    budget=40,
    domain="fermentation",
)

# Inspect the multi-stage strategy
for stage in strategy["stages"]:
    print(f"Stage {stage['stage_number']}: {stage['stage_name']}")
    print(f"  Design: {stage['design_type']}, Runs: {stage['estimated_runs']}")
    print(f"  Purpose: {stage['purpose']}")

# Review reasoning, risks, and alternatives
print(strategy["budget_allocation"])
print(strategy["reasoning"])

The engine applies ~50 deterministic rules (from Montgomery, NIST, Stat-Ease) to recommend screening, optimization, and confirmation stages — with budget-aware allocation and domain-specific advice for fermentation, cell culture, pharma, and 5 other application domains.

Features

PCA with SVD, NIPALS, and missing data (TSR) algorithms
PLS regression with sklearn-compatible API
TPLS (Total PLS) for multi-block data
Missing data handling via TSR and NIPALS algorithms
Outlier detection combining Hotelling's T² and SPE with robust ESD test
Score contributions for variable-level diagnostics
Cross-validation for component selection (PRESS with Wold's criterion)
Interactive plots (Plotly) for scores, loadings, SPE, and T²
Designed experiments — full factorial, fractional factorial, response surface
DOE strategy recommender — multi-stage experimental planning (screening, optimization, confirmation) with budget-aware allocation and 8 application domains
Process monitoring — Shewhart, CUSUM, EWMA control charts
Batch data analysis — alignment, feature extraction, multivariate batch monitoring

API Design

Both PCA and PLS follow sklearn conventions:

Fitted attributes end with _ (e.g., scores_, loadings_, spe_)
fit() returns self
predict() returns a Bunch object with named fields
score() is compatible with sklearn.model_selection.cross_val_score
Works with pandas.DataFrame inputs (preserves index and column names)

Documentation

Full documentation is available at https://kgdunn.github.io/process-improve/.

To build the documentation locally:

cd docs
make html

License

MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.13.11

May 6, 2026

1.13.9

May 6, 2026

1.13.8

May 6, 2026

1.13.7

May 6, 2026

1.13.6

May 3, 2026

1.13.5

Apr 29, 2026

1.13.4

Apr 29, 2026

1.13.3

Apr 29, 2026

1.13.2

Apr 29, 2026

1.13.1

Apr 29, 2026

1.13.0

Apr 29, 2026

1.9.3

Apr 28, 2026

1.9.2

Apr 28, 2026

1.9.1

Apr 28, 2026

1.9.0

Apr 28, 2026

1.8.1

Apr 28, 2026

1.8.0

Apr 27, 2026

1.7.1

Apr 27, 2026

This version

1.7.0

Apr 27, 2026

1.6.2

Apr 27, 2026

1.6.1

Apr 25, 2026

1.6.0

Apr 23, 2026

1.5.1

Apr 21, 2026

1.5.0

Apr 21, 2026

1.4.1

Apr 19, 2026

1.4.0

Apr 19, 2026

1.3.3

Apr 17, 2026

1.3.2

Apr 16, 2026

1.3.1

Apr 15, 2026

1.2.8

Apr 14, 2026

1.2.7

Mar 25, 2026

1.2.6

Mar 25, 2026

1.2.5

Mar 25, 2026

1.2.0

Mar 23, 2026

1.1.0

Mar 16, 2026

1.0.0

Mar 14, 2026

0.9.99

Jan 24, 2026

0.9.98

Jan 23, 2026

0.9.97

Jan 22, 2026

0.9.96

Sep 18, 2025

0.9.95

Jul 15, 2025

0.9.94

Jul 10, 2025

0.9.93

Jul 9, 2025

0.9.92

Jul 8, 2025

0.9.92rc1 pre-release

Jul 8, 2025

0.9.91

Jul 8, 2025

0.9.90

Jul 7, 2025

0.9.89

Jul 4, 2025

0.9.88

Jul 1, 2025

0.9.87

Jun 29, 2025

0.9.86

Jun 29, 2025

0.9.85

May 16, 2025

0.9.84

Mar 11, 2025

0.9.83

Mar 11, 2025

0.9.82

Mar 11, 2025

0.9.81

Mar 3, 2025

0.9.80

Feb 27, 2025

0.9.78

Feb 7, 2025

0.9.76

Feb 7, 2025

0.9.75

Oct 17, 2024

0.9.74

Sep 10, 2024

0.9.73

Aug 4, 2024

0.9.72

Aug 4, 2024

0.9.71

Apr 22, 2024

0.9.70

Mar 26, 2024

0.9.66

Aug 24, 2023

0.9.65

May 18, 2023

0.9.64

Sep 16, 2022

0.9.63

Aug 31, 2022

0.9.62

Aug 31, 2022

0.9.60

May 16, 2022

0.9.59

Apr 12, 2022

0.9.58

Feb 22, 2022

0.9.57

Feb 22, 2022

0.9.56

Feb 3, 2022

0.9.55

Jan 31, 2022

0.9.54

Oct 10, 2021

0.9.53

Oct 9, 2021

0.9.52

Sep 28, 2021

0.9.51

Sep 27, 2021

0.9.50

Sep 27, 2021

0.9.49

Sep 27, 2021

0.9.48

Sep 27, 2021

0.9.47

Sep 25, 2021

0.9.46

Sep 14, 2021

0.9.45

Aug 31, 2021

0.9.44

Aug 31, 2021

0.9.43

Aug 30, 2021

0.9.42

Aug 17, 2021

0.9.41

Aug 17, 2021

0.9.40

Aug 16, 2021

0.9.39

Aug 16, 2021

0.9.38

Aug 16, 2021

0.9.37

Aug 13, 2021

0.9.36

Aug 13, 2021

0.9.35

Aug 13, 2021

0.9.34

Aug 13, 2021

0.9.33

Aug 12, 2021

0.9.32

Aug 12, 2021

0.9.31

Aug 12, 2021

0.9.30

Aug 12, 2021

0.9.29

Aug 12, 2021

0.9.28

Aug 10, 2021

0.9.27

Aug 10, 2021

0.9.26

Aug 9, 2021

0.9.25

Aug 2, 2021

0.9.24

Jul 21, 2021

0.9.23

Jul 21, 2021

0.9.22

Jul 21, 2021

0.9.21

Jun 28, 2021

0.9.20

Jun 28, 2021

0.9.19

Jun 28, 2021

0.9.18

Jun 27, 2021

0.9.17

Jun 27, 2021

0.9.16

Jun 14, 2021

0.9.14

Jun 5, 2021

0.9.13

Jun 5, 2021

0.9.12

Jun 4, 2021

0.9.11

Jun 4, 2021

0.9.0

Jun 4, 2021

0.8.8

Apr 27, 2021

0.8.7

Apr 26, 2021

0.8.6

Apr 26, 2021

0.8.4

Apr 20, 2021

0.8.3

Apr 19, 2021

0.8.1

Apr 8, 2021

0.8.0

Apr 6, 2021

0.7.9

Mar 31, 2021

0.7.7

Mar 24, 2021

0.7.6

Mar 24, 2021

0.7.5

Mar 19, 2021

0.7.3

Mar 15, 2021

0.7.2

Mar 15, 2021

0.7.1

Mar 4, 2021

0.7.0

Mar 4, 2021

0.6.9

Nov 28, 2019

0.6.8 yanked

Nov 28, 2019

0.6.5

Nov 21, 2019

0.6.4

Nov 21, 2019

0.6.3

Nov 20, 2019

0.6.2

Nov 20, 2019

0.6.1

Nov 20, 2019

0.5.8

Nov 6, 2019

0.5.7

Nov 6, 2019

0.5.6

Nov 6, 2019

0.5.5

Nov 6, 2019

0.5.4

Nov 6, 2019

0.5.3

Nov 6, 2019

0.5.2

Oct 29, 2019

0.5.1

Oct 24, 2019

0.5.0

Oct 23, 2019

0.4.9

Oct 23, 2019

0.4.8

Oct 23, 2019

0.4.7

Oct 23, 2019

0.4.6

Oct 17, 2019

0.4.5

Oct 17, 2019

0.4.4

Oct 17, 2019

0.4.3

Oct 10, 2019

0.4.2

Oct 10, 2019

0.4.1

Oct 10, 2019

0.4.0

Oct 9, 2019

0.3.5

Oct 8, 2019

0.3.3

Oct 3, 2019

0.3.1

Oct 3, 2019

0.3.0

Oct 3, 2019

0.2.8

Oct 3, 2019

0.2.6

Oct 3, 2019

0.2.5

Oct 3, 2019

0.2.3

Oct 3, 2019

0.2.2 yanked

Oct 3, 2019

Reason this release was yanked:

Out of date

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

process_improve-1.7.0.tar.gz (3.5 MB view details)

Uploaded Apr 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

process_improve-1.7.0-py3-none-any.whl (3.5 MB view details)

Uploaded Apr 27, 2026 Python 3

File details

Details for the file process_improve-1.7.0.tar.gz.

File metadata

Download URL: process_improve-1.7.0.tar.gz
Upload date: Apr 27, 2026
Size: 3.5 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for process_improve-1.7.0.tar.gz
Algorithm	Hash digest
SHA256	`20cf52323fefc1ea6be29baa925d9ffdf36535a5ce4f2fb3a1e25fc7a4335915`
MD5	`c816ac3ded95de5c93b42a19e0a181ea`
BLAKE2b-256	`478ff0287595c2acb470ebbb30e70015894cb5e47d75c57473596bf4e146a8c5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for process_improve-1.7.0.tar.gz:

Publisher: publish.yml on kgdunn/process-improve

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: process_improve-1.7.0.tar.gz
- Subject digest: 20cf52323fefc1ea6be29baa925d9ffdf36535a5ce4f2fb3a1e25fc7a4335915
- Sigstore transparency entry: 1392735515
- Sigstore integration time: Apr 27, 2026
Source repository:
- Permalink: kgdunn/process-improve@caf1edb558b119be35ce6f5a05b8be2f2287e46a
- Branch / Tag: refs/heads/main
- Owner: https://github.com/kgdunn
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@caf1edb558b119be35ce6f5a05b8be2f2287e46a
- Trigger Event: push

File details

Details for the file process_improve-1.7.0-py3-none-any.whl.

File metadata

Download URL: process_improve-1.7.0-py3-none-any.whl
Upload date: Apr 27, 2026
Size: 3.5 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for process_improve-1.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`92cc7dd3fc56c41fbc8e435a450fce8032a61c8aa37e12b75a89436124051ce2`
MD5	`5afd5e9e1cc6db9719fa2a4b32145cd6`
BLAKE2b-256	`4d8ae88ae86f986ebe923eebdd32de1f0e50c9972970925dfcc5c9001acdc3a0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for process_improve-1.7.0-py3-none-any.whl:

Publisher: publish.yml on kgdunn/process-improve

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: process_improve-1.7.0-py3-none-any.whl
- Subject digest: 92cc7dd3fc56c41fbc8e435a450fce8032a61c8aa37e12b75a89436124051ce2
- Sigstore transparency entry: 1392735519
- Sigstore integration time: Apr 27, 2026
Source repository:
- Permalink: kgdunn/process-improve@caf1edb558b119be35ce6f5a05b8be2f2287e46a
- Branch / Tag: refs/heads/main
- Owner: https://github.com/kgdunn
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@caf1edb558b119be35ce6f5a05b8be2f2287e46a
- Trigger Event: push

process-improve 1.7.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Process Improvement using Data

Installation

Quick Start

PCA — Principal Component Analysis

PLS — Projection to Latent Structures

DOE — Experimental Strategy Recommendation

Features

API Design

Documentation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance