A fast and modular scikit-learn replacement for generalized linear models

These details have not been verified by PyPI

Project links

Project description

A fast ⚡ and modular ⚒️ scikit-learn replacement for sparse GLMs

build License

skglm is a Python package that offers fast estimators for sparse Generalized Linear Models (GLMs) that are 100% compatible with scikit-learn. It is highly flexible and supports a wide range of GLMs. You get to choose from skglm's already-made estimators or customize your own by combining the available datafits and penalties.

Excited to have a tour on skglm documentation?

Cite

skglm is the result of perseverant research. It is licensed under BSD 3-Clause. You are free to use it and if you do so, please cite

@inproceedings{skglm,
    title     = {Beyond L1: Faster and better sparse models with skglm},
    author    = {Q. Bertrand and Q. Klopfenstein and P.-A. Bannier and G. Gidel and M. Massias},
    booktitle = {NeurIPS},
    year      = {2022},
}

@article{moufad2023skglm,
  title={skglm: improving scikit-learn for regularized Generalized Linear Models},
  author={Moufad, Badr and Bannier, Pierre-Antoine and Bertrand, Quentin and Klopfenstein, Quentin and Massias, Mathurin},
  year={2023}
}

Why `skglm`?

skglm is specifically conceived to solve sparse GLMs. It supports many missing models in scikit-learn and ensures high performance. There are several reasons to opt for skglm among which:


Speed	Fast solvers able to tackle large datasets, either dense or sparse, with millions of features up to 100 times faster than `scikit-learn`
Modularity	User-friendly API that enables composing custom estimators with any combination of its existing datafits and penalties
Extensibility	Flexible design that makes it simple and easy to implement new datafits and penalties, a matter of few lines of code
Compatibility	Estimators fully compatible with the `scikit-learn` API and drop-in replacements of its GLM estimators

Get started with `skglm`

Installing `skglm`

skglm is available on PyPi. Run the following command to get the latest version of the package

pip install -U skglm

It is also available on conda-forge and can be installed using, for instance:

conda install -c conda-forge skglm

First steps with `skglm`

Once you installed skglm, you can run the following code snippet to fit a MCP Regression model on a toy dataset

# import model to fit
from skglm.estimators import MCPRegression
# import util to create a toy dataset
from skglm.utils.data import make_correlated_data

# generate a toy dataset
X, y, _ = make_correlated_data(n_samples=10, n_features=100)

# init and fit estimator
estimator = MCPRegression()
estimator.fit(X, y)

# print R²
print(estimator.score(X, y))

You can refer to the documentation to explore the list of skglm's already-made estimators.

Didn't find one that suits you? you can still compose your own. Here is a code snippet that fits a MCP-regularized problem with Huber loss.

# import datafit, penalty and GLM estimator
from skglm.datafits import Huber
from skglm.penalties import MCPenalty
from skglm.estimators import GeneralizedLinearEstimator

from skglm.utils.data import make_correlated_data
from skglm.solvers import AndersonCD

X, y, _ = make_correlated_data(n_samples=10, n_features=100)
# create and fit GLM estimator with Huber loss and MCP penalty
estimator = GeneralizedLinearEstimator(
    datafit=Huber(delta=1.),
    penalty=MCPenalty(alpha=1e-2, gamma=3),
    solver=AndersonCD()
)
estimator.fit(X, y)

You will find detailed description on the supported datafits and penalties and how to combine them in the API section of the documentation. You can also take our tutorial to learn how to create your own datafit and penalty.

Contribute to `skglm`

skglm is a continuous endeavour that relies on the community efforts to last and evolve. Your contribution is welcome and highly valuable. It can be

bug report: you may encounter a bug while using skglm. Don't hesitate to report it on the issue section.
feature request: you may want to extend/add new features to skglm. You can use the issue section to make suggestions.
pull request: you may have fixed a bug, added a features, or even fixed a small typo in the documentation, ... you can submit a pull request and we will reach out to you asap.

Useful links

link to documentation: https://contrib.scikit-learn.org/skglm/
link to skglm arXiv article: https://arxiv.org/pdf/2204.07826.pdf

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5

Jul 29, 2025

0.4

Apr 8, 2025

0.3.1

Dec 21, 2023

0.3

Jun 30, 2023

0.2

Oct 18, 2022

0.1

Apr 25, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skglm-0.5.tar.gz (90.7 kB view details)

Uploaded Jul 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

skglm-0.5-py3-none-any.whl (114.3 kB view details)

Uploaded Jul 29, 2025 Python 3

File details

Details for the file skglm-0.5.tar.gz.

File metadata

Download URL: skglm-0.5.tar.gz
Upload date: Jul 29, 2025
Size: 90.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.14

File hashes

Hashes for skglm-0.5.tar.gz
Algorithm	Hash digest
SHA256	`11037aec119dd2869d6b574b6cca8bae69091eeeefe06cbbd0a5aa879ad79dc7`
MD5	`42c31cf8b15b9b3e462488b70b1bbd66`
BLAKE2b-256	`9605580b6e46a116e67e77980dc4a60386c51e37a5477b41201aa3c85205bfa5`

See more details on using hashes here.

File details

Details for the file skglm-0.5-py3-none-any.whl.

File metadata

Download URL: skglm-0.5-py3-none-any.whl
Upload date: Jul 29, 2025
Size: 114.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.14

File hashes

Hashes for skglm-0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a45a0065ccfea2f85a633b7fe59b2f71bbffed152e9e78ceb00da8c90ad07513`
MD5	`09db76f44ac4fac70bd0ffd0e517877f`
BLAKE2b-256	`1b1b36cf9dc5f0049624b627f785892a1c169ba440ac0dd6f6bd4b1ef685680d`

See more details on using hashes here.

skglm 0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

A fast ⚡ and modular ⚒️ scikit-learn replacement for sparse GLMs

Cite

Why `skglm`?

Get started with `skglm`

Installing `skglm`

First steps with `skglm`

Contribute to `skglm`

Useful links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

skglm 0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

A fast ⚡ and modular ⚒️ scikit-learn replacement for sparse GLMs

Cite

Why skglm?

Get started with skglm

Installing skglm

First steps with skglm

Contribute to skglm

Useful links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Why `skglm`?

Get started with `skglm`

Installing `skglm`

First steps with `skglm`

Contribute to `skglm`