Fast GLM-PCA (Townes et al. 2019) with Rust core and optional PyTorch GPU backend

These details have not been verified by PyPI

Project links

Project description

glmpca-fast

Fast GLM-PCA (Generalised Linear Model PCA) for non-Gaussian count data — Rust core with optional PyTorch GPU backend.

Implements the algorithm of:

Townes, F. W., Hicks, S. C., Aryee, M. J., & Irizarry, R. A. (2019). "Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model." Genome Biology, 20:295. doi:10.1186/s13059-019-1861-6

Why

Standard PCA assumes Gaussian + homoscedastic noise — wrong likelihood for count data (RNA-seq UMIs, genotype dosages, etc.). GLM-PCA fits the proper Poisson / Multinomial / Bernoulli / NB likelihood, capturing the mean–variance relationship inside the model.

The reference implementation (willtownes/glmpca-py) is CPU-only Python. glmpca-fast ports the algorithm to:

Rust (rayon-parallel coordinate-block Newton) — ~13× faster than glmpca-py on a single CPU.
PyTorch with batched torch.linalg.solve — ~290× faster on a modern GPU (RTX A6000) per fit.

Install

With uv (recommended)

# Add to a project (preferred)
uv add glmpca-fast                    # CPU (Rust + numpy)
uv add "glmpca-fast[torch]"           # + GPU (PyTorch)

# Or install into the active environment
uv pip install glmpca-fast
uv pip install "glmpca-fast[torch]"

# Or run a one-shot script
uv tool run --from glmpca-fast python -c "from glmpca_fast import fit_poisson; ..."

With pip

pip install glmpca-fast              # CPU
pip install "glmpca-fast[torch]"     # + GPU

Quick start

import numpy as np
from glmpca_fast import fit_poisson

# Synthetic Binomial(2, p) genotype dosage matrix
rng = np.random.default_rng(0)
N, M, L = 2504, 200, 8
p = rng.uniform(0.05, 0.5, M)
Y = rng.binomial(2, p, size=(N, M)).astype(np.float32)

# Rust backend (default, CPU)
res = fit_poisson(Y, L=L, max_iter=100)
print(res["factors"].shape)         # (2504, 8)
print(res["loadings"].shape)        # (200, 8)
print(res["deviance"][-1])          # final deviance
print(res["backend"])               # 'rust'

# Auto backend — picks GPU if CUDA is available
res = fit_poisson(Y, L=L, backend="auto")
print(res["backend"])               # 'torch' if CUDA, else 'rust'

# Explicit GPU device
res = fit_poisson(Y, L=L, backend="torch", device="cuda:0")

API

fit_poisson(
    Y,                   # (n_samples, n_features) non-negative counts
    L,                   # latent dim, >= 2
    max_iter=100,
    tol=1e-4,            # relative deviance tolerance
    penalty=1.0,         # L2 ridge on factors and loadings
    seed=42,
    backend="rust",      # 'rust' | 'torch' | 'auto'
    device=None,         # torch device override (e.g. 'cuda:1')
) -> dict
# returns: factors, loadings, intercept, deviance, n_iter, backend

project_ols(X_held, train_mean, loadings) -> ndarray
# Approximate OLS projection of held-out samples (Pearson-residual approx).

Benchmark

Single gene, 2,504 samples × 200 variants, L=8:

Backend	Time / fit	Speedup vs `glmpca-py`
`glmpca-py` (reference, NumPy)	7.58 s	1×
glmpca-fast (Rust, 11 cores)	0.56 s	13.5×
glmpca-fast (PyTorch, RTX A6000)	0.026 s	290×

Both backends converge to within ~1 % of the reference final deviance (non-convex objective, different random init).

Limitations / scope

Currently Poisson family only. Multinomial / NB / Bernoulli branches are planned for v0.2.
Newton step uses full update without line-search damping. For degenerate Hessians the implementation falls back to a small gradient step.
Held-out projection uses the OLS approximation (not full per-sample IRLS).
Built and tested on Linux x86-64 + CUDA 12. Other platforms via source build.

Citation

If you use this package, please cite both the original paper and this software:

@article{Townes2019GLMPCA,
  title   = {Feature selection and dimension reduction for single-cell
             RNA-Seq based on a multinomial model},
  author  = {Townes, F. William and Hicks, Stephanie C. and Aryee,
             Martin J. and Irizarry, Rafael A.},
  journal = {Genome Biology},
  volume  = {20},
  number  = {1},
  pages   = {295},
  year    = {2019},
  doi     = {10.1186/s13059-019-1861-6}
}

@software{glmpca_fast,
  title  = {glmpca-fast: Fast GLM-PCA with Rust and GPU backends},
  author = {zongseung},
  year   = {2026},
  url    = {https://github.com/zongseung/glmpca-fast}
}

License

GPL-3.0-or-later — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.2

Apr 28, 2026

0.1.1

Apr 27, 2026

0.1.0

Apr 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glmpca_fast-0.1.2.tar.gz (29.8 kB view details)

Uploaded Apr 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

glmpca_fast-0.1.2-cp313-cp313-manylinux_2_34_x86_64.whl (345.2 kB view details)

Uploaded Apr 28, 2026 CPython 3.13manylinux: glibc 2.34+ x86-64

File details

Details for the file glmpca_fast-0.1.2.tar.gz.

File metadata

Download URL: glmpca_fast-0.1.2.tar.gz
Upload date: Apr 28, 2026
Size: 29.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.11

File hashes

Hashes for glmpca_fast-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`0e69c7b9cbee45d0a69936fa878f38fd6a50303f27b0211644a987a864282ccb`
MD5	`11a892735231d0803e763e9e42265671`
BLAKE2b-256	`c57094390af06db06165a0311739aff21cb4c0155737aa7acaff77dedefa7642`

See more details on using hashes here.

File details

Details for the file glmpca_fast-0.1.2-cp313-cp313-manylinux_2_34_x86_64.whl.

File metadata

Download URL: glmpca_fast-0.1.2-cp313-cp313-manylinux_2_34_x86_64.whl
Upload date: Apr 28, 2026
Size: 345.2 kB
Tags: CPython 3.13, manylinux: glibc 2.34+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.11

File hashes

Hashes for glmpca_fast-0.1.2-cp313-cp313-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`ae2d99cbc559dada1d5a6fd305d26c41f46d0dc0f499439c2362a086b4c361c5`
MD5	`492abddee00664b562c99707db71692c`
BLAKE2b-256	`db0225e25908e267cdec410cc8c101f202cc2bff71fe45bda42696d82876fcad`

See more details on using hashes here.

glmpca-fast 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

glmpca-fast

Why

Install

With uv (recommended)

With pip

Quick start

API

Benchmark

Limitations / scope

Citation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes