Intrinsic Green Learning: task-conditioned intrinsic-dimensionality discovery via a learned encoder and a multi-scale Green's-function kernel.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

aquemy

These details have not been verified by PyPI

Project description

Intrinsic Green Learning

High-dimensional inputs — pixel grids, EEG channels, embedding vectors — almost never use all the dimensions they appear to. The handful that actually matter depends on the question you ask: a binary classifier may need only one or two latent axes, a regressor to a continuous target may need a few more, and a full reconstruction needs whatever dimension the data manifold genuinely has.

Intrinsic Green Learning (IGL) discovers that task-conditioned effective dimension while it fits the model. A learned encoder maps the ambient input to a low-dimensional latent space; a multi-scale Green's-function kernel computes a structured design matrix on that latent space; and Variable Projection with random Matryoshka truncation trains the encoder and reads off the smallest dimension that still solves the task. There's no separate "dimensionality reduction" step and no fixed bottleneck — the dimension you should use falls out of training.

The key difference from PCA, UMAP, t-SNE, or any other purely-geometric manifold-learning method: the effective dimension IGL reports is a property of (input, task), not of the input alone. The same dataset will resolve into different d_eff values for a classifier, a regressor, and an autoencoder — and the hierarchy $d_{\text{cls}} \le d_{\text{reg}} \le d_{\text{recon}}$ holds out of the box.

What ships:

scikit-learn-compatible estimators (IGLClassifier, IGLRegressor, IGLAutoencoder) for drop-in use in existing pipelines.
Bare PyTorch building blocks (IGLModule, GreenKernel, MatryoshkaTrainer, …) for custom training loops, novel kernels, and research extensions.
Spectral formulation (SpectralKernel + closed-form Fourier / Chebyshev / Legendre / Hermite / Laguerre bases, plus learned Laplace–Beltrami and user-supplied graph bases) with kernel-agnostic null-space augmentation for operators with non-trivial $\ker(L)$.
Riemannian / SPD extension (igl.spd) for covariance-valued data — EEG, fMRI-derived connectivity, financial covariances — with an AIRM-based loss plugged in through the same ExtraLoss seam used by every other training-time regulariser.

Note on the import name. The distribution is intrinsic-green-learning; the import name is igl. This collides with libigl; if you need both in the same env, install one of them under a different module name.

Why IGL?

For the same input data, a classifier usually needs fewer latent dimensions than a regressor, which in turn needs fewer than a full autoencoder. IGL discovers this hierarchy automatically:

$$ d_{\text{eff}}(\text{classification}) ;\le; d_{\text{eff}}(\text{regression}) ;\le; d_{\text{eff}}(\text{reconstruction}) $$

The library ships an examples/synthetic/moons_xor.py script that fits all three estimators on the same data and reports the discovered dimensions — the hierarchy holds out of the box.

Installation

pip install intrinsic-green-learning

Optional extras:

Extra	Adds	Use case
`[viz]`	matplotlib	Plot dimension curves via `igl.viz.plot_dimension_curve`.
`[eeg]`	mne + moabb + pyriemann	Future EEG / clinical loaders (placeholder for v0.2).
`[nlp]`	transformers + datasets	Future NLP loaders.
`[elbow]`	kneed	Alternative elbow detector.
`[all]`	all of the above	One-shot install for development.

Quickstart

The library exposes three sklearn-compatible estimators plus a SPD extension. All accept numpy arrays at the API boundary.

Classification

import numpy as np
import igl
from igl.data import embed_in_high_dim, make_moons

x_2d, y = make_moons(400, noise=0.1, seed=0)
x = embed_in_high_dim(x_2d, target_dim=16, seed=0).numpy()

clf = igl.IGLClassifier(max_dim=8, random_state=0).fit(x, y.numpy())
print(f"accuracy = {clf.score(x, y.numpy()):.3f}")
print(f"discovered d_eff = {clf.effective_dimension_}")  # ~ 1 on moons

Regression and reconstruction

from igl.data import make_swiss_roll

x, params = make_swiss_roll(800, seed=0)
x_np = x.numpy(); params_np = params.numpy()

reg = igl.IGLRegressor(max_dim=8, random_state=0).fit(x_np, params_np)
ae = igl.IGLAutoencoder(max_dim=8, random_state=0).fit(x_np)

print(reg.effective_dimension_)   # ~ 2 on swiss roll (intrinsic dim)
print(ae.effective_dimension_)    # ~ 2 on swiss roll

Cross-task hierarchy check

report = igl.compare_d_eff(
    cls=clf.dimension_curve_,
    reg=reg.dimension_curve_,
    recon=ae.dimension_curve_,
)
print(report.d_effs)            # {'cls': 1, 'reg': 2, 'recon': 2}
print(report.hierarchy_holds)   # True

SPD / Riemannian extension

For covariance-valued data (EEG, clinical signals, …), igl.spd ships an AIRM-based reconstruction classifier:

from igl.data import make_spd_dataset
from igl.spd import IGLReconSPDClassifier, LogEigVectorizer

spd, y = make_spd_dataset(400, d=8, n_classes=3, seed=0)
x = LogEigVectorizer().fit(spd.numpy()).transform(spd.numpy())

clf = IGLReconSPDClassifier(
    latent_dim=8, max_dim=12,
    orthogonality_weight=0.1,   # plug-in via the ExtraLoss seam
    random_state=0,
).fit(x, y.numpy())
print(clf.effective_dimension_)

For EEG (raw signals → covariances → AIRM), the make_igl_airm factory (in the [eeg] extra) composes Ledoit-Wolf vs sample-cov auto-selection with Tikhonov-preconditioned IGL-AIRM into a single sklearn pipeline:

import igl  # requires: pip install intrinsic-green-learning[eeg]

pipe = igl.make_igl_airm(latent_dim=22)
pipe.fit(X_raw, y)   # X_raw: [N, channels, time]

Tikhonov ε = 10⁻⁶ is applied to every input SPD by default — bit-near identical to no preconditioning at d ≤ 64 (with a BatchNorm encoder) and rescues torch.linalg.eigh from LAPACK error 8481 at d ≥ 128.

Custom training loop

If sklearn's surface is too high-level, use the bare PyTorch entry points directly:

import torch
import igl

module = igl.IGLModule(
    input_dim=16, max_dim=8, output_dim=2,
    config=igl.IGLConfig(
        encoder=igl.EncoderConfig(hidden=(128, 64)),  # pyramidal MLP
        kernel=igl.KernelConfig(n_anchors=64, operator=igl.OperatorName.GAUSSIAN),
    ),
)

trainer = igl.MatryoshkaTrainer(
    loss=igl.CrossEntropyLoss(n_classes=2),
    config=igl.MatryoshkaConfig(epochs=500),
)
history = trainer.fit(module, x_train_t, y_train_t, x_val=x_val_t, y_val=y_val_t)
curve = igl.eval_dimension_curve(module, x_val_t, y_val_t, loss=igl.CrossEntropyLoss(n_classes=2))
print("d_eff =", igl.detect_elbow(curve))

Documentation

Local build:

uv sync --group doc
uv run mkdocs serve

Published at https://hotherio.github.io/intrinsic-green-learning/latest/ after the first release.

Examples

Three runnable scripts under examples/synthetic/:

Script	Manifold	Tasks	Expected `d_eff`
`torus_classification.py`	T² ⊂ R⁴ → R³²	XOR cls + sin/cos reg	≈ 2
`moons_xor.py`	Moons ⊂ R² → R¹⁶	cls + reg + recon	d_cls ≤ d_reg ≤ d_recon
`swiss_roll_recon.py`	Swiss roll ⊂ R³	autoencoder + reg	≈ 2

Run with python -m examples.synthetic.<name>; outputs land in results/<name>/<git_short_sha>/. Install [viz] for PNG plots.

Development

uv sync --all-groups
uv run lefthook install

Verify your environment:

uv run pytest                                # tests + 100% coverage
uv run basedpyright src                      # strict type check
uv run lefthook run pre-commit --all-files   # full pre-commit pass

Conventions

Style, typing, exceptions, commit-message format, and the rest are documented in CONTRIBUTING.md and docs/guidelines/.

Release process

Releases are fully automated by python-semantic-release on every push to main via .github/workflows/semantic-release.yml. See docs/security.md for the supply-chain posture (OIDC, sigstore attestations, GPG-signed checksums, pip-audit).

Bibliography

If you use IGL in academic work, please cite the paper this library implements:

Quemy, A. (2026). Intrinsic Green's Learning: Supervised Learning on Manifolds via Inverse PDE. ICLR 2026 Workshop on AI and PDE. https://openreview.net/forum?id=Y6RpdS98l8

@inproceedings{quemy2026igl,
  title     = {{Intrinsic Green's Learning: Supervised Learning on Manifolds via Inverse PDE}},
  author    = {Quemy, Alexandre},
  booktitle = {ICLR 2026 Workshop on AI and PDE},
  year      = {2026},
  month     = {3},
  url       = {https://openreview.net/forum?id=Y6RpdS98l8}
}

For a citation to this exact software version, GitHub's "Cite this repository" widget reads CITATION.cff; its preferred-citation block points back to the paper above.

License

MIT. See LICENSE and REUSE.toml.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

aquemy

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.4.0

Jun 10, 2026

0.2.8

May 29, 2026

0.2.7

May 29, 2026

0.2.6

May 29, 2026

0.2.5

May 29, 2026

0.2.4

May 28, 2026

0.2.3

May 28, 2026

0.2.2

May 28, 2026

0.2.1

May 28, 2026

0.2.0

May 28, 2026

0.1.0

May 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

intrinsic_green_learning-0.4.0.tar.gz (118.2 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

intrinsic_green_learning-0.4.0-py3-none-any.whl (112.2 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file intrinsic_green_learning-0.4.0.tar.gz.

File metadata

Download URL: intrinsic_green_learning-0.4.0.tar.gz
Upload date: Jun 10, 2026
Size: 118.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for intrinsic_green_learning-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`3605511504f32ed318080411afb76defc54328f9f6d2f0ad93328eca2a613152`
MD5	`f5266e7a347a3958b926a9d5794de05a`
BLAKE2b-256	`178e2efcec7c0d865f718710204a2331aaae14f6deffae4f95a9e506e99fd8bd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for intrinsic_green_learning-0.4.0.tar.gz:

Publisher: semantic-release.yml on hotherio/intrinsic-green-learning

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: intrinsic_green_learning-0.4.0.tar.gz
- Subject digest: 3605511504f32ed318080411afb76defc54328f9f6d2f0ad93328eca2a613152
- Sigstore transparency entry: 1779510589
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: hotherio/intrinsic-green-learning@d6f492f842732fc2cde7a850dbfb01e91fee537b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/hotherio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: semantic-release.yml@d6f492f842732fc2cde7a850dbfb01e91fee537b
- Trigger Event: workflow_dispatch

File details

Details for the file intrinsic_green_learning-0.4.0-py3-none-any.whl.

File metadata

Download URL: intrinsic_green_learning-0.4.0-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 112.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for intrinsic_green_learning-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b756dad773b96711e3331d4a11792cb8b40a6a9f20fd03986504fc95ebb345bc`
MD5	`e6466bb38cca21680a34b25dfdd45306`
BLAKE2b-256	`6282441572d07be92cb06c09e4baa97231bf86a465e5fa750a363a7c76573370`

See more details on using hashes here.

Provenance

The following attestation bundles were made for intrinsic_green_learning-0.4.0-py3-none-any.whl:

Publisher: semantic-release.yml on hotherio/intrinsic-green-learning

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: intrinsic_green_learning-0.4.0-py3-none-any.whl
- Subject digest: b756dad773b96711e3331d4a11792cb8b40a6a9f20fd03986504fc95ebb345bc
- Sigstore transparency entry: 1779510707
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: hotherio/intrinsic-green-learning@d6f492f842732fc2cde7a850dbfb01e91fee537b
- Branch / Tag: refs/heads/main
- Owner: https://github.com/hotherio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: semantic-release.yml@d6f492f842732fc2cde7a850dbfb01e91fee537b
- Trigger Event: workflow_dispatch

intrinsic-green-learning 0.4.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Intrinsic Green Learning

Why IGL?

Installation

Quickstart

Classification

Regression and reconstruction

Cross-task hierarchy check

SPD / Riemannian extension

Custom training loop

Documentation

Examples

Development

Conventions

Release process

Bibliography

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance