Gradient-free machine learning for any numpy-compatible function

These details have not been verified by PyPI

Project links

Repository

Project description

LambdaML

Gradient-free machine learning. Give it any function; it learns the parameters.

LambdaML lets you use any numpy-compatible function as your model and automatically fits its parameters using numerical (finite-difference) differentiation — no hand-derived gradients required. The "lambda" really can be anything: logistic regression, a neural network with custom activations, a physics equation, a learnable signal transform, or something entirely your own.

Quick-start

pip install lambdaml
# With progress bars:
pip install lambdaml[progress]

import numpy as np
from lambdaml import LambdaClassifierModel, Optimizer, DiffMethod, LRSchedule

# 1. Write your model — anything numpy-compatible works
def my_model(x, p):
    return (np.tanh(p['w'].dot(x) + p['b']) + 1) / 2

# 2. Initial parameters (scalars or numpy arrays)
p = {'w': np.zeros(2), 'b': 0.0}

# 3. Create and fit
model = LambdaClassifierModel(
    f=my_model,
    p=p,
    diff_method=DiffMethod.COMPLEX_STEP,   # recommended
    l2_factor=0.001,
    optimizer=Optimizer.ADAM,
    lr_schedule=LRSchedule.cosine_annealing(T_max=100),
)
model.fit(X_train, Y_train, n_iter=100, lr=0.01,
          early_stopping=True, patience=10)

print(model.score(X_test, Y_test))       # accuracy
print(model.predict_proba(X_test))       # probabilities

For regression, swap in LambdaRegressorModel with loss='mse', 'mae', 'huber', or 'pseudo_huber'.

See the examples/ folder for runnable scripts and LambdaML_Showcase.ipynb for an interactive walkthrough with charts.

What's new in v1.0.3

Progress bars — fit() shows a live tqdm epoch bar with loss and lr in the postfix. predict() / predict_proba() accept progress_bar=True for a per-sample bar.

eval_every — the hidden cost of training was a full forward pass on the entire dataset every epoch just to log the loss. Set eval_every=10 to evaluate only every 10th epoch — no effect on gradient updates, but can cut training time significantly.

vectorized=True — if your f can accept the full X matrix at once (any pure-numpy function already can), set vectorized=True to eliminate the Python sample loop and get a 2–10× speedup.

# Standard (per-sample loop)
def f(x, p):
    return p['w'].dot(x) + p['b']

# Vectorized (full matrix at once — much faster)
def f(X, p):
    return X @ p['w'] + p['b']

model = LambdaRegressorModel(f=f, p={...}, vectorized=True)
model.fit(X, Y, n_iter=200, lr=0.01, eval_every=10)   # progress bar on by default

What is finite-difference differentiation?

The term you're looking for is finite-difference approximation (sometimes called numerical differentiation). Rather than deriving f′(θ) analytically, we estimate it by evaluating the function at nearby points:

f'(θ) ≈ [f(θ+h) - f(θ-h)] / (2h)     ← Central difference, O(h²)

LambdaML supports six methods with different accuracy/cost trade-offs:

Method	Order	f-evals/param	Notes
Forward	O(h)	1	Fast, low accuracy
Backward	O(h)	1	Fast, low accuracy
Central	O(h²)	2	Default — good balance
Five-Point	O(h⁴)	4	High accuracy, smooth f
Complex-Step	O(h²)	1 (complex)	Recommended — no cancellation error
Richardson	O(h⁴)	4	High accuracy, no complex inputs needed

Derivative methods comparison

Left: all six estimates on a known function. Right: absolute error vs step size h — complex-step never hits the cancellation-error floor.

Is it tractable? Yes, for models up to ~10k parameters. Each gradient step costs O(n_params) forward passes instead of O(1) for analytic backprop. For small-to-medium models on a CPU+numpy backend this is entirely practical.

Speed tips

The main cost per epoch is n_params × diff_evals × n_samples calls to f. To reduce it:

Technique	How	Typical gain
`eval_every=N`	Skip loss re-evaluation on most epochs	~1.5–2×
`vectorized=True`	Write `f(X, p)` to accept the full matrix	2–10×
`batch_size=N`	Mini-batch gradient steps	Scales with batch ratio
`DiffMethod.FORWARD`	1 f-eval/param instead of 2	~1.5× (noisier grads)

The lambda can be any function

Six completely different model functions, one .fit() call:

Decision boundaries

From top-left: logistic regression, tanh, sine activation (non-standard), Gaussian RBF, softplus, and a physics-inspired decay+oscillation model σ(a·exp(−λ|x₀|)·cos(ω·x₁+φ)) — the kind of thing nobody derives analytically.

Neural network with numerically computed gradients

A 2-layer ELU network on non-linearly separable data, fitted entirely via finite-difference backprop. No autograd, no torch, no chain rule.

Neural network training

Clockwise from top-left: log-loss curve, final decision boundary, weight trajectories for hidden and output layers, bias evolution across epochs.

Regression — recovering true sine parameters

Starting from wrong values (a=2.5, ω=0.4, φ=1.8, c=−1) on outlier-corrupted data, the optimizer converges back to the true parameters using pseudo-Huber loss (complex-step safe).

Sine regression

Optimizer comparison

SGD vs Momentum vs RMSProp vs Adam on the same logistic task:

Optimizer comparison

Derivative method benchmark

All 6 methods on the same problem — speed, accuracy, and Pareto trade-off:

Diff method benchmark

Regularization — L1 vs L2

With the corrected L1 formula (Σ|θ| not Σθ — a bug in the original), L1 now induces true sparsity on a 10-feature problem where only features 0 and 1 matter:

Regularization

Learning rate schedules

Five schedules visualised and compared for convergence speed:

LR schedules

Gradient accuracy verification

Per-component absolute error vs an analytically known gradient — complex-step and Richardson hit near-machine-precision:

Gradient accuracy

API reference

`LambdaClassifierModel(f, p, **kwargs)`

Parameter	Default	Description
`f`	—	Model: `f(x, p) → float ∈ (0,1)` — or `f(X, p) → array` when `vectorized=True`
`p`	—	Parameter dict (scalars or numpy arrays)
`diff_method`	`DiffMethod.CENTRAL`	Finite-difference method
`diff_h`	`None`	Custom step size (None = optimal default per method)
`l1_factor`	`0.0`	L1 regularization strength
`l2_factor`	`0.01`	L2 regularization strength
`regularize_bias`	`False`	Whether to regularize `b*` params
`optimizer`	`Optimizer.ADAM`	`sgd`, `momentum`, `rmsprop`, `adam`
`lr_schedule`	`None` (constant)	Learning rate schedule callable
`vectorized`	`False`	If `True`, `f` receives the full `X` matrix — faster

.fit(X, Y, ...)

Parameter	Default	Description
`n_iter`	`100`	Max gradient steps
`lr`	`0.01`	Initial learning rate
`batch_size`	`None`	Mini-batch size; `None` = full batch
`early_stopping`	`False`	Stop if loss stalls for `patience` steps
`patience`	`10`	Early stopping patience
`tol`	`1e-6`	Minimum improvement threshold
`verbose`	`False`	Print loss every 10 iterations
`validation_data`	`None`	`(X_val, Y_val)` tuple
`progress_bar`	`True`	Show tqdm epoch bar (requires `tqdm`)
`eval_every`	`1`	Evaluate loss every N epochs — increase for speed

Other methods: .predict(X, progress_bar=False) · .predict_proba(X, progress_bar=False) · .score(X, Y) · .compute_loss(X, Y) · .get_params() · .loss_history

`LambdaRegressorModel(f, p, loss='mse', **kwargs)`

Parameter	Default	Description
`loss`	`'mse'`	`'mse'`, `'mae'`, `'huber'`, `'pseudo_huber'`
`huber_delta`	`1.0`	Threshold for Huber / pseudo-Huber

Methods: .fit(...) · .predict(X, progress_bar=False) · .score(X, Y) (R²)

`DiffMethod` · `Optimizer` · `LRSchedule`

# Derivative methods
DiffMethod.FORWARD | BACKWARD | CENTRAL | FIVE_POINT | COMPLEX_STEP | RICHARDSON

# Optimizers
Optimizer.SGD | MOMENTUM | RMSPROP | ADAM

# LR schedules
LRSchedule.constant()
LRSchedule.step_decay(drop=0.5, epochs_drop=10)
LRSchedule.exponential_decay(k=0.01)
LRSchedule.cosine_annealing(T_max=100)
LRSchedule.warmup_cosine(warmup=10, T_max=100)

Bug fixes from the original library

Bug	Original	Fixed
Epsilon	`float16.eps ≈ 0.001` — catastrophically large	Float64-optimal per method (~6e-6 for central)
L1 regularization	Summed raw `θ` — negative weights reduced penalty	Summed `\|θ\|` using smooth complex-safe approximation
Closure-in-loop	Array gradient loop captured last index for all closures	Fixed with factory functions
L1/L2 complex-step safety	`float()` cast stripped imaginary part	Uses `vv` and `sqrt(vv+eps)` to preserve imaginary parts
No test split	Accuracy reported on training data	Train/test split in all examples

Is LambdaML useful for Kaggling?

As a primary model for large nets — rarely. As a prototyping and ensembling tool — genuinely yes.

The core insight: LambdaML decouples your model definition from gradient computation. Anywhere you want a custom functional form but don't want to derive its gradients by hand, LambdaML fills that gap.

Concrete use cases: fitting domain equations with unknown parameters (physics-based pricing, pharmacokinetics, decay curves); directly optimising non-differentiable competition metrics (NDCG, F-beta, Cohen's kappa) as the loss function; building exotic meta-learners in stacking ensembles; small-data + custom hypothesis problems where sklearn doesn't have your model form.

Project structure

LambdaML/
├── lambdaml/                # Installable package (pip install lambdaml)
│   ├── __init__.py
│   ├── lambda_model.py      # LambdaClassifierModel, LambdaRegressorModel, Optimizer
│   └── lambda_utils.py      # NumericalDiff, GradientComputer, Regularization, LossFunctions, LRSchedule
├── pyproject.toml           # Package metadata
├── LambdaML_Showcase.ipynb  # Interactive notebook with all charts
├── examples/
│   ├── example_tanh_regression.py
│   ├── example_neural_network.py
│   ├── example_diff_methods.py
│   └── example_regressor.py
├── assets/                  # Notebook-generated figures for README
├── data/
│   └── circles.csv
└── legacy/                  # Original library files (pre-rewrite)

License

See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

1.2.0

Apr 19, 2026

1.1.0

Apr 4, 2026

This version

1.0.5

Apr 4, 2026

1.0.3

Apr 4, 2026

1.0.2

Apr 4, 2026

1.0.1

Apr 4, 2026

1.0.0

Apr 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lambdaml-1.0.5.tar.gz (21.6 kB view details)

Uploaded Apr 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lambdaml-1.0.5-py3-none-any.whl (18.8 kB view details)

Uploaded Apr 4, 2026 Python 3

File details

Details for the file lambdaml-1.0.5.tar.gz.

File metadata

Download URL: lambdaml-1.0.5.tar.gz
Upload date: Apr 4, 2026
Size: 21.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for lambdaml-1.0.5.tar.gz
Algorithm	Hash digest
SHA256	`5a766d7dc1254f82d4a34f4c2cef3129a3a78ace123794e304cf55e1f25c24cd`
MD5	`ee6c91f0cd657493f39d200cfd3f3eca`
BLAKE2b-256	`dbe577a6e47eeb6ed60b48854123f9cce59c2882829b9305fcd894677a6b297f`

See more details on using hashes here.

File details

Details for the file lambdaml-1.0.5-py3-none-any.whl.

File metadata

Download URL: lambdaml-1.0.5-py3-none-any.whl
Upload date: Apr 4, 2026
Size: 18.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for lambdaml-1.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3e6971b5829635ca3ea74453b9d44746254a7cf26e60a12229fa72ad0f88a410`
MD5	`f1e3c59cbb0365c7bca3b69ff718afb6`
BLAKE2b-256	`2274e40758cbca5431336b031bc8427bfcfca29d555238b1f4e84acc1c756657`

See more details on using hashes here.

lambdaml 1.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

LambdaML

Quick-start

What's new in v1.0.3

What is finite-difference differentiation?

Speed tips

The lambda can be any function

Neural network with numerically computed gradients

Regression — recovering true sine parameters

Optimizer comparison

Derivative method benchmark

Regularization — L1 vs L2

Learning rate schedules

Gradient accuracy verification

API reference

LambdaClassifierModel(f, p, **kwargs)

LambdaRegressorModel(f, p, loss='mse', **kwargs)

DiffMethod · Optimizer · LRSchedule

Bug fixes from the original library

Is LambdaML useful for Kaggling?

Project structure

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`LambdaClassifierModel(f, p, **kwargs)`

`LambdaRegressorModel(f, p, loss='mse', **kwargs)`

`DiffMethod` · `Optimizer` · `LRSchedule`