Constructing minimum variance portfolios

Project description

fast-minimum-variance: Solving Minimum Variance Portfolios Fast

Overview

fast-minimum-variance solves the long-only minimum variance portfolio without ever forming the sample covariance matrix. The key observation is that the KKT stationarity condition $2\Sigma w = \lambda\mathbf{1}$ immediately gives $w \propto \Sigma^{-1}\mathbf{1}$: the entire problem reduces to one symmetric positive definite linear system $\Sigma v = \mathbf{1}$, solved matrix-free by conjugate gradients. The budget constraint is recovered by a single rescaling $w = v / (\mathbf{1}^\top v)$.

Working directly with the returns matrix $X \in \mathbb{R}^{T \times N}$ — rather than the assembled covariance $X^\top X$ — has two consequences. First, each conjugate gradient iteration costs $O(TN)$ rather than $O(N^2)$, and $X^\top X$ is never stored. Second, Ledoit-Wolf shrinkage enters as a simple row-augmentation of $X$: stacking $[\sqrt{1-\alpha},X;,\sqrt{\gamma},I]$ yields a matrix whose Gram matrix equals $\Sigma_{\text{LW}}$. The same CG code handles both the plain and shrunk problem without modification.

Quick Start

import numpy as np
from fast_minimum_variance import Problem

# 500 daily returns, 20 assets
X = np.random.default_rng(42).standard_normal((500, 20))

w, iters = Problem(X).solve_cg()   # matrix-free CG — recommended
w, iters = Problem(X).solve_kkt()  # direct dense solve — exact baseline

assert abs(w.sum() - 1.0) < 1e-8
assert (w >= 0).all()

Ledoit-Wolf Shrinkage

Ledoit-Wolf shrinkage plays a dual role: statistically it reduces estimation error; numerically it compresses the eigenvalue spectrum and directly cuts CG iteration counts. Use alpha = N / (N + T) as a simple analytical estimate of the optimal shrinkage intensity:

T, N = X.shape
w, iters = Problem(X, alpha=N / (N + T)).solve_cg()

On S&P 500 equity data (495 assets, 1192 days), shrinkage cuts CG iterations from 685 to 205 and makes the matrix-free solver the fastest option by a wide margin.

Solvers

All solvers are methods on Problem and return (w, iters) where $w \in \mathbb{R}^N$, $\sum_i w_i = 1$, $w_i \geq 0$.

Method	Approach	When to use
`solve_cg()`	Matrix-free conjugate gradients on the SPD reduced system	Default — fastest for large $N$, especially with shrinkage
`solve_kkt()`	Direct dense factorisation via `numpy.linalg.solve`	Small problems or when an exact solve is needed
`solve_nnls()`	Non-negative least squares via Lawson-Hanson	Single-shot; useful when no outer loop is desired
`solve_clarabel()`	Clarabel interior-point solver (direct API)	Comparison baseline without CVXPY overhead
`solve_cvxpy()`	CVXPY + Clarabel	Ground-truth reference; requires `[convex]` extra

`solve_cg` — matrix-free conjugate gradients

The inner step builds a LinearOperator that applies

$$v ;\mapsto; (1-\alpha),X_a^\top(X_a v) + \gamma v, \qquad \gamma = \frac{\alpha|X|_F^2}{N}$$

to a vector using two matrix-vector products with the active-asset submatrix $X_a$, without ever forming $\Sigma_a = X_a^\top X_a$. Standard CG then solves $\Sigma_a v = \mathbf{1}$. Ledoit-Wolf shrinkage ($\alpha > 0$) compresses the eigenvalue spectrum and reduces iteration counts dramatically — from nearly 2000 iterations at $\alpha \approx 0$ to single digits at $\alpha \approx 1$ in rank-deficient settings.

`solve_kkt` — direct dense solve

Assembles $\Sigma_a = (1-\alpha)X_a^\top X_a + \gamma I$ explicitly and calls numpy.linalg.solve. Exact to machine precision. Scales as $O(N^3)$ in the active portfolio size, so it becomes expensive for $N \gtrsim 500$ without shrinkage (which reduces the number of active assets). With shrinkage, the active-set outer loop converges in 2–4 steps and the inner systems are small, making the direct solve competitive.

`solve_nnls` — non-negative least squares

Reformulates the problem as a non-negative least squares problem on an augmented matrix:

$$\min_{w \geq 0};\left|\begin{pmatrix}\sqrt{1-\alpha},X \ \sqrt{\gamma},I \ M\mathbf{1}^\top\end{pmatrix}w - \begin{pmatrix}\mathbf{0} \ \mathbf{0} \ M\end{pmatrix}\right|^2$$

where $M = |X|_F \cdot T$ enforces the budget constraint as a large penalty. The Lawson-Hanson algorithm handles $w \geq 0$ natively, so no outer primal-dual loop is needed. Single-shot but does not benefit from the matrix-free structure: Lawson-Hanson implicitly forms normal equations of the augmented matrix. With shrinkage the augmented matrix grows from $T \times N$ to $(T+N) \times N$, making solve_nnls slower with shrinkage than without.

`solve_clarabel` — Clarabel direct API

Calls the Clarabel interior-point solver directly, bypassing CVXPY's problem-construction overhead. Assembles $P = 2\Sigma_{\text{LW}}$ as a sparse CSC matrix and solves the standard QP. Useful for benchmarking: on a 1000-asset synthetic problem, Clarabel direct takes 0.28 s while the CVXPY wrapper takes 8.2 s — over 97% of solve_cvxpy's time is Python interface overhead, not solving. CG is still 15× faster than Clarabel direct.

The Primal-Dual Active-Set Loop

Long-only weights are enforced by an outer loop that wraps any inner solver:

Primal step. Solve the budget-only equality system over the current active asset set. Drop any asset with weight below $-\varepsilon$ (multiple assets at once if violations are large).
Dual step. Once all active weights are non-negative, compute the gradient $\nabla_i f(w) = 2[(1-\alpha)(X^\top X w)_i + \gamma w_i] - \rho\mu_i$ for every excluded asset. If any excluded asset has $\nabla_i f(w) < \lambda$ (the budget multiplier), it would decrease variance if added — re-insert the most-violated asset and repeat.
Termination. The loop exits when primal and dual feasibility hold simultaneously. Combined with stationarity from the inner solve, this is sufficient for global optimality.

With Ledoit-Wolf shrinkage at the analytically optimal $\alpha$, the loop typically converges in 2–4 outer iterations on real equity data.

Problem Variants

The same solver handles a range of portfolio construction problems by choosing $\alpha$, $\rho$, $\mu$:

Problem	`alpha`	`rho`	`mu`
Minimum variance	$0$	$0$	—
Mean-variance (Markowitz)	any	$> 0$	expected returns
Minimum tracking error to benchmark $b$	any	$2$	`X.T @ (X @ b)`
LW-regularised minimum variance	$N/(N+T)$	$0$	—

# Mean-variance
mu = np.random.default_rng(0).standard_normal(N)  # expected returns, shape (N,)
w, _ = Problem(X, rho=1.0, mu=mu).solve_cg()

# Minimum tracking error to benchmark b
b = np.ones(N) / N  # equal-weight benchmark
mu_te = X.T @ (X @ b)
w, _ = Problem(X, rho=2.0, mu=mu_te).solve_cg()

When rho != 0, two SPD solves are performed per outer step: $\Sigma_a v_1 = \mathbf{1}$ and $\Sigma_a v_2 = \mu_a$. The budget multiplier $\lambda$ is recovered analytically from the budget constraint, avoiding the full saddle-point system.

Custom Constraints

For problems beyond budget + long-only (sector limits, turnover bounds, factor-exposure constraints), pass explicit constraint matrices:

A = np.ones((N, 1))   # budget: 1'w = 1
b = np.ones(1)
C = -np.eye(N)        # long-only: w >= 0
d = np.zeros(N)
w, _ = Problem(X, A=A, b=b, C=C, d=d).solve_kkt()

This routes to a general active-set solver that handles arbitrary linear equality and inequality constraints. Use this path sparingly — the default path (no A, b, C, d) is significantly faster for the standard long-only problem.

Benchmarks

All timings on Apple M4 Pro, Python 3.12, NumPy 2.4, SciPy 1.17.

Synthetic: $N=1000$, $T=2000$, i.i.d. Gaussian returns

Method	Time (s)	Speedup vs CVXPY
`solve_cvxpy`	8.16	1×
`solve_clarabel`	0.28	29×
`solve_kkt`	0.063	129×
`solve_cg`	0.019	430×
`solve_nnls`	1.69	5×

With Ledoit-Wolf shrinkage ($\alpha = 0.333$), 56 CG iterations.

S&P 500: $N=495$, $T=1192$ (Jul 2021–Apr 2026)

Method	Time (s)	Speedup vs CVXPY
`solve_cvxpy`	1.48	1×
`solve_clarabel`	0.067	22×
`solve_kkt`	0.018	84×
`solve_cg`	0.0091	162×
`solve_nnls`	0.088	17×

With Ledoit-Wolf shrinkage ($\alpha = 0.293$), 205 CG iterations.

Installation

pip install fast-minimum-variance

To use the CVXPY and Clarabel reference solvers:

pip install fast-minimum-variance[convex]

For development:

git clone https://github.com/Jebel-Quant/fast_minimum_variance
cd fast_minimum_variance
make install

Requirements

Python 3.11+
numpy
scipy
cvxpy (optional, only required for solve_cvxpy and solve_clarabel)

Citing

If you use this library in academic work or research, please cite:

@software{fast_minimum_variance,
  author  = {Schmelzer, Thomas},
  title   = {fast-minimum-variance: Solving Minimum Variance Portfolios Fast},
  url     = {https://github.com/Jebel-Quant/fast_minimum_variance},
  year    = {2026},
  license = {MIT}
}

License

MIT License — see LICENSE for details.

Project details

Release history Release notifications | RSS feed

0.7.0

May 7, 2026

0.6.1

May 3, 2026

This version

0.6.0

May 3, 2026

0.5.0

May 2, 2026

0.4.0

May 1, 2026

0.3.0

Apr 30, 2026

0.2.1

Apr 29, 2026

0.2.0

Apr 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast_minimum_variance-0.6.0.tar.gz (5.5 MB view details)

Uploaded May 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fast_minimum_variance-0.6.0-py3-none-any.whl (15.6 kB view details)

Uploaded May 3, 2026 Python 3

File details

Details for the file fast_minimum_variance-0.6.0.tar.gz.

File metadata

Download URL: fast_minimum_variance-0.6.0.tar.gz
Upload date: May 3, 2026
Size: 5.5 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for fast_minimum_variance-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`a16f57051c9daf542251ecd292149cf282187da3e68dc1115bfd58223efbdf1d`
MD5	`7d13f2010a1940219ca1a68a119138e5`
BLAKE2b-256	`7ebba6a6017dc0d86df2ef1f539c32e260f5c3a7c99b6d1efdd92ca1b64aa4a0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fast_minimum_variance-0.6.0.tar.gz:

Publisher: rhiza_release.yml on Jebel-Quant/fast_minimum_variance

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fast_minimum_variance-0.6.0.tar.gz
- Subject digest: a16f57051c9daf542251ecd292149cf282187da3e68dc1115bfd58223efbdf1d
- Sigstore transparency entry: 1435922115
- Sigstore integration time: May 3, 2026
Source repository:
- Permalink: Jebel-Quant/fast_minimum_variance@cd30dca84f1b7b853837410adb87aa677c1aa846
- Branch / Tag: refs/tags/v0.6.0
- Owner: https://github.com/Jebel-Quant
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: rhiza_release.yml@cd30dca84f1b7b853837410adb87aa677c1aa846
- Trigger Event: push

File details

Details for the file fast_minimum_variance-0.6.0-py3-none-any.whl.

File metadata

Download URL: fast_minimum_variance-0.6.0-py3-none-any.whl
Upload date: May 3, 2026
Size: 15.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for fast_minimum_variance-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`790125cbaf056bff8d733d1a52339d25d20e5ccade7cf0eee7cb3906f56f6f05`
MD5	`a61a86c14469c2b7431ea46879c72cd7`
BLAKE2b-256	`419e1c591e378a694cbeafaebf98fdcd6bc21f86c9daf0a342431877c2e55a20`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fast_minimum_variance-0.6.0-py3-none-any.whl:

Publisher: rhiza_release.yml on Jebel-Quant/fast_minimum_variance

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fast_minimum_variance-0.6.0-py3-none-any.whl
- Subject digest: 790125cbaf056bff8d733d1a52339d25d20e5ccade7cf0eee7cb3906f56f6f05
- Sigstore transparency entry: 1435922117
- Sigstore integration time: May 3, 2026
Source repository:
- Permalink: Jebel-Quant/fast_minimum_variance@cd30dca84f1b7b853837410adb87aa677c1aa846
- Branch / Tag: refs/tags/v0.6.0
- Owner: https://github.com/Jebel-Quant
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: rhiza_release.yml@cd30dca84f1b7b853837410adb87aa677c1aa846
- Trigger Event: push

fast-minimum-variance 0.6.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

fast-minimum-variance: Solving Minimum Variance Portfolios Fast

Overview

Quick Start

Ledoit-Wolf Shrinkage

Solvers

solve_cg — matrix-free conjugate gradients

solve_kkt — direct dense solve

solve_nnls — non-negative least squares

solve_clarabel — Clarabel direct API

The Primal-Dual Active-Set Loop

Problem Variants

Custom Constraints

Benchmarks

Synthetic: $N=1000$, $T=2000$, i.i.d. Gaussian returns

S&P 500: $N=495$, $T=1192$ (Jul 2021–Apr 2026)

Installation

Requirements

Citing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`solve_cg` — matrix-free conjugate gradients

`solve_kkt` — direct dense solve

`solve_nnls` — non-negative least squares

`solve_clarabel` — Clarabel direct API