Online covariance and precision estimation

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

precise

A collection of autonomous incremental estimators for covariance, precision, correlation and associated quantities.

TLDR: "Just a pile of functions that forecast covariance in online fashion"

The running_empirical_covariance colab notebook illustrates the style. To see all the other online methods of covariance estimation supplied here, run the cov skaters manifest notebook. Or to look at Elo ratings, run the elo_ratings_and_urls.

Install

pip install precise

Covariance skaters

Similar in style to skaters used in the timemachines package, this package may be thought of as a collection of covariance prediction functions taking one vector at a time, and also the prior state, and spitting out a prediction mean vector x, a prediction covariance x_cov, and a posterior state whose interpretation is the responsibility of the skater, not the caller.

This mildly unusual convention requires the caller to maintain state from one call to the next:

from precise.skatertools.syntheticdata.miscellaneous import create_correlated_dataset
from precise.skaters.covariance.runemmp import run_emp_pcov_d0 # <-- Running empirical population covariance
from pprint import pprint

if __name__=='__main__':
    ys = create_correlated_dataset(n=500)
    s = {}
    for y in ys:
        x, x_cov, s = run_emp_pcov_d0(s=s, y=y)
    pprint(x_cov)

See /examples_basic_usage.

See the timemachines faq for justification of this style.

Skater Elo ratings

As noted, see the elo_ratings_and_urls.

Browsing for skaters

You can hunt for skaters other than run_emp_pcov_d0 in precise/skaters/covariance. There are some location utilities in precise/whereami.

Examples of interpretation:

Skater name	Location	Meaning
buf_huber_pcov_d1_a1_b2_n50	skaters/covariance/bufhuber	Applies an approach that exploits Huber pseudo-means to a buffer of data of length 50 in need of differencing once, with generalized Huber loss parameters a=1, b=2.
buf_sk_ld_pcov_d0_n100	skaters/covariance/bufsk	Applies sk-learn's implementation of Ledoit-Wolf to stationary buffered data of length 100
ewa_pm_emp_scov_r01	skaters/covariance/ewapartial	Performs an incremental, recency-weighted sample covariance estimate that exploits partial moments. Uses a memory parameter r=0.01

Reading skater names

Broad calculation style categories

Shorthand	Interpretation	Incremental ?
buf	Performs classical batch calculation on a fixed window of data each time	No
win	Performs incremental fixed window calculation.	Yes
run	Running calculation weighing all observations equally	Yes
ewa	Running calculation weighing recent observations more	Yes

Methodology hints (can be combined)

Shorthand	Inspiration
emp	"Empirical" (not shrunk or augmented)
lz	Le-Zhong variable-by-variable updating
lw	Ledoit-Wolf
pm	Partial moments
huber	Generalized Huber pseudo-mean
oas	Oracle approximating shrinkage.
gl	Graphical Lasso
mcd	Minimum covariance determinant

Intended main target (more than one may be produced in the state)

Shorthand	Intent
scov	Sample covariance
pcov	Population covariance
spre	Inverse of sample covariance
ppre	Inverse of population covariance

Differencing hints:

Shorthand	Intent
d0	For use on stationary, ideally IID data
d1	For use on data that is iid after taking one difference

Portfolio & mixture of experts

See the portfolio directories in skaters. Work in progress.

Stand-alone utilities

The covariance/statefunctions are illustrated by the example running_oas_covariance.
State covariatnce/statemutations do things like ensuring both covariance and precision matrices exist in the state. Or for instance: s = both_cov(s) ensures both sample and population covariances are present.
Some /covariance/datascatterfunctions
The /covariance/datacovfunctions take data and produce covariance functions.
The /covariance/covfunctions manipulate 2d cov arrays.

Miscellaneous

Here is some related, and potentially related, literature.
This is a piece of the microprediction project, should you ever care to cite the same. The uses include mixtures of experts models for time-series analysis, buried in timemachines somewhere.
If you just want univariate calculations, and don't want numpy as a dependency, there is momentum. However if you want univariate forecasts of the variance of something, as distinct from mere online calculations of the same, I would suggest checking the time-series elo ratings and the "special" category in particular.
The name of this package refers to precision matrices, not numerical precision. This isn't a source of high precision covariance calculations per se. The intent is more in forecasting future realized covariance. Perhaps I'll include some more numerically stable methods from this survey to make the name more fitting. Pull requests are welcome!
The intent is that methods are parameter free. However some not-quite autonomous methods admit a few parameters (the factories). A few might even use just one additional scalar parameter r with a space-filling curve convention - somewhat akin to the tuning of skaters explained here in the timemachines package).

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.15.0

Dec 5, 2023

0.14.1

Jul 12, 2023

0.14.0

Jul 12, 2023

0.13.5

May 24, 2023

0.13.4

Apr 4, 2023

0.13.2

Mar 27, 2023

0.13.0

Mar 21, 2023

0.12.8

Feb 27, 2023

0.12.7

Feb 27, 2023

0.12.5

Feb 27, 2023

0.12.4

Feb 9, 2023

0.12.2

Jan 18, 2023

0.12.1

Dec 29, 2022

0.12.0

Dec 28, 2022

0.11.17

Dec 27, 2022

0.11.15

Dec 24, 2022

0.11.14

Dec 24, 2022

0.11.13

Dec 24, 2022

0.11.12

Dec 24, 2022

0.11.10

Dec 24, 2022

0.11.9

Dec 24, 2022

0.11.8

Dec 21, 2022

0.11.7

Dec 21, 2022

0.10.33

Aug 29, 2022

0.10.29

Aug 25, 2022

0.10.28

Aug 19, 2022

0.10.21

Aug 2, 2022

0.10.14

Aug 2, 2022

0.10.4

Jul 26, 2022

0.10.3

Jul 26, 2022

0.10.1

Jul 23, 2022

0.10.0

Jul 19, 2022

0.9.5

Jul 18, 2022

0.9.1

Jul 14, 2022

0.8.7

Jun 22, 2022

0.8.1

Jun 13, 2022

0.8.0

Jun 13, 2022

0.7.5

Jun 11, 2022

0.7.3

Jun 10, 2022

0.7.1

Jun 10, 2022

0.7.0

Jun 7, 2022

0.6.10

Jun 4, 2022

0.6.1

May 23, 2022

0.5.21

May 23, 2022

0.5.16

May 12, 2022

0.5.13

Apr 5, 2022

0.5.12

Apr 2, 2022

0.5.11

Mar 11, 2022

0.5.10

Mar 4, 2022

0.5.9

Mar 3, 2022

0.5.7

Mar 1, 2022

0.5.5

Feb 18, 2022

0.4.18

Feb 16, 2022

0.4.0

Feb 9, 2022

0.3.14

Feb 2, 2022

This version

0.3.13

Feb 2, 2022

0.3.1

Jan 28, 2022

0.1.0

Jan 13, 2022

0.0.2

Jan 7, 2022

0.0.1

Dec 30, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

precise-0.3.13.tar.gz (46.9 kB view hashes)

Uploaded Feb 2, 2022 Source

Built Distribution

precise-0.3.13-py3-none-any.whl (69.3 kB view hashes)

Uploaded Feb 2, 2022 Python 3

Hashes for precise-0.3.13.tar.gz

Hashes for precise-0.3.13.tar.gz
Algorithm	Hash digest
SHA256	`30467b7919f9f3e5527da2854354108a9b8e6daccd6526e4b3da891fd5f9bb79`
MD5	`ece6995846e223f9c6f5bf76677b433d`
BLAKE2b-256	`a321c5da36b01c7a430977be90fdcd67ab6f2e7872aee7df0bb4c07216b3d0c6`

Hashes for precise-0.3.13-py3-none-any.whl

Hashes for precise-0.3.13-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5bd9c8bca75014c0961d4a7a99c55212fed15945969d1cd96ce22ddbfb95a5a5`
MD5	`63e1c678a404f2bdfa613e8a892ff288`
BLAKE2b-256	`868d4b3ec5bdacc73320e6bd971a9235974983f0fc0fb81ed945cd54833604b7`