Ratio mean-reversion testing for asset pairs. Hurst, ADF, walk-forward backtest.
Project description
pairscan-rmr
Ratio mean-reversion testing for asset pairs. Hurst exponent + ADF + range filters + walk-forward backtest. No lookahead. MIT license.
This is the open-source utility behind pairscan.io — a screener for pair trading on crypto and tokenized US equities. It does one thing: takes two price series, tells you whether their log-ratio shows mean-reversion, and if so, what a walk-forward backtest would have looked like.
What this does (and doesn't do)
✅ Does:
- Compute Hurst exponent via R/S analysis
- Run Augmented Dickey-Fuller test for stationarity
- Test range width and alternating boundary touches
- Combine all four into a single
is_mean_reverting()predicate - Walk-forward backtest one pair (rolling P5/P95, no lookahead)
❌ Doesn't:
- Fetch data from exchanges (use
ccxt,yfinance, or your own pipeline) - Screen multiple pairs (this is a single-pair tool)
- Validate tokenized asset pegs against oracles
- Run scheduled, multi-source data fallback
- Send alerts
If you need those, pairscan.io does them as a hosted product — that's our commercial offering. This package is the math, free and open.
Install
pip install pairscan-rmr
Quick start
import numpy as np
from pairscan_rmr import is_mean_reverting, walk_forward_backtest
# Your price series — daily closes for two assets
price_a = np.array([...]) # e.g. ETH daily closes
price_b = np.array([...]) # e.g. BTC daily closes
# Step 1: Does this pair mean-revert?
result = is_mean_reverting(price_a, price_b)
print(result)
# MeanReversionResult(passed=True, hurst=0.42, adf_pvalue=0.31,
# range_width=0.53, low_touches=3, high_touches=2)
# Step 2: If yes, run a walk-forward backtest
if result.passed:
backtest = walk_forward_backtest(
price_a, price_b,
lookback_days=540,
entry_low=0.2,
entry_high=0.8,
fee_pct=0.001,
)
print(f"Final A qty: {backtest.final_a_qty:.2f}")
print(f"Final B qty: {backtest.final_b_qty:.2f}")
print(f"Trades: {backtest.n_trades}")
print(f"Max drawdown: {backtest.max_drawdown:.1%}")
Why we open-sourced this
Because the math has been public since 1951. Hurst (1951), Dickey-Fuller (1979), Lo-MacKinlay (1988) — none of this is proprietary. Anyone can reimplement it in an afternoon.
What's not in this repo is what makes pairscan.io worth $19/mo: 5-source data fallback, oracle peg-check on tokenized assets, 170-pair screening every 6 hours, cross-sector matching, Telegram alerts. That's operational engineering, and that's what we sell.
The math should be free. The pipeline costs money to run.
Methodology
Brief intro below. Full walkthrough with derivations and academic references at pairscan.io/methodology.
Hurst exponent (R/S analysis)
Measures long-term memory of a time series:
H < 0.5— anti-persistent / mean-reverting (we want this)H = 0.5— random walkH > 0.5— persistent / trending
We compute it on the log-ratio, not raw prices.
ADF test
Augmented Dickey-Fuller checks for unit root. Low p-value → stationarity → mean to revert to. We use a loose threshold (p < 0.7) combined with other filters — strict p < 0.05 throws out genuinely mean-reverting crypto pairs because crypto data is noisier than equities.
Range width and alternating touches
Operational filters: range must span ≥ 40% (so swap fees don't kill returns) and the series must touch both boundaries multiple times alternately (so it's genuinely oscillating, not just visiting an extreme once).
Walk-forward backtest
At each decision point t, only data up to t is used to set entry/exit thresholds. The percentile bounds are recomputed every day on a trailing 540-day window. This is the only way to honestly simulate "what would have happened if I'd been running this in real time".
Verification: how to know there's no lookahead bias
tests/test_no_lookahead.py runs the same backtest twice — once with clean data, once with all data after a midpoint replaced with garbage — and asserts the two trade lists are byte-identical up to the midpoint. If a future-dependent statistic ever leaks in, the test fails immediately. Look at it before trusting the backtest output.
Examples
See examples/ for runnable scripts:
- 01_quick_start.py — 5 minutes, synthetic data
- 02_synthetic_series.py — Ornstein-Uhlenbeck (mean-reverting) and GBM (trending) as ground truth — check that filters classify them correctly
- 03_real_crypto_pair.py — ETH/BTC via
ccxt, full pipeline - 04_walk_forward_explained.py — visual comparison with naive in-sample backtest
Limitations
We're explicit about where this fails. See full discussion at pairscan.io/methodology:
- Hurst R/S has variance — sensitive to
max_lagchoice - ADF assumes stationary residuals — structural breaks mislead it
- Tests are descriptive, not predictive
- Sample size matters: < 200 days = noise, < 540 days = use with caution
- Real execution adds slippage, taxes, exchange downtime — none modeled
Contributing
PRs welcome, especially:
- Performance improvements (vectorization, Numba)
- Additional tests (edge cases, numerical stability)
- Examples on different asset classes (FX, commodities, equities)
See CONTRIBUTING.md.
License
MIT — do whatever you want, attribution appreciated.
Citation
If you use this in research:
@software{pairscan_rmr,
author = {Pairscan},
title = {pairscan-rmr: Ratio mean-reversion testing for asset pairs},
url = {https://github.com/pairscan/ratio-mean-reversion},
year = {2026}
}
Acknowledgments
- Hurst, H.E. (1951). Long-term storage capacity of reservoirs. Transactions of the American Society of Civil Engineers, 116, 770–799.
- Dickey, D.A. & Fuller, W.A. (1979). Distribution of the Estimators for Autoregressive Time Series with a Unit Root. JASA, 74, 427–431.
- Gatev, E., Goetzmann, W.N. & Rouwenhorst, K.G. (2006). Pairs Trading: Performance of a Relative-Value Arbitrage Rule. Review of Financial Studies, 19(3), 797–827.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pairscan_rmr-0.1.0.tar.gz.
File metadata
- Download URL: pairscan_rmr-0.1.0.tar.gz
- Upload date:
- Size: 34.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bc3b2adaa84f5d4ab66216b88905293cd7bfa3ac3b47ce5a1a030289ddbbad1e
|
|
| MD5 |
08135e3e1bf05b7fa5ddbfec9570e0a6
|
|
| BLAKE2b-256 |
3de648a72025ffc03cb97418dc1a374e277ee2fdb98b0fcc2f1e9629fe3bf184
|
Provenance
The following attestation bundles were made for pairscan_rmr-0.1.0.tar.gz:
Publisher:
publish.yml on pairscan/ratio-mean-reversion
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pairscan_rmr-0.1.0.tar.gz -
Subject digest:
bc3b2adaa84f5d4ab66216b88905293cd7bfa3ac3b47ce5a1a030289ddbbad1e - Sigstore transparency entry: 1429176351
- Sigstore integration time:
-
Permalink:
pairscan/ratio-mean-reversion@2206f11567af4ffb300f8a6c5d4e1d858e20e8c0 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/pairscan
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@2206f11567af4ffb300f8a6c5d4e1d858e20e8c0 -
Trigger Event:
release
-
Statement type:
File details
Details for the file pairscan_rmr-0.1.0-py3-none-any.whl.
File metadata
- Download URL: pairscan_rmr-0.1.0-py3-none-any.whl
- Upload date:
- Size: 19.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
43585f34fbed8fae845e582d795c0b0b4a4bb7540cf675717f60d5622300d770
|
|
| MD5 |
4ba691e989263c52246c263ec94d06ad
|
|
| BLAKE2b-256 |
87deda49d71556722873b63641f0b44a3935762114c4b2bb69c420015a03d541
|
Provenance
The following attestation bundles were made for pairscan_rmr-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on pairscan/ratio-mean-reversion
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pairscan_rmr-0.1.0-py3-none-any.whl -
Subject digest:
43585f34fbed8fae845e582d795c0b0b4a4bb7540cf675717f60d5622300d770 - Sigstore transparency entry: 1429176404
- Sigstore integration time:
-
Permalink:
pairscan/ratio-mean-reversion@2206f11567af4ffb300f8a6c5d4e1d858e20e8c0 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/pairscan
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@2206f11567af4ffb300f8a6c5d4e1d858e20e8c0 -
Trigger Event:
release
-
Statement type: