A generic, plugin-based ML framework for Polymarket prediction markets.

These details have not been verified by PyPI

Project links

Project description

pmlab

A generic, plugin-based ML framework for Polymarket prediction markets.

Installation · Quickstart · Architecture · Writing a Plugin · CLI · API Reference · Contributing

What is pmlab?

pmlab is a Python library that lets you build, backtest, and paper-trade machine learning models for any Polymarket prediction market — weather, sports, politics, crypto, or anything else.

The framework is completely domain-agnostic: you write a small plugin that tells pmlab how to find markets, generate features, and resolve outcomes. pmlab handles everything else — edge calculation, walk-forward backtesting, champion promotion with a hard gate, paper trading with stale-signal guards, and settlement.

Battle-tested design — the architecture is extracted from two live production systems:

polymarket-tmax-lab — weather/temperature markets (active since 2025)
f1-polymarket-lab — Formula 1 race outcome markets

Installation

pip install pmlab

Or with uv:

uv add pmlab

Requirements: Python 3.12+

Quickstart

1. Implement a plugin

from pmlab import MarketPlugin, MarketSpec, OutcomeBin

class MyPlugin(MarketPlugin):
    family = "my_markets"

    def discover_markets(self, **kwargs) -> list[MarketSpec]:
        return [...]

    def fetch_features(self, spec: MarketSpec, horizon: str, **kwargs) -> dict[str, float]:
        return {"feature_trend": 0.65, "feature_volume": 1200.0}

    def fetch_truth(self, spec: MarketSpec, **kwargs) -> float | str | None:
        return 31.5

    def build_training_row(self, spec: MarketSpec, horizon: str, **kwargs) -> dict | None:
        features = self.fetch_features(spec, horizon)
        truth = self.fetch_truth(spec)
        if truth is None:
            return None
        winning_label = spec.resolve_winning_bin(float(truth))
        return {"market_id": spec.market_id, "decision_horizon": horizon,
                "winning_label": winning_label, "market_price": 0.3, **features}

2. Run a backtest

import pandas as pd
from pmlab import LGBMForecaster
from pmlab.backtest.rolling_origin import rolling_origin_eval

panel = pd.read_parquet("data/historical_panel.parquet")
model = LGBMForecaster()
result = rolling_origin_eval(panel, model, stride=30, min_train_rows=100)

print(f"Total trades: {len(result.trades)}")
print(f"Total PnL:    {result.trades['realized_pnl'].sum():.2f}")
print(f"Hit rate:     {(result.trades['realized_pnl'] > 0).mean():.1%}")

3. Evaluate calibration

from pmlab import brier_decomposition, reliability_data

# Murphy (1973) decomposition: BS = reliability - resolution + uncertainty
result = brier_decomposition(y_true, y_prob)
print(f"Brier score:  {result.brier_score:.4f}")
print(f"Skill score:  {result.skill_score:.4f}")
print(f"Reliability:  {result.reliability:.4f}")
print(f"Resolution:   {result.resolution:.4f}")

# Data for a reliability diagram
centers, mean_pred, frac_pos = reliability_data(y_true, y_prob, n_bins=10)

4. Promote a champion

from pmlab import ChampionManifest, HoldoutGateResult

gate = HoldoutGateResult.evaluate(
    trades=result.trades,
    required_segments=["Buenos Aires", "Atlanta", "Dallas"],
    min_trades_per_segment=40,
    min_pnl_per_segment=0.0,
)
print(f"Gate decision: {gate.decision}")  # "GO" or "NO_GO"

if gate.decision == "GO":
    manifest = ChampionManifest.publish(
        model=model,
        gate=gate,
        output_dir="artifacts/public_models",
        plugin_family="my_markets",
    )
    print(f"Allowed segments: {manifest.get_allowed_segments()}")
# If gate.decision == "NO_GO", publish() raises ValueError — by design.

5. Paper trade with Kelly sizing

from pmlab import EdgeSignal, PaperBroker, kelly_stake_size

# Kelly-sized position (quarter-Kelly, max 5% of bankroll per trade)
stake = kelly_stake_size(
    win_prob=0.65,
    entry_price=0.35,
    bankroll=200.0,
    fraction=0.25,
    max_exposure=0.05,
)

broker = PaperBroker(
    trades_path="artifacts/ops/forward_paper_trades.json",
    allowed_segments=manifest.get_allowed_segments(),
    flat_stake=stake,
)
new_trades = broker.record(signals)

6. Generate an HTML report

from pmlab import generate_report

trades = broker.load_trades()
generate_report(
    trades,
    output_path="reports/session_report.html",
    title="May 2026 Paper Trading",
    brier_score=result_diagnostics.brier_score,
)

Opens a self-contained dark-themed HTML with equity curve, per-segment breakdown, and full trade log.

7. Go live (dry-run first)

from pmlab import LiveBroker

# Test without sending real orders
with LiveBroker(api_key="...", api_secret="...", api_passphrase="...", dry_run=True) as broker:
    receipt = broker.place_order(token_id="tok_abc", side="BUY", price=0.35, size=14.28)
    print(receipt.status)  # "dry_run"

# Check balance before going live
with LiveBroker(api_key="...", api_secret="...", api_passphrase="...") as broker:
    balance = broker.get_balance()
    print(f"Available: ${balance:.2f} USDC")

Architecture

Polymarket API
      │
      ▼
MarketPlugin.discover_markets()  ──►  list[MarketSpec]
      │
      ├── fetch_features(spec, horizon)       ──►  dict[str, float]
      ├── fetch_truth(spec)                   ──►  float | str | None
      └── build_training_row(spec, horizon)   ──►  dict | None
                │
    [pmlab.features]  ← add_lags, rolling_stats, encode_cyclical, onehot, clip_outliers
                │
                ▼
        rolling_origin_eval(panel, model)      ◄── LGBMForecaster
                │
                ├── BacktestMetrics  (PnL, hit_rate, avg_edge)
                ├── brier_decomposition()      ◄── calibration diagnostics
                │
                └── HoldoutGate.evaluate(required_segments)
                            │
                          GO │ NO_GO → ValueError (hard gate)
                            │
                  ChampionManifest.publish(model, gate)
                            │
                     champion.json + champion.pkl
                            │
              ┌─────────────┴──────────────┐
              │                            │
        PaperBroker                   LiveBroker
        (paper trades)                (real CLOB orders)
        flat or kelly sizing          dry_run=True to test
              │
        SettlementEngine.settle_all(specs)
              │
        generate_report() → HTML report

Module map

Package	Responsibility
`pmlab.core`	`MarketSpec`, `OutcomeBin`, `Position`, `compute_edge`, `estimate_fee`, `flat_stake_size`, `kelly_fraction`, `kelly_stake_size`
`pmlab.features`	`add_lags`, `add_rolling_stats`, `encode_cyclical`, `encode_onehot`, `clip_outliers`
`pmlab.plugins`	`MarketPlugin` ABC, `PluginRegistry`, reference plugins
`pmlab.plugins.weather_tmax`	Reference implementation — temperature markets
`pmlab.plugins.sports_f1`	Categorical outcome plugin — F1 race markets
`pmlab.markets`	`GammaClient`, `ClobClient`, `AsyncGammaClient`, `AsyncClobClient`, `DiskCache`
`pmlab.backtest`	`rolling_origin_eval`, `HoldoutGateResult`, `BacktestMetrics`
`pmlab.modeling`	`MarketForecaster` ABC, `LGBMForecaster`, `ChampionManifest`, `brier_decomposition`, `reliability_data`
`pmlab.execution`	`EdgeSignal`, `PaperBroker`, `SettlementEngine`, `LiveBroker`
`pmlab.reports`	`generate_report` — self-contained HTML report
`pmlab.workspace`	`WorkspaceContext` — multi-workspace path isolation

Writing a Plugin

See the full guide at docs/plugin-authoring.md.

The four methods

class MarketPlugin(ABC):
    family: str  # unique identifier

    def discover_markets(self, **kwargs) -> list[MarketSpec]: ...
    def fetch_features(self, spec: MarketSpec, horizon: str, **kwargs) -> dict[str, float]: ...
    def fetch_truth(self, spec: MarketSpec, **kwargs) -> float | str | None: ...
    def build_training_row(self, spec: MarketSpec, horizon: str, **kwargs) -> dict | None: ...

    # Optional — override if your data source has a finalization lag:
    def is_truth_final(self, spec: MarketSpec, **kwargs) -> bool: ...

Dependency injection (recommended for testing)

class MyPlugin(MarketPlugin):
    family = "my_markets"

    def __init__(self, gamma_client=None):
        self._gamma = gamma_client or GammaClient()

# In tests — inject a mock, never hit the network:
mock_gamma = MagicMock()
plugin = MyPlugin(gamma_client=mock_gamma)

Feature Engineering

pmlab.features provides common transforms so plugins don't reimplement them:

from pmlab import add_lags, add_rolling_stats, encode_cyclical, encode_onehot, clip_outliers

# Lag features (with optional group-by for panel data)
df = add_lags(df, cols=["temp"], lags=[1, 3, 7], group_by="city")

# Rolling statistics
df = add_rolling_stats(df, cols=["temp"], windows=[7, 14], stats=["mean", "std"])

# Cyclical encoding (e.g. hour-of-day, day-of-year)
df["hour_sin"], df["hour_cos"] = encode_cyclical(df["hour"], period=24)

# One-hot encoding (drop_first=True avoids multicollinearity)
df = encode_onehot(df, cols=["city"], drop_first=True)

# Outlier clipping (IQR or z-score)
df = clip_outliers(df, cols=["temp", "humidity"], method="iqr", iqr_factor=3.0)

All transforms return a copy — originals are never mutated.

Async Market Access

For scanning many markets in parallel, use the async clients:

import asyncio
from pmlab import AsyncGammaClient, AsyncClobClient

async def scan():
    async with AsyncGammaClient() as gamma:
        markets = await gamma.fetch_markets(tag="temperature", keyword="highest")

    token_ids = [m["conditionId"] for m in markets]
    async with AsyncClobClient(concurrency=20) as clob:
        prices = await clob.fetch_prices(token_ids)

    return markets, prices

markets, prices = asyncio.run(scan())

TTL disk cache

Avoid redundant API calls during repeated scans:

from pmlab import GammaClient, DiskCache

cache = DiskCache(cache_dir=".pmlab_cache", ttl_seconds=3600)
client = GammaClient(cache=cache)

markets = client.fetch_markets(tag="temperature")  # fetched from API
markets = client.fetch_markets(tag="temperature")  # served from cache

Position Sizing

from pmlab import flat_stake_size, kelly_fraction, kelly_stake_size

# Flat stake — always risk $5 regardless of edge
shares = flat_stake_size(flat_stake=5.0, entry_price=0.35)

# Kelly fraction — what % of bankroll to wager?
f = kelly_fraction(win_prob=0.65, entry_price=0.35, fraction=0.25)  # quarter-Kelly

# Kelly stake — USDC amount, capped at max_exposure % of bankroll
stake = kelly_stake_size(
    win_prob=0.65,
    entry_price=0.35,
    bankroll=500.0,
    fraction=0.25,
    max_exposure=0.05,  # never more than 5% of bankroll per trade
)

Bundled Plugins

`WeatherTmaxPlugin`

Handles "Highest temperature in [city] on [date]?" markets. Features: ECMWF forecast temperature, ensemble spread, lead time, city baseline. Truth: official observations via Wunderground / NOAA / CWA.

`SportsF1Plugin`

Handles "F1 [GP] winner?" and similar categorical race markets. Categorical bins — winning label is a driver/team name string.

CLI Reference

pmlab version                                              # print version
pmlab status                                               # paper trade status
pmlab scan-markets --plugin weather_tmax                   # discover open markets
pmlab record-trades --plugin weather_tmax --min-edge 0.20  # record paper trades
pmlab settle-trades --plugin weather_tmax                  # settle resolved positions
pmlab backtest --plugin weather_tmax --stride 30           # walk-forward backtest
pmlab promote-champion model.pkl --gate-path gate.json \   # promote if gate is GO
               --plugin weather_tmax

Workspace isolation

scripts/pmlab-workspace ops_daily pmlab scan-markets --plugin weather_tmax
scripts/pmlab-workspace historical_real pmlab backtest --plugin weather_tmax --stride 30

API Reference

Core

Symbol	Description
`MarketSpec`	Market descriptor: bins, token IDs, resolution logic
`OutcomeBin`	A single labeled outcome bin (label, lo, hi)
`Position`	Open trade: spec, entry_price, stake, direction
`settle_position(pos, outcome)`	Compute realized PnL for a settled position
`compute_edge(model_prob, market_price, fee_bps)`	After-cost probability edge
`estimate_fee(stake, bps)`	USDC fee for a trade
`flat_stake_size(stake, price)`	Share count for a flat-stake bet
`kelly_fraction(win_prob, price, fraction)`	Fractional Kelly bankroll fraction
`kelly_stake_size(win_prob, price, bankroll, ...)`	Kelly-sized USDC stake with cap

Features

Symbol	Description
`add_lags(df, cols, lags, group_by)`	Add lag columns (e.g. `temp_lag7`)
`add_rolling_stats(df, cols, windows, stats)`	Add rolling mean/std/min/max/median
`encode_cyclical(series, period)`	Sin/cos encoding for periodic features
`encode_onehot(df, cols, drop_first)`	One-hot encode categorical columns
`clip_outliers(df, cols, method)`	Clip outliers via IQR or z-score

Markets

Symbol	Description
`GammaClient`	Sync Gamma API client (with optional DiskCache)
`ClobClient`	Sync CLOB price client
`AsyncGammaClient`	Async Gamma API client (httpx.AsyncClient)
`AsyncClobClient`	Async CLOB client with semaphore concurrency
`DiskCache`	TTL disk cache (MD5-keyed JSON, auto-purge)

Modeling

Symbol	Description
`MarketForecaster`	Abstract base — implement `fit`, `predict_proba`, `save`, `load`
`LGBMForecaster`	LightGBM binary/multiclass forecaster
`ChampionManifest`	Publish/load champion model with hard NO_GO gate
`brier_decomposition(y_true, y_prob)`	Murphy (1973) Brier score decomposition
`reliability_data(y_true, y_prob)`	Reliability diagram data (bin centers, mean pred, frac pos)
`BrierDecomposition`	Dataclass: uncertainty, resolution, reliability, skill_score

Execution

Symbol	Description
`EdgeSignal`	Typed output of a market scan (edge, direction, horizon)
`PaperBroker`	Record paper trades with segment gate, stale guard, dedup
`SettlementEngine`	Settle open trades via plugin truth resolution
`LiveBroker`	Real CLOB order execution (place, cancel, balance, dry_run)
`OrderReceipt`	Return value of `LiveBroker.place_order`

Reports

Symbol	Description
`generate_report(trades, output_path, ...)`	Self-contained HTML report with equity curve SVG

Key Design Decisions

Hard gate on champion promotion — ChampionManifest.publish() raises ValueError if gate.decision != "GO". Not configurable. Investigate failures; don't bypass the gate.

champion.json is the source of truth — PaperBroker reads allowed_segments from champion.json, not from any intermediate benchmark file that could be overwritten by a failed retrain.

No-lookahead guarantee — rolling_origin_eval trains strictly on rows with decision_date < eval_date. This is asserted in the test suite.

Settlement waits for finalization — SettlementEngine only settles when plugin.is_truth_final(spec) returns True. Data sources with finalization lags can override this method.

paper → live transition — Use LiveBroker(dry_run=True) to validate signals against the real CLOB API without sending orders. Flip dry_run=False only when validators pass.

Development

git clone https://github.com/ArtBreguez/polymarket-lab
cd polymarket-lab
uv sync --extra dev

uv run pytest                              # all tests
uv run pytest --cov=src/pmlab             # with coverage report
uv run ruff check src/ tests/             # lint
uv run ruff check src/ tests/ --fix       # auto-fix
uv run mypy src/                          # type check
uv build                                  # build wheel + sdist

Project Status

Module	Status	Coverage
`core` (PnL, edge, fees, sizing, Kelly)	✅ Stable	100%
`features` (lags, rolling, cyclical, onehot)	✅ Stable	100%
`plugins` (ABC, registry)	✅ Stable	100%
`plugins/weather_tmax`	✅ Stable	96%
`plugins/sports_f1`	✅ Skeleton	94%
`backtest` (rolling_origin, gate, metrics)	✅ Stable	94–100%
`modeling` (LGBM, calibration, champion, diagnostics)	✅ Stable	95–100%
`execution` (paper broker, settlement, live broker)	✅ Stable	84–99%
`markets` (Gamma, CLOB, async, cache)	✅ Stable	84–98%
`reports` (HTML report)	✅ Stable	96%
`workspace`	✅ Stable	100%
`cli`	🔧 Shell	91%

Tests: 237 passing · Coverage: 95.7% · Python: 3.12

Contributing

See CONTRIBUTING.md for the full guide.

Quick version:

Fork the repo
Create a branch: git checkout -b feat/my-plugin
Write your plugin in src/pmlab/plugins/<family>/
Add tests in tests/plugins/<family>/ — TDD required
Run uv run pytest && uv run ruff check src/ tests/
Open a pull request

License

MIT — see LICENSE for details.

Built to trade · Designed to be extended · Tested to be trusted

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0

May 11, 2026

This version

0.2.0

May 11, 2026

0.1.0

May 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pmlab-0.2.0.tar.gz (105.7 kB view details)

Uploaded May 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pmlab-0.2.0-py3-none-any.whl (53.3 kB view details)

Uploaded May 11, 2026 Python 3

File details

Details for the file pmlab-0.2.0.tar.gz.

File metadata

Download URL: pmlab-0.2.0.tar.gz
Upload date: May 11, 2026
Size: 105.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for pmlab-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`4385e4f05aeb7c0811edabf514b6940406f93c759a66dd09c1787cac49226958`
MD5	`0e9123c2d208b231c875276fb8b09360`
BLAKE2b-256	`412cc40abe5e74b2a3e4ddef10cf4464c2e8e5bd25f5e97998b1e73579b3ccbb`

See more details on using hashes here.

File details

Details for the file pmlab-0.2.0-py3-none-any.whl.

File metadata

Download URL: pmlab-0.2.0-py3-none-any.whl
Upload date: May 11, 2026
Size: 53.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for pmlab-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`30505a2afe66a24ed703f11ca61859e5ba7993ebd6bcbe8aa219665c988694d3`
MD5	`72b2e295830ea24fbebdc14fc95f5e50`
BLAKE2b-256	`98b1a4dd5723184f7542ba6f5d96d690efeba324bf2bad9b8676af6891939da6`

See more details on using hashes here.

pmlab 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pmlab

What is pmlab?

Installation

Quickstart

1. Implement a plugin

2. Run a backtest

3. Evaluate calibration

4. Promote a champion

5. Paper trade with Kelly sizing

6. Generate an HTML report

7. Go live (dry-run first)

Architecture

Module map

Writing a Plugin

The four methods

Dependency injection (recommended for testing)

Feature Engineering

Async Market Access

TTL disk cache

Position Sizing

Bundled Plugins

WeatherTmaxPlugin

SportsF1Plugin

CLI Reference

Workspace isolation

API Reference

Core

Features

Markets

Modeling

Execution

Reports

Key Design Decisions

Development

Project Status

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`WeatherTmaxPlugin`

`SportsF1Plugin`