Skip to main content

Numinous Crunch Starter Package

Project description

Numinous Crunch Challenge

A real-time binary event forecasting competition powered by Numinous (Bittensor Subnet 6) and hosted on CrunchDAO.

Numinous is a forecasting protocol that aggregates agents into superhuman LLM forecasters. In this competition, models predict the probability that real-world events — sourced from Polymarket — resolve "Yes". Predictions are scored using the Brier score, a strictly proper scoring rule that rewards calibrated, honest probabilities.

Install

pip install crunch-numinous

What You Must Predict

For each event, you receive structured data and must return a probability between 0.0 and 1.0 that the event resolves "Yes":

# Input: event data pushed to your model (aligned with Numinous Subnet 6)
{
    "event_id": "numinous-12345",
    "run_id": "run-abc",
    "track": "MAIN",
    "event_type": "llm",
    "title": "Will X happen by Y?",
    "description": "...",               # Optional
    "cutoff": "2026-03-16T00:00:00Z",   # Optional, ISO 8601
    "metadata": {"market_type": "LLM", "topics": ["Finance"]}
}

# Output: your probability forecast
{"event_id": "numinous-12345", "prediction": 0.72, "reasoning": "Based on..."}
  • prediction = 1.0 → certain "Yes"
  • prediction = 0.0 → certain "No"
  • prediction = 0.5 → maximum uncertainty

Predictions are clipped to [0.01, 0.99] during scoring.

Scoring

Your final score is a weighted combination of Brier score (prediction accuracy) and reasoning quality. The weights depend on the track and event category.

Weight distribution

Pool Track Weight
Global Brier MAIN 5%
Geopolitics Brier MAIN 5%
Reasoning MAIN 25%
Global Brier SIGNAL 30%
Geopolitics Brier SIGNAL 15%
Reasoning SIGNAL 20%

See emission weights.

Brier score

Prediction accuracy is evaluated using the Brier score:

$$ \text{Brier} = (\text{prediction} - \text{outcome})^2 $$

Lower is better.

Score Meaning
0.00 Perfect prediction
0.25 Always predicting 0.5 (no information)
1.00 Worst possible (predicted 1.0, outcome was 0)

The Brier score is strictly proper — the optimal strategy is to report your honest probability estimate.

Missing predictions are imputed as 0.5 → scored at 0.25.

Geopolitics events have a separate Brier pool because their resolution dates are often far away — Numinous requests intermediate predictions and uses Polymarket probabilities to score them.

Reasoning scoring

The reasoning field returned by your model is scored by an LLM and has significant weight in the final score (25% on MAIN, 20% on SIGNAL). Models that simply relay market probabilities without genuine reasoning will be penalized.

The reasoning is evaluated on 5 criteria: sources used, evidence extracted, combination & weighting, uncertainties / counterpoints, and mapping to final probabilities. See the full evaluation prompt for details.

Tracks (MAIN / SIGNAL)

Each event specifies a track that restricts which resources your model can access:

Track Resources Weight
MAIN All gateway endpoints Lower (35% total)
SIGNAL Restricted subset (see config) Higher (65% total)

The track field is included in the event dict passed to _predict(). For SIGNAL events, do not call unauthorized gateway endpoints — the gateway enforces this and your prediction will fail.

Create Your Tracker

A tracker is a model that receives event data and returns probability forecasts. The predict() method receives the full event dict directly.

To participate, subclass TrackerBase and implement _predict():

from numinous.tracker import TrackerBase


class MyModel(TrackerBase):

    def _predict(self, event):
        event_id = event.get("event_id", "unknown")
        run_id = event.get("run_id")
        track = event.get("track")

        # Your logic here
        prediction = 0.5

        return {
            "event_id": event_id,
            "prediction": prediction,
            "reasoning": "..."  # Optional, can be None
        }

Available Event Fields

Inside _predict(), the event dict contains:

Field Type Description
event_id str Unique event identifier
run_id str Run identifier — must be forwarded to gateway calls for cost tracking
track str "MAIN" or "SIGNAL" — determines which gateway resources are available
event_type str Market type, lowercased (e.g. "llm", "sports", "crypto")
title str The question being asked
description str | None Additional context and resolution criteria
cutoff str | None ISO 8601 resolution deadline
metadata dict Event metadata: market_type, topics, trigger_name, polymarket_market_id

Example

See the quickstart notebook to get started.

Gateway

Your model has no direct internet access in production. All external calls (LLMs, search, OSINT...) go through the gateway. You will need to provide your own API keys (most providers offer a free tier)

  • In production: SANDBOX_PROXY_URL is set automatically
  • Locally: you use a public gateway, identical to the one used in production

The official Numinous documentation contains a list of all available endpoints.

Use the gateway in your tracker

import os
import httpx

# Specify your OpenAI's API Key
OPENAI_API_KEY = ...

# Get the URL of the Gateway
GATEWAY_URL = os.environ.get("SANDBOX_PROXY_URL", "https://public-gateway.numinous.competition.crunchdao.com")

response = httpx.post(
    f"{GATEWAY_URL}/api/gateway/openai/responses",
    json={
        # IMPORTANT: Always forward the `run_id` to the Gateway otherwise the model will fail.
        "run_id": run_id,

        "model": "gpt-5-mini",
        "input": [
            {
                "role": "user",
                "content": "Will BTC hit 100k?"
            }
        ],
    },
    headers={
        # IMPORTANT: Send the API Key header to the Gateway.
        "x-openai-api-key": OPENAI_API_KEY,
    },
    timeout=30,
)

Authentication

Your model must submit the different providers' API keys that it needs to contact them. For example, if you want to use the OpenAI endpoint, you must include an OpenAI API key and pass it as the header x-openai-api-key.

To prevent the wrong API key being sent to the wrong provider, each of them has a unique header name:

Provider Header Name
Chutes x-chutes-api-key
Desearch x-dearch-api-key
Lightning Rod x-lightning-rod-api-key
Lunar Crush x-lunar-crush-api-key
Numinous Indicia (No API key required)
Numinous Signals x-numinous-signals-api-key
OpenAI x-openai-api-key
OpenRouter x-openrouter-api-key
Perplexity x-perplexity-api-key
Public Data (Name will depend on the service)*
Unusual Whales x-unusual-whales-api-key
Vericore x-vericore-api-key

[!NOTE] The header names for Public Data are always based on the data source you are trying to access. For example, if you want to use api.stlouisfed.org, the header will be named x-fred-api-key.

You can find a list of all the data sources here (JSON format).

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crunch_numinous-1.0.1.tar.gz (249.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

crunch_numinous-1.0.1-py3-none-any.whl (18.7 kB view details)

Uploaded Python 3

File details

Details for the file crunch_numinous-1.0.1.tar.gz.

File metadata

  • Download URL: crunch_numinous-1.0.1.tar.gz
  • Upload date:
  • Size: 249.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for crunch_numinous-1.0.1.tar.gz
Algorithm Hash digest
SHA256 521c8055c49788f02728c133e6095b92148f568e7721c7cd85e00c090b010df6
MD5 4c0be4128afe43ae56efce2c420ed3f6
BLAKE2b-256 8dd88bcb5974e738986e14a204a9ef095d9d001bcd7909f787e025c6678cd62e

See more details on using hashes here.

File details

Details for the file crunch_numinous-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for crunch_numinous-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6624f3860edadfd946d6e55918682c25bdcb9e63d98b67d806949308195e968b
MD5 097b2463184830c08c80f1e34c4efc3e
BLAKE2b-256 e1903d38199c063c882d8729584da889dfdf187e7c6f3305c03e27a3d83deece

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page