Learning-to-rank scheduling library for Python job queues.

These details have not been verified by PyPI

Project links

Project description

chronoq-ranker

Learning-to-rank scheduling library. Predict which queued job finishes fastest and run it first.

Trained LightGBM LambdaRank over 15 features, online incremental retraining, drift detection. Standalone ML library — zero runtime dependencies on Celery, Redis, FastAPI, or any queueing framework. Works with Celery via the companion chronoq-celery package, or with any task system via the predict_scores(candidates) API.

Install

pip install chronoq-ranker

Why this exists

Python task queues — Celery, RQ, Dramatiq, and others — all dispatch in FIFO or static-priority order. On heavy-tail workloads (LLM inference, media transcoding, data pipelines), a 20 ms resize job waits behind a 1.8 s transcode for no good reason.

Ten-plus years of systems research says learned scheduling beats FIFO on these workloads. Chronoq packages that research as a reusable Python library: record job completion telemetry, let a LightGBM LambdaRank model learn the duration structure, then use predict_scores() to rank pending jobs before dispatch.

Quick start

from chronoq_ranker import TaskRanker, TaskCandidate

ranker = TaskRanker(storage="sqlite:///telemetry.db")

# Feed telemetry as jobs complete
ranker.record(task_type="resize",    payload_size=2048,  actual_ms=312.4)
ranker.record(task_type="transcode", payload_size=8000,  actual_ms=1780.1)
# ... enough records trigger auto-retraining from heuristic → LambdaRank ...

# Rank pending jobs shortest-first
scored = ranker.predict_scores([
    TaskCandidate(task_id="j1", task_type="transcode", payload_size=8000),
    TaskCandidate(task_id="j2", task_type="resize",    payload_size=500),
])
# scored[0] is the job LambdaRank predicts finishes fastest
for s in scored:
    print(f"rank={s.rank}  {s.task_id}  score={s.score:.4f}")

What you get

LightGBM LGBMRanker with lambdarank objective — pairwise learning with NDCG gain
Online incremental retraining — warm-start via init_model; full refit every N incrementals to bound drift accumulation
15-feature default extractor (task_type, payload_size, hour_of_day, day_of_week, queue_depth, queue_depth_same_type, recent_mean_ms_this_type, recent_p95_ms_this_type, recent_count_this_type, time_since_last_retrain_s, worker_count_busy, worker_count_idle, prompt_length, user_tier, retry_count) — or supply your own FeatureExtractor
Pluggable storage — "memory://" for tests, "sqlite:///path.db" for persistence
Drift detection — PSI over numeric features (warn >0.2, flag >0.3)
Rank-oriented evaluation — Spearman ρ, Kendall τ, pairwise accuracy; MAE is secondary

Evidence

Validated on 4 workload traces (synthetic Pareto, BurstGPT real LLM inference, Google Borg 2011 cluster-batch, Azure Functions 2019 serverless).

On the synthetic Pareto trace: +32% mean JCT / +17.5% p99 vs FCFS at ρ=0.7, within 13.4% of a clairvoyant SJF-oracle. On BurstGPT (real LLM inference): LambdaRank tracks the SJF-oracle upper bound within 5.1% at p99. On Google Borg: +14–22% mean JCT at ρ ≥ 0.8 where queue-ordering decisions dominate.

All results reproducible with one command (make bench). Byte-identical results.json across macOS and Windows (SHA-256 verified). Full methodology and per-trace tables in BENCHMARKS.md.

Configuration

from chronoq_ranker import RankerConfig, TaskRanker

config = RankerConfig(
    cold_start_threshold=50,               # records before promoting to LambdaRank
    retrain_every_n=100,                   # auto-retrain trigger interval
    incremental_rounds=10,                 # warm-start boosting rounds per incremental fit
    full_refit_every_n_incrementals=20,    # force full refit every N incrementals
    min_groups=20,                         # minimum pairwise groups per fit
    num_leaves=31,                         # LightGBM num_leaves
    n_estimators=500,                      # LightGBM n_estimators
    learning_rate=0.05,                    # LightGBM learning rate
    storage_uri="sqlite:///telemetry.db",
)
ranker = TaskRanker(config=config)

Custom feature extractor

The default extractor ships 15 features tuned for general-purpose task queues. Roll your own by subclassing FeatureExtractor:

from chronoq_ranker import FeatureExtractor, FeatureSchema, TaskRanker

class MyExtractor(FeatureExtractor):
    schema = FeatureSchema(
        version="my-v1",
        numeric=["payload_size"],
        categorical=["task_type"],
    )

    def extract(self, candidate, context=None):
        return {
            "task_type": candidate.task_type,
            "payload_size": float(candidate.features.get("payload_size", 0)),
        }

    def extract_from_record(self, record):
        return {
            "task_type": record.task_type,
            "payload_size": float(record.payload_size),
        }

ranker = TaskRanker(feature_extractor=MyExtractor())

Integration

Celery: chronoq-celery provides a 2-line LearnedScheduler drop-in with fifo / shadow / active modes
Any other task system: call ranker.predict_scores(candidates) before dispatch, ranker.record(...) after completion

Honest limitations

p99 starvation at ρ ≥ 0.8: SJF-family tradeoff — short-first bias indefinitely delays long jobs at the tail. Pair with aging in production. An aging-aware scheduler is planned for v0.3.0.
Workload-dependent wins: on traces where even a clairvoyant oracle cannot improve p99 (narrow duration variance, single task type), the ranker also cannot. The bench harness is a diagnostic tool for this — see the Azure Functions result in BENCHMARKS.md.
Pre-1.0 API: breaking changes are allowed in minor-version bumps under the project's semver policy; deprecation shims land one minor ahead with a CHANGELOG "Breaking" entry.

Runtime dependencies

lightgbm>=4.3, numpy>=1.26, pydantic>=2.0, scikit-learn>=1.4, loguru>=0.7. No Celery, Redis, FastAPI, or other queueing framework imports.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0.post1

Apr 24, 2026

0.2.0

Apr 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chronoq_ranker-0.2.0.post1.tar.gz (24.9 kB view details)

Uploaded Apr 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chronoq_ranker-0.2.0.post1-py3-none-any.whl (30.0 kB view details)

Uploaded Apr 24, 2026 Python 3

File details

Details for the file chronoq_ranker-0.2.0.post1.tar.gz.

File metadata

Download URL: chronoq_ranker-0.2.0.post1.tar.gz
Upload date: Apr 24, 2026
Size: 24.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.10

File hashes

Hashes for chronoq_ranker-0.2.0.post1.tar.gz
Algorithm	Hash digest
SHA256	`4e49360cd0f9ac9818c265abda10c0dfb5f2732af5745252f139fd9ed18d8e9f`
MD5	`bbc62aef43c382b05aa1a0386cd766c7`
BLAKE2b-256	`f0848a8cbee2a232080af3b611dada3710d97d852bb82edc315089f518f3ef8f`

See more details on using hashes here.

File details

Details for the file chronoq_ranker-0.2.0.post1-py3-none-any.whl.

File metadata

Download URL: chronoq_ranker-0.2.0.post1-py3-none-any.whl
Upload date: Apr 24, 2026
Size: 30.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.10

File hashes

Hashes for chronoq_ranker-0.2.0.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`48bf02bd23efa9788f83712dc2c5443344611b4929ececab58297c6fad37a26a`
MD5	`d2fb715a835a029ed928387db8ef2ed6`
BLAKE2b-256	`378d672d4e8a1d41514a234af0ddd974f1d33d56bcde79d74dd2aa8d4253cc1a`

See more details on using hashes here.

chronoq-ranker 0.2.0.post1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

chronoq-ranker

Install

Why this exists

Quick start

What you get

Evidence

Configuration

Custom feature extractor

Integration

Honest limitations

Runtime dependencies

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes