Temporal partitioning and backtesting utilities for time-correlated datasets.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

MarcosMuraro

These details have not been verified by PyPI

Project description

Jano

Jano logo

Jano is a Python library for defining temporal partitions and backtesting schemes over time-correlated datasets.

The missing layer between ML models and production temporal validation.

Documentation: marmurar.github.io/jano

It is designed for cases where a plain train_test_split() is not enough: transactional data, production simulations, repeated retraining, walk-forward validation, model monitoring, rule evaluation, or any experiment where the ordering of time matters.

The core accepts pandas.DataFrame, numpy.ndarray and polars.DataFrame inputs through a unified API. Jano keeps native pandas, NumPy and Polars paths for partition planning when that is safe, and falls back to pandas materialization for reporting and user-facing slices.

The project is named after Janus, the Roman god of beginnings, transitions and thresholds. That framing fits the library well: Jano helps define how a dataset moves from training periods into evaluation periods, fold after fold.

Reproducible external datasets

Jano examples should be reproducible without committing large datasets to Git. Dataset metadata is versioned in datasets/registry.json, while downloaded files stay local under data/raw/, which is ignored by the repository.

List available datasets:

python scripts/download_dataset.py --list

Download one locally:

python scripts/download_dataset.py bike_sharing_hourly --extract

The current registry includes Bike Sharing, BTS Airline On-Time Performance, NYC TLC Yellow Taxi, Household Power and Rossmann Store Sales datasets for regression, classification, ordinal-cost, larger benchmark and gold-example workflows.

Rossmann is the recommended gold example because it is a panel dataset with one row per store and day. It makes the difference between random splitting, chronological holdout, walk-forward simulation and retraining-policy comparison easy to inspect:

python scripts/download_dataset.py rossmann_store_sales --extract

This command requires the Kaggle CLI and local Kaggle credentials. The executable notebook lives at notebooks/rossmann_temporal_validation.ipynb. If Kaggle credentials are not available, the notebook uses a deterministic Rossmann-like fallback so Jano can still be executed end to end without committing large data files.

Why Jano exists

Many machine learning datasets are not just tabular; they are structured over time and often across multiple entities such as users, routes, sellers or products. In those settings, a more faithful view of the data is not "a bag of independent rows" but a temporally ordered process.

Standard evaluation tooling usually assumes observations are i.i.d. enough that a static split is acceptable. That assumption breaks quickly when time matters: future information leaks into training, performance estimates become optimistic, and offline validation stops reflecting what really happens in production.

Most train/test utilities answer a simple question:

"How do I split this dataset once?"

Jano is meant to answer a richer one:

"How would this system have behaved over time if I had trained, retrained and evaluated it under a specific temporal policy?"

That difference is the core of the project. Jano treats evaluation as a temporal simulation rather than a static partition. Instead of defining one split, it defines a policy over time: train window, evaluation horizon, shift between iterations and optional leakage-control gaps. Running that policy produces a sequence of causally valid folds rather than one aggregate estimate.

That also makes it a useful way to evidence drift in simulation results, because temporal shifts in behavior, performance or calibration become visible fold after fold.

That makes it useful not only for machine learning, but for any workflow where the data is time-dependent:

Backtesting predictive models on transactional data.
Simulating daily or weekly retraining in production.
Comparing rolling versus expanding windows.
Introducing explicit gaps between training and evaluation periods.
Defining train/test or train/validation/test partitions with durations, row counts or percentages.
Surfacing drift in simulation outcomes by making temporal changes explicit across folds.

It supports:

single, rolling and expanding strategies.
train_test and train_val_test layouts.
Segment sizes defined as durations like "30D", row counts like 5000, or fractions like 0.7.
Calendar-aligned duration windows with calendar_frequency="D" when you want complete days instead of elapsed-time windows anchored at the first timestamp.
Optional gaps before validation or test segments.
Plain index output through split().
Rich fold objects through iter_splits().
Simulation summaries, HTML timeline reports and plot-ready chart data through describe_simulation().
An adaptive partition engine that keeps pandas, NumPy and Polars inputs native for planning when it is safe, and falls back to pandas when stability is more important.

Example: random splits vs temporal validation

sklearn.model_selection.train_test_split is useful for random i.i.d.-style evaluation. It is the wrong abstraction when the model will be trained on the past and asked to predict the future.

The first snippet assumes scikit-learn is installed only to illustrate the common baseline. Jano itself does not require scikit-learn.

import pandas as pd
from sklearn.model_selection import train_test_split

frame = pd.DataFrame(
    {
        "timestamp": pd.date_range("2025-01-01", periods=120, freq="D"),
        "feature": range(120),
        "target": [0] * 80 + [1] * 40,
    }
)

train_random, test_random = train_test_split(
    frame,
    test_size=0.2,
    shuffle=True,
    random_state=7,
)

print(train_random["timestamp"].max() > test_random["timestamp"].min())
# True: train contains dates later than some test rows.

With Jano, the evaluation is a temporal policy:

from jano import TemporalPartitionSpec, WalkForwardPolicy

policy = WalkForwardPolicy(
    time_col="timestamp",
    partition=TemporalPartitionSpec(
        layout="train_test",
        train_size="60D",
        test_size="14D",
        gap_before_test="1D",
    ),
    step="14D",
    strategy="rolling",
)

plan = policy.plan(frame, title="Production-like temporal validation")
print(plan.to_frame()[["iteration", "train_end", "test_start", "test_end"]])

That makes the temporal contract inspectable before training: train only sees the past, test moves forward, and optional gaps model label or data availability latency.

Example: run a full simulation without manual iteration

import pandas as pd

from jano import TemporalPartitionSpec, WalkForwardPolicy

frame = pd.DataFrame(
    {
        "timestamp": pd.date_range("2024-01-01", periods=60, freq="D"),
        "feature": range(60),
        "target": range(100, 160),
    }
)

policy = WalkForwardPolicy(
    time_col="timestamp",
    partition=TemporalPartitionSpec(
        layout="train_test",
        train_size="30D",
        test_size="1D",
    ),
    step="1D",
    strategy="rolling",
)

result = policy.run(frame, title="One month in production")

print(result.total_folds)
print(result.engine_metadata.to_dict())
print(result.summary.to_frame().head())
print(result.chart_data.segment_stats)

By default, engine="auto" lets Jano choose the safest fast path for partitioning:

Example: run a model over the walk-forward policy

import numpy as np

from jano import TemporalPartitionSpec, WalkForwardPolicy, WalkForwardRunner

def mae(y_true, y_pred):
    return float(np.mean(np.abs(np.asarray(y_true) - np.asarray(y_pred))))

def rmse(y_true, y_pred):
    return float(np.sqrt(np.mean((np.asarray(y_true) - np.asarray(y_pred)) ** 2)))

policy = WalkForwardPolicy(
    time_col="timestamp",
    partition=TemporalPartitionSpec(
        layout="train_test",
        train_size="30D",
        test_size="7D",
    ),
    step="7D",
    strategy="rolling",
)

runner = WalkForwardRunner(
    model=model,
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    retrain="periodic",
    retrain_interval=2,
    metrics={"mae": mae, "rmse": rmse},
)

run = runner.run(policy, frame)

print(run.to_frame().head())
print(run.summary())
print(run.metric_trajectory().head())
print(run.retrain_events())

report_data = run.report_data(include_predictions=False)

Example: run a temporal system with update policies

Not every temporal workflow updates itself through fit() and predict(). For RAG refreshes, prompt-set updates or fine-tuning jobs, the more natural contract is "update the current system state, then evaluate that state on the next temporal window".

import numpy as np
import pandas as pd

from jano import (
    PeriodicRetrain,
    SystemEvaluationResult,
    SystemUpdateResult,
    TemporalPartitionSpec,
    TemporalSystemRunner,
    WalkForwardPolicy,
)

class MeanTargetSystem:
    def update(self, train_frame: pd.DataFrame):
        mean_target = float(train_frame["target"].mean())
        return SystemUpdateResult(
            state=mean_target,
            metadata={"train_target_mean": mean_target},
        )

    def evaluate(self, state, test_frame: pd.DataFrame):
        predictions = np.repeat(float(state), len(test_frame))
        mae = float(np.mean(np.abs(test_frame["target"] - predictions)))
        return SystemEvaluationResult(
            metrics={"mae": mae},
            metadata={"prediction_mean": float(state)},
        )

runner = TemporalSystemRunner(
    system=MeanTargetSystem(),
    update_policy=PeriodicRetrain(2),
    metric_directions={"mae": "min"},
    primary_metric="mae",
)

run = runner.run(policy, frame)

print(run.to_frame().head())
print(run.metric_trajectory().head())
print(run.update_events())

Example: evaluate observation-driven online updates

Jano always operates on temporal processes, but the clock does not have to be a calendar interval. Online evaluation can advance by observed events or micro-batches: predict the next tick, observe its target, update the model, and repeat. This keeps the splitter core unchanged while extending the simulation layer.

from jano import OnlineTemporalRunner, PartialFitUpdateStrategy

runner = OnlineTemporalRunner(
    model=model,
    time_col="timestamp",
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    initial_train_size="30D",
    update_size=1,
    metrics={"mae": mae, "rmse": rmse},
    update_strategy=PartialFitUpdateStrategy(),
)

run = runner.run(frame)

print(run.to_frame().head())
print(run.metric_trajectory().head())
print(run.summary())

Use update_size=1 for one temporal tick per observed event, an integer such as update_size=100 for row micro-batches, or a duration such as update_size="1D" for calendar-based micro-batches.

You can also mark retraining inflection points with a user-defined trigger. The trigger receives the online history observed so far and the latest scored batch:

def should_retrain(history, latest):
    if latest["mae"] > 0.15:
        return {
            "retrain": True,
            "reason": "mae crossed production tolerance",
            "score": latest["mae"],
        }
    return False

runner = OnlineTemporalRunner(
    model=model,
    time_col="timestamp",
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    initial_train_size="30D",
    update_size=100,
    metrics={"mae": mae},
    retrain_trigger=should_retrain,
)

run = runner.run(frame)
print(run.retrain_checkpoints())

Jano does not decide the drift formula for you. It records the exact temporal checkpoint where your trigger fired, including batch number, timestamps, row counts, metrics and optional trigger metadata.

PartialFitUpdateStrategy is for estimators that support real incremental updates. For standard fit/predict estimators, use RefitUpdateStrategy:

from jano import OnlineTemporalRunner, RefitUpdateStrategy

runner = OnlineTemporalRunner(
    model=model,
    time_col="timestamp",
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    initial_train_size="30D",
    update_size="1D",
    metrics={"mae": mae},
    update_strategy=RefitUpdateStrategy(max_train_rows=10_000),
)

That refits the estimator after each observed batch, optionally keeping only the most recent max_train_rows rows to approximate bounded production history.

You can also compare several observation-driven policies over the same stream:

from jano import OnlineUpdatePolicy, OnlineUpdatePolicyStudy, RefitUpdateStrategy

study = OnlineUpdatePolicyStudy(
    model=model,
    time_col="timestamp",
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    initial_train_size="30D",
    policies=[
        OnlineUpdatePolicy("every-event", update_size=1, update_strategy=RefitUpdateStrategy()),
        OnlineUpdatePolicy("every-100-events", update_size=100, update_strategy=RefitUpdateStrategy()),
        OnlineUpdatePolicy("daily", update_size="1D", update_strategy=RefitUpdateStrategy()),
    ],
    metrics={"mae": mae},
)

comparison = study.run(frame)

print(comparison.to_frame())
print(comparison.find_optimal_policy(metric="mae", update_cost_weight=0.01))

This shifts the question from "which date should trigger retraining?" to "how much new evidence should trigger retraining, and is the predictive gain worth the update cost?".

Supported retrain modes are:

retrain=True or retrain="always" to refit on every fold.
retrain=False or retrain="never" to train once and benchmark a fixed model.
retrain="periodic" with retrain_interval=K to refit every K folds.
retrain_policy=DriftBasedRetrain(...) when the next retrain decision should depend on previously observed fold metrics.
retrain_policy=FunctionRetrainPolicy(...) when the retrain decision is a custom function of fold history, dates, costs or external thresholds.

Evaluation profiles separate how a run is measured from when a model is retrained. Jano does not implement metric formulas. Pass the loss or score functions that match your problem and declare whether lower or higher is better:

import numpy as np

from jano import EvaluationProfile, FunctionRetrainPolicy, WalkForwardRunner

def daily_cost(y_true, y_pred):
    return float(np.mean(np.abs(y_true - y_pred)))

def retrain_rule(context):
    if context.history.empty:
        return True
    latest = context.history["daily_cost"].iloc[-1]
    limit = limit_for_date(context.split.boundaries["test"].end)
    return latest > limit

runner = WalkForwardRunner(
    model=model,
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    evaluation=EvaluationProfile(
        metrics={"daily_cost": daily_cost},
        metric_directions={"daily_cost": "min"},
        primary_metric="daily_cost",
    ),
    retrain_policy=FunctionRetrainPolicy(retrain_rule),
)

Convenience profiles such as RegressionProfile, ClassificationProfile, OrdinalClassificationProfile and RankingProfile keep the API explicit without splitting the runner into problem-specific classes.

Runner results are intentionally data-first rather than dashboard-first:

run.fold_summary() returns temporal fold geometry and retraining metadata.
run.metric_trajectory() returns metrics in long format, ready for plotting.
run.retrain_events() returns only folds where the estimator was refit.
run.predictions_frame() returns row-level test predictions.
run.report_data() / run.to_dict() return structured dictionaries for notebooks, agents, dashboards or presentation tools.

Built-in scenarios compose the core for common production-style questions without changing the runner. For example, estimate prediction bands by fold with a user-owned band estimator:

from jano import estimate_prediction_band_by_fold


class FixedWidthBand:
    def estimate(self, context):
        return {
            "lower": context.predictions - 5.0,
            "upper": context.predictions + 5.0,
            "artifacts": {"method": "fixed_width"},
        }


result = estimate_prediction_band_by_fold(
    frame,
    estimator=model,
    band_estimator=FixedWidthBand(),
    time_col="timestamp",
    target_col="target",
    feature_cols=["feature_a", "feature_b"],
    train_size="90D",
    test_size="7D",
    step="7D",
    metrics={"mae": mae},
)

print(result.to_frame().head())
print(result.predictions_frame().head())

The band_estimator is user code. It can use scikit-learn KFold, a bootstrap routine, conformal prediction, quantile regression or any other method to produce lower and upper arrays for each Jano test fold.

pandas inputs stay pandas, Polars inputs use Polars column extraction, and NumPy arrays use array indexing. You can force a path with engine="pandas", engine="polars" or engine="numpy" when you need deterministic behavior for a pipeline.

If you want to inspect the full simulation geometry before materializing folds, plan it first:

plan = policy.plan(frame, title="One month in production")
print(plan.total_folds)
print(plan.to_frame().head())

filtered = plan.exclude_windows(
    train=[("2025-12-20", "2026-01-05")],
).select_from_iteration(5)

result = filtered.materialize()

That plan frame includes the explicit iteration index, segment boundaries and row counts for each fold.

You can also anchor a simulation to a specific date and limit how many folds are materialized:

policy = WalkForwardPolicy(
    time_col="timestamp",
    partition=TemporalPartitionSpec(
        layout="train_test",
        train_size="15D",
        test_size="4D",
    ),
    step="1D",
    strategy="rolling",
    start_at="2025-09-01",
    max_folds=15,
)

result = policy.run(frame, title="15 daily retraining iterations")

The recommended walk-forward surface also supports end_at when you want to constrain the simulation to a bounded time window before folds are generated.

When a single timestamp is not enough, WalkForwardPolicy, TemporalSimulation and TemporalBacktestSplitter can also receive a TemporalSemanticsSpec. That lets you keep one column as the reported timeline while using different timestamp columns to decide whether train, validation or test rows are actually eligible. This is useful for production-style leakage control, for example when a target only becomes available at arrived_at even if the operational timeline is anchored on departured_at.

For numpy.ndarray inputs, use integer column references:

import numpy as np

values = np.array(
    [
        ["2025-09-01", 1.2, 10],
        ["2025-09-02", 1.5, 11],
        ["2025-09-03", 1.1, 12],
    ],
    dtype=object,
)

splitter = TemporalBacktestSplitter(
    time_col=0,
    partition=TemporalPartitionSpec(
        layout="train_test",
        train_size="2D",
        test_size="1D",
    ),
    step="1D",
    strategy="single",
)

Example: manual control with the low-level splitter

from jano import TemporalBacktestSplitter, TemporalPartitionSpec

splitter = TemporalBacktestSplitter(
    time_col="timestamp",
    partition=TemporalPartitionSpec(
        layout="train_val_test",
        train_size=0.6,
        validation_size=0.2,
        test_size=0.2,
    ),
    step=0.2,
    strategy="single",
)

for split in splitter.iter_splits(frame):
    print(split.summary())

Example: keep the same test window and grow train backward

This is a special use case. It is useful when you want to study whether more training history really improves the same test slice.

from jano import TrainHistoryPolicy

policy = TrainHistoryPolicy(
    "timestamp",
    cutoff="2025-09-15",
    train_sizes=["7D", "14D", "21D", "28D"],
    test_size="4D",
)

result = policy.evaluate(
    frame,
    model=model,
    target_col="target",
    feature_cols=["feature_1", "feature_2"],
    metrics={"mae": mae, "rmse": rmse},
)

print(result.to_frame()[["train_size", "rmse"]])
print(result.find_optimal_train_size(metric="rmse", tolerance=0.01))

That pattern keeps test fixed while train expands toward the past. It is a practical way to study data efficiency or to estimate how much history is actually needed.

The opposite special case is also common: keep train fixed and move test forward day by day to estimate how long a model or rule keeps its performance without retraining. The two patterns answer different questions:

fixed test + growing train: how much history do I actually need?
fixed train + moving test: for how long does performance hold after deployment?

Example of the second pattern:

from jano import DriftMonitoringPolicy

policy = DriftMonitoringPolicy(
    "timestamp",
    cutoff="2025-09-15",
    train_size="30D",
    test_size="3D",
    step="1D",
    max_windows=10,
)

result = policy.evaluate(
    frame,
    model=model,
    target_col="target",
    feature_cols=["feature_1", "feature_2"],
    metrics={"mae": mae, "rmse": rmse},
)

print(result.to_frame()[["window", "test_start", "rmse"]])
print(result.find_drift_onset(metric="rmse", threshold=0.15, baseline="first"))

Example: optimize training history inside each walk-forward iteration

This is the next-level composed question: if each outer test window is allowed to choose its own optimal training history, how much history is needed on average?

from jano import RollingTrainHistoryPolicy, TemporalPartitionSpec

policy = RollingTrainHistoryPolicy(
    "timestamp",
    partition=TemporalPartitionSpec(
        layout="train_test",
        train_size="30D",
        test_size="1D",
    ),
    step="1D",
    strategy="rolling",
    max_folds=10,
    train_sizes=["5D", "10D", "15D", "30D"],
)

result = policy.evaluate(
    frame,
    model=model,
    target_col="target",
    feature_cols=["feature_1", "feature_2"],
    metrics={"rmse": rmse},
    metric="rmse",
    tolerance=0.01,
)

print(result.to_frame().head())
print(result.summary())

Example: different feature groups can require different history depths

The supervised fold can stay fixed while feature engineering still asks for different lookback windows per feature group.

from jano import FeatureLookbackSpec

split = next(splitter.iter_splits(frame))
lookbacks = FeatureLookbackSpec(
    default_lookback="15D",
    group_lookbacks={"lag_features": "65D"},
    feature_groups={"lag_features": ["lag_30", "lag_60"]},
)

history = split.slice_feature_history(
    frame,
    lookbacks,
    time_col="timestamp",
    segment_name="train",
)

recent_context = history["__default__"]
lag_context = history["lag_features"]

This is useful when recent features only need a short window while lagged or seasonal features need much deeper historical context for the same model.

Example: describe a simulation as HTML

summary = splitter.describe_simulation(frame, title="Walk-forward simulation")
html = splitter.describe_simulation(frame, output="html")
chart_data = splitter.describe_simulation(frame, output="chart_data")

print(summary.total_folds)
print(summary.to_frame().head())
print(chart_data.segment_stats)

That gives you three ways to consume the same simulation:

summary for tabular metadata and export helpers,
html for a standalone visual report,
chart_data for direct Python plotting without reparsing HTML.

The generated report shows each fold across the dataset timeline, with richer summary cards, clearer segment labels and row counts per partition.

MCP server

Jano also ships an optional local MCP server so AI agents can use the library through a small, explicit tool surface instead of generating Python ad hoc.

Current MCP tools:

preview_local_dataset
inspect_local_dataset
inspect_and_recommend_local_dataset
suggest_temporal_partition_policy
validate_temporal_partition_policy
compare_temporal_partition_strategies
plan_walk_forward_simulation
run_walk_forward_simulation
run_walk_forward_baseline_model
compare_retrain_policy_baselines
find_train_history_window_baseline
monitor_decay_baseline

Install it in a Python 3.10+ environment:

python -m pip install "jano[mcp]"

Run it locally over stdio:

jano-mcp

Or use the module entrypoint:

python -m jano.mcp_server

Example MCP client configuration:

{
  "mcpServers": {
    "jano": {
      "command": "jano-mcp"
    }
  }
}

The MCP layer is intentionally opinionated: it exposes dataset inspection, policy suggestions, plan validation, walk-forward simulation, baseline-model execution and baseline temporal studies first, while the full Python library remains available when you need custom composition. The fastest first call for an agent is usually inspect_and_recommend_local_dataset, which bundles inspection and a conservative starting policy in one response.

The MCP server does not attempt to transport arbitrary Python systems or callables. If you need TemporalSystemRunner, WalkForwardRunner with custom metrics, or project-owned update logic such as reindexing a RAG system or refreshing prompt sets, use the Python API directly and keep MCP for dataset inspection, planning and baseline studies.

This is meant for MCP-aware coding assistants such as Claude Code, Claude Desktop, Cursor, Codex runtimes with MCP support, and other local agent environments. The server runs locally and reads only the file paths you provide to its tools; Jano does not upload datasets anywhere by itself.

AI-ready usage

Jano includes three surfaces intended to make the project easier for AI agents to use and extend:

Architecture notes in docs/architecture/ explain the project layers, accepted decisions, specs and open RFCs.
The canonical agent guide in docs/ai/jano-agent-guide.md explains which Jano API to use for common temporal validation tasks.
Tool-specific adapters provide lightweight entry points for Codex, Claude and Cursor:
- skills/jano/SKILL.md
- CLAUDE.md
- .cursor/rules/jano.mdc

Use the MCP server when an agent should execute Jano operations over local datasets. Use the skill or agent guide when an agent needs to reason about Jano, write code with the library or modify the repository safely.

Installation

Install the current release from PyPI:

python -m pip install jano

To use Polars inputs directly:

python -m pip install "jano[polars]"

For local development:

python -m pip install -e ".[dev]"
python -m pytest --cov=jano --cov-report=term-missing
python -m sphinx -b html docs docs/_build/html

Jano also exposes its runtime version through jano.__version__.

Release flow

The repository includes a dedicated GitHub Actions workflow for PyPI publication through trusted publishing.

The release path is:

Update jano/_version.py.
Run python -m pytest -q.
Run python -m build and python -m twine check dist/*.
Push a tag like v0.4.0.

That tag triggers the Publish workflow, which builds the wheel and source distribution and publishes them to PyPI.

In parallel, the repository also includes a GitHub Release workflow that can create a GitHub Release and attach the built wheel and source distribution for any v* tag.

Zenodo DOI

Jano includes repository metadata for Zenodo in .zenodo.json and citation metadata in CITATION.cff.

To mint a DOI for the project:

Log in to Zenodo with the GitHub account that owns or administers this repository.
Open the Zenodo GitHub integration page.
Click Sync now.
Enable the marmurar/jano repository.
Create a new GitHub Release for the next version tag.
Wait for Zenodo to archive the release and assign the DOI.
Add the generated DOI badge and DOI URL back to this README and the Sphinx docs.

Current Zenodo DOI: 10.5281/zenodo.20301006.

Continuous integration and coverage

The repository includes:

GitHub Actions for tests across multiple Python versions.
GitHub Pages publication for Sphinx documentation.
Coverage reporting with pytest-cov.
Codecov upload and status tracking.
A coverage gate set to 99%.

Status

Jano is an early public project with a usable core and an API that is still being refined as the simulation layer grows.

The low-level temporal partitioning surface is the most stable part of the library: TemporalBacktestSplitter, TemporalPartitionSpec, TemporalSimulation, WalkForwardPolicy and plan() are the foundation for manual fold iteration, auditability and simulation planning.

The higher-level execution and study APIs, including WalkForwardRunner, retrain policies, train-history studies and drift-monitoring helpers, are intentionally evolving. They are covered by tests and documented, but naming and ergonomics may still change while Jano is being shaped into a broader temporal experimentation framework.

Current distribution and quality signals:

PyPI package: jano.
Latest tested release line: 0.4.x.
Test suite: 152 passed.
Coverage gate: 99% minimum.
Current measured coverage: 99.30%.
Documentation: marmurar.github.io/jano.

For production use, pin an explicit version and review release notes before upgrading. For experimentation, temporal validation design work and prototype evaluation pipelines, the project is ready to use.

Citation

If you use Jano in research, technical reports, benchmarks or production validation work, please cite the project with this BibTeX entry:

@software{muraro_jano_2026,
  author       = {Muraro, Marcos Manuel},
  title        = {Jano: Temporal Simulation and Backtesting Toolkit for Time-Dependent Machine Learning Systems},
  year         = {2026},
  version      = {0.4.2},
  url          = {https://github.com/marmurar/jano},
  repository   = {https://github.com/marmurar/jano},
  doi          = {10.5281/zenodo.20301006},
  license      = {MIT},
  note         = {Python toolkit for temporal simulation, walk-forward validation, backtesting, and retraining-policy analysis for time-dependent machine learning systems}
}

The same citation metadata is also available in CITATION.cff.

Authors

Marcos Manuel Muraro

Contributing

Feedback and design discussion are especially valuable right now. If you are using temporal backtesting for ML, analytics, operations or experimentation, that context can help shape the API in the right direction.

Project direction

Jano is being reshaped as a small, explicit temporal partitioning toolkit with an interface inspired by sklearn.model_selection.

The design goals are:

Clear, composable temporal partition definitions.
Low hidden state and predictable behavior.
Compatibility with pandas-first workflows.
A splitter-style API that can evolve toward stronger scikit-learn interoperability.
Rich split objects for inspection, auditability and simulation.

Star history

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

MarcosMuraro

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.4.2

Jun 13, 2026

0.4.1

May 20, 2026

0.4.0

May 19, 2026

0.3.1

Apr 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jano-0.4.2.tar.gz (115.0 kB view details)

Uploaded Jun 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

jano-0.4.2-py3-none-any.whl (79.8 kB view details)

Uploaded Jun 13, 2026 Python 3

File details

Details for the file jano-0.4.2.tar.gz.

File metadata

Download URL: jano-0.4.2.tar.gz
Upload date: Jun 13, 2026
Size: 115.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for jano-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`f752bb02c8e047b1b11e1fe113816747f7e13672731a3d9a750166d3f9f3c7e8`
MD5	`5ea295789a0c38377168389c39eacaa4`
BLAKE2b-256	`5e40b62ac9f0b7372385a771a79380392ad16080f7ebd32be246c60127545265`

See more details on using hashes here.

Provenance

The following attestation bundles were made for jano-0.4.2.tar.gz:

Publisher: publish.yml on marmurar/jano

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: jano-0.4.2.tar.gz
- Subject digest: f752bb02c8e047b1b11e1fe113816747f7e13672731a3d9a750166d3f9f3c7e8
- Sigstore transparency entry: 1807326080
- Sigstore integration time: Jun 13, 2026
Source repository:
- Permalink: marmurar/jano@bbb0b87bf73c0ce94467a4dcbe0b2003971a5d08
- Branch / Tag: refs/tags/v0.4.2
- Owner: https://github.com/marmurar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@bbb0b87bf73c0ce94467a4dcbe0b2003971a5d08
- Trigger Event: push

File details

Details for the file jano-0.4.2-py3-none-any.whl.

File metadata

Download URL: jano-0.4.2-py3-none-any.whl
Upload date: Jun 13, 2026
Size: 79.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for jano-0.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ad973a8a4f242dc345377637793c732968762c0df5a31e9a01528f8d03b4114f`
MD5	`7c73c33d10eddd3f9dbcce5c9ff29d5c`
BLAKE2b-256	`3b3fb269ef1289cf3f08eea7246965f854f035c17c7fc4e9982bf06f63aac68c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for jano-0.4.2-py3-none-any.whl:

Publisher: publish.yml on marmurar/jano

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: jano-0.4.2-py3-none-any.whl
- Subject digest: ad973a8a4f242dc345377637793c732968762c0df5a31e9a01528f8d03b4114f
- Sigstore transparency entry: 1807326369
- Sigstore integration time: Jun 13, 2026
Source repository:
- Permalink: marmurar/jano@bbb0b87bf73c0ce94467a4dcbe0b2003971a5d08
- Branch / Tag: refs/tags/v0.4.2
- Owner: https://github.com/marmurar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@bbb0b87bf73c0ce94467a4dcbe0b2003971a5d08
- Trigger Event: push

jano 0.4.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Jano

Reproducible external datasets

Why Jano exists

Example: random splits vs temporal validation

Example: run a full simulation without manual iteration

Example: run a model over the walk-forward policy

Example: run a temporal system with update policies

Example: evaluate observation-driven online updates

Example: manual control with the low-level splitter

Example: keep the same test window and grow train backward

Example: optimize training history inside each walk-forward iteration

Example: different feature groups can require different history depths

Example: describe a simulation as HTML

MCP server

AI-ready usage

Installation

Release flow

Zenodo DOI

Continuous integration and coverage

Status

Citation

Authors

Contributing

Project direction

Star history

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance