PyNRPF implementation package for reverse power flow detection and correction.

Project description

PyNRPF

PyNRPF provides an implementation package (pynrpf) for reverse power flow inference and m8_xgb training workflows.

Contributors

M. Syahman Samhan (contact: m.samhan@unsw.edu.au)
Anna Bruce
Baran Yildiz

Install

python -m venv .venv
.\.venv\Scripts\Activate.ps1
pip install --upgrade pip
pip install -e .[dev]

User Journey

Step 1: Choose your workflow

Inference only:
- Use run_inference(...).
- If model is m8_xgb, provide a trained bundle URI.
Train + inference (m8_xgb):
- Use train_m8_xgb(...) first.
- Take returned artifact_uri.
- Put it into pynrpf_inference.artifacts.m8_pretrained_bundle_uri.
- Run run_inference(...).

Step 2: Keep one config file (`pipeline.yaml`)

PyNRPF reads only:

pynrpf_inference
pynrpf_training (for training)

Other pipeline keys (tables, orchestration, write targets) are ignored by PyNRPF.

Step 3: Run training and inference

import yaml
from pynrpf import run_inference, train_m8_xgb

with open("config/pipeline.yaml", "r", encoding="utf-8") as f:
    cfg = yaml.safe_load(f)

train_out = train_m8_xgb(train_df, cfg)

cfg.setdefault("pynrpf_inference", {}).setdefault("artifacts", {})
cfg["pynrpf_inference"]["artifacts"]["m8_pretrained_bundle_uri"] = train_out["artifact_uri"]

result = run_inference(score_df, cfg)
scored_df = result["data"]
summary = result["summary"]

If you already have a trained bundle, skip train_m8_xgb(...) and set m8_pretrained_bundle_uri directly in YAML.

API Classification

Core APIs (most users)

run_inference(data, config)
train_m8_xgb(data, config)
load_config(config)
list_models()

Additional APIs (advanced/helpers)

load_artifact_bundle(location)
save_artifact_bundle(bundle, location)
build_pipeline_config(model_id, include_training)
generate_pipeline_config(output_path, model_id, include_training, overwrite)
generate_model_scaffold(model_id, output_dir, overwrite, include_tests, include_pipeline_config)

Detailed API Function Guide

Core: `run_inference(data, config)`

What it does:

Validates and standardizes input data.
Selects model from config (m7_dtr or m8_xgb).
Runs model inference and returns scored data plus operational summary.

Use when:

You want corrected net load + flags on new data.

Input:

data: pandas DataFrame or Spark DataFrame.
config: mapping or YAML path. Can be:
- pure inference config, or
- full pipeline config containing pynrpf_inference.
Required logical columns (configured under columns):
- site, timestamp, net_load, solar.

Output:

data: same table type as input (pandas in, pandas out; Spark in, Spark out).
summary: row counts and monitoring stats.
model: resolved model id.
input_type: "pandas" or "spark".
m7_dtr note: strict day flags remain threshold-based, while interval corrections and corrected net load use a relaxed, threshold-free minima span, so day and interval flags may diverge.

Common errors:

Missing required columns.
Unsupported model id.
m8_xgb without artifacts.m8_pretrained_bundle_uri.

Core: `train_m8_xgb(data, config)`

What it does:

Trains both internal models:
- xgb1_day (day classifier)
- xgb2_timestamp (interval classifier)
Writes a versioned artifact bundle and manifest.
Returns artifact URIs + validation metrics.

Use when:

You need to create or refresh m8_xgb artifacts for inference.

Input:

data: interval-level pandas/Spark DataFrame containing:
- inference columns (site, timestamp, net_load, solar)
- day label column
- interval label column
config: mapping or YAML path containing:
- pynrpf_inference
- pynrpf_training

Output:

bundle: in-memory artifact dictionary.
bundle_schema: currently pynrpf.m8_xgb.bundle.v2.
artifact_uri: bundle file URI to use for inference.
artifact_dir_uri, manifest_uri.
validation_metrics for both stages.

Common errors:

Missing day/interval labels.
Invalid training split window.
Invalid threshold values.
Unsupported training model id (currently only m8_xgb).

Core: `load_config(config)`

What it does:

Loads and validates inference config.
Accepts mapping or YAML path.
If full pipeline config is provided, extracts pynrpf_inference.
Applies defaults and normalizes model selection fields.

Use when:

You want to inspect/validate final effective inference config before execution.

Core: `list_models()`

What it does:

Returns currently registered inference model ids.

Use when:

You want to see which model names are valid for selected_model.

Additional: `load_artifact_bundle(location)`

What it does:

Reads and deserializes a pickle artifact bundle.
Supports local paths, file://, dbfs:/, and http(s):// for reads.

Use when:

You want to inspect/debug a trained artifact payload.

Additional: `save_artifact_bundle(bundle, location)`

What it does:

Serializes and writes bundle payload to a local or DBFS/Volumes-backed path.

Use when:

You need explicit one-file bundle writes outside training API orchestration.

Additional: `build_pipeline_config(model_id, include_training)`

What it does:

Builds an in-memory pipeline-style config dictionary with pynrpf_inference.
Optionally includes pynrpf_training (currently only for m8_xgb).

Use when:

You want a Python-first config object without writing a file.

Additional: `generate_pipeline_config(output_path, model_id, include_training, overwrite)`

What it does:

Writes a pipeline YAML template to disk using the same schema as build_pipeline_config(...).

Use when:

You want a starter config file for Databricks/notebook use.

Additional: `generate_model_scaffold(model_id, output_dir, overwrite, include_tests, include_pipeline_config)`

What it does:

Creates a starter plugin module under src/pynrpf/plugins/.
Optionally creates a plugin test and pipeline config template.
Auto-wires model import/export and registry entries:
- src/pynrpf/plugins/__init__.py
- src/pynrpf/registry.py

Use when:

You want to add a new model quickly and start editing logic immediately.

API Input/Output Schemas (Quick View)

Input:

run_inference:
- data (pandas/Spark) + config
train_m8_xgb:
- labeled interval data + training/inference config blocks

Output:

{
  "run_inference": {
    "data": "<same type as input>",
    "summary": {...},
    "model": "<model_id>",
    "input_type": "pandas|spark",
  },
  "train_m8_xgb": {
    "bundle_schema": "pynrpf.m8_xgb.bundle.v2",
    "artifact_uri": "<base>/m8_xgb/<utc_ts>/bundle.pkl",
    "validation_metrics": {
      "xgb1_day": {...},
      "xgb2_timestamp": {...},
    },
  }
}

`m8_xgb` Notes

m8_xgb is a two-stage model family:

xgb1_day (day-level)
xgb2_timestamp (interval-level)

Training consumes one interval-level labeled dataset and internally builds both feature schemas.

Artifact contract details:

docs/m8_xgb_artifact_contract.md

Scaffold Helpers

Generate starter model logic/test/config files:

from pynrpf import generate_model_scaffold

created = generate_model_scaffold("m9_custom", output_dir=".")
print(created)

generate_model_scaffold(...) now auto-wires:

src/pynrpf/plugins/__init__.py import/export list
src/pynrpf/registry.py model registry entry

Generate only a pipeline config template:

from pynrpf import generate_pipeline_config

generate_pipeline_config(
    output_path="config/pynrpf_pipeline_m8_xgb.yaml",
    model_id="m8_xgb",
    include_training=True,
    overwrite=True,
)

Conference Publication Archive

Publication artifacts are isolated and frozen under:

publication/1_conference_paper

Archive run instructions:

publication/1_conference_paper/README.md

Continuous Integration and Release

ci: lint, tests, build
release: publish on version tags (v*)
publication_archive_smoke (nightly): non-blocking publication notebook smoke

Project details

Release history Release notifications | RSS feed

This version

0.3.0

Mar 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pynrpf-0.3.0.tar.gz (36.7 kB view details)

Uploaded Mar 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pynrpf-0.3.0-py3-none-any.whl (33.5 kB view details)

Uploaded Mar 24, 2026 Python 3

File details

Details for the file pynrpf-0.3.0.tar.gz.

File metadata

Download URL: pynrpf-0.3.0.tar.gz
Upload date: Mar 24, 2026
Size: 36.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pynrpf-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`c705be392b9d30ded58753e2be460fa87babea9f710d8571871c7f4aac1a9193`
MD5	`9d63bfa0dc860a1ff79e2d06455f9543`
BLAKE2b-256	`240ad9b8bd9deda0352cb683700e8384a5d19ae0702c5342019d06dcee3071eb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pynrpf-0.3.0.tar.gz:

Publisher: release.yml on mssamhan31/PyNRPF

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pynrpf-0.3.0.tar.gz
- Subject digest: c705be392b9d30ded58753e2be460fa87babea9f710d8571871c7f4aac1a9193
- Sigstore transparency entry: 1176407645
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: mssamhan31/PyNRPF@b49a3039b8b26ddd65dbcf910cfc2af7f5cbc9f5
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/mssamhan31
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b49a3039b8b26ddd65dbcf910cfc2af7f5cbc9f5
- Trigger Event: push

File details

Details for the file pynrpf-0.3.0-py3-none-any.whl.

File metadata

Download URL: pynrpf-0.3.0-py3-none-any.whl
Upload date: Mar 24, 2026
Size: 33.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pynrpf-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d7b9112851bed87c25b2ecb67e79f34988d3cc9dda7d9cdc1352d8c830fe204b`
MD5	`68ee39fad40674fe435c31e531a77b66`
BLAKE2b-256	`0a0d4b5788111c9ea32c64316b5d1c7911f533ccbdd6120926ad2298896fd4b6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pynrpf-0.3.0-py3-none-any.whl:

Publisher: release.yml on mssamhan31/PyNRPF

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pynrpf-0.3.0-py3-none-any.whl
- Subject digest: d7b9112851bed87c25b2ecb67e79f34988d3cc9dda7d9cdc1352d8c830fe204b
- Sigstore transparency entry: 1176407711
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: mssamhan31/PyNRPF@b49a3039b8b26ddd65dbcf910cfc2af7f5cbc9f5
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/mssamhan31
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b49a3039b8b26ddd65dbcf910cfc2af7f5cbc9f5
- Trigger Event: push

pynrpf 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

PyNRPF

Contributors

Install

User Journey

Step 1: Choose your workflow

Step 2: Keep one config file (pipeline.yaml)

Step 3: Run training and inference

API Classification

Core APIs (most users)

Additional APIs (advanced/helpers)

Detailed API Function Guide

Core: run_inference(data, config)

Core: train_m8_xgb(data, config)

Core: load_config(config)

Core: list_models()

Additional: load_artifact_bundle(location)

Additional: save_artifact_bundle(bundle, location)

Additional: build_pipeline_config(model_id, include_training)

Additional: generate_pipeline_config(output_path, model_id, include_training, overwrite)

Additional: generate_model_scaffold(model_id, output_dir, overwrite, include_tests, include_pipeline_config)

API Input/Output Schemas (Quick View)

m8_xgb Notes

Scaffold Helpers

Conference Publication Archive

Continuous Integration and Release

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Step 2: Keep one config file (`pipeline.yaml`)

Core: `run_inference(data, config)`

Core: `train_m8_xgb(data, config)`

Core: `load_config(config)`

Core: `list_models()`

Additional: `load_artifact_bundle(location)`

Additional: `save_artifact_bundle(bundle, location)`

Additional: `build_pipeline_config(model_id, include_training)`

Additional: `generate_pipeline_config(output_path, model_id, include_training, overwrite)`

Additional: `generate_model_scaffold(model_id, output_dir, overwrite, include_tests, include_pipeline_config)`

`m8_xgb` Notes