Preflight checks for LLM prototypes.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

wx-b

These details have not been verified by PyPI

Project description

ModelPreflight

Preflight checks for LLM prototypes.

A tiny local gateway for LLM smoke tests, provider failover, and cheap prototype checks before you wire an LLM into something bigger.

License LiteLLM

If you want to...	Start here
Get one green check quickly	60-second start
Try it without keys	No-key demo path
Configure provider groups once	Machine-local config
Run project smoke cases	Smoke tests
Fan out a one-off prompt	Pro Mode
Use it as a Python helper	Library usage

ModelPreflight keeps provider setup machine-local and keeps smoke cases project-local. It gives prototypes stable model-group aliases, simple failover, and JSONL audit logs without becoming a benchmark harness or hosted gateway.

60-second start

uvx model-preflight --help

# In a persistent tool or project environment:
uv tool install model-preflight
# or:
pipx install model-preflight

Pick one provider, set one key, and run one live check:

mpf init --provider nvidia
export NVIDIA_NIM_API_KEY=...
mpf doctor --live
mpf demo

Expected signal:

mpf init --provider nvidia writes your machine-local config and prints next: mpf doctor --live.
mpf doctor --live prints a deployments table, then live check ok: group=....
mpf demo prints JSON with "passed": true and an empty "failures": [] list.

Add checks to a project:

cd my-project
mpf init-project
mpf run

Expected signal:

mpf init-project writes evals/smoke.jsonl, writes .model-preflight/README.md, and updates .gitignore.
mpf run prints JSON results for the starter cases. Every passing case has "passed": true.
A failing case exits non-zero and includes strings under "failures" so you know what drifted.

Both mpf and model-preflight are installed as console scripts.

ModelPreflight catches missing keys, broken provider routes, prompt formatting regressions, output-shape drift, accidental model/provider changes, and "this worked yesterday" prototype failures before you wire the LLM call into something larger.

No-key demo path

Use the minimal offline preset when you want to test the CLI and project workflow without a provider account:

mpf init --preset minimal
mpf doctor --live
mpf demo
mpf init-project
mpf run

What this proves:

Config loading works without secrets.
The CLI can run a live-style check through the offline echo provider.
Project bootstrap works by creating evals/smoke.jsonl.
Smoke scoring works when mpf run returns JSON where every case has "passed": true.

What it does not prove: remote provider auth, quota, latency, or model quality. Use the OpenRouter path below for that.

Machine-local config

ModelPreflight reads provider routes from ~/.config/model-preflight/config.yaml by default. Override the path with either --config or MODEL_PREFLIGHT_CONFIG.

mpf init --provider openrouter
mpf doctor
mpf models

Provider setup is discoverable from the CLI:

mpf providers list
mpf providers guide nvidia
mpf providers guide openrouter
mpf providers test nvidia
mpf providers test openrouter

NVIDIA Build / NIM is the primary high-capability open/open-weight endpoint option. OpenRouter is still the lowest-friction discovery option because one API key can route to many model providers through an OpenAI-compatible API.

Use either primary path:

mpf init --provider nvidia
export NVIDIA_NIM_API_KEY=...
mpf doctor --provider nvidia --live

mpf init --provider openrouter
export OPENROUTER_API_KEY=...
mpf doctor --provider openrouter --live

Provider	Best for	Env var	Setup
NVIDIA Build / NIM	Primary high-capability open/open-weight endpoint pool	`NVIDIA_NIM_API_KEY`	API keys
OpenRouter	One-key first run with broad model access	`OPENROUTER_API_KEY`	Authentication docs
Groq	Fast repeated calls after first-run setup works	`GROQ_API_KEY`	Groq console
Cerebras	Fast inference experiments when current dev-tier limits fit	`CEREBRAS_API_KEY`	Cerebras inference docs
Mistral	First-party Mistral model-family smoke checks	`MISTRAL_API_KEY`	Mistral API keys

Secondary/overflow pool to add manually once the primary pool works: Google Gemini/Gemma, Cloudflare Workers AI, GitHub Models, Hugging Face Inference Providers, and SambaNova. These are documented in docs/PROVIDER_PRESETS.md, but not packaged as first-run presets yet because auth shape, model IDs, and free/dev limits are more account-specific.

The default config creates logical groups, then maps each group to one or more LiteLLM deployments:

router:
  num_retries: 1
  timeout_seconds: 60
  default_group: free_reasoning
  audit_jsonl: null
artifacts_dir: ~/.cache/model-preflight/artifacts

deployments:
  - name: nvidia_nim_nemotron_3_super
    provider: nvidia
    group: free_reasoning
    model: nvidia_nim/nvidia/nemotron-3-super-120b-a12b
    api_key_env: NVIDIA_NIM_API_KEY
    enabled: true
    required: true
    status: best_effort
    setup_url: https://build.nvidia.com/settings/api-keys
    rpm: 10
    tier: reasoning

Provider presets are best-effort starter data, not authoritative claims about free availability. User-local config wins over bundled defaults, optional/disabled providers do not block first-run checks, and endpoint names, quotas, pricing, and behavior can change without this package knowing.

Smoke tests

Smoke cases are JSONL files owned by the project that is doing the prototype work.

{"id":"basic-ok","prompt":"Return only: ok","expected_substrings":["ok"]}
{"id":"avoid-word","prompt":"Answer yes without using the word nope","forbidden_substrings":["nope"]}

Run them with:

mpf run
# or:
mpf run path/to/smoke_cases.jsonl

mpf run prints JSON results and exits non-zero if any case fails.

Pro Mode

mpf pro fans out a one-off prompt, then synthesizes a final answer through a judge group.

mpf pro "Suggest three robust JSON schemas for this toy extraction task" --n 8

Defaults:

Option	Default	Role
`--n`	`8`	number of sampled answers
`--sample-group`	`free_fast`	fanout group
`--judge-group`	`free_reasoning`	synthesis group

Fanout multiplies live provider calls. Keep --n low while testing, use restricted provider keys where available, and review provider dashboards when running against paid endpoints.

Library usage

from model_preflight import ModelGateway, load_config, pro_mode

gateway = ModelGateway(load_config())

print(gateway.text("Return only: ok", group="free_reasoning"))

result = pro_mode(gateway, "Solve this toy puzzle", n=8)
print(result["final"])

The library API is intentionally thin:

load_config() reads the same machine-local config as the CLI
ModelGateway wraps LiteLLM Router with stable group aliases and audit logging
pro_mode() runs fanout plus synthesis for one-off prototype prompts

Audit artifacts

By default, ModelPreflight writes audit logs under:

~/.cache/model-preflight/artifacts/audit.jsonl

Each live call should be traceable enough to debug provider drift: timestamp, logical group, resolved provider/model when available, prompt or case metadata, latency, token usage when available, and response id when available.

Non-goals

ModelPreflight is not a model leaderboard, a formal benchmark framework, a hosted inference gateway, a provider catalog authority, or proof that an endpoint is free, fast, or available today.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

wx-b

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.16

May 1, 2026

0.1.14

Apr 30, 2026

0.1.13

Apr 29, 2026

0.1.11

Apr 29, 2026

0.1.10

Apr 29, 2026

0.1.9

Apr 29, 2026

0.1.8

Apr 28, 2026

This version

0.1.7

Apr 28, 2026

0.1.6

Apr 28, 2026

0.1.5

Apr 28, 2026

0.1.4

Apr 27, 2026

0.1.3

Apr 27, 2026

0.1.0

Apr 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

model_preflight-0.1.7.tar.gz (1.5 MB view details)

Uploaded Apr 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

model_preflight-0.1.7-py3-none-any.whl (19.5 kB view details)

Uploaded Apr 28, 2026 Python 3

File details

Details for the file model_preflight-0.1.7.tar.gz.

File metadata

Download URL: model_preflight-0.1.7.tar.gz
Upload date: Apr 28, 2026
Size: 1.5 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for model_preflight-0.1.7.tar.gz
Algorithm	Hash digest
SHA256	`ddeeb005c808fb7adc23a1d728b9cbdc88b976e2f4413d8cbb547ed4f39b613e`
MD5	`b71ba6b215ff441c7589bbb808a68294`
BLAKE2b-256	`159db43dc32a4ada40b3f339d95341e9f9e9fa0b58a57bb6cad3cabbb3db3665`

See more details on using hashes here.

File details

Details for the file model_preflight-0.1.7-py3-none-any.whl.

File metadata

Download URL: model_preflight-0.1.7-py3-none-any.whl
Upload date: Apr 28, 2026
Size: 19.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for model_preflight-0.1.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`098798bba7fc198326e15a4bec4c0f104730494e8d5913af0307403abc633652`
MD5	`9a04f7e181c7851ec9a3cc01921a6e70`
BLAKE2b-256	`a9c447a3b30e71c3788466a204443b581996ee294a67fa2076842fc5c3c4e4af`

See more details on using hashes here.

model-preflight 0.1.7

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

ModelPreflight

60-second start

No-key demo path

Machine-local config

Smoke tests

Pro Mode

Library usage

Audit artifacts

Non-goals

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes