Model-agnostic natural-language → GMAT mission-script generation, retrieval-grounded and lint-validated.
Project description
gmat-copilot
Turn a natural-language request into a GMAT mission .script — grounded in the GMAT
documentation, validated against a static linter, and produced through a model you choose.
Status: Retrieval-grounded generation, the static lint gate, the model-agnostic provider abstraction, the two-layer evaluation suite, and the CLI are all in place — and, behind the optional
[gmat]extra, a GMAT dry-run, a bounded repair loop, and a provenance sidecar close the loop from an intent to a script validated against a real GMAT.
gmat-copilot is a library and a CLI for NASA's General Mission Analysis Tool.
Generation is retrieval-grounded: a request is answered against relevant GMAT help pages, sample
scripts, GmatFunctions, and a curated set of domain notes, so the model writes against real syntax
rather than from memory. Every draft is checked by the
gmat-script linter before it is returned.
Install
pip install gmat-copilot
The base install is light and GMAT-free. Add the provider you use as an extra:
pip install "gmat-copilot[anthropic]" # or [openai], or [ollama]
GitHub Models (the free-tier path the eval and CI use) needs no extra — it works on the base install.
The dynamic GMAT dry-run is its own optional extra; it needs a discoverable GMAT install (see Close the loop):
pip install "gmat-copilot[gmat]"
Use it
There is no default model — you choose one explicitly as provider:model. With none chosen, the
tool lists the providers it can reach from your configured credentials rather than picking for you.
from gmat_copilot import draft
result = draft(
"A 500 km circular Earth orbit at 51.6 degrees inclination; "
"propagate one day and report altitude and semi-major axis.",
model="anthropic:claude-...",
)
print(result.script) # the generated GMAT .script
print(result.lint.clean) # did it lint clean?
result.save("mission.script")
From the command line:
gmat-copilot "a sun-synchronous orbit at 700 km" --model anthropic:claude-... -o mission.script
gmat-copilot validate mission.script
The script is written to -o (default mission.script; -o - for stdout) and a concise lint
summary is printed. Strict mode (the default) exits non-zero if the draft does not lint clean; pass
--permissive to write the best-effort draft anyway. gmat-copilot draft "<intent>" is an alias of
the bare form. API keys are read from the environment (ANTHROPIC_API_KEY, OPENAI_API_KEY, …),
never committed.
Close the loop
With the [gmat] extra installed, a lint-clean draft can be loaded — and, with a Target/Optimize
solver, run — in a real GMAT, and a bounded repair loop can feed any failure back to the model:
gmat-copilot "a Hohmann transfer to GEO" --model anthropic:claude-... \
--dry-run --repair 2 --provenance
# lint: clean; dry-run: ok; retries: 1 -> wrote mission.script (+ mission.script.copilot.json)
Here the first draft failed the dry-run and one repair pass produced a runnable script (retries: 1).
--dry-runloads (and runs, where a solver is present) the draft in GMAT after it lints clean, catching the runtime errors a static parse cannot. It needs the[gmat]extra and a discoverable GMAT install; without them the flag fails with a clear message, and the default path is unaffected.--repair Nregenerates a failing draft up toNtimes, feeding the lint (and, with--dry-run, runtime) diagnostics back each round. The default0is a single pass.--provenancewrites a.copilot.jsonsidecar next to the script — the request, the per-attempt draft history, and the outcome — so a generated mission records how it was produced.
In your editor
The same engine is available in VS Code through the GMAT Copilot extension — install it from the VS Code Marketplace or Open VSX:
- Draft a Mission from a Description… — type a prompt, then review the generated script as a diff against the active file and apply it only on accept. Nothing is written silently and nothing is auto-applied; in strict mode a draft that does not lint clean is not applied at all.
- Lint (and the optional dry-run) findings land in the Problems panel as inline diagnostics.
- The provider/model is explicit — there is no default — via a Select the Provider and Model… quick-pick over the providers your credentials can reach.
The extension is a thin client over the engine; all .script language features (highlighting,
lint-on-type, hover, formatting) come from the
GMAT Script extension, which it depends on. See the
VS Code docs for the commands, settings, and
the apply-to-current-file flow.
Validation contract
Validation runs in two tiers, static then dynamic:
- Static lint gate — always on, GMAT-free, instant. Strict (the default) rejects a script that reports any error or warning (every warning-level rule is a hard GMAT load error); permissive returns the best-effort script with every diagnostic attached.
- Dynamic GMAT dry-run — optional, behind the
[gmat]extra. On a script that lints clean, GMAT loads it (and runs it when a solver is present) to catch the runtime errors a static parse cannot. It is a strictly additive backstop; the strict/permissive contract is unchanged.
Generation and the lint gate need no GMAT install — only the dry-run tier does.
What gmat-copilot is not
- Not a GMAT replacement or a mission optimiser — it writes and validates the script; GMAT runs it. The dry-run checks that a script loads and runs; it is not a way to execute missions for their results.
- Not a correctness guarantee — the lint gate catches malformed scripts, not wrong physics. Always review and run generated scripts.
- Not an auto-applier — in the editor a draft is shown as a reviewable diff and written only when you accept it; it never edits your file unattended.
- Not a model vendor — it ships no model, recommends none, and never silently falls back to one.
- Not a hosted service — the only thing gmat-copilot hosts is the leaderboard, a presentation-only board that scores no submissions live. The library and CLI run entirely on your machine.
Documentation
Full docs are at https://astro-tools.github.io/gmat-copilot/ — getting started, the provider/auth model, the validation contract, the repair loop, the result schema and the provenance sidecar, the VS Code extension, the evaluation protocol, the leaderboard, the corpus and its licences, worked examples (draft a Hohmann transfer, close the loop, read the provenance, reproduce the eval, add a provider, drive it from VS Code, reproduce a leaderboard entry), an API reference, and the design decisions.
The per-model leaderboard is hosted as a static Hugging Face Space —
https://huggingface.co/spaces/astro-tools/gmat-copilot-leaderboard — ranking provider:models
on the evaluation suite. It ranks on a never-committed held-out set (the headline) with the committed
public set shown alongside as the reproducibility anchor, so overfitting the public prompts buys no
rank. Any model can be entered by PRing a recorded bundle that reproduces its public score offline;
the leaderboard docs cover how to read the
board and submit an entry.
License
MIT — see LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gmat_copilot-0.3.0.tar.gz.
File metadata
- Download URL: gmat_copilot-0.3.0.tar.gz
- Upload date:
- Size: 4.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
26e870914a893ff844ac3b32e9880ccda8283805acae5cc2441a2d8dd6bffd15
|
|
| MD5 |
20fd94606f67ab158f56b111e3103cb1
|
|
| BLAKE2b-256 |
db4ef81cdd836250e14319568b508d5205c47dea06eb2dff49b737bd87ef9c29
|
Provenance
The following attestation bundles were made for gmat_copilot-0.3.0.tar.gz:
Publisher:
release.yml on astro-tools/gmat-copilot
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gmat_copilot-0.3.0.tar.gz -
Subject digest:
26e870914a893ff844ac3b32e9880ccda8283805acae5cc2441a2d8dd6bffd15 - Sigstore transparency entry: 1871598228
- Sigstore integration time:
-
Permalink:
astro-tools/gmat-copilot@36832e1a5e88d3e1e1e54bf96a492b2e1467f5f9 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/astro-tools
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@36832e1a5e88d3e1e1e54bf96a492b2e1467f5f9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file gmat_copilot-0.3.0-py3-none-any.whl.
File metadata
- Download URL: gmat_copilot-0.3.0-py3-none-any.whl
- Upload date:
- Size: 3.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
83469809adbc761afb4222c412b3ad9516f17a2cfd9b3c4b5e89c2b59e4446f8
|
|
| MD5 |
79d3ded6241c1132f5b20601699b3b18
|
|
| BLAKE2b-256 |
aaf2237682e9903f5d277c4426c995c8b499003b28464bc686b54b39ee707a85
|
Provenance
The following attestation bundles were made for gmat_copilot-0.3.0-py3-none-any.whl:
Publisher:
release.yml on astro-tools/gmat-copilot
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gmat_copilot-0.3.0-py3-none-any.whl -
Subject digest:
83469809adbc761afb4222c412b3ad9516f17a2cfd9b3c4b5e89c2b59e4446f8 - Sigstore transparency entry: 1871598313
- Sigstore integration time:
-
Permalink:
astro-tools/gmat-copilot@36832e1a5e88d3e1e1e54bf96a492b2e1467f5f9 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/astro-tools
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@36832e1a5e88d3e1e1e54bf96a492b2e1467f5f9 -
Trigger Event:
push
-
Statement type: