End-to-end differentiable JAX implementation of VMEC2000 for fixed and free-boundary equilibria.

These details have not been verified by PyPI

Project description

vmec-jax

End-to-end differentiable JAX implementation of VMEC2000 for fixed-boundary and free-boundary ideal-MHD equilibria.

Install

pip install vmec-jax

Developer (editable) install:

git clone https://github.com/uwplasma/vmec_jax
pip install -e vmec_jax/

Choosing CPU or GPU

vmec_jax follows the JAX backend you select. If you installed CPU-only JAX, runs use CPU. If you installed GPU-enabled JAX and select a GPU backend, runs use GPU; vmec_jax does not silently force those runs back to CPU.

# Check what JAX will use.
python -c "import jax; print(jax.default_backend()); print(jax.devices())"

# Force CPU for one command.
JAX_PLATFORMS=cpu vmec_jax input.nfp4_QH_warm_start

# Force an accelerator backend after installing GPU-enabled JAX.
JAX_PLATFORM_NAME=gpu vmec_jax input.nfp4_QH_warm_start

# For NVIDIA CUDA specifically, this is also valid.
JAX_PLATFORMS=cuda vmec_jax input.nfp4_QH_warm_start

From Python, leave solver_device unset to inherit JAX's default backend, or pass solver_device="cpu" / solver_device="gpu" explicitly:

import vmec_jax as vj

run_gpu = vj.run_fixed_boundary("input.nfp4_QH_warm_start", solver_device="gpu")
run_cpu = vj.run_fixed_boundary("input.nfp4_QH_warm_start", solver_device="cpu")

For GPU runs, vmec_jax defaults XLA_PYTHON_CLIENT_PREALLOCATE=false before JAX import so the allocator grows on demand. This avoids GPU memory contention between optimization workers and was faster in the exact-Jacobian GPU profile. Set XLA_PYTHON_CLIENT_PREALLOCATE=true before import if you explicitly want JAX's default preallocation behavior.

Usage

Run the solver (VMEC2000-style CLI):

vmec_jax input.nfp4_QH_warm_start        # → wout_nfp4_QH_warm_start.nc

Generate diagnostic plots from any wout_*.nc (four-panel output, replicates vmecPlot2.py):

vmec_jax --plot wout_nfp4_QH_warm_start.nc           # saves in same directory
vmec_jax --plot wout_nfp4_QH_warm_start.nc --outdir figures/

From Python:

import vmec_jax as vj

# Run a fixed-boundary solve
run = vj.run_fixed_boundary("input.nfp4_QH_warm_start")

# Run a free-boundary solve
freeb = vj.run_free_boundary("input.cth_like_free_bdy_lasym_small")

# Plot any wout file (produces *_VMECparams.pdf, *_poloidal_plot.png, *_VMECsurfaces.pdf, *_VMEC_3Dplot.png)
vj.plot_wout("wout_nfp4_QH_warm_start.nc", outdir="figures/")

Run tests:

pytest -q

Showcase (single-grid)

All figures below use the same single-grid run settings: NS_ARRAY=151, NITER_ARRAY=5000, FTOL_ARRAY=1e-14, NSTEP=500.


`ITERModel` cross-section (VMEC2000 vs vmec_jax)	`LandremanPaul2021_QA_lowres` cross-section (VMEC2000 vs vmec_jax)

`ITERModel` iota (VMEC2000 vs vmec_jax)	`LandremanPaul2021_QA_lowres` iota (VMEC2000 vs vmec_jax)

`ITERModel` 3D LCFS	`LandremanPaul2021_QA_lowres` 3D LCFS

`ITERModel` \|B\| on LCFS	`LandremanPaul2021_QA_lowres` \|B\| on LCFS

Cold vs warm runtime: the cold bar includes XLA JIT compilation on the first call (one-time cost per process); the warm bar is the steady-state solve time for subsequent calls in the same process. VMEC2000 has no compilation overhead, so it is always effectively cold. vmec_jax enables JAX's persistent compilation cache by default under ~/.cache/vmec_jax/jax_cache so repeated cold-process runs can reuse compiled kernels; set VMEC_JAX_COMPILATION_CACHE=0 to disable it or VMEC_JAX_COMPILATION_CACHE_DIR=/path/to/cache to choose a different location.

Quasi-helical symmetry optimization (discrete-adjoint)

examples/optimization/qh_fixed_resolution_jax.py demonstrates an end-to-end fixed-boundary QH optimization using the built-in exact discrete-adjoint Jacobian — no finite differences, no SIMSOPT dependency.

The script is intentionally written in the same teaching style as SIMSOPT's QH_fixed_resolution.py: choose the VMEC resolution directly in Python, choose the active boundary coefficients directly, build the objective blocks directly in the script, and choose the outer optimizer explicitly. Nothing relies on a hidden SIMSOPT wrapper layer.

Discrete adjoint: rather than perturbing each boundary DOF separately (finite differences), vmec_jax records a checkpoint tape of the VMEC iteration and propagates all parameter tangents through it in one batched forward pass (jax.vmap(jax.jvp(...))). The Jacobian is exact (machine precision) and its cost is roughly 1–2 forward solves regardless of the number of DOFs — vs. n_DOFs forward solves for finite differences. → Detailed explanation · SIMSOPT comparison

python examples/optimization/qh_fixed_resolution_jax.py   # MAX_MODE=2 by default

Key top-level controls in the script:

VMEC_MPOL, VMEC_NTOR: solver resolution
MAX_MODE: boundary parameterization richness
OBJECTIVE_TUPLES: explicit aspect + QS residual blocks
METHOD: "gauss_newton" or "scipy"
SCIPY_TR_SOLVER: SciPy trust-region linear solver ("lsmr" by default for the QA/QH examples)
USE_MODE_CONTINUATION: staged solves for higher-mode runs
USE_ESS, ALPHA: optional exponential spectral scaling

When max_mode exceeds the modes present in the input file, vmec_jax automatically extends the boundary to include the requested harmonics at zero amplitude (vj.extend_boundary_for_max_mode), matching SIMSOPT's fixed_range() behaviour. All runs use consistent VMEC resolution mpol = ntor = 5 so the initial QS metric is normalised identically across max_mode values.

`max_mode`	DOFs	Policy	QS initial	QS final	Reduction	Objective final	Wall time ¹
1	8	continuation, no ESS	0.303	0.214	30 %	`0.216`	~133 s
2	24	continuation, no ESS	0.303	`3.19e-3`	99 %	`3.19e-3`	~746 s
3	48	continuation + ESS	0.303	`9.51e-4`	99.7 %	`9.51e-4`	~952 s

¹ Wall time on Apple M-series (warm-cache subsequent runs are faster).

With only 8 DOFs (max_mode=1) the boundary deformation space is too limited to reach a deep quasi-helical minimum. max_mode=2 already gives a strong QH solution, and the current max_mode=3 continuation+ESS run improves it further on the exact standalone path.

vmec_jax vs SIMSOPT: vmec_jax uses an exact discrete-adjoint Jacobian (one batched JVP pass ≈ 1–2 forward solves regardless of DOF count) while SIMSOPT + VMEC2000 uses finite differences (n_DOFs × 1 forward solve per Jacobian). For a detailed comparison of algorithms, runtimes, and memory, see docs/simsopt_comparison.rst.

max_mode = 1 (8 DOFs, 30 % QS reduction)	max_mode = 2 (24 DOFs, 99 % QS reduction)	max_mode = 3 (48 DOFs, 99.7 % QS reduction)

The |B| contour plots show quasi-helical alignment after optimization: contour lines become increasingly helical (aligned with m θ − n φ = const). The ζ axis spans one field period (0 → 2π/nfp).

The current exact standalone path keeps improving through max_mode=3, with the 48-DOF continuation+ESS run reaching ~9.5e-4 total objective and QS.

Regenerate plots after running the optimization:

python examples/optimization/plot_qh_optimization_results.py --output-dir results/qh_opt

Quasi-axisymmetric optimization (fixed-boundary)

examples/optimization/qa_fixed_resolution_jax_ess.py optimizes an nfp=2 QA equilibrium for aspect ratio, mean iota, and QA symmetry residuals.

Like the QH script, it exposes the problem construction directly in Python: VMEC resolution, active boundary DOFs, the three objective blocks, weights, continuation policy, ESS settings, and the outer optimizer are all top-level variables in the file.

python examples/optimization/qa_fixed_resolution_jax_ess.py   # MAX_MODE=2 by default

When max_mode exceeds the modes in the input file, vmec_jax automatically extends the boundary to include those harmonics at zero amplitude (vj.extend_boundary_for_max_mode). All runs use consistent VMEC resolution mpol = ntor = 5. Objectives: aspect ratio (target 6.0) + mean iota (target 0.41) + QA symmetry residuals. The current standalone QA path uses exact residuals + exact discrete-adjoint Jacobians with scipy.optimize.least_squares. For max_mode > 1, the script can use staged mode continuation: it solves the lower-mode QA problem first, then lifts that solution into the richer boundary space before running the final stage. The full sweep also tests direct-start and ESS variants.

`max_mode`	DOFs	Policy	Eval used	Aspect final	Mean iota final	QS final	Objective final	Wall time ¹
1	8	input deck, no ESS	27	6.0024	0.3942	`9.04e-3`	`9.29e-3`	~315 s
2	24	continuation, no ESS	52	6.0000	0.4095	`1.46e-4`	`1.46e-4`	~801 s
3	48	continuation, no ESS	64	6.0000	0.4099	`7.61e-6`	`7.62e-6`	~1150 s

¹ Wall time on Apple M-series.

On the latest fresh standalone rerun, staged continuation is decisive for QA. Direct-start max_mode=3 stays in a poor basin, while continuation reaches a deep QA minimum; the best displayed QA run is the no-ESS max_mode=3 continuation case.

max_mode = 1 (8 DOFs, exact SciPy + adjoint)	max_mode = 2 (24 DOFs, exact SciPy + adjoint, continuation)	max_mode = 3 (48 DOFs, exact SciPy + adjoint, continuation)

QA/QH/QP optimization policy sweep

The CPU panel below compares the exact standalone optimizer on three target symmetries: QA, QH, and QP. Columns increase the boundary space from max_mode = 1 to max_mode = 3. Rows compare staged mode continuation against direct-start mode expansion. Blue curves use unscaled boundary DOFs; orange curves use ESS with alpha = 2.5.

The main QA lesson is that direct max_mode=3 is not a VMEC convergence failure: rerunning its input.final with both vmec_jax and VMEC2000 converges to fsq ~ 1e-13, but it is a zero-iota stationary branch. The target-iota residual therefore stays at 0.41^2 = 0.1681. Staged continuation avoids that branch and reaches iota ~= 0.410 with much lower objective.

Recreate the continuation rows:

JAX_PLATFORMS=cpu python examples/optimization/generate_qs_ess_sweep.py --backend-label cpu --solver-device cpu --policy continuation --problems qa,qh,qp --modes 1,2,3 --ess both

Recreate the direct-start rows:

JAX_PLATFORMS=cpu python examples/optimization/generate_qs_ess_sweep.py --backend-label cpu --solver-device cpu --policy direct --problems qa,qh,qp --modes 1,2,3 --ess both

Render the README/docs panels and tables:

python examples/optimization/render_qs_ess_publication_panel.py

CPU wall-time summary for the plotted runs:

Problem	Policy	max_mode	ESS	Status	Final J	Aspect	Iota	nfev	Wall min
QA	continuation	1	no	ok	9.29e-03	6.002	0.3942	27	5.2
QA	continuation	1	yes	ok	9.29e-03	6.002	0.3942	27	4.7
QA	continuation	2	no	ok	1.46e-04	6.000	0.4095	52	13.3
QA	continuation	2	yes	ok	1.51e-04	6.000	0.4095	50	13.5
QA	continuation	3	no	ok	7.62e-06	6.000	0.4099	64	19.2
QA	continuation	3	yes	ok	2.16e-05	6.000	0.4099	71	25.1
QA	direct	2	no	ok	4.50e-04	5.999	0.4066	18	18.6
QA	direct	2	yes	stopped	1.58e-04	6.000	0.4095	40	14.9
QA	direct	3	no	ok	1.68e-01	6.000	-0.0000	5	2.2
QA	direct	3	yes	ok	1.68e-01	6.000	0.0000	5	1.2
QH	continuation	1	no	ok	2.16e-01	7.049	-	9	2.2
QH	continuation	1	yes	ok	2.16e-01	7.049	-	9	2.3
QH	continuation	2	no	ok	3.72e-03	7.001	-	28	8.5
QH	continuation	2	yes	ok	4.32e-03	7.000	-	29	6.3
QH	continuation	3	no	ok	1.37e-03	7.000	-	32	10.7
QH	continuation	3	yes	ok	1.38e-03	7.000	-	33	8.1
QH	direct	2	no	ok	3.45e-03	7.001	-	28	10.2
QH	direct	2	yes	ok	4.00e-03	7.001	-	20	5.6
QH	direct	3	no	ok	4.29e-03	6.999	-	15	9.5
QH	direct	3	yes	ok	3.27e-03	6.999	-	20	9.2
QP	continuation	1	no	stopped	6.00e-01	7.089	-0.3083	20	0.5
QP	continuation	1	yes	stopped	6.00e-01	7.089	-0.3083	20	0.5
QP	continuation	2	no	stopped	2.97e-01	7.077	-0.3097	28	0.9
QP	continuation	2	yes	stopped	4.43e-01	7.087	-0.3102	28	0.8
QP	continuation	3	no	ok	3.20e-01	7.077	-0.3023	26	1.0
QP	continuation	3	yes	stopped	2.74e-01	7.063	-0.3105	36	1.4
QP	direct	1	no	stopped	6.00e-01	7.089	-0.3083	20	0.5
QP	direct	1	yes	stopped	6.00e-01	7.089	-0.3083	20	0.5
QP	direct	2	no	ok	4.60e-02	7.006	-0.5828	19	0.7
QP	direct	2	yes	stopped	5.67e-02	7.013	-0.3097	20	0.7
QP	direct	3	no	ok	5.35e-01	7.064	-1.1401	17	0.7
QP	direct	3	yes	ok	9.60e-02	7.057	-0.3092	20	0.8

Performance vs parity

Default runs select the fastest stable path for each input automatically.
Use --parity (or performance_mode=False in Python) to force the conservative VMEC2000 loop.
Use --solver-mode accelerated to force the optimized fixed-boundary controller.
For GPU benchmarking, compare both first-process and cache-warm timings; the first GPU process pays XLA compilation, while later processes reuse the persistent cache automatically.

Details, profiling guidance, and parity methodology:

docs/performance.rst
docs/validation.rst
tools/diagnostics/parity_manifest.toml + tools/diagnostics/parity_sweep_manifest.py

CLI reference

vmec_jax input.*                run the equilibrium solver → wout_*.nc
vmec_jax --plot wout.nc         generate diagnostic plots (4 output files)
vmec_jax --parity input.*       force conservative VMEC2000 loop
vmec_jax --help                 full option list

VMEC++ notes

The current runtime benchmark compares vmec_jax against VMEC2000. VMEC++ is not included in this benchmark.

When VMEC++ is available, it can be added to the runtime plot via --cpu-summary entries with backend=vmecpp. Some inputs are not supported or do not converge under the same single-grid settings:

VMEC++ unsupported inputs (lasym=True):

LandremanSenguptaPlunk_section5p3_low_res
basic_non_stellsym_pressure
cth_like_free_bdy_lasym_small
up_down_asymmetric_tokamak

VMEC++ known non-convergence on these lasym=False cases under the same single-grid settings:

DIII-D_lasym_false
LandremanPaul2021_QA_reactorScale_lowres
LandremanPaul2021_QH_reactorScale_lowres
LandremanSengupta2019_section5.4_B2_A80
cth_like_fixed_bdy

CLI output and `NSTEP`

The VMEC-style iteration loop prints every NSTEP iterations. Larger NSTEP means fewer print callbacks and faster runs.

To disable live printing:

export VMEC_JAX_SCAN_PRINT=0

Quiet runs (--quiet or verbose=False) default the scan path to minimal history mode to reduce host/device traffic. Override with:

export VMEC_JAX_SCAN_MINIMAL=0  # keep full scan diagnostics even when quiet

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.3

Apr 24, 2026

0.0.2

Apr 21, 2026

0.0.1

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vmec_jax-0.0.3.tar.gz (554.1 kB view details)

Uploaded Apr 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vmec_jax-0.0.3-py3-none-any.whl (487.7 kB view details)

Uploaded Apr 24, 2026 Python 3

File details

Details for the file vmec_jax-0.0.3.tar.gz.

File metadata

Download URL: vmec_jax-0.0.3.tar.gz
Upload date: Apr 24, 2026
Size: 554.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vmec_jax-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`5f4ad3a7c27426f59a1afd1153d5ecafbbe3ce053b7b127eb543a7597127ed14`
MD5	`eb90eb915765e9440a947095e27427b4`
BLAKE2b-256	`a6f42fb69967283e35cf5f64f0a8000b4c7433aa5c52d0108c89629b36bb2b4a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vmec_jax-0.0.3.tar.gz:

Publisher: publish-pypi.yml on uwplasma/vmec_jax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vmec_jax-0.0.3.tar.gz
- Subject digest: 5f4ad3a7c27426f59a1afd1153d5ecafbbe3ce053b7b127eb543a7597127ed14
- Sigstore transparency entry: 1375776610
- Sigstore integration time: Apr 24, 2026
Source repository:
- Permalink: uwplasma/vmec_jax@f7dcd4a80771acdc741aa669098d6cc19315e1d1
- Branch / Tag: refs/tags/v0.0.3
- Owner: https://github.com/uwplasma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@f7dcd4a80771acdc741aa669098d6cc19315e1d1
- Trigger Event: release

File details

Details for the file vmec_jax-0.0.3-py3-none-any.whl.

File metadata

Download URL: vmec_jax-0.0.3-py3-none-any.whl
Upload date: Apr 24, 2026
Size: 487.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vmec_jax-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`461e3db51ac3286499c5d7f33cb0af69f7960742de467b2d84756e2ebacc018a`
MD5	`cfd4d79d557fd5c03d9e856c390c317e`
BLAKE2b-256	`a0acd7ffab685949d38378103ff42ca71af01f4f17ef961ec1470ed87b6c47db`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vmec_jax-0.0.3-py3-none-any.whl:

Publisher: publish-pypi.yml on uwplasma/vmec_jax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vmec_jax-0.0.3-py3-none-any.whl
- Subject digest: 461e3db51ac3286499c5d7f33cb0af69f7960742de467b2d84756e2ebacc018a
- Sigstore transparency entry: 1375776646
- Sigstore integration time: Apr 24, 2026
Source repository:
- Permalink: uwplasma/vmec_jax@f7dcd4a80771acdc741aa669098d6cc19315e1d1
- Branch / Tag: refs/tags/v0.0.3
- Owner: https://github.com/uwplasma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@f7dcd4a80771acdc741aa669098d6cc19315e1d1
- Trigger Event: release

vmec-jax 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

vmec-jax

Install

Choosing CPU or GPU

Usage

Showcase (single-grid)

Quasi-helical symmetry optimization (discrete-adjoint)

Quasi-axisymmetric optimization (fixed-boundary)

QA/QH/QP optimization policy sweep

Performance vs parity

CLI reference

VMEC++ notes

CLI output and `NSTEP`

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

vmec-jax 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

vmec-jax

Install

Choosing CPU or GPU

Usage

Showcase (single-grid)

Quasi-helical symmetry optimization (discrete-adjoint)

Quasi-axisymmetric optimization (fixed-boundary)

QA/QH/QP optimization policy sweep

Performance vs parity

CLI reference

VMEC++ notes

CLI output and NSTEP

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

CLI output and `NSTEP`