Neoclassical transport solver with CPU/GPU and differentiable JAX workflows

Project description

sfincs_jax

sfincs_jax solves the radially local, linearized drift-kinetic equation on a flux surface — the same physics as SFINCS Fortran v3 — in pure JAX. One input.namelist plus one geometry file gives neoclassical particle/heat fluxes, parallel flows, bootstrap current, and transport matrices for stellarators and tokamaks, on CPU or GPU, with end-to-end automatic differentiation for sensitivities and optimization. Outputs, per-species result tables, and console prints are pinned field-by-field against SFINCS Fortran v3.

Installation

pip install sfincs_jax

The solver tiers (block-tridiagonal Legendre elimination, recycled GCROT, implicit differentiation) live in the external solvax library, which installs automatically as a core dependency.

Optional extras:

GPU: install the matching CUDA build of JAX, e.g. pip install -U "jax[cuda12]".

Large public equilibrium files (W7-X, HSX) are not shipped in the wheel; they are fetched from a GitHub release on first use and cached under ~/.cache/sfincs_jax/data. Prefetch with python -m sfincs_jax.validation.data_fetch; see the installation docs for offline/cache options and for building the SFINCS Fortran v3 reference executable with conda-provided PETSc/MUMPS.

Quickstart

Run a small circular-tokamak deck through the canonical driver (this mirrors examples/run_tokamak.py, which also builds the namelist from Python dicts and plots the results):

from pathlib import Path
from sfincs_jax.run import run_profile

deck = Path("input.namelist")
deck.write_text("""\
&geometryParameters
  geometryScheme = 1  ! circular tokamak: BHat = 1 + 0.1 cos(theta)
  inputRadialCoordinate = 3
  rN_wish = 0.3
  B0OverBBar = 1.0  GHat = 1.0  IHat = 0.0  iota = 1.31
  epsilon_t = 0.1  epsilon_h = 0.0  psiAHat = 0.045  aHat = 0.1
/
&speciesParameters
  Zs = 1  mHats = 1.0  nHats = 1.0  THats = 0.5
  dNHatdrHats = -6.0  dTHatdrHats = -3.0
/
&physicsParameters
  Delta = 4.5694d-3  alpha = 1.0  nu_n = 8.4774d-3
  Er = 0.0  collisionOperator = 1  ! pitch-angle scattering
/
&resolutionParameters
  Ntheta = 15  Nzeta = 1  Nxi = 8  NL = 4  Nx = 6
  solverTolerance = 1d-10
/
""")

run = run_profile(deck, solve_method="auto", out_path=Path("sfincsOutput.h5"))
print("particle flux:", float(run.moments["particleFlux_vm_psiHat"][0]))
print("bootstrap current <j.B>:", float(run.moments["FSABjHat"]))

run_profile prints the Fortran-parity console flow (banner, grids, solve progress, per-species results table), writes sfincsOutput.h5/.nc keyed by the SFINCS output names, and returns the state vector, solver statistics, and all moments in memory. The CLI equivalent is sfincs_jax input.namelist --out sfincsOutput.h5, and sfincs_jax --plot sfincsOutput.h5 builds a PDF diagnostics panel.

Performance vs SFINCS Fortran v3

Measured head-to-head on the same machine (MacBook, Apple M4, 24 GB) and the same deck: HSX_PASCollisions_DKESTrajectories, RHSMode=1, at Ntheta=25, Nzeta=51, Nxi=100, Nx=5 — 744,610 unknowns. The Fortran reference is the conda PETSc 3.23 + MUMPS 5.8.2 build of SFINCS v3; sfincs_jax uses the tier-1 truncated Legendre block elimination from solvax.

Runtime and peak memory: sfincs_jax vs SFINCS Fortran v3 on the 744k-unknown HSX PAS case

With the matched Nxi-for-x ramp discretization, sfincs_jax solves in 27.2 s at 0.93 GB — 17x faster than 1-rank Fortran (463.6 s, 3.98 GB) and 8.4x faster than Fortran's best measured parallel floor (229.5 s / 2.86 GB at 2 ranks; 4 and 8 ranks are slower on this machine), at roughly 30% of the memory. With uniform Nxi it takes 44.3 s at 1.16 GB; an RTX A4000 GPU takes 45.0 s (the Legendre scan is serial and A4000 FP64 is 1/32 rate).
The Fortran strong-scaling baseline on the same case: 463.6 s (1 rank), 229.5 s (2 ranks, 101% efficiency), 240.9 s (4 ranks), 270.5 s (8 ranks).
At the full production resolution of this case (2.5 M unknowns), neither code fits a global sparse factorization in 24 GB; the truncated Legendre elimination needs only O(K m^2) memory (~0.3 GB here, vs ~91 GB for the full-band factor) and is the locally viable direct path.
The direct solve is more converged than the Fortran reference: Fortran's own electron FSABFlow scatters 51% across its 1/2/4/8-rank runs (KSP rtol=1e-6 solver noise), while sfincs_jax matches the closest Fortran run to 2e-10 and sits inside Fortran's spread on every quantity.

Scope: this is one measured 744k-unknown HSX PAS case; further cases are promoted here as each vertical slice lands with its own evidence. Regenerate the figure from the recorded values with python tools/benchmarks/readme_figures.py; rerun the measurement with python tools/benchmarks/tier1_hsx_head_to_head.py. Full tables, provenance, and known issues: docs/performance.rst.

Parity with SFINCS Fortran v3

Every canonical module was admitted against the reference implementation (Fortran golden outputs, tiny-grid PETSc matrix dumps, or the retained legacy implementation) at pinned tolerances that run in CI:

Measured parity envelopes of the canonical stack

The scheme-1 monoenergetic transportMatrix[0,1] element is pinned to upstream's expected value because that element is tolerance-unstable in the Fortran build itself; the sfincs_jax direct solve reproduces the expected value to 4.2e-6 by construction.

Functionality: sfincs_jax vs SFINCS Fortran v3

Every SFINCS Fortran v3 physics family is supported, each admitted with Fortran-parity evidence (golden outputs, tiny-grid PETSc matrix dumps) pinned in CI:

Capability	sfincs_jax	SFINCS Fortran v3
RHSMode 1 (fluxes, flows, bootstrap current)	✅	✅
RHSMode 2 / 3 (thermal and monoenergetic transport matrices)	✅	✅
Pitch-angle scattering + full Fokker-Planck (Rosenbluth) collisions	✅	✅
Full and DKES trajectory models; radial electric field	✅	✅
Constraint schemes −1…4	✅	✅
Geometry: analytic 1–4, VMEC 5, Boozer `.bc` 11/12, namelist spectrum 13	✅	✅
Non-stellarator-symmetric VMEC (`lasym`)	✅	✅
`Phi1`/quasineutrality (kinetic + collision coupling, `readExternalPhi1`)	✅	✅
Tangential magnetic drifts (`magneticDriftScheme` 0–9)	✅	✅
Speed grids `xGridScheme` 1–8; `xDotDerivativeScheme` −2…11	✅	✅
`export_f`, HDF5/NetCDF output, Fortran-format stdout	✅	✅
Ambipolar radial-electric-field root solve	✅	✅
Exact gradients of any output w.r.t. any input (`jax.grad`, implicit differentiation)	✅	❌ (fixed adjoint pairs via RHSMode 4/5)
Differentiable ambipolar root and `Phi1` state (implicit function theorem)	✅	❌
GPU execution	✅	❌
Variational upper/lower bounds on transport coefficients (convergence certificates)	✅	❌
`.npz` output, versioned solver traces, geometry-only output	✅	❌
Warm starts + Krylov recycling across scans and optimizer iterations	✅	❌
Automatic memory-based solver-tier selection	✅	❌
MPI multi-node execution	❌ (single-node multicore + GPU)	✅

Beyond the Fortran-v3 feature set, several JAX-only research capabilities have landed: momentum-conserving flow corrections (sfincs_jax.momentum_correction), a monoenergetic (collisionality, electric field) database mode with energy convolution (sfincs_jax.monoenergetic), batched multi-Er/multi-surface GPU scans (sfincs_jax.batch), and the variational transport-coefficient bounds above. Still in development on the research roadmap: an extended-collisionality (improved Sugama) model operator and a differentiable bounce-averaged fast model. See docs/feature_matrix.rst for the matrix.

Optimization showcase

examples/optimize_QA_bootstrap.py compares two precise quasi-axisymmetric (QA) stellarator boundaries: one shaped for QA alone, one whose bootstrap current is lowered while its QA is held fixed. Stage A shapes a circular torus into a precise QA equilibrium with vmec_jax.optimize.least_squares (implicit Jacobian): a max_mode 1->2 continuation drives the two-term quasisymmetry ratio residual down ~4 orders of magnitude at aspect ratio 6.00 and mean iota 0.42. That precise-QA boundary is config 1 (QS ratio residual 1.1e-4, <j.B>/sqrt(<B^2>) 6.4e-3). Stage B holds that residual under a hard one-sided cap while lowering the bootstrap current -- <j.B> from the sfincs_jax kinetic solve -- so config 2 is precise QA too. One jax.value_and_grad differentiates the whole chain: boundary coefficients -> vmec_jax fixed-boundary equilibrium (implicit-adjoint VJP) -> differentiable Boozer transform (booz_xform_jax) -> sfincs_jax kinetic solve (tier-2 GCROT with implicit differentiation, warm-started and recycled) -> FSABjHat. QA makes the guiding-centre drifts tokamak-like and the bootstrap current large, so at held QA the reduction is a genuine, modest Pareto gain, not a free lunch: <j.B>/sqrt(<B^2>) drops to 5.7e-3 (1.12x lower) with the QS ratio residual held at 1.1e-4 vs 2.6e-4 (both sub-3e-4 precise QA), aspect ratio ~6 and mean iota above 0.41, in roughly thirteen minutes on a laptop CPU (mostly one-time XLA compilation). The end-to-end gradient is finite-difference verified (kinetic segment ~3e-6; full chain ~5e-3, limited by the host solver's termination noise, not by autodiff).

Precise QA vs precise QA held + bootstrap: bootstrap current and two-term QS residual for both configs, plus a 3D render of |B| on the bootstrap-optimized boundary

Examples

Six pedagogic scripts on the canonical API live at the top of examples/ — no main(), parameters at the top, printed progress, a plot, and output files written and read back:

run_tokamak.py — build a namelist in Python, solve, read HDF5/NetCDF back.
run_w7x.py — W7-X Boozer geometry with full Fokker-Planck collisions (tier-2 Krylov).
transport_coefficients.py — monoenergetic transport matrices and a collisionality scan.
ambipolar_er_scan.py — scan Er, bracket and solve the ambipolar root.
gradients_tour.py — jax.grad through the solve, verified against finite differences.
optimize_QA_bootstrap.py — the flagship optimization above.

The wider examples/ tree (tutorial notebooks, parity/benchmark drivers, upstream SFINCS decks) is mapped in examples/README.md.

Documentation

pip install -e ".[docs]"
sphinx-build -b html -W docs docs/_build/html

Entry points: docs/index.rst (landing + quickstart), docs/examples.rst, docs/performance.rst (measured evidence and known issues), docs/inputs.rst / docs/outputs.rst (namelist and output references), and docs/system_equations.rst (the equations solved).

Known issues

Nxi_for_x ramps embed the truncated degrees of freedom as identity-pinned rows in the matrix-free operator (the Fortran code packs them out of its matrix). The direct tier solves each (species, x) subsystem with its own packed Legendre count — the exact Fortran discretization — and the differentiable path pins the truncated rows; gradients through the ramped route match finite differences to 1e-6 relative in the regression tests, and every solve raises at execution time if a forward or adjoint solve fails to converge.
The scheme-1 monoenergetic transportMatrix[0,1] element is ill-conditioned in the upstream configuration itself (tolerance-unstable in the Fortran build); parity for it is pinned to upstream's expected value. Near-singular structured eliminations (for example a collisionless nu_n = 0 deck) fall back automatically from the direct tier to the preconditioned Krylov tier.

Testing

pytest -q

License

See LICENSE.

Project details

Release history Release notifications | RSS feed

This version

1.2.0

Jul 13, 2026

1.1.7

Jun 13, 2026

1.1.6

Jun 10, 2026

1.1.5

Jun 10, 2026

1.1.4

May 16, 2026

1.1.3

May 15, 2026

1.1.2

May 10, 2026

1.1.1

May 10, 2026

1.1.0

Apr 29, 2026

1.0.7

Apr 26, 2026

1.0.6

Apr 26, 2026

1.0.5

Apr 25, 2026

1.0.4

Apr 24, 2026

1.0.3

Apr 24, 2026

1.0.2

Apr 24, 2026

1.0.1

Apr 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sfincs_jax-1.2.0.tar.gz (594.5 kB view details)

Uploaded Jul 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sfincs_jax-1.2.0-py3-none-any.whl (364.8 kB view details)

Uploaded Jul 13, 2026 Python 3

File details

Details for the file sfincs_jax-1.2.0.tar.gz.

File metadata

Download URL: sfincs_jax-1.2.0.tar.gz
Upload date: Jul 13, 2026
Size: 594.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for sfincs_jax-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`1e9c2bf2aba8ee2b56bd52f8acb2f33976d45b897cae9677aa2eb58dda1d41f1`
MD5	`539a80372b2e9b940410809772d65676`
BLAKE2b-256	`264ee2186192454e4f97ce39a8524e8a0374592c49118c21948a9ba149bd5d50`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sfincs_jax-1.2.0.tar.gz:

Publisher: publish-pypi.yml on uwplasma/sfincs_jax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sfincs_jax-1.2.0.tar.gz
- Subject digest: 1e9c2bf2aba8ee2b56bd52f8acb2f33976d45b897cae9677aa2eb58dda1d41f1
- Sigstore transparency entry: 2164641079
- Sigstore integration time: Jul 13, 2026
Source repository:
- Permalink: uwplasma/sfincs_jax@537b21a7261c9ca780affaf770c303cfc053e7ec
- Branch / Tag: refs/tags/v1.2.0
- Owner: https://github.com/uwplasma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@537b21a7261c9ca780affaf770c303cfc053e7ec
- Trigger Event: push

File details

Details for the file sfincs_jax-1.2.0-py3-none-any.whl.

File metadata

Download URL: sfincs_jax-1.2.0-py3-none-any.whl
Upload date: Jul 13, 2026
Size: 364.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for sfincs_jax-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`820854e9653083bb4005cee812aeff509999290f18b65e15d1482a1cbf5b7c87`
MD5	`e38f8ed7fb3c78b25a5efcda80b80bb1`
BLAKE2b-256	`ba4f6341761dba1c0cf8dd14ffa055c7caf2b0744aea80394a5850baad6b8706`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sfincs_jax-1.2.0-py3-none-any.whl:

Publisher: publish-pypi.yml on uwplasma/sfincs_jax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sfincs_jax-1.2.0-py3-none-any.whl
- Subject digest: 820854e9653083bb4005cee812aeff509999290f18b65e15d1482a1cbf5b7c87
- Sigstore transparency entry: 2164641094
- Sigstore integration time: Jul 13, 2026
Source repository:
- Permalink: uwplasma/sfincs_jax@537b21a7261c9ca780affaf770c303cfc053e7ec
- Branch / Tag: refs/tags/v1.2.0
- Owner: https://github.com/uwplasma
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@537b21a7261c9ca780affaf770c303cfc053e7ec
- Trigger Event: push

sfincs-jax 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

sfincs_jax

Installation

Quickstart

Performance vs SFINCS Fortran v3

Parity with SFINCS Fortran v3

Functionality: sfincs_jax vs SFINCS Fortran v3

Optimization showcase

Examples

Documentation

Known issues

Testing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance