Testing and Evaluation suite for GRDL library using real-world data

These details have not been verified by PyPI

Project description

GRDL-TE: Testing, Evaluation & Benchmarking

GRDL-TE is fully compatible with the GRDL v0.5.1 release.

GRDL-TE is the validation and benchmarking suite for the GRDL (GEOINT Rapid Development Library). It serves two purposes:

Validation — tests GRDL's public API against real-world satellite data with 3-level validation (format, quality, integration).
Benchmarking — profiles GRDL workflows and individual components, aggregates metrics across runs, and persists results for regression detection and cross-hardware comparison.

GRDL-TE is a consumer of GRDL — it only imports the public API. It never modifies GRDL internals.

Architecture

grdx/
├── grdl/             # Core library — readers, filters, transforms, geolocation
├── grdl-runtime/     # Workflow execution engine (DAG orchestration, YAML pipelines)
├── grdk/             # GUI toolkit (Orange3 widgets, napari viewers)
└── grdl-te/          # This package — validation tests + benchmark profiling

Layer	Package	Role
Library	`grdl`	Modular building blocks for GEOINT image processing
Runtime	`grdl-runtime`	Headless workflow executor, YAML pipeline loader
T&E	`grdl-te`	Correctness validation and performance profiling against `grdl`

Setup

Environment

GRDL-TE shares the grdl conda environment with all GRDX repositories:

conda activate grdl

Installation

# Core — models, store, component benchmarks, all tests
pip install -e .

# With workflow benchmarking (requires grdl-runtime)
pip install -e ".[benchmarking]"

# With dev tools (pytest-benchmark, pytest-xdist)
pip install -e ".[dev]"

Dependencies

Package	Version	Purpose
`grdl`	latest	Library under test
`pytest`	>=7.0	Test framework
`pytest-cov`	>=4.0	Coverage reporting
`numpy`	>=1.21	Array operations
`h5py`	>=3.0	HDF5 format support
`rasterio`	>=1.3	GeoTIFF/NITF support (via GDAL)

Optional:

Package	Install extra	Purpose
`grdl-runtime`	`benchmarking`	Active workflow benchmarking
`pytest-benchmark`	`dev`	Benchmark comparison
`pytest-xdist`	`dev`	Parallel test execution (`-n auto`)

Validation Suite

Three-level validation against real-world satellite data (~700+ tests, 51 test files):

Level	Scope	Examples
L1 — Format	Reader instantiation, metadata, shape/dtype, chip reads, resource cleanup	SICD complex64 dtype, GeoTIFF COG tiling
L2 — Quality	CRS projection, value ranges, NoData masking, format-specific features	UTM zone validation, 15-bit reflectance ceilings, SAR speckle statistics
L3 — Integration	Multi-component pipelines (chip, normalize, tile, detect)	ChipExtractor → Normalizer → batch validation

Tests skip gracefully when data is absent (pytest.skip with download instructions). Present data produces pass/fail — never a false pass.

Each data/<dataset>/README.md contains download instructions, expected file properties, and format specifications.

Running Tests

conda activate grdl

# Full suite (missing data files skip cleanly)
pytest

# Specific reader
pytest tests/validation/test_io_geotiff.py -v        # Landsat
pytest tests/validation/test_io_nitf.py -v            # Umbra SICD
pytest tests/validation/test_io_sentinel1.py -v       # Sentinel-1
pytest tests/validation/test_io_nisar.py -v            # NISAR
pytest tests/validation/test_io_eo_nitf.py -v          # EO NITF (RPC/RSM)
pytest tests/validation/test_io_sicd.py -v             # Dedicated SICD
pytest tests/validation/test_io_viirs.py -v            # Dedicated VIIRS

# Geolocation tests
pytest tests/validation/test_geolocation_base.py tests/validation/test_geolocation_utils.py -v
pytest tests/validation/test_geolocation_affine_real.py -v

# Processing tests
pytest tests/validation/test_detection_cfar.py -v
pytest tests/validation/test_decomposition_halpha.py -v
pytest tests/validation/test_decomposition_pauli.py -v
pytest tests/validation/test_sar_image_formation.py -v
pytest tests/validation/test_sar_sublook_dominance.py -v

# Benchmarking infrastructure tests
pytest tests/benchmarking/ -v

# By marker
pytest -m landsat                     # All Landsat tests
pytest -m viirs                       # All VIIRS tests
pytest -m geolocation                 # All geolocation tests
pytest -m integration                 # Only Level 3 integration tests
pytest -m "nitf and not slow"         # NITF tests, skip slow ones
pytest -m benchmark                   # Benchmarking infrastructure tests
pytest -m sar                         # SAR processing tests
pytest -m detection                   # Detection algorithm tests
pytest -m decomposition               # Polarimetric decomposition tests
pytest -m interpolation               # Interpolation tests

# Skip all data-dependent tests
pytest -m "not requires_data"

Test Markers

Marker	Purpose
`landsat`	Landsat 8/9 tests (GeoTIFFReader)
`viirs`	VIIRS VNP09GA tests (HDF5Reader)
`sentinel2`	Sentinel-2 tests (JP2Reader)
`nitf`	Umbra SICD tests (NITFReader)
`cphd`	CPHD format tests
`crsd`	CRSD format tests
`sidd`	SIDD format tests
`sentinel1`	Sentinel-1 SLC tests
`aster`	ASTER L1T tests
`biomass`	BIOMASS L1 tests
`terrasar`	TerraSAR-X/TanDEM-X tests
`nisar`	NISAR HDF5 SAR tests (NISARReader)
`eo_nitf`	EO NITF with RPC/RSM tests (EONITFReader)
`sicd`	Dedicated SICDReader tests
`writers`	IO writer roundtrip tests
`transforms`	Detection geometry transform tests
`geolocation`	Geolocation utility and coordinate transform tests
`elevation`	Elevation model tests
`requires_data`	Test requires real data files in `data/`
`slow`	Long-running test (large file reads, full pipelines)
`integration`	Level 3 tests (ChipExtractor, Normalizer, Tiler workflows)
`benchmark`	Performance benchmark tests
`sar`	SAR-specific processing tests
`image_formation`	SAR image formation tests
`detection`	Detection model tests
`cfar`	CFAR detector tests
`decomposition`	Polarimetric decomposition tests
`ortho`	Orthorectification tests
`coregistration`	CoRegistration tests
`interpolation`	Interpolation algorithm tests

Benchmarking

CLI Benchmark Suite

Run the full benchmark suite from the command line:

python -m grdl_te                              # medium arrays, 10 iterations
python -m grdl_te --size small -n 5            # quick run
python -m grdl_te --size large -n 20           # thorough run
python -m grdl_te --only filters intensity     # specific benchmark groups
python -m grdl_te --skip-workflow              # component benchmarks only
python -m grdl_te --store-dir ./results        # custom output directory
python -m grdl_te --report                     # print report to terminal
python -m grdl_te --report ./reports/          # save report to directory
python -m grdl_te --report ./my_report.txt     # save report to file

Array size presets:

Preset	Dimensions
`small`	512 x 512
`medium`	2048 x 2048
`large`	4096 x 4096

Benchmark groups (13):

Group	Coverage
`filters`	Mean, Gaussian, Median, Min, Max, StdDev, Lee, ComplexLee, PhaseGradient
`intensity`	ToDecibels, PercentileStretch
`decomposition`	Pauli, DualPolHAlpha, SublookDecomposition
`detection`	CA-CFAR, GO-CFAR, SO-CFAR, OS-CFAR, DetectionSet, Fields
`sar`	MultilookDecomposition, CSIProcessor
`image_formation`	CollectionGeometry, PolarGrid, PFA, RDA, StripmapPFA, FFBP, SubaperturePartitioner
`ortho`	Orthorectifier, OutputGrid, OrthoPipeline, compute_output_resolution
`coregistration`	Affine, FeatureMatch, Projective
`io`	22 readers/writers (GeoTIFF, HDF5, NITF, JP2, SICD, CPHD, CRSD, SIDD, Sentinel-1/2, ASTER, BIOMASS, TerraSAR-X)
`geolocation`	Affine, GCP, SICD, Sentinel-1 SLC, NoGeolocation
`interpolation`	Lanczos, KaiserSinc, Lagrange, Farrow, Polyphase, ThiranDelay
`data_prep`	ChipExtractor, Tiler, Normalizer
`pipeline`	Sequential Pipeline composition

Active Workflow Benchmarking

Run a grdl-runtime Workflow N times, aggregate per-step metrics, and persist results:

from grdl_rt import Workflow
from grdl_rt.api import load_workflow
from grdl_te.benchmarking import ActiveBenchmarkRunner, BenchmarkSource, JSONBenchmarkStore

store = JSONBenchmarkStore()

# ==== Pass a declared workflow ====
wf = (
    Workflow("SAR Pipeline", modalities=["SAR"])
    .reader(SICDReader)
    .step(SublookDecomposition, num_looks=3)
    .step(ToDecibels)
)
runner = ActiveBenchmarkRunner(wf, iterations=10, warmup=2, store=store)
record = runner.run(source="image.nitf", prefer_gpu=True)

# ==== Load a YAML workflow ====
wf = load_workflow("path/to/my_workflow.yaml")
source = BenchmarkSource.synthetic("medium")

runner = ActiveBenchmarkRunner(
    workflow=wf, source=source, iterations=5, warmup=1, store=store,
)
record = runner.run()

# record.total_wall_time.mean, .stddev, .p95
# record.step_results[0].wall_time_s.mean
# record.hardware.cpu_count, .gpu_devices

Component Benchmarking

Profile individual GRDL functions outside of a workflow context:

from grdl.data_prep import Normalizer
from grdl_te.benchmarking import ComponentBenchmark

image = np.random.rand(4096, 4096).astype(np.float32)
norm = Normalizer(method='minmax')

bench = ComponentBenchmark(
    name="Normalizer.minmax.4k",
    fn=norm.normalize,
    setup=lambda: ((image,), {}),
    iterations=20,
    warmup=3,
)
record = bench.run()

Benchmark Data Sources

from grdl_te.benchmarking import BenchmarkSource

# Synthetic data (lazy generation with caching)
source = BenchmarkSource.synthetic("medium")   # 2048x2048
source = BenchmarkSource.synthetic("small")    # 512x512
source = BenchmarkSource.synthetic("large")    # 4096x4096

# Real data file
source = BenchmarkSource.from_file("path/to/image.nitf")

# Existing array
source = BenchmarkSource.from_array(my_array)

Result Storage

Results are stored as JSON files in .benchmarks/:

.benchmarks/
  index.json              # lightweight index for fast filtering
  records/
    <uuid>.json           # full BenchmarkRecord per run

Each BenchmarkRecord captures the HardwareSnapshot (CPU, RAM, GPU, platform), per-step AggregatedMetrics (min, max, mean, median, stddev, p95), and raw per-iteration measurements for lossless post-hoc analysis.

Example Workflow

The workflows/ directory contains example grdl-runtime workflow definitions. comprehensive_benchmark_workflow.yaml defines a 28-step SAR processing pipeline covering complex speckle filtering, phase gradient analysis, amplitude conversion, CFAR detection, and conditional orthorectification.

benchmark_examples.py demonstrates active workflow benchmarking with ActiveBenchmarkRunner at multiple array scales.

Project Status

Component coverage: 78/78 (100%)

All public GRDL components have both a dedicated benchmark in suite.py and a correctness validation test in tests/validation/. See BENCHMARK_COVERAGE_GAPS.md for the full inventory.

Metric	Value
Benchmarked components	78/78
Benchmark groups	13
Validation test files	51
Benchmark infrastructure tests	6
YAML workflow steps	28
Array size presets	small (512), medium (2048), large (4096)

Active Development

Passive Monitoring — ExecutionHook for capturing metrics from production workflows
Regression Detection — cross-run comparison with configurable thresholds
Cross-Hardware Prediction — collect results from different machines, predict performance on new hardware

Dependency Management

Source of Truth: `pyproject.toml`

All dependencies are defined in pyproject.toml. Keep these files synchronized:

pyproject.toml — source of truth for versions and dependencies
requirements.txt — regenerate with pip freeze > requirements.txt after updating pyproject.toml

Note: GRDL-TE is a validation suite, not a published library, so there is no .github/workflows/publish.yml or PyPI versioning requirement.

Updating Dependencies

Update dependencies in pyproject.toml (add new packages, change versions, create/rename extras)
Install dependencies: pip install -e ".[all,dev]" (or appropriate extras for your work)
If requirements.txt exists, regenerate it: pip freeze > requirements.txt
Commit both files

See CLAUDE.md for detailed dependency management guidelines.

License

MIT License — see LICENSE for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.4.1

Apr 27, 2026

0.4.0

Apr 7, 2026

0.1.0

Feb 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grdl_te-0.4.1.tar.gz (84.3 kB view details)

Uploaded Apr 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

grdl_te-0.4.1-py3-none-any.whl (99.3 kB view details)

Uploaded Apr 27, 2026 Python 3

File details

Details for the file grdl_te-0.4.1.tar.gz.

File metadata

Download URL: grdl_te-0.4.1.tar.gz
Upload date: Apr 27, 2026
Size: 84.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for grdl_te-0.4.1.tar.gz
Algorithm	Hash digest
SHA256	`56a3d559540b965e9bf2cddfb8e34930cbb26c1f553ec8bb98e31ab93234a74f`
MD5	`4e912230ed682879b76a60ad5ec46516`
BLAKE2b-256	`fcf965eff215b4d2bf2d76f5a4b7ff91a5de057375d7ef3fc9984161df527307`

See more details on using hashes here.

Provenance

The following attestation bundles were made for grdl_te-0.4.1.tar.gz:

Publisher: publish.yml on GEOINT/grdl-te

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: grdl_te-0.4.1.tar.gz
- Subject digest: 56a3d559540b965e9bf2cddfb8e34930cbb26c1f553ec8bb98e31ab93234a74f
- Sigstore transparency entry: 1393458905
- Sigstore integration time: Apr 27, 2026
Source repository:
- Permalink: GEOINT/grdl-te@45d0cef63b6565a9330e3fbc0d4edd9dac7eab0f
- Branch / Tag: refs/tags/v0.4.1
- Owner: https://github.com/GEOINT
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@45d0cef63b6565a9330e3fbc0d4edd9dac7eab0f
- Trigger Event: release

File details

Details for the file grdl_te-0.4.1-py3-none-any.whl.

File metadata

Download URL: grdl_te-0.4.1-py3-none-any.whl
Upload date: Apr 27, 2026
Size: 99.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for grdl_te-0.4.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c3547fb703ae08404f254a93ecca9cd54bf5872aa297fd04c2d54f3ed8cfc8ce`
MD5	`a3d02501fd55cec543a326066acd7c02`
BLAKE2b-256	`404ec44faaff35f7f44c5c729454482677b91139f11edf569a958e50e580a0f9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for grdl_te-0.4.1-py3-none-any.whl:

Publisher: publish.yml on GEOINT/grdl-te

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: grdl_te-0.4.1-py3-none-any.whl
- Subject digest: c3547fb703ae08404f254a93ecca9cd54bf5872aa297fd04c2d54f3ed8cfc8ce
- Sigstore transparency entry: 1393458909
- Sigstore integration time: Apr 27, 2026
Source repository:
- Permalink: GEOINT/grdl-te@45d0cef63b6565a9330e3fbc0d4edd9dac7eab0f
- Branch / Tag: refs/tags/v0.4.1
- Owner: https://github.com/GEOINT
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@45d0cef63b6565a9330e3fbc0d4edd9dac7eab0f
- Trigger Event: release

grdl-te 0.4.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

GRDL-TE: Testing, Evaluation & Benchmarking

Architecture

Setup

Environment

Installation

Dependencies

Validation Suite

Running Tests

Test Markers

Benchmarking

CLI Benchmark Suite

Active Workflow Benchmarking

Component Benchmarking

Benchmark Data Sources

Result Storage

Example Workflow

Project Status

Active Development

Dependency Management

Source of Truth: pyproject.toml

Updating Dependencies

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Source of Truth: `pyproject.toml`