Icarus: data-driven heat flux prediction from infrared thermography using POD, DMD, and machine learning
Project description
icarus
Data-driven heat flux prediction from infrared thermography.
icarus provides a full pipeline from raw IR camera data to trained
heat flux prediction models using Proper Orthogonal Decomposition (POD),
Dynamic Mode Decomposition (DMD), and artificial neural networks.
It implements the methodology from:
Investigating the efficacy of data-driven techniques and machine learning algorithms to predict heat transfer characteristics (Twum-Barima, 2025)
The best-performing approach (Model C: POD modal mapping) achieved R² = 0.729 on a 17M-sample flow boiling dataset — a 69 % improvement over the linear baseline.
Installation
Install directly from GitHub:
pip install git+https://github.com/twumbarimaraymond1-coder/icarus
Or from source (recommended for development):
git clone https://github.com/twumbarimaraymond1-coder/icarus
cd icarus
pip install -e ".[dev]"
Requirements: Python ≥ 3.9, NumPy, SciPy, scikit-learn, Optuna, Matplotlib.
The package name on install is icarus-thermal; the import name is icarus.
Quickstart
import icarus as tf
# Load your dataset (.mat, .h5, .npz supported)
data = tf.data.loader.load(
"experiment.mat",
temperature_key="T",
heatflux_key="qL2",
)
# Or load from numpy arrays directly
import numpy as np
data = tf.data.loader.from_arrays(T, q, dt=2.5e-4)
# Run the full pipeline (POD modal strategy, best performance)
pipeline = tf.Pipeline(
strategy="modal", # "raw" | "gradient" | "modal"
n_pod_modes=5,
spatial_crop=5,
trim_frames=43,
optimise_hyperparams=True,
n_trials=30,
)
pipeline.fit(data)
# Evaluate
metrics = pipeline.evaluate()
# [test] R² = 0.7293 RMSE = 25,959 W/m² MAE = 20,656 W/m²
# Predict on new data
q_predicted = pipeline.predict(T_new) # shape [ny, nx, nt]
Three model strategies
| Strategy | Features | Notes |
|---|---|---|
"raw" (Model A) |
Temperature only | Baseline |
"gradient" (Model B) |
T + dT/dt + dT/dx + dT/dy | Modest improvement |
"modal" (Model C) |
POD modal contributions | Best: R² = 0.729 |
The modal strategy works by:
- Decomposing the temperature field into dominant POD modes
- Learning a mapping from temperature modal coefficients → heat flux modal coefficients
- Reconstructing the full heat flux field from the predicted coefficients
Cross-dataset generalisation (multi-dataset workflow)
Beyond the single-dataset Pipeline, icarus includes a dataset registry for
training across multiple experiments and testing on a fully held-out one
(leave-one-surface-out). This answers the stronger question "does the
temperature→heat-flux modal coupling transfer to an experiment the model has
never seen?" — not merely to unseen timesteps of the same experiment.
from icarus.registry.dataset import DatasetRegistry, DatasetEntry
from icarus.registry.extractor import FeatureExtractor
from icarus.registry.trainer import MultiDatasetTrainer
reg = DatasetRegistry("~/.icarus/datasets")
for ds_id, path in [("D001", "surface1.mat"),
("D002", "surface2.mat"),
("D003", "surface3.mat")]:
reg.register(DatasetEntry(ds_id, "water", "flow_boiling", "patch",
"MyLab", raw_path=path))
ext = FeatureExtractor(reg, n_pod_modes=5)
for ds_id in ("D001", "D002", "D003"):
ext.process(ds_id)
trainer = MultiDatasetTrainer(reg, n_pod_modes=5)
trainer.cross_dataset_fit(train_ids=["D001", "D002"], test_id="D003")
metrics = trainer.evaluate() # fluctuation + absolute-field R²/RMSE
A runnable end-to-end script is provided in
examples/cross_dataset_real.py
(real .mat files) and
examples/cross_dataset_eval.py (synthetic).
Spectral POD (frequency-resolved structures)
Where POD ranks modes by energy alone, SPOD produces modes coherent at a single frequency — separating boiling structures by timescale (nucleation, bubble departure, microlayer). The high-level API mirrors the Quickstart:
import icarus as tf
# Low-memory load of one field (reads only the heater layer of 4-D temperature)
field, dt = tf.load_field("MODEL_~1.MAT", field="heatflux")
# Fit Spectral POD straight from the [ny, nx, nt] field
spod = tf.SPOD(block_size=1024).fit_field(field, dt=dt, spatial_crop=5, trim_frames=43)
print(spod.dominant_frequencies(n=4)) # candidate dominant timescales (Hz)
spod.plot_spectrum("spectrum.png") # energy vs frequency
spod.plot_mode(spod.dominant_frequencies()[0], "mode.png") # structure at top peak
A runnable version is in examples/spod_analysis.py.
Metrics: fluctuation vs absolute R²
MultiDatasetTrainer.evaluate() reports two test metrics, and the
distinction matters when comparing results:
- Fluctuation R² (returned as
"test"/"test_fluctuation") is computed on the mean-subtracted heat-flux field — the quantity the POD modal model actually predicts. It measures how well the temperature→heat-flux modal coupling is captured. This is the honest headline number. - Absolute R² (
"test_absolute") adds the per-pixel time-mean field back to both truth and prediction. It is always more flattering, because the large quasi-static spatial mean dominates the variance.
When citing or comparing results from this package, state which metric you are using.
Assumptions & conventions (read before using your own data)
- Array convention is
[ny, nx, nt]for all 3-D fields. Time-major flattening (transpose(2, 0, 1)before reshape) is used throughout so that temporal train/test splits are genuine past→future splits. - 4-D temperature arrays
[ny, nx, nz, nt]are reduced by taking z-layer 0, assumed to be the heater surface. If your surface is at a different layer, slice before loading (from_arrays(T[:, :, k, :], q)). - MATLAB v7.3 files (HDF5-based) are handled automatically, including MATLAB's reversed axis storage order.
- Default variable names are
T(temperature),qL2(heat flux), andTimeStep(scalar dt in seconds) — all overridable via keyword arguments toload()/FeatureExtractor.process(). - Units are assumed to be kelvin and W/m²; RMSE/MAE are reported in the units of the heat-flux input.
- POD
modal_contributions()returnsU^T X_cscaling (no extra σ multiplication); see docstrings before composing with your own SVD code.
Individual components
You can also use the modules independently:
from icarus.decomposition.pod import POD
from icarus.data.preprocessor import Preprocessor
# Preprocessing
pre = Preprocessor()
out = pre.fit_transform(data)
X_c = Preprocessor.to_matrix(out["T_c"]) # [n_pix, nt]
# POD
pod = POD(n_modes=10)
pod.fit(X_c)
print(f"First 5 modes capture {pod.cumulative_energy_[4]:.1%} of variance")
# Modal contributions
contribs = pod.modal_contributions(X_c) # [n_pix, nt, n_modes]
# Visualisation
from icarus.visualisation.plots import plot_pod_modes, plot_cumulative_energy
ny, nx = out["T"].shape[:2]
plot_cumulative_energy(pod)
plot_pod_modes(pod, ny=ny, nx=nx, n_modes=5)
from icarus.decomposition.dmd import DMD
# DMD forecasting
dmd = DMD(energy_threshold=0.99, dt=2.5e-4)
dmd.fit(X_c_train)
X_forecast = dmd.forecast_from(X_c_train[:, -1], n_steps=1200)
Visualisation
from icarus.visualisation.plots import (
plot_field,
plot_pod_modes,
plot_cumulative_energy,
plot_scatter,
plot_model_summary,
)
# Single field
plot_field(q[:, :, 100], title="Heat flux at t=100")
# Full model evaluation summary (6-panel figure)
plot_model_summary(
q_true_field, q_pred_field,
y_true_flat, y_pred_flat,
metrics_train, metrics_test,
r2_t=r2_t, rmse_t=rmse_t,
model_name="Model C — POD Modal",
)
Running tests
pytest tests/ -v
Project structure
icarus/
├── data/
│ ├── loader.py # .mat, .h5, .npz, numpy array loading
│ └── preprocessor.py # cropping, mean-centering, reshaping
├── decomposition/
│ ├── pod.py # POD via SVD
│ └── dmd.py # DMD forecasting
├── features/
│ └── engineer.py # gradient and modal feature construction
├── models/
│ └── neural.py # MLP with Bayesian optimisation
├── metrics/
│ └── evaluation.py # R², RMSE, MAE
├── visualisation/
│ └── plots.py # spatial fields, modes, diagnostics
└── pipeline/
└── runner.py # end-to-end Pipeline
Data availability
The flow-boiling experimental datasets used to develop and validate this
package were produced at Loughborough University and are not redistributed
in this repository; they may be available from the authors / Loughborough
University on reasonable request. All code paths can be exercised without
them: examples/quickstart.py and examples/cross_dataset_eval.py generate
synthetic data, and the test suite (pytest tests/) is fully self-contained.
Citation
If you use icarus in academic work, please cite it (see CITATION.cff):
Twum-Barima, R. (2026). icarus: data-driven heat flux prediction from infrared thermography (v0.1.0) [Computer software]. https://github.com/twumbarimaraymond1-coder/icarus
Known limitations
- Experimental datasets are not included in this repository (see Data availability).
- The reported Model C R² = 0.729 is dataset-specific and should be revalidated on independent datasets before being cited as a general result.
- The default ANN search space (
"medium") is designed for moderate-sized datasets with 5 POD modes. Larger mode counts or datasets may requirehyperparam_search_space="large"and more Optuna trials. - Current models use scikit-learn MLPs. Future versions may include PyTorch models for larger-scale training and GPU acceleration.
- DMD forecasting accuracy degrades over longer horizons — it is suited to short-horizon prediction only.
Contributing
Contributions welcome — particularly additional datasets, fluid-specific
pre-trained models, and improved DMD variants. See CONTRIBUTING.md.
Licence
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file icarus_thermal-0.4.0.tar.gz.
File metadata
- Download URL: icarus_thermal-0.4.0.tar.gz
- Upload date:
- Size: 53.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
771a17c5463d6ad3dabbd0a5cd1efbe56a4cb6f3d23bbbc51380fcceb6f6aaa4
|
|
| MD5 |
ecc228235c278c3c440a8668fe1fe789
|
|
| BLAKE2b-256 |
37fd49f3aa411d0bb8e46d4d589800681d0945128dc5668b26113eedb85b2623
|
File details
Details for the file icarus_thermal-0.4.0-py3-none-any.whl.
File metadata
- Download URL: icarus_thermal-0.4.0-py3-none-any.whl
- Upload date:
- Size: 48.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ba7c8f548610fe80c16393f3dbf808d216231cd1ea3d141b40606f83a3c05bf8
|
|
| MD5 |
1ddc126768f385ac71d2762093722f0a
|
|
| BLAKE2b-256 |
f47a7bd4a80048b3cca9a7da04a711a40c8faadadcff744dd591271c9d3fef29
|