Learned compression models for multispectral and multitemporal Earth Observation data.

These details have not been verified by PyPI

Project links

Project description

TerraCodec

Compressing Optical Earth Observation Data

TerraCodec (TEC) is a family of pretrained neural compression models for optical Sentinel-2 Earth Observation imagery. Models compress multispectral images and seasonal time series using learned latent representations and entropy coding.

Compared to classical codecs (JPEG2000, WebP, HEVC), TerraCodec achieves 3–10× higher compression at comparable reconstruction quality on multispectral EO imagery. Temporal models further improve compression by exploiting redundancy across seasonal sequences.

📄 Paper: https://arxiv.org/abs/2510.12670
🤗 Models: https://huggingface.co/embed2scale

Installation

pip install terracodec

Requirements: Python ≥ 3.10, PyTorch ≥ 2.0

All pretrained checkpoints are automatically downloaded from HuggingFace on first use.

Models

TerraCodec includes image codecs and temporal codecs for EO data.

Image Codecs

Model	Description
TEC-FP	Factorized-prior model. Smallest, strong baseline.
TEC-ELIC	Enhanced entropy model with spatial + channel context. Better rate–distortion, slightly larger.

TEC Image Architecture

Temporal Codecs

Model	Description
TEC-TT	Temporal Transformer for multispectral time series data. Predicts latent distributions from previous frames.
FlexTEC	Flexible-rate extension of TEC-TT. One checkpoint covers many compression levels via latent repacking and token prediction.

TEC-TT Architecture

FlexTEC Examples

One model, multiple quality levels: by varying the token budget at inference, FlexTEC provides different compression/quality trade-offs. Early tokens encode global structure; additional tokens progressively refine details.

Pretrained Checkpoints

Image Compression

Checkpoint	Architecture	Training Data	λ values
`terracodec_v1_fp_s2l2a`	TEC-FP	Sentinel-2 L2A	0.5, 2, 10, 40, 200
`terracodec_v1_elic_s2l2a`	TEC-ELIC	Sentinel-2 L2A	0.5, 2, 10, 40, 200

Low λ → higher compression. High λ → higher quality.

Temporal Compression

Checkpoint	Architecture	Training Data	λ values
`terracodec_v1_tt_s2l2a`	TEC-TT	Sentinel-2 L2A (seasonal)	0.4, 1, 5, 20, 100, 200, 700
`terracodec_v1_tt_s2l1c`	TEC-TT	Sentinel-2 L1C	5, 20, 100

The L1C model was used for the declouding experiments in the paper.

Flexible-Rate

Checkpoint	Architecture	Quality range
`flextec_v1_s2l2a`	FlexTEC	1–16 (low = high compression)

Loading Models

Standalone Usage

Image codec — pass compression as a λ value:

from terracodec import terracodec_v1_fp_s2l2a

model = terracodec_v1_fp_s2l2a(
    pretrained=True,
    compression=10
)

Temporal codec — pass compression as a λ value:

from terracodec import terracodec_v1_tt_s2l2a

model = terracodec_v1_tt_s2l2a(
    pretrained=True,
    compression=20
)

FlexTEC — one model for many compression levels, quality is specified at inference time (see below):

from terracodec import flextec_v1_s2l2a

model = flextec_v1_s2l2a(
    pretrained=True,
)

Alternative: TerraTorch Integration

TerraCodec models are also available through the TerraTorch model registry.

Install TerraTorch via pip. To ensure compatibility, we recommend installing TerraTorch from the main branch until v1.3 is released:

pip install terracodec "terratorch @ git+https://github.com/terrastackai/terratorch@main"

Models can then be instantiated directly via the registry:

from terratorch import FULL_MODEL_REGISTRY

model = FULL_MODEL_REGISTRY.build(
    "terracodec_v1_fp_s2l2a",
    pretrained=True,
    compression=10
)

Input Format

Tensor shapes

Codec type	Shape	Example
Image codecs	`[B, C, H, W]`	`[1, 12, 256, 256]`
Temporal codecs	`[B, T, C, H, W]`	`[1, 4, 12, 256, 256]`

12 spectral bands (Sentinel-2 L2A) or 13 bands (L1C)
Spatial size: 256×256 recommended. TEC-FP accepts arbitrary sizes; all other models expect 256×256.
Temporal models: Models are pretrained on four seasonal frames but can process an arbitrary number of input timesteps at inference time. Using more frames increases the computational cost and therefore the required inference time.

Normalization

All models are pretrained on Training used SSL4EO-S12 v1.1.

Inputs should be standardized per spectral band using SSL4EO-S12 v1.1 L2A statistics:

mean = torch.tensor([793.243, 924.863, 1184.553, 1340.936, 1671.402, 2240.082, 2468.412, 2563.243, 2627.704, 2711.071, 2416.714, 1849.625])
std = torch.tensor([1160.144, 1201.092, 1219.943, 1397.225, 1400.035, 1373.136, 1429.170, 1485.025, 1447.836, 1652.703, 1471.002, 1365.307])

For S2L1C, similarly use:

mean = torch.tensor([1607.345, 1393.068, 1320.225, 1373.963, 1562.536, 2110.071, 2392.832, 2321.154, 2583.77,  838.712, 21.753, 2205.112, 1545.798])
std = torch.tensor([786.523, 849.702, 875.318, 1143.578, 1126.248, 1161.98, 1273.505, 1246.79, 1342.755, 576.795, 45.626, 1340.347, 1145.036])

Inference

Forward pass (fast, no bitstream)

# Image codec
reconstruction = model(inputs)

# Temporal codec
reconstruction, _ = model(sequence)

Compress / decompress (true bitstream)

# Image or temporal codec
compressed = model.compress(inputs)
reconstruction = model.decompress(**compressed)

print(compressed["bits"])   # total bits

FlexTEC

# Quality 1–16: lower = higher compression
compressed = model.compress(sequence, quality=8)
reconstruction = model.decompress(compressed)

Examples & Notebooks

The notebooks/ directory contains end-to-end examples:

Notebook	Description
`terracodec_fp_usage.ipynb`	TEC-FP image codec walkthrough
`terracodec_elic_usage.ipynb`	TEC-ELIC image codec walkthrough
`terracodec_tt_usage.ipynb`	TEC-TT temporal codec walkthrough

Example Sentinel-2 images are in examples/.

For running these examples, clone and set up the repo with:

git clone https://github.com/IBM/TerraCodec.git
cd TerraCodec
python -m venv venv
source venv/bin/activate
pip install -e . # Install terracodec dependencies
pip install -r requirements.txt  # Install packages for data loading

FAQ

Reconstruction quality is poor

Check preprocessing — verify band order, reflectance scaling, and per-band normalization.
GPU nondeterminism — entropy coding is sensitive to nondeterministic GPU operations. Enable deterministic mode:

os.environ.setdefault("CUBLAS_WORKSPACE_CONFIG", ":16:8")
torch.backends.cudnn.benchmark = False
torch.use_deterministic_algorithms(True)

If CPU and GPU results differ, nondeterminism is likely the cause.

Citation

@article{terracodec2025,
  title   = {TerraCodec: Neural Codecs for Earth Observation},
  author  = {Costa Watanabe, Julen and Wittmann, Isabelle and Blumenstiel, Benedikt},
  journal = {arXiv preprint arXiv:2510.12670},
  year    = {2025}
}

License

Apache 2.0 — see LICENSE.

Acknowledgments

This research is carried out as part of the Embed2Scale project and is co-funded by the EU Horizon Europe program under Grant Agreement No. 101131841. Additional funding for this project has been provided by the Swiss State Secretariat for Education, Research and Innovation (SERI) and UK Research and Innovation (UKRI).

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.1

Mar 12, 2026

This version

0.3.0

Mar 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

terracodec-0.3.0.tar.gz (62.1 kB view details)

Uploaded Mar 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

terracodec-0.3.0-py3-none-any.whl (67.3 kB view details)

Uploaded Mar 11, 2026 Python 3

File details

Details for the file terracodec-0.3.0.tar.gz.

File metadata

Download URL: terracodec-0.3.0.tar.gz
Upload date: Mar 11, 2026
Size: 62.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for terracodec-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`9405b99d2e68482168517dcf3e4ffc6d45f6dcefbe1b33fcee3b6d7a2207a1c6`
MD5	`80f93870bbbf9e652c125626f0c84f47`
BLAKE2b-256	`dcf571860bcba38f775a182da8ad2e1e347f6bbdb036d0c1713b73cdaa0f073c`

See more details on using hashes here.

File details

Details for the file terracodec-0.3.0-py3-none-any.whl.

File metadata

Download URL: terracodec-0.3.0-py3-none-any.whl
Upload date: Mar 11, 2026
Size: 67.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for terracodec-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2b29411f47512150c1562fa315b07e6fd7e91feaaed7ea45a9be6eda2d2e4acc`
MD5	`5093df2ec0d4c491d885e6ef4a8c42d8`
BLAKE2b-256	`fda8e19817a29963487e5de7ea1674fba0759dfe8df1480c4eb44b3ec05c5083`

See more details on using hashes here.

terracodec 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TerraCodec

Installation

Models

Image Codecs

Temporal Codecs

Pretrained Checkpoints

Image Compression

Temporal Compression

Flexible-Rate

Loading Models

Standalone Usage

Alternative: TerraTorch Integration

Input Format

Tensor shapes

Normalization

Inference

Forward pass (fast, no bitstream)

Compress / decompress (true bitstream)

FlexTEC

Examples & Notebooks

FAQ

Citation

License

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes