Skip to main content

SAR data processing and focusing utilities with dataloader for Sentinel-1 products

Project description

Maya4 Logo

Multi-Level SAR Processing & PyTorch DataLoader

Unveiling the layers of Synthetic Aperture Radar data from Sentinel-1 missions

Python Version PyTorch License: GPL v3 Code style: black HF Organization

OverviewInstallationQuick StartProcessing LevelsCitation


🎯 Overview

Maya4 is a production-ready Python package and dataset organization dedicated to curating and providing multi-level intermediate SAR representations from Sentinel-1 acquisitions, spanning the entire processing chain from Level 0 (raw) to Level 1 (focused imagery).

The Māyā Philosophy

The name Maya4 draws inspiration from the Māyā veil in philosophy, where reality is hidden behind successive layers—just as radar echoes undergo multiple transformations before forming a final SAR image. Each processing level reveals a different aspect of the electromagnetic interaction with Earth's surface.

Why Maya4?

  • 🎚️ Multi-Level Access: Complete processing chain from raw echoes to focused imagery
  • 🚀 Performance: Zarr-based storage with intelligent chunk caching and lazy loading
  • 🔧 Flexibility: Access any intermediate representation for research and experimentation
  • ☁️ Cloud-Native: Native Hugging Face Hub integration with 68TB+ of curated data
  • 📊 ML-Ready: PyTorch-compatible dataloaders optimized for pre-training workflows
  • 🌍 Geographic-Aware: Built-in support for location-based clustering and filtering

🌐 Processing Levels

Maya4 Steps

Maya4 exposes the complete SAR processing chain through intermediate signal representations:

Level Abbrev. Description Purpose / Value
📡 Raw raw Unprocessed radar echoes as recorded by Sentinel-1 Baseline data; enables full custom SAR processing
🎚️ Range Compressed rc Echoes compressed in range via matched filtering Improved SNR; isolates scatterers along range
🎯 Range Cell Migration Corrected rcmc Motion-compensated with corrected range migration Preserves geometric fidelity; enables azimuth focusing
🖼️ Azimuth Compressed ac Fully focused SAR image in slant-range geometry Standard Level-1 product; interpretable imagery

Each level represents a distinct transformation in the SAR focusing pipeline, allowing researchers to:

  • Experiment with custom processing algorithms
  • Pre-train deep learning models on intermediate representations
  • Analyze signal characteristics at different processing stages
  • Develop novel focusing techniques

📦 Pre-Training Datasets

Maya4 provides curated Pre-Training (PT) datasets in cloud-native Zarr format:

Dataset Split Contents Acquisition Mode Size Hub Link
PT1 Multi-level SAR data Stripmap 17 TB 🤗 Maya4/PT1
PT2 Multi-level SAR data Stripmap 17 TB 🤗 Maya4/PT2
PT3 Multi-level SAR data Stripmap 17 TB Coming Soon
PT4 Multi-level SAR data Stripmap 17 TB 🤗 Maya4/PT4
Total 68 TB

Data provided by the Copernicus Sentinel-1 mission (ESA)


✨ Features

Core Capabilities

  • Multi-Level Data Access
    Complete processing chain from raw to focused

  • Zarr Backend
    Scalable, chunked storage for 68TB+ datasets

  • Normalization Suite
    MinMax, Z-Score, Robust, and Adaptive strategies

  • HuggingFace Integration
    Direct loading from Maya4 Hub repositories

Advanced Features

  • Geographic Clustering
    Balanced sampling by location distribution

  • Positional Encoding
    Built-in transformer-compatible embeddings

  • Flexible Patch Modes
    Rectangular and parabolic extraction

  • Lazy Loading
    Memory-efficient processing of massive datasets


📦 Installation

Quick Install

# Using PDM (recommended)
pdm install

# Using pip
pip install -e .

Environment-Specific Installation

Jupyter Environment
pdm install -G jupyter_env

Includes Jupyter notebook and lab dependencies for interactive development.

Geospatial Features
pdm install -G geospatial

Adds geographic processing tools and coordinate system support.

Development Setup
pdm install -G dev

Installs testing, linting, and development utilities.

Complete Installation
pdm install -G :all

Installs all optional dependencies for full functionality.

Requirements

  • Python 3.8+
  • PyTorch 2.0+
  • CUDA (optional, for GPU acceleration)

📖 Citation

If you use Maya4 datasets or tools in your research, please cite:

@software{maya4_2024,
  author       = {Del Prete, Roberto and Maya4 Organization},
  title        = {Maya4: Multi-Level SAR Processing and Intermediate Representations},
  year         = {2024},
  publisher    = {Hugging Face},
  howpublished = {\url{https://huggingface.co/Maya4}},
  note         = {68TB+ curated Sentinel-1 Stripmap data spanning processing levels from raw to focused imagery}
}

Made with ❤️ by the Maya4 Team

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

maya4-0.1.2.tar.gz (196.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

maya4-0.1.2-py3-none-any.whl (198.0 kB view details)

Uploaded Python 3

File details

Details for the file maya4-0.1.2.tar.gz.

File metadata

  • Download URL: maya4-0.1.2.tar.gz
  • Upload date:
  • Size: 196.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.26.2 CPython/3.12.9 Darwin/24.5.0

File hashes

Hashes for maya4-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c9937ff57381aeb485849ea3596d115176c95668531800394b3a458c696fd10f
MD5 7ae9304ea1ed3b657481a581fe310b3e
BLAKE2b-256 683c61c45c95ceeaf42f7ebe3d3603defc2d534b2099e771c2d9ae87a0f3fd1e

See more details on using hashes here.

File details

Details for the file maya4-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: maya4-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 198.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.26.2 CPython/3.12.9 Darwin/24.5.0

File hashes

Hashes for maya4-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 661d52f48a22783cc55941ddfa808f6ba316897bf2e54eb8c83448fb4544e4b3
MD5 eed666a4fb001ad8ab3f6755b6216c59
BLAKE2b-256 04e6f31a744e8199e84208fa632905fdda24807cc609bd3954169c297f945440

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page