Production-grade Earth Observation, Geospatial, Remote Sensing, and SAR Python library

These details have not been verified by PyPI

Project links

Project description

Unbihexium

Production-Grade Geospatial AI Library for Earth Observation

Executive Summary

Unbihexium is a production-grade, enterprise-ready Python library for geospatial artificial intelligence, Earth observation analytics, and remote sensing workflows. The library provides a unified, extensible framework encompassing 520 pre-trained models with 515 million total parameters across 4 variant tiers (tiny, base, large, mega), 12 capability domains, and comprehensive tooling for end-to-end geospatial analysis pipelines.

The library is named after the theoretical chemical element with atomic number 126, symbolizing the comprehensive and foundational nature of this framework in bridging Earth observation data with artificial intelligence capabilities.

Key Differentiators

Feature	Unbihexium	Traditional GIS	Cloud AI Services
Offline Capable	Yes	Yes	No
Model Count	520	0	10-50
Open Source	Apache-2.0	Varies	No
Self-Hosted	Yes	Yes	No
GPU Acceleration	Yes	Limited	Yes
Edge Deployment	Yes	No	No
Custom Training	Yes	No	Limited

Model Zoo Overview
System Architecture
Capability Matrix
Mathematical Foundations
Installation
Quick Start
Performance Metrics
Documentation
Security and Compliance
Contributing
Citation
License

Model Zoo Overview

Comprehensive Model Statistics

The Unbihexium Model Zoo represents a comprehensive collection of 520 production-ready models organized into 130 base architectures across 4 variant tiers. Each model has been trained on curated datasets and validated for production deployment.

Variant Specifications

Variant	Count	Resolution	Base Channels	Parameter Range	Average Parameters	Total Parameters
tiny	130	64 x 64 px	32	49,667 - 258,754	133,755	17,388,189
base	130	128 x 128 px	64	191,491 - 1,029,506	530,267	68,934,749
large	130	256 x 256 px	96	425,475 - 2,312,258	1,189,538	154,639,901
mega	130	512 x 512 px	128	751,619 - 4,107,010	2,111,567	274,503,645

Total Parameter Count

The aggregate parameter count across all variants:

$$ P_{total} = \sum_{v \in \mathcal{V}} \sum_{m=1}^{130} P_{v,m} = 515,466,484 $$

Where $\mathcal{V} = {tiny, base, large, mega}$ represents the set of variant tiers.

Task Distribution

pie title Model Distribution by Task (520 Total)
    "Regression" : 188
    "Segmentation" : 128
    "Detection" : 76
    "Terrain" : 52
    "Enhancement" : 44
    "Index" : 28
    "Super Resolution" : 4

Detailed Task Statistics

Task	Models per Variant	Total Models	Min Parameters	Max Parameters	Average Parameters	Primary Metric
Regression	47	188	67,329	1,065,473	498,942	R-squared
Segmentation	32	128	143,266	4,107,010	1,307,290	mIoU
Detection	19	76	143,201	2,269,059	1,064,595	mAP@0.5
Terrain	13	52	186,177	2,956,545	1,387,041	RMSE
Enhancement	11	44	186,243	2,956,803	1,387,203	PSNR
Index	7	28	186,243	2,956,803	1,387,203	MAE
Super Resolution	1	4	49,667	751,619	354,563	PSNR

Parameter Scaling Analysis

The relationship between variant parameters follows a consistent scaling pattern:

$$ \frac{P_{base}}{P_{tiny}} \approx 3.96, \quad \frac{P_{large}}{P_{base}} \approx 2.24, \quad \frac{P_{mega}}{P_{large}} \approx 1.78 $$

This scaling relationship can be approximated by:

$$ P_{variant} = P_{tiny} \times \left(\frac{C_{variant}}{C_{tiny}}\right)^{\alpha} $$

Where $C$ represents the base channel count and $\alpha \approx 2.0$ for convolutional architectures.

System Architecture

High-Level Architecture Diagram

graph TB
    subgraph "Input Layer"
        A1["Satellite Imagery<br/>Sentinel-1/2, Landsat-8/9, WorldView, Pleiades"]
        A2["Aerial Photography<br/>UAV, Aircraft, Balloon"]
        A3["Vector Data<br/>GeoJSON, Shapefile, GeoPackage, KML"]
        A4["Tabular Data<br/>CSV, Parquet, Arrow, HDF5"]
    end
    
    subgraph "Core Framework"
        B1[Pipeline Orchestrator]
        B2[Capability Registry]
        B3[Model Zoo Manager]
        B4["Inference Engine<br/>ONNX Runtime"]
    end
    
    subgraph "Processing Modules"
        C1[Tiling Engine]
        C2[Preprocessing]
        C3[Postprocessing]
        C4[Georeferencing]
    end
    
    subgraph "Output Layer"
        D1[GeoTIFF Rasters]
        D2[Vector Features]
        D3[Analysis Reports]
        D4[Metrics JSON]
    end
    
    A1 --> B1
    A2 --> B1
    A3 --> B1
    A4 --> B1
    
    B1 --> B2
    B2 --> B3
    B3 --> B4
    
    B4 --> C1
    C1 --> C2
    C2 --> C3
    C3 --> C4
    
    C4 --> D1
    C4 --> D2
    C4 --> D3
    C4 --> D4

Component Architecture

Pipeline Orchestrator

The Pipeline Orchestrator serves as the central coordination component, managing workflow execution, resource allocation, and stage sequencing.

sequenceDiagram
    participant User
    participant Orchestrator
    participant Registry
    participant ModelZoo
    participant Inference
    participant Output
    
    User->>Orchestrator: submit_pipeline(config)
    Orchestrator->>Registry: resolve_capabilities()
    Registry->>ModelZoo: get_models(capability_ids)
    ModelZoo->>ModelZoo: verify_checksums()
    ModelZoo->>Inference: load_models()
    Inference->>Inference: warm_up()
    loop For each tile
        Orchestrator->>Inference: process_tile(data)
        Inference->>Output: write_result(prediction)
    end
    Output->>User: return results

Model Architecture Details

Architecture	Task Types	Layer Configuration	Parameters (mega)	Receptive Field
UNet	Detection, Segmentation	3-level encoder-decoder with skip connections	2.3M	256 px
Siamese	Change Detection	Dual-stream encoder with shared weights	4.1M	256 px
MLP	Regression, Risk Assessment	6-layer fully-connected with BatchNorm	1.0M	N/A
CNN	Enhancement, Index	6-layer convolutional with residual connections	3.0M	128 px
SRCNN	Super Resolution	Feature extraction + PixelShuffle upsampling	752K	64 px

Data Flow Architecture

flowchart LR
    subgraph Input
        I1[GeoTIFF]
        I2[JPEG2000]
        I3[NetCDF]
    end
    
    subgraph Preprocessing
        P1[Normalization]
        P2[Tiling]
        P3[Augmentation]
    end
    
    subgraph Inference
        M1[Model Loading]
        M2[Batch Processing]
        M3[GPU Acceleration]
    end
    
    subgraph Postprocessing
        O1[Stitching]
        O2[Georeferencing]
        O3[Vectorization]
    end
    
    I1 --> P1
    I2 --> P1
    I3 --> P1
    P1 --> P2
    P2 --> P3
    P3 --> M1
    M1 --> M2
    M2 --> M3
    M3 --> O1
    O1 --> O2
    O2 --> O3

Mathematical Foundations

Convolutional Neural Network Theory

The fundamental operation in our convolutional architectures is the 2D convolution:

$$ (f * g)(x, y) = \sum_{i=-k}^{k} \sum_{j=-k}^{k} f(i, j) \cdot g(x-i, y-j) $$

Where $f$ is the input feature map, $g$ is the convolutional kernel, and $k$ is the kernel radius.

Batch Normalization

All architectures employ batch normalization for training stability:

$$ \hat{x}_i = \frac{x_i - \mu_B}{\sqrt{\sigma_B^2 + \epsilon}} $$

$$ y_i = \gamma \hat{x}_i + \beta $$

Where $\mu_B$ and $\sigma_B^2$ are the batch mean and variance, and $\gamma$, $\beta$ are learned parameters.

Activation Functions

The primary activation function is ReLU:

$$ \text{ReLU}(x) = \max(0, x) $$

For certain layers, we employ GELU for smoother gradients:

$$ \text{GELU}(x) = x \cdot \Phi(x) = x \cdot \frac{1}{2}\left[1 + \text{erf}\left(\frac{x}{\sqrt{2}}\right)\right] $$

Loss Functions

Cross-Entropy Loss (Segmentation)

$$ \mathcal{L}{CE} = -\frac{1}{N} \sum{i=1}^{N} \sum_{c=1}^{C} y_{i,c} \log(\hat{y}_{i,c}) $$

Dice Loss (Segmentation)

$$ \mathcal{L}{Dice} = 1 - \frac{2 \sum{i} p_i g_i + \epsilon}{\sum_{i} p_i + \sum_{i} g_i + \epsilon} $$

Mean Squared Error (Regression)

$$ \mathcal{L}{MSE} = \frac{1}{N} \sum{i=1}^{N} (y_i - \hat{y}_i)^2 $$

Focal Loss (Detection)

$$ \mathcal{L}_{FL} = -\alpha_t (1 - p_t)^\gamma \log(p_t) $$

Where $\gamma$ is the focusing parameter (typically 2.0) and $\alpha_t$ is the class balancing weight.

Evaluation Metrics

Intersection over Union (IoU)

$$ \text{IoU} = \frac{| A \cap B | }{ | A \cup B |} = \frac{\text{TP}}{\text{TP} + \text{FP} + \text{FN}} $$

Mean Average Precision (mAP)

$$ \text{mAP} = \frac{1}{| C | } \sum_{c \in C} \text{AP}(c) = \frac{1}{ | C |} \sum_{c \in C} \int_0^1 P(R) , dR $$

Peak Signal-to-Noise Ratio (PSNR)

$$ \text{PSNR} = 10 \cdot \log_{10}\left(\frac{\text{MAX}I^2}{\text{MSE}}\right) = 20 \cdot \log{10}\left(\frac{\text{MAX}_I}{\sqrt{\text{MSE}}}\right) $$

Structural Similarity Index (SSIM)

$$ \text{SSIM}(x, y) = \frac{(2\mu_x\mu_y + c_1)(2\sigma_{xy} + c_2)}{(\mu_x^2 + \mu_y^2 + c_1)(\sigma_x^2 + \sigma_y^2 + c_2)} $$

Capability Matrix

Domain Coverage Summary

The library implements 12 primary capability domains with 130 individual base models:

ID	Domain	Models	Primary Tasks	Production Status
01	AI Products	13	Super-resolution, Detection, Segmentation	Production
02	Tourism and Data Processing	10	Route planning, Spatial analysis	Production
03	Vegetation Indices and Flood/Water	12	NDVI, NDWI, NBR, Flood risk	Production
04	Environment and Forestry	14	Deforestation, Forest density	Production
05	Asset Management and Energy	12	Pipeline monitoring, Site selection	Production
06	Urban and Agriculture	18	Urban planning, Crop classification	Production
07	Risk and Defense (Neutral)	15	Hazard analysis, Maritime awareness	Production
08	Value-Added Imagery	4	DSM, DEM, Orthorectification	Production
09	Benefits Narrative	0	Documentation only	N/A
10	Satellite Imagery Features	6	Stereo, Pansharpening	Production
11	Resolution and Metadata QA	4	Quality assurance	Production
12	Radar and SAR	8	Amplitude, Phase, InSAR	Production

Capability Distribution Visualization

xychart-beta
    title "Models per Capability Domain"
    x-axis [D01, D02, D03, D04, D05, D06, D07, D08, D09, D10, D11, D12]
    y-axis "Model Count" 0 --> 20
    bar [13, 10, 12, 14, 12, 18, 15, 4, 0, 6, 4, 8]

Installation

System Requirements

Component	Minimum	Recommended	Optimal	Notes
Python	3.10	3.12	3.12	3.13 supported
RAM	8 GB	16 GB	32 GB	Per concurrent pipeline
Disk	5 GB	50 GB	200 GB	Model cache space
GPU	None	RTX 3060	A100	10-50x inference speedup
CPU Cores	4	8	16+	Parallel preprocessing
OS	Linux, Windows, macOS	Ubuntu 22.04 LTS	Ubuntu 22.04 LTS	Best tested

Installation Methods

Standard Installation (PyPI)

# Basic installation
pip install unbihexium

# With optional dependencies
pip install unbihexium[gpu]      # GPU acceleration
pip install unbihexium[dev]      # Development tools
pip install unbihexium[docs]     # Documentation
pip install unbihexium[test]     # Testing utilities
pip install unbihexium[all]      # All optional dependencies

Conda Installation

# Create environment
conda create -n unbihexium python=3.12
conda activate unbihexium

# Install package
conda install -c conda-forge unbihexium

# With GPU support
conda install -c conda-forge unbihexium cudatoolkit=11.8

Development Installation

# Clone repository
git clone https://github.com/unbihexium-oss/unbihexium.git
cd unbihexium

# Create virtual environment
python -m venv .venv
source .venv/bin/activate  # Linux/macOS
.venv\Scripts\activate     # Windows

# Install in development mode
pip install -e ".[dev,test,docs]"

# Run tests
pytest tests/

Docker Installation

# Pull official image
docker pull ghcr.io/unbihexium-oss/unbihexium:latest

# Run container
docker run -it --gpus all \
    -v $(pwd)/data:/data \
    -v $(pwd)/output:/output \
    ghcr.io/unbihexium-oss/unbihexium:latest

# Docker Compose
docker-compose up -d

Verification

# Verify installation
unbihexium --version

# Run self-test
unbihexium self-test

# List available models
unbihexium zoo list --count

# Check GPU availability
unbihexium device status

Quick Start

CLI Usage

# List all models with detailed statistics
unbihexium zoo list --verbose

# Filter models by task type
unbihexium zoo list --task detection --variant mega

# Download a specific model with verification
unbihexium zoo download ship_detector_base --verify

# Run single-image inference
unbihexium infer ship_detector_base \
    --input satellite_image.tif \
    --output detections.tif \
    --confidence 0.5

# Run batch inference on directory
unbihexium infer building_detector_large \
    --input data/images/ \
    --output results/ \
    --batch-size 8 \
    --workers 4

# Run a complete pipeline
unbihexium pipeline run detection \
    --config pipeline_config.yaml \
    --input data/ \
    --output results/ \
    --progress

Python API

from unbihexium import Pipeline, Config
from unbihexium.zoo import get_model, list_models, download_model

# Discover available models
models = list_models(task="detection", variant="mega")
print(f"Found {len(models)} detection models")

for model in models[:5]:
    print(f"  - {model.id}: {model.params:,} parameters")

# Download model if not cached
model_path = download_model("ship_detector_mega", verify=True)

# Load model for inference
model = get_model("ship_detector_mega")
print(f"Loaded model with {model.num_parameters:,} parameters")

# Create pipeline with configuration
config = Config(
    tile_size=512,
    overlap=64,
    batch_size=4,
    device="cuda:0",
    precision="fp16"
)

pipeline = Pipeline.from_config(
    capability="ship_detection",
    variant="mega",
    config=config
)

# Run inference
results = pipeline.run("satellite_image.tif")

# Access predictions
for detection in results.detections:
    print(f"Class: {detection.label}")
    print(f"Confidence: {detection.score:.4f}")
    print(f"Bounding Box: {detection.bbox}")
    print(f"Centroid: {detection.centroid}")

# Export results
results.to_geojson("detections.geojson")
results.to_shapefile("detections.shp")
results.to_geotiff("detections.tif")

Performance Metrics

Throughput Analysis

Processing throughput depends on hardware configuration and model variant:

$$ T = \frac{N_{tiles} \times S_{tile}^2}{t_{total}} \quad [\text{pixels/second}] $$

Where $N_{tiles}$ is the number of tiles, $S_{tile}$ is the tile dimension, and $t_{total}$ is total processing time.

xychart-beta
    title "Inference Throughput by Hardware (tiles/sec)"
    x-axis [tiny, base, large, mega]
    y-axis "Tiles per Second" 0 --> 600
    bar "CPU (8 cores)" [100, 25, 6, 2]
    bar "GPU (RTX 3080)" [400, 100, 25, 6]
    bar "GPU (A100)" [600, 200, 50, 12]

Memory Requirements

Total memory consumption follows:

$$ M_{total} = M_{base} + M_{model} + N_{batch} \times M_{tile} $$

Variant	Model Size	Runtime Memory	Batch Size 1	Batch Size 8	Batch Size 16
tiny	500 KB	50 MB	100 MB	200 MB	350 MB
base	2 MB	100 MB	200 MB	500 MB	900 MB
large	5 MB	200 MB	500 MB	1.5 GB	2.8 GB
mega	15 MB	500 MB	1.5 GB	4 GB	7.5 GB

Latency Analysis

Operation	tiny	base	large	mega
Model Load (cold)	50 ms	100 ms	200 ms	500 ms
Model Load (warm)	5 ms	10 ms	20 ms	50 ms
Single Tile (CPU)	10 ms	40 ms	160 ms	500 ms
Single Tile (GPU)	2 ms	8 ms	30 ms	100 ms
Batch 8 Tiles (GPU)	8 ms	32 ms	120 ms	400 ms

Documentation

Section	Description	Link
Getting Started	Installation, quickstart, configuration	docs/getting_started/
Tutorials	Step-by-step guides and examples	docs/tutorials/
API Reference	Complete Python API documentation	docs/reference/api.md
CLI Reference	Command-line interface documentation	docs/reference/cli.md
Architecture	System design and internals	docs/architecture/
Capabilities	Domain encyclopedia (12 documents)	docs/capabilities/
Model Zoo	Model catalog and usage guides	docs/model_zoo/
Security	Security practices and compliance	docs/security/
Operations	Deployment and operations	docs/operations/

Security and Compliance

Security Controls

Control	Implementation	Status	Verification
Dependency Scanning	Dependabot, Safety, pip-audit	Active	Daily
Static Analysis	CodeQL, Bandit, Semgrep	Active	Every PR
Model Integrity	SHA256 checksums	Active	On download
Supply Chain	SBOM generation, SLSA Level 3	Active	Every release
Secrets Management	GitHub Secrets, no hardcoding	Enforced	Pre-commit
Container Scanning	Trivy, Grype	Active	Every build

Compliance Certifications

Standard	Status	Scope
Apache-2.0 License	Compliant	Full codebase
GDPR	Compliant	No PII collection
CCPA	Compliant	No PII collection
EAR	Reviewed	Non-controlled items
SOC 2 Type II	In Progress	Enterprise deployment

Contributing

We welcome contributions from the community. Please review:

CONTRIBUTING.md - Contribution guidelines
CODE_OF_CONDUCT.md - Community standards
GOVERNANCE.md - Project governance
SECURITY.md - Security reporting

Development Workflow

Fork the repository
Create a feature branch
Make changes with tests
Run linting and tests locally
Submit a pull request
Address review feedback
Merge after approval

Citation

@software{unbihexium2025,
  author       = {Unbihexium OSS Foundation},
  title        = {Unbihexium: Production-Grade Geospatial AI Library},
  year         = {2025},
  version      = {1.0.0},
  publisher    = {GitHub},
  url          = {https://github.com/unbihexium-oss/unbihexium},
  doi          = {10.5281/zenodo.0000000},
  license      = {Apache-2.0},
  note         = {520 models, 515M parameters, 12 capability domains}
}

License

Licensed under the Apache License, Version 2.0. See LICENSE.txt for the full license text.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Unbihexium - Element 126 - Bridging Earth Observation and Artificial Intelligence

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.1

Dec 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unbihexium-1.0.1.tar.gz (2.8 MB view details)

Uploaded Dec 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

unbihexium-1.0.1-py3-none-any.whl (118.2 kB view details)

Uploaded Dec 21, 2025 Python 3

File details

Details for the file unbihexium-1.0.1.tar.gz.

File metadata

Download URL: unbihexium-1.0.1.tar.gz
Upload date: Dec 21, 2025
Size: 2.8 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.1

File hashes

Hashes for unbihexium-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`2e0723bc2a6f0fdb95fa13848e25ce365e90b191d726c0f8636d05cffc5a857b`
MD5	`6d75363c1efd61f062db84bda1f381d2`
BLAKE2b-256	`e43688c977cf8a916a621cf709fa44d5b99481654a1072dd56064894e108b156`

See more details on using hashes here.

File details

Details for the file unbihexium-1.0.1-py3-none-any.whl.

File metadata

Download URL: unbihexium-1.0.1-py3-none-any.whl
Upload date: Dec 21, 2025
Size: 118.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.1

File hashes

Hashes for unbihexium-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`751f4e4397fac71930fa81f9de408a3cbed17d643605494591c427e55934ffa1`
MD5	`ce79692c9d1d15e547b047982a92f6c3`
BLAKE2b-256	`48f84c898e2ca4ebf5bcf8e349fc598b48b23a12676a8eeac48ad3ac87fb50cb`

See more details on using hashes here.

unbihexium 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Unbihexium

Executive Summary

Key Differentiators

Table of Contents

Model Zoo Overview

Comprehensive Model Statistics

Variant Specifications

Total Parameter Count

Task Distribution

Detailed Task Statistics

Parameter Scaling Analysis

System Architecture

High-Level Architecture Diagram

Component Architecture

Pipeline Orchestrator

Model Architecture Details

Data Flow Architecture

Mathematical Foundations

Convolutional Neural Network Theory

Batch Normalization

Activation Functions

Loss Functions

Cross-Entropy Loss (Segmentation)

Dice Loss (Segmentation)

Mean Squared Error (Regression)

Focal Loss (Detection)

Evaluation Metrics

Intersection over Union (IoU)

Mean Average Precision (mAP)

Peak Signal-to-Noise Ratio (PSNR)

Structural Similarity Index (SSIM)

Capability Matrix

Domain Coverage Summary

Capability Distribution Visualization

Installation

System Requirements

Installation Methods

Standard Installation (PyPI)

Conda Installation

Development Installation

Docker Installation

Verification

Quick Start

CLI Usage

Python API

Performance Metrics

Throughput Analysis

Memory Requirements

Latency Analysis

Documentation

Security and Compliance

Security Controls

Compliance Certifications

Contributing

Development Workflow

Citation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes