The Open-Source Geospatial Intelligence Protocol & Engine. Give any AI model a spatial mind.

These details have not been verified by PyPI

Project links

Project description

GeoSpark

The Open-Source Geospatial Intelligence Protocol & Engine

Give any AI model a spatial mind. Open source. Run anywhere.

Current LLMs fail at spatial reasoning — achieving 0% on geodesic distance computation and ~48% (random chance) on topological reasoning across five model families in our benchmarks. GeoSpark fixes this.

The Problem

Ask any LLM: "Is the Louvre inside the 7th arrondissement of Paris?"

It will confidently guess — and get it wrong most of the time. LLMs have no geometric engine, no coordinate system awareness, and no way to verify spatial claims. They hallucinate distances, confuse containment with proximity, and silently swap lat/lon.

The Solution

GeoSpark gives AI models ground-truth spatial reasoning through a standardized protocol:

from geospark import Engine
from geospark.protocol import SpatialQuery, SpatialOperation

engine = Engine(tools=["geocoder", "terrain"])

# Geocode a location (not guessing — real coordinates)
result = engine.execute(SpatialQuery(
    operation=SpatialOperation.GEOCODE,
    metadata={"query": "Eiffel Tower, Paris"}
))

# Check spatial relationships (100% accurate, not LLM guessing)
from geospark.engine.spatial_reasoner import SpatialReasoner

park = {"type": "Polygon", "coordinates": [[[2.29, 48.85], [2.30, 48.85], [2.30, 48.86], [2.29, 48.86], [2.29, 48.85]]]}
point = {"type": "Point", "coordinates": [2.295, 48.855]}

SpatialReasoner.check_relationship(park, point, "contains")  # True — ground truth

Key Features

GeoSpark Protocol (GSP) — A standardized JSON protocol for spatial queries. Like MCP, but for geospatial.
Spatial Reasoning Engine — Topology, distance, CRS transforms, buffering, area calculations. All geometrically correct.
MCP Server — Use GeoSpark as a tool in Claude, ChatGPT, or any MCP-compatible AI assistant.
Pluggable Tools — Geocoding, satellite imagery (STAC), terrain/elevation, routing, spectral indices, change detection.
GeoSpark Bench — 535 benchmark questions across 5 suites proving LLMs fail 70%+ on spatial tasks. See results →
GeoSpark Flows — DAG-based workflow automation with conditional routing and pre-built templates.
Spatial Knowledge Graph — Entity-relation graph with BFS traversal, auto-relate, and natural language queries.
Plugin System — Community plugin ecosystem with manifest-based discovery, lifecycle hooks, and dependency management.
Zero-Cost Stack — OpenRouter free models + Supabase free tier. Full spatial AI at $0/month.

Quick Start

pip install geospark-ai

As a Python library

from geospark import Engine
from geospark.engine.spatial_reasoner import SpatialReasoner

# Spatial relationship check
SpatialReasoner.check_relationship(polygon_a, polygon_b, "intersects")

# Distance calculation (geodesic, not Euclidean)
SpatialReasoner.calculate_distance(
    {"type": "Point", "coordinates": [2.2945, 48.8584]},   # Eiffel Tower
    {"type": "Point", "coordinates": [2.3376, 48.8606]},   # Louvre
)
# Returns: ~3,300 meters (actual geodesic distance)

As an MCP Server (for Claude Desktop)

pip install geospark-ai[mcp]
geospark-mcp  # Starts stdio MCP server with 6 spatial tools

Add to your Claude Desktop config (~/.claude/claude_desktop_config.json):

{
  "mcpServers": {
    "geospark": { "command": "geospark-mcp" }
  }
}

Natural language spatial questions

from geospark import Engine

engine = Engine(tools=["geocoder", "terrain"])
result = engine.ask("How far is the Eiffel Tower from Big Ben?")
print(result.spatial_context.summary)
# Automatically geocodes both locations + computes geodesic distance

Tries local Ollama first (free, fast), falls back to OpenRouter.

CLI

geospark geocode "Tokyo Tower, Japan"
geospark elevation 35.6586 139.7454
geospark distance 48.8566 2.3522 51.5074 -- -0.1278  # Paris → London
geospark ask "Is Tokyo closer to Seoul or Beijing?"
geospark tools    # List available tools
geospark info     # System info

Try the Live API (no install needed)

Explore all 11 endpoints interactively at geospark.terrascout.app/docs

# Quick test
curl -X POST https://geospark.terrascout.app/api/v1/distance \
  -H "Content-Type: application/json" \
  -d '{"lat_a": 48.8566, "lon_a": 2.3522, "lat_b": 51.5074, "lon_b": -0.1278}'

Run the Benchmark

# Run GeoSpark Bench on topological reasoning
python -m geospark.bench run --benchmark geotopo

# Run all benchmarks
python -m geospark.bench run

# List available benchmarks
python -m geospark.bench list

Architecture

┌─────────────────────────────────────────────────┐
│                   User / LLM                    │
│         (Claude, ChatGPT, Ollama, ...)          │
└──────────┬──────────────────────┬───────────────┘
           │ MCP                  │ REST API
           v                      v
┌──────────────────────────────────────────────────┐
│              GeoSpark Protocol (GSP)             │
│         Standardized JSON query/result           │
└──────────┬───────────────────────────────────────┘
           │
           v
┌──────────────────────────────────────────────────┐
│             Spatial Reasoning Engine              │
│  Topology · Distance · CRS · Buffer · Centroid   │
│  Planner · Cache · Temporal · Aggregator         │
└──────────┬───────────────────────────────────────┘
           │
    ┌──────┴──────┬──────────┬──────────┬──────────┐
    v             v          v          v          v
┌────────┐ ┌──────────┐ ┌────────┐ ┌────────┐ ┌────────┐
│Geocoder│ │Satellite │ │Terrain │ │Routing │ │Change  │
│        │ │(STAC,    │ │(Elev.) │ │(OSRM)  │ │Detect. │
│        │ │NDVI, EVI)│ │        │ │        │ │        │
└────────┘ └──────────┘ └────────┘ └────────┘ └────────┘
           │
    ┌──────┴──────┬──────────┬──────────┐
    v             v          v          v
┌────────┐ ┌──────────┐ ┌────────┐ ┌────────┐
│ Flows  │ │Knowledge │ │Plugins │ │Spatial │
│(DAG    │ │Graph     │ │(Commun │ │RAG     │
│Runner) │ │(BFS,NL)  │ │ity)   │ │        │
└────────┘ └──────────┘ └────────┘ └────────┘

Benchmark Results

GeoSpark Bench v1.0 — 535 questions across 5 benchmarks, evaluated on 5 LLM families (Qwen, Llama, Gemma, Mistral, Phi) via Ollama.

Baseline: LLM Alone (No Tools)

Benchmark	Qwen 2.5 7B	Llama 3.1 8B	Gemma 2 9B	Mistral 7B	Phi-3.5 3.8B	Mean
GeoDistance	0%	0%	30%	0%	0%	6%
GeoTopo	45%	50%	50%	50%	45%	48%
GeoChange	90%	65%	80%	85%	75%	79%
GeoReason	85%	65%	90%	75%	70%	77%
GeoMultimodal	30%	35%	30%	35%	35%	33%

With GeoSpark Tool Augmentation

Benchmark	Qwen 2.5 7B	Llama 3.1 8B	Mistral 7B	Improvement (best)
GeoDistance	70%	10%	0%	+70%
GeoReason	100%	65%	80%	+15%
GeoTopo	50%	50%	50%	+5%

Key findings:

0% on distance across 4/5 models — LLMs cannot compute geodesic distances from coordinates
48% on topology — random chance on binary questions, confirming no spatial predicate capability
79% on change detection — knowledge-based spatial reasoning works; the deficit is strictly computational
70% with tools (Qwen 2.5 7B) — tool augmentation fixes the computational gap
100% on reasoning (Qwen 2.5 7B) — structured prompting solves multi-step spatial chains

Full results: Benchmark Report | Run your own: python -m geospark.bench run

Why GeoSpark?

Problem	Without GeoSpark	With GeoSpark
"Is point A inside region B?"	LLM guesses (30% accuracy)	Ground-truth topology check (100%)
"How far is A from B?"	LLM can't compute (0% accuracy)	Geodesic calculation in meters (100%)
"What changed here since 2020?"	LLM hallucinates	Real satellite change detection
CRS confusion	Silent errors	Automatic detection & transformation
"Which landmark is closest?"	LLM guesses wrong (0%)	Exact nearest-neighbor computation (100%)

Project Status

Phase	Status	Tests	Description
Phase 0 — Foundation	Complete	50	Protocol, engine, CRS, tools, CLI, MCP, Docker, CI/CD
Phase 1 — Launch	Complete	96	Bench v0.1, baselines, demo notebook, GitHub repo
Phase 2 — Ecosystem	Complete	249	8 tools, RAG, memory, planner, cache, 4 LLM integrations
Phase 3 — Platform	Complete	446	Bench v1.0, Flows, Knowledge Graph, Plugin System
Phase 4 — Scale	In Progress	446	Live API, Docker deploy, API auth, benchmarking

See CONTRIBUTING.md for development guidelines.

Development

# Clone and setup
git clone https://github.com/Maz2580/geospark.git
cd geospark
python -m venv .venv
source .venv/bin/activate  # or .venv\Scripts\activate on Windows
pip install -e ".[dev]"

# Run tests
pytest tests/ -v

# Lint & format
ruff check geospark/ tests/
ruff format geospark/ tests/

# Type check
mypy geospark/

Contributing

See CONTRIBUTING.md for guidelines.

Live API

GeoSpark is deployed and accessible at geospark.terrascout.app — 11 endpoints with interactive Swagger documentation.

Author

Created by Mazdak Ghasemi Tootkaboni (University of Melbourne)

ORCID: 0000-0001-8084-5270
GitHub: @Maz2580

License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.0

Apr 4, 2026

0.4.0

Apr 4, 2026

0.3.0

Mar 31, 2026

This version

0.2.0

Mar 30, 2026

0.1.0

Mar 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geospark_ai-0.2.0.tar.gz (209.8 kB view details)

Uploaded Mar 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

geospark_ai-0.2.0-py3-none-any.whl (196.3 kB view details)

Uploaded Mar 30, 2026 Python 3

File details

Details for the file geospark_ai-0.2.0.tar.gz.

File metadata

Download URL: geospark_ai-0.2.0.tar.gz
Upload date: Mar 30, 2026
Size: 209.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for geospark_ai-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`5e1715dee08aa93ee61d8469314140f32dd9a78867fcf6035b644c3bf3718605`
MD5	`3f30bd6287137afc50242768cd9474e8`
BLAKE2b-256	`486ace37d9121a320ec8e0e1b5bf7b85819db5301a97203feccf857a68ec9a26`

See more details on using hashes here.

File details

Details for the file geospark_ai-0.2.0-py3-none-any.whl.

File metadata

Download URL: geospark_ai-0.2.0-py3-none-any.whl
Upload date: Mar 30, 2026
Size: 196.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for geospark_ai-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0af50b1f91d4a3446785188eef7b4134b61a94a8f4c40d87668fd14412a0a891`
MD5	`63254379defd791003ce0b0764d85cd1`
BLAKE2b-256	`299eab67a3e8c18953b261f52cb8f8c88d94ce031ee930018772a87d881b0bdf`

See more details on using hashes here.

geospark-ai 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GeoSpark

The Problem

The Solution

Key Features

Quick Start

As a Python library

As an MCP Server (for Claude Desktop)

Natural language spatial questions

CLI

Try the Live API (no install needed)

Run the Benchmark

Architecture

Benchmark Results

Baseline: LLM Alone (No Tools)

With GeoSpark Tool Augmentation

Why GeoSpark?

Project Status

Development

Contributing

Live API

Author

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes