Skip to main content

Map geometries to environmental semantics

Project description

geoenv

Map geometries to environmental semantics

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. example workflow codecov PyPI - Version

geoenv is a Python library that maps geospatial geometries, such as points and polygons, to standardized environmental terms. It’s like reverse geocoding, but for environments.

Motivation

Finding datasets based on their environmental context is a challenge in data synthesis. The process often relies on vague or inconsistent metadata. This variability presents a barrier to reliable, large-scale analysis due to time lost in data discovery and incomplete search results.

geoenv helps address this challenge by using a dataset’s originating location as a consistent and objective starting point. It can programmatically map the geometry of this location to standardized environmental terms, providing a scalable and repeatable method for generating interoperable metadata. This approach aims to enrich datasets with uniform, semantic metadata, making them potentially easier to discover, query, and integrate at scale.

Key Features

  • Automated Semantic Annotation: Supplements inconsistent, manual descriptions with standardized environmental terms from controlled vocabularies.
  • Structured, Interoperable Output: Generates GeoJSON objects enriched with formal terms from ENVO (by default).
  • Global Coverage: Provides worldwide coverage for terrestrial, coastal, and marine environments using high-resolution data sources.
  • Extensible: Designed to accommodate new data sources or vocabularies for specific research needs.

Know of a useful data source or vocabulary? Suggest it!

Quick Start

Install from PyPI:

pip install geoenv

Resolve a point on land:

import asyncio
from geoenv.data_sources import WorldTerrestrialEcosystems
from geoenv.geometry import Geometry
from geoenv.resolver import Resolver

# Define a geometry in GeoJSON format (Point or Polygon)
geometry = Geometry(
    {
        "type": "Point",
        "coordinates": [
            -122.622364,
            37.905931
        ]
    }
)

# Configure the resolver with a data source (there can be multiple)
resolver = Resolver(data_source=[WorldTerrestrialEcosystems()])

# Resolve the geometry to environmental descriptions. Concurrent resolution 
# to multiple data sources is supported via asyncio.
response = asyncio.run(resolver.resolve(geometry))

The response is a GeoJSON Feature with structured environments mapped to ENVO (by default):

{
  "type": "Feature",
  "identifier": null,
  "geometry": {
    "type": "Point",
    "coordinates": [
      -122.622364,
      37.905931
    ]
  },
  "properties": {
    "description": null,
    "environment": [
      {
        "type": "Environment",
        "dataSource": {
          "identifier": "https://doi.org/10.5066/P9DO61LP",
          "name": "WorldTerrestrialEcosystems"
        },
        "dateCreated": "2025-03-07 15:53:09",
        "properties": {
          "temperature": "Warm Temperate",
          "moisture": "Moist",
          "landCover": "Cropland",
          "landForm": "Mountains",
          "climate": "Warm Temperate Moist",
          "ecosystem": "Warm Temperate Moist Cropland on Mountains"
        },
        "mappedProperties": [
          {
            "label": "temperate",
            "uri": "http://purl.obolibrary.org/obo/ENVO_01000206"
          },
          {
            "label": "humid air",
            "uri": "http://purl.obolibrary.org/obo/ENVO_01000828"
          },
          {
            "label": "area of cropland",
            "uri": "http://purl.obolibrary.org/obo/ENVO_01000892"
          },
          {
            "label": "mountain range",
            "uri": "http://purl.obolibrary.org/obo/ENVO_00000080"
          }
        ]
      }
    ]
  }
}

Related Projects

The Global Ecosystems Atlas is a project that provides a comprehensive, harmonized open resource on the world's ecosystems. It standardizes diverse geospatial datasets by mapping them to the IUCN Global Ecosystem Typology, a hierarchical classification of environments.

Contributing

We welcome contributions! If you know of a useful data source or vocabulary, and have ideas for new features, or find a bug, please open an issue to start a discussion.

License

This project is licensed under the terms of the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geoenv-0.3.0.tar.gz (37.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geoenv-0.3.0-py3-none-any.whl (48.1 kB view details)

Uploaded Python 3

File details

Details for the file geoenv-0.3.0.tar.gz.

File metadata

  • Download URL: geoenv-0.3.0.tar.gz
  • Upload date:
  • Size: 37.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for geoenv-0.3.0.tar.gz
Algorithm Hash digest
SHA256 235f1bbc7355ffb4bb682b1a9a8395030d061182dc56cf436460fe564ef35eec
MD5 6cd33f4c0dddaaf20fd51559ff9b9ad9
BLAKE2b-256 a06eab3a4f40f2a1d6ef9fbbd1e15af7557b7c5451dc16191eaad2d06ae0d305

See more details on using hashes here.

File details

Details for the file geoenv-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: geoenv-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 48.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for geoenv-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 441c5bd6f758db2f60631740b71286e1886bc413c3574c40bdd7d2d9ba12b34d
MD5 735cc01e9ae5d6c68c7b0ae15688e981
BLAKE2b-256 bdeb8ee34f3e307ed7b5502e638e01fb45978ca11a8dc421c08f1fb182ce2110

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page