Skip to main content

Map geometries to environmental semantics

Project description

geoenv

Map geometries to environmental semantics

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. CI CD codecov PyPI - Version

geoenv is a Python library that maps geospatial geometries, such as points and polygons, to environmental terms in vocabularies/ontologies (e.g. ENVO). It’s like reverse geocoding, but for environments.

Features

  • Broad scale environmental context: Provides consistent broad scale environmental context supplementing local scale environmental descriptions.
  • Global Coverage: Provides worldwide resolution of terrestrial, coastal, and marine environments.
  • GeoJSON Output: Outputs data as a GeoJSON Feature, for integration with other tools and libraries.
  • Concurrent Data Resolution: Leverages asyncio to query multiple geospatial data sources concurrently, providing fast results.
  • Modular and Extensible: Designed with a modular architecture to facilitate integration of new data sources and vocabularies.

Quick Start

Install from PyPI:

$ pip install geoenv

Resolve a point location to environmental descriptions:

import asyncio
from geoenv.geometry import Geometry
from geoenv.resolver import Resolver
from geoenv.data_sources import (WorldTerrestrialEcosystems,
                                 EcologicalMarineUnits,
                                 EcologicalCoastalUnits)

# Define a geometry in GeoJSON format (Point or Polygon)
geometry = Geometry(
    {
        "type": "Point",
        "coordinates": [
            -122.622364,
            37.905931
        ]
    }
)

# Set up the resolver. When the location's environment is not known, 
# multiple data sources are included to cover potential environment 
# types.
resolver = Resolver(
    data_source=[
        WorldTerrestrialEcosystems(),
        EcologicalMarineUnits(),
        EcologicalCoastalUnits(),
    ]
)

# Resolve the geometry to environmental descriptions. The resolver 
# queries multiple data sources concurrently using `asyncio`.
response = asyncio.run(resolver.resolve(geometry))

# Access response data.
print(response.data)

The response is a GeoJSON Feature with environmental terms mapped to ENVO (by default). Only resolved environments are included:

{
  "type": "Feature",
  "identifier": null,
  "geometry": {
    "type": "Point",
    "coordinates": [
      -122.622364,
      37.905931
    ]
  },
  "properties": {
    "description": null,
    "environment": [
      {
        "type": "Environment",
        "dataSource": {
          "identifier": "https://doi.org/10.5066/P9DO61LP",
          "name": "WorldTerrestrialEcosystems"
        },
        "dateCreated": "2025-03-07 15:53:09",
        "properties": {
          "temperature": "Warm Temperate",
          "moisture": "Moist",
          "landCover": "Cropland",
          "landForm": "Mountains",
          "climate": "Warm Temperate Moist",
          "ecosystem": "Warm Temperate Moist Cropland on Mountains"
        },
        "mappedProperties": [
          {
            "label": "temperate",
            "uri": "http://purl.obolibrary.org/obo/ENVO_01000206"
          },
          {
            "label": "humid air",
            "uri": "http://purl.obolibrary.org/obo/ENVO_01000828"
          },
          {
            "label": "area of cropland",
            "uri": "http://purl.obolibrary.org/obo/ENVO_01000892"
          },
          {
            "label": "mountain range",
            "uri": "http://purl.obolibrary.org/obo/ENVO_00000080"
          }
        ]
      }
    ]
  }
}

Motivation

Finding datasets based on their environmental context is a challenge in data synthesis. The process often relies on vague or inconsistent metadata. This variability presents a barrier to reliable, large-scale analysis due to time lost in data discovery and incomplete search results.

geoenv helps address this challenge by using a dataset’s originating location as a consistent and objective starting point. It can programmatically map the geometry of this location to standardized environmental terms, providing a scalable and repeatable method for generating interoperable metadata. This approach aims to enrich datasets with uniform, semantic metadata, making them potentially easier to discover, query, and integrate at scale.

Related Projects

The Global Ecosystems Atlas is a project that provides a comprehensive, harmonized open resource on the world's ecosystems. It standardizes diverse geospatial datasets by mapping them to the IUCN Global Ecosystem Typology, a hierarchical classification of environments.

Contributing

We welcome contributions! If you know of a useful data source or vocabulary, and have ideas for new features, or find a bug, please open an issue to start a discussion.

License

This project is licensed under the terms of the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geoenv-0.5.0.tar.gz (36.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geoenv-0.5.0-py3-none-any.whl (47.5 kB view details)

Uploaded Python 3

File details

Details for the file geoenv-0.5.0.tar.gz.

File metadata

  • Download URL: geoenv-0.5.0.tar.gz
  • Upload date:
  • Size: 36.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for geoenv-0.5.0.tar.gz
Algorithm Hash digest
SHA256 3c8de22adbabbef6b3125d487698a9d45e3c42e1818bd1125eb18530fc3aa222
MD5 ec0f8e82f4bf47a2affa32a1e298fb11
BLAKE2b-256 3a2bd527118828d6d364a943a8d5a753a6ce3c25da696c9811d1d24a20e2869f

See more details on using hashes here.

File details

Details for the file geoenv-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: geoenv-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 47.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for geoenv-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1a43d0bfa726b7baf921316bc73ebc241112d43a0a9d7428b41046446f7679e2
MD5 66611f9606a2f8abc0f9e10d3bc18b77
BLAKE2b-256 60679a441c6b17246186880b6f86408f14ae9a307975fd6520ac98fcd95e4f92

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page