Skip to main content

Earth observation framework for scaled-up processing in Python

Project description

Package version Supported Python versions Build Status Docs status License Overall downloads Last month downloads Code coverage

eo-grow

Earth observation framework for scaled-up processing in Python.

Analyzing Earth Observation (EO) data is complex and solutions often require custom tailored algorithms. In the EO domain most problems come with an additional challenge: How do we apply the solution on a larger scale?

Working with EO data is made easy by the eo-learn package, while the eo-grow package takes care of running the solutions at a large scale. In eo-grow an EOWorkflow based solution is wrapped in a pipeline object, which takes care of parametrization, logging, storage, multi-processing, EOPatch management and more. However, pipelines are not necessarily bound to EOWorkflow execution and can be used for other tasks such as training ML models.

Features of eo-grow include:

  • Direct use of EOWorkflow procedures.
  • Parametrizing workflows by using validated configuration files, making executions easy to reproduce and adjust.
  • Easy use of both local and AWS S3 storage with no required code adaptation.
  • Splitting large areas of interest into grids and defining collections of EOPatches.
  • Workflows can be run either single-process, multi-process, or even on multiple machines (by using ray clusters).
  • Execution reports and customizable logging.
  • Options for skipping already processed data when re-running a pipeline.
  • Offers a CLI interface for running pipelines, validating configuration files, and generating templates.
  • A collection of basic pipelines, with methods that can be overridden to tailor to a large amount of use-cases.

General Structure Overview

The core object of eo-grow is the Pipeline. Each pipeline has a run_procedure method, which is executed after the pipeline is set up. By default, the run_procedure executes an EOWorkflow which is built by the (user-defined) build_workflow method.

Each pipeline is linked to so called managers:

  • StorageManager handles loading and saving of files,
  • AreaManager defines the area of interest and how it should be split into EOPatches,
  • LoggingManager provides control over logging.

eo-grow-structure

Managers and pipelines usually require a large amount of parameters (setting storage paths, configuring log parameters, etc.), which are provided in .json configuration files. Each eo-grow object contains a special Schema class, which is a pydantic model describing the parameters of the object. Config files are then validated before execution to catch issues early. Templates for config files can be generated with the eogrow-template CLI command.

To make config files easier to write eo-grow uses a simple config language that supports importing other configs, variables, and more.

Install

PyPi distribution

The eo-grow package requires Python version >= 3.8 and can be installed with

pip install eo-grow

Command Line Interface

Running pipelines is easiest by using the CLI provided by eo-grow. For all options use the --help flag with each command.

  • eogrow <config> executes the pipeline defined in the <config> file.
  • eogrow-validate <config> only performs validation of the <config> file.
  • eogrow-test <config> initializes the pipeline/object but does not run it. Useful for testing if managers are set correctly or for generating area-split grids.
  • eogrow-ray <cluster> <config> executes the pipeline defined in <config> on the active Ray cluster defined by the <cluster> file.
  • eogrow-template <import path> <template> generates a template config for the object specified by the <import path> and saves it to the <template> file (or outputs it directly if <template> is not provided).

Documentation

For more information on the package visit readthedocs.

Explanatory examples can be found here.

More details on the config language used by eo-grow can be found here.

Questions and Issues

Feel free to ask questions about the package and its use cases at Sentinel Hub forum or raise an issue on GitHub.

License

See LICENSE.

Acknowledgements

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 101004112. The project is co-funded also by the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 101086461.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eo_grow-1.7.14.tar.gz (85.5 kB view details)

Uploaded Source

Built Distribution

eo_grow-1.7.14-py3-none-any.whl (116.1 kB view details)

Uploaded Python 3

File details

Details for the file eo_grow-1.7.14.tar.gz.

File metadata

  • Download URL: eo_grow-1.7.14.tar.gz
  • Upload date:
  • Size: 85.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.8.10

File hashes

Hashes for eo_grow-1.7.14.tar.gz
Algorithm Hash digest
SHA256 18f800136fc080a64ca642ba718fb7bdab2a7c62a9b77c4cf797a875625f54e0
MD5 917f56fcf79aaaa51897d4d58af0b1ef
BLAKE2b-256 89ecdbdc966c1867e9717999237f9ca600a2d35fa51fb8d672e97c6bd25f176f

See more details on using hashes here.

File details

Details for the file eo_grow-1.7.14-py3-none-any.whl.

File metadata

  • Download URL: eo_grow-1.7.14-py3-none-any.whl
  • Upload date:
  • Size: 116.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.8.10

File hashes

Hashes for eo_grow-1.7.14-py3-none-any.whl
Algorithm Hash digest
SHA256 23a05c422285e8b2b80a8ed9d007f135e8c8d338ef3ec6b1abd95f6efa1b6d25
MD5 ecb57bde90e8dc9cdcf72326fb1cd0ef
BLAKE2b-256 3e47401d9b39722ff12a69e8ab0ae733849f0c9704d73ac3586b629f7097b3de

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page