Skip to main content

AWS-native STAC-based processing pipeline

Project description

# Cirrus

[![build-status-image]][build-status] [![coverage-status-image]][codecov] [![pypi-version]][pypi]

Cirrus is a [STAC](https://stacspec.org/)-based geospatial processing pipeline platform, implemented using a scalable architecture deployed on AWS. Cirrus provides the generic infrastructure for processing, allowing a user to focus on implementing the specific processing logic for their data.

![architecture-overview](docs/src/cirrus/images/arch-overview.png)

As input, Cirrus takes a STAC ItemCollection along with a process definition block. That input is called a “payload” and follows the Payload model defined in the [stac-task](https://github.com/stac-utils/stac-task) package, with slightly tighter requirements on the presence and content of the process definition block.

An input payload is run through a workflow that generates one or more output STAC Items. These output Items are added to the Cirrus static STAC catalog in S3, and are also broadcast via an SNS topic. Subscriptions to that topic can trigger additional workflows or external processes, such as indexing into a STAC API catalog (e.g., [stac-server](https://github.com/stac-utils/stac-server)).

Cirrus workflows range from the simple publishing of unmodified input items to the complex transformation of input Items and generation of wholly-new output Items. The current state of a payload in a processing pipeline is tracked in a state database to prevent duplicate processing and allow for a user to follow the state of any input payload through the pipeline.

As shown in this high-level overview of Cirrus, users input data to Cirrus through the use of _feeders_. Feeders are simply programs that get/generate some type of STAC metadata, combine it with processing parameters, and pass it into Cirrus as a payload.

## Cirrus Development

If developing new code for cirrus-geo, checkout the [Contributing Guide](CONTRIBUTING.md).

## Documentation

Documentation for deploying, using, and customizing Cirrus is contained within the [docs](https://cirrus-geo.github.io/cirrus-geo/) directory:

## About

Cirrus is an Open-Source pipeline for processing geospatial data in AWS. Cirrus was developed by [Element 84](https://element84.com/) originally under a [NASA ACCESS project](https://earthdata.nasa.gov/esds/competitive-programs/access) called [Community Tools for Analysis of NASA Earth Observation System Data in the Cloud](https://earthdata.nasa.gov/esds/competitive-programs/access/eos-data-cloud).

[build-status-image]: https://github.com/cirrus-geo/cirrus-geo/actions/workflows/python-test.yml/badge.svg [build-status]: https://github.com/cirrus-geo/cirrus-geo/actions/workflows/python-test.yml [coverage-status-image]: https://img.shields.io/codecov/c/github/cirrus-geo/cirrus-geo/master.svg [codecov]: https://codecov.io/github/cirrus-geo/cirrus-geo?branch=master [pypi-version]: https://img.shields.io/pypi/v/cirrus-geo.svg [pypi]: https://pypi.org/project/cirrus-geo/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cirrus_geo-1.1.0.tar.gz (418.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cirrus_geo-1.1.0-py3-none-any.whl (56.2 kB view details)

Uploaded Python 3

File details

Details for the file cirrus_geo-1.1.0.tar.gz.

File metadata

  • Download URL: cirrus_geo-1.1.0.tar.gz
  • Upload date:
  • Size: 418.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cirrus_geo-1.1.0.tar.gz
Algorithm Hash digest
SHA256 7a915bb4527cc173bfc196ec7ef1d9a87ef957c5a358c7d2e758b5ccddb7cd47
MD5 23133a2fab5903ffb7a8f94d8b74e4b5
BLAKE2b-256 b81589c5bd466f6f3935cef2f36025944fa0f4ff7af2766b9539d89644ce71f8

See more details on using hashes here.

Provenance

The following attestation bundles were made for cirrus_geo-1.1.0.tar.gz:

Publisher: python-publish.yml on cirrus-geo/cirrus-geo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cirrus_geo-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: cirrus_geo-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 56.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cirrus_geo-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 05fe0aae51098490cd8281d1177fc0a4b01474a74c6f1d2cf110209295e4e6ab
MD5 e01dfee7025ab85ff9fda47b7b1fb105
BLAKE2b-256 fbcb7939f4b860f0f74b6d773954f644de671c134cd8f5f21548a74175ee9386

See more details on using hashes here.

Provenance

The following attestation bundles were made for cirrus_geo-1.1.0-py3-none-any.whl:

Publisher: python-publish.yml on cirrus-geo/cirrus-geo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page