Skip to main content

No project description provided

Project description

STAC-geoparquet

Convert STAC items between JSON, GeoParquet, pgstac, and Delta Lake.

Purpose

The STAC spec defines a JSON-based schema. But it can be hard to manage and search through many millions of STAC items in JSON format. For one, JSON is very large on disk. And you need to parse the entire JSON data into memory to extract just a small piece of information, say the datetime and one asset of an Item.

GeoParquet can be a good complement to JSON for many bulk-access and analytic use cases. While STAC Items are commonly distributed as individual JSON files on object storage or through a STAC API, STAC GeoParquet allows users to access a large number of STAC items in bulk without making repeated HTTP requests.

For analytic questions like "find the items in the Sentinel-2 collection in June 2024 over New York City with cloud cover of less than 20%" it can be much, much faster to find the relevant data from a GeoParquet source than from JSON, because GeoParquet needs to load only the relevant columns for that query, not the full data.

See the STAC-GeoParquet specification for details on the exact schema of the written Parquet files.

Documentation

Documentation website

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stac_geoparquet-0.6.0.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

stac_geoparquet-0.6.0-py3-none-any.whl (32.4 kB view details)

Uploaded Python 3

File details

Details for the file stac_geoparquet-0.6.0.tar.gz.

File metadata

  • Download URL: stac_geoparquet-0.6.0.tar.gz
  • Upload date:
  • Size: 1.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for stac_geoparquet-0.6.0.tar.gz
Algorithm Hash digest
SHA256 8c83752859e64104844b7c0226f269df1c5a6cfc610d9c505340be6e09996d9b
MD5 4960bc471e88cc3ee16b47f3a946bff8
BLAKE2b-256 658f67972d7959d4b32b411463ab6d3114829c8dd120926e9c4ec9b356256e8d

See more details on using hashes here.

File details

Details for the file stac_geoparquet-0.6.0-py3-none-any.whl.

File metadata

File hashes

Hashes for stac_geoparquet-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7a4bac40e074881a2c7827ce8b2148e4885131f689d9845ac7441996147a0f55
MD5 754a9d20de725b25946d7b36ac31281d
BLAKE2b-256 af76193084ad93b37f1f2ccc54ac5b70e07f3fbef4fd8f703cd999c18285976f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page