Skip to main content

Fast I/O and transformation tools for GeoParquet files

Project description

geoparquet-io

Tests Python Version License Code style: ruff

Fast I/O and transformation tools for GeoParquet files using PyArrow and DuckDB.

📚 Full Documentation | Quick Start Tutorial

Features

  • Fast: Built on PyArrow and DuckDB for high-performance operations
  • Comprehensive: Sort, partition, enhance, and validate GeoParquet files
  • Spatial Indexing: Add bbox, H3 hexagonal cells, KD-tree partitions, and hierarchical admin divisions
  • Best Practices: Automatic optimization following GeoParquet 1.1 spec
  • Flexible: CLI and Python API for any workflow
  • Tested: Extensive test suite across Python 3.9-3.13 and all platforms

Installation

pip install geoparquet-io

See the Installation Guide for other options (uv, from source) and requirements.

Quick Start

# Inspect file structure and metadata
gpio inspect myfile.parquet

# Check file quality and best practices
gpio check all myfile.parquet

# Add bounding box column for faster queries
gpio add bbox input.parquet output.parquet

# Sort using Hilbert curve for spatial locality
gpio sort hilbert input.parquet output_sorted.parquet

# Partition by admin boundaries
gpio partition admin buildings.parquet output_dir/ --dataset gaul --levels continent,country

For more examples and detailed usage, see the Quick Start Tutorial and User Guide.

Contributing

Contributions are welcome! See CONTRIBUTING.md for development setup, coding standards, and how to submit changes.

Links

License

Apache 2.0 - See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geoparquet_io-0.4.0.tar.gz (356.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geoparquet_io-0.4.0-py3-none-any.whl (100.9 kB view details)

Uploaded Python 3

File details

Details for the file geoparquet_io-0.4.0.tar.gz.

File metadata

  • Download URL: geoparquet_io-0.4.0.tar.gz
  • Upload date:
  • Size: 356.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for geoparquet_io-0.4.0.tar.gz
Algorithm Hash digest
SHA256 6fb68590987c70906692dbc3bacbf8c47e7b361b3512ea5045726e0012e3f92e
MD5 5c7b35a008f1243409e7ed004cd4b1f0
BLAKE2b-256 dbed8b3f81d056c807b4db56af573a48aafd1986b19a583b6eb18e1dbb29fbfe

See more details on using hashes here.

Provenance

The following attestation bundles were made for geoparquet_io-0.4.0.tar.gz:

Publisher: publish.yml on cholmes/geoparquet-io

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file geoparquet_io-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: geoparquet_io-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 100.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for geoparquet_io-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0fb62b7e8f0ae4741dc73c8e6598272af7eae36813d2a2290a60f7f8178a251d
MD5 a97608b679ae7242037dc4e2396afe98
BLAKE2b-256 45d44b092d3079889fe29461261b6c3113c0adcc45c7676bd9137dcbe0ea46a9

See more details on using hashes here.

Provenance

The following attestation bundles were made for geoparquet_io-0.4.0-py3-none-any.whl:

Publisher: publish.yml on cholmes/geoparquet-io

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page