Skip to main content

Geobuf is a compact binary geospatial format for lossless compression of GeoJSON.

Project description

Geobuf

Geobuf is a compact binary geospatial format for lossless compression of GeoJSON.

Note well: this project has been transferred by Mapbox to the new pygeobuf organization.

Advantages over using GeoJSON directly (in this revised version):

  • Very compact: typically makes GeoJSON 6-8 times smaller.
  • Smaller even when comparing gzipped sizes: 2-2.5x compression for GeoJSON.
  • Easy incremental parsing — you can get features out as you read them, without the need to build in-memory representation of the whole data.
  • Partial reads — you can read only the parts you actually need, skipping the rest.
  • Trivial concatenation: you can concatenate many Geobuf files together and they will form a valid combined Geobuf file.
  • Potentially faster encoding/decoding compared to native JSON implementations (i.e. in Web browsers).
  • Can still accommodate any GeoJSON, including extensions with arbitrary properties.

Think of this as an attempt to design a simple, modern Shapefile successor that works seamlessly with GeoJSON.

Unlike Mapbox Vector Tiles, it aims for lossless compression of datasets — without tiling, projecting coordinates, flattening geometries or stripping properties.

pygeobuf

This repository is the first encoding/decoding implementation of this new major version of Geobuf (in Python). It serves as a prototyping playground, with faster implementations in JS and C++ coming in future.

Sample compression sizes

normal gzipped
us-zips.json 101.85 MB 26.67 MB
us-zips.pbf 12.24 MB 10.48 MB
us-zips.topo.json 15.02 MB 3.19 MB
us-zips.topo.pbf 4.85 MB 2.72 MB
idaho.json 10.92 MB 2.57 MB
idaho.pbf 1.37 MB 1.17 MB
idaho.topo.json 1.9 MB 612 KB
idaho.topo.pbf 567 KB 479 KB

Usage

Installation:

pip install geobuf

Command line:

geobuf encode < example.json > example.pbf
geobuf decode < example.pbf > example.pbf.json

As a module:

import geobuf

pbf = geobuf.encode(my_json) # GeoJSON -> Geobuf string
my_json = geobuf.decode(pbf) # Geobuf string -> GeoJSON

The encode function accepts a dict-like object, for example the result of json.loads(json_str).

Both encode.py and geobuf.encode accept two optional arguments:

  • precision — max number of digits after the decimal point in coordinates, 6 by default.
  • dimensions — number of dimensions in coordinates, 2 by default.

Tests

py.test -v

The tests run through all .json files in the fixtures directory, comparing each original GeoJSON with an encoded/decoded one.

Generating from geobuf.proto

To (re-)generate the geobuf_pb2.py file, you can run the following commands:

protoc geobuf.proto --python_out=geobuf

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geobuf-2.0.1.tar.gz (46.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geobuf-2.0.1-py3-none-any.whl (9.8 kB view details)

Uploaded Python 3

File details

Details for the file geobuf-2.0.1.tar.gz.

File metadata

  • Download URL: geobuf-2.0.1.tar.gz
  • Upload date:
  • Size: 46.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for geobuf-2.0.1.tar.gz
Algorithm Hash digest
SHA256 73f173e42e50ca546ad81be8ac33511bf09d0fcb58799fa50ef0ee5f01c77561
MD5 a7a70f9c6cdc4d24e717f88dcd389070
BLAKE2b-256 fdd4538d7d6c6524022bf91ed35a411913cef0e00dcb70fd3b9d3776be631f6e

See more details on using hashes here.

File details

Details for the file geobuf-2.0.1-py3-none-any.whl.

File metadata

  • Download URL: geobuf-2.0.1-py3-none-any.whl
  • Upload date:
  • Size: 9.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for geobuf-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b8407af71134aa1e2e55e2ae1d2d241b2c2345e2904de7cca48eb441ee6f2e5d
MD5 8ebc664271172e4f2b876f375e237fde
BLAKE2b-256 41d6d02c66ce9dc888f663b127062f43cbdbaa5f4a37dfeaa6937aa5fe67a963

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page