Geobuf is a compact binary geospatial format for lossless compression of GeoJSON.
Project description
Geobuf
Geobuf is a compact binary geospatial format for lossless compression of GeoJSON.
Note well: this project has been transferred by Mapbox to the new pygeobuf organization.
Advantages over using GeoJSON directly (in this revised version):
- Very compact: typically makes GeoJSON 6-8 times smaller.
- Smaller even when comparing gzipped sizes: 2-2.5x compression for GeoJSON.
- Easy incremental parsing — you can get features out as you read them, without the need to build in-memory representation of the whole data.
- Partial reads — you can read only the parts you actually need, skipping the rest.
- Trivial concatenation: you can concatenate many Geobuf files together and they will form a valid combined Geobuf file.
- Potentially faster encoding/decoding compared to native JSON implementations (i.e. in Web browsers).
- Can still accommodate any GeoJSON, including extensions with arbitrary properties.
Think of this as an attempt to design a simple, modern Shapefile successor that works seamlessly with GeoJSON.
Unlike Mapbox Vector Tiles, it aims for lossless compression of datasets — without tiling, projecting coordinates, flattening geometries or stripping properties.
pygeobuf
This repository is the first encoding/decoding implementation of this new major version of Geobuf (in Python). It serves as a prototyping playground, with faster implementations in JS and C++ coming in future.
Sample compression sizes
normal | gzipped | |
---|---|---|
us-zips.json | 101.85 MB | 26.67 MB |
us-zips.pbf | 12.24 MB | 10.48 MB |
us-zips.topo.json | 15.02 MB | 3.19 MB |
us-zips.topo.pbf | 4.85 MB | 2.72 MB |
idaho.json | 10.92 MB | 2.57 MB |
idaho.pbf | 1.37 MB | 1.17 MB |
idaho.topo.json | 1.9 MB | 612 KB |
idaho.topo.pbf | 567 KB | 479 KB |
Usage
Installation:
pip install geobuf
Command line:
geobuf encode < example.json > example.pbf
geobuf decode < example.pbf > example.pbf.json
As a module:
import geobuf
pbf = geobuf.encode(my_json) # GeoJSON -> Geobuf string
my_json = geobuf.decode(pbf) # Geobuf string -> GeoJSON
The encode
function accepts a dict-like object, for example the result of json.loads(json_str)
.
Both encode.py
and geobuf.encode
accept two optional arguments:
- precision — max number of digits after the decimal point in coordinates,
6
by default. - dimensions — number of dimensions in coordinates,
2
by default.
Tests
py.test -v
The tests run through all .json
files in the fixtures
directory,
comparing each original GeoJSON with an encoded/decoded one.
Generating from geobuf.proto
To (re-)generate the geobuf_pb2.py
file, you can run the following
commands:
protoc geobuf.proto --python_out=geobuf
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file geobuf-2.0.0.tar.gz
.
File metadata
- Download URL: geobuf-2.0.0.tar.gz
- Upload date:
- Size: 46.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 96a347cd8443cc53a65f34f958b45b0e45c4cca8f1ca3b72078a3aec97605a72 |
|
MD5 | f91d457d4c383cce8dfdc08a603e533b |
|
BLAKE2b-256 | eda9166da74c1e3b8c6b9644c244de75fa5e001b01fe06311721c84471b9296e |
File details
Details for the file geobuf-2.0.0-py3-none-any.whl
.
File metadata
- Download URL: geobuf-2.0.0-py3-none-any.whl
- Upload date:
- Size: 9.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 01e89318861a5f80a2980c1fe602c1f378439cbfd3a0d99f1991550956c4ecf6 |
|
MD5 | e093be1c1ecab2a0ea59c48ce15e201a |
|
BLAKE2b-256 | f3ece174665e4b824ecfad1555746a5bc13eb26de85c66ec61f55eb097c433f0 |