Skip to main content

CLI tools such as validation and file format conversion for vecorel.

Project description

Vecorel CLI

A command-line interface (CLI) for working with Vecorel files.

Getting Started

In order to make working with Vecorel easier we have developed command-line interface (CLI) tools such as inspection, validation and file format conversions.

Installation

Using Pixi (Recommended)

This project uses Pixi for dependency management. Install Pixi first, then:

# Clone the repository and navigate to it
git clone https://github.com/vecorel/cli.git
cd cli

# Install all dependencies
pixi install

# Run the CLI
pixi run vec

Note: This project recently migrated to Pixi. If you were using the old pip-based workflow, see MIGRATION.md for details.

Using pip

Alternatively, you can install from PyPI with Python 3.10 or any later version:

pip install vecorel-cli

Execute a command

After the installation you should be able to run the following command: vec (or pixi run vec if using Pixi)

You should see usage instructions and available commands for the CLI.

Vecorel CLI supports various commands to work with the files:

Commands

Validation

To validate a Vecorel GeoParquet or GeoJSON file, you can for example run:

  • GeoJSON: vec validate example.json --collection collection.json
  • GeoParquet: vec validate example.parquet --data

Check vec validate --help for more details.

The validator also supports remote files.

  • http:// or https://: no further configuration is needed.
  • s3://: With Pixi, run pixi install -e s3 or with pip, run pip install vecorel-cli[s3] and you may need to set environment variables. Refer to the s3fs credentials documentation for how to define credentials.
  • gs://: With Pixi, run pixi install -e gcs or with pip, run pip install vecorel-cli[gcs]. By default, gcsfs will attempt to use your default gcloud credentials or, attempt to get credentials from the google metadata service, or fall back to anonymous access.

Create Vecorel GeoParquet from GeoJSON

To create a Vecorel-compliant GeoParquet for a Vecorel-compliant set of GeoJSON files containing Features or FeatureCollections, you can for example run:

  • vec create-geoparquet geojson/example.json -o example.parquet -c geojson/collection.json

Check vec create-geoparquet --help for more details.

Create Vecorel GeoJSON from GeoParquet

To create one or multiple Vecorel-compliant GeoJSON file(s) for a Vecorel-compliant GeoParquet file, you can for example run:

  • GeoJSON FeatureCollection: vec create-geojson example.parquet -o dest-folder

  • GeoJSON Features (with indentation and max. 100 features): vec create-geojson example.parquet -o dest-folder -n 100 -i 2 -f

    vec create-geojson example.parquet -o dest-folder -n 100 -i 2 -f

Check vec create-geojson --help for more details.

Inspect Vecorel GeoParquet file

To look into a Vecorel GeoParquet file to get a rough understanding of the content, the following can be executed:

  • vec describe example.parquet

Check vec describe --help for more details.

Merge Vecorel GeoParquet files

Merges multiple Vecorel datasets to a combined Vecorel dataset:

  • vec merge ec_ee.parquet ec_lv.parquet -o merged.parquet -e https://vecorel.org/hcat-extension/v0.1.0/schema.yaml -i ec:hcat_name -i ec:hcat_code -i ec:translated_name

Check vec merge --help for more details.

Create JSON Schema from Vecorel Schema

To create a JSON Schema for a Vecorel Schema YAML file, you can for example run:

  • vec jsonschema example.json --id=https://vecorel.org/specification/v0.1.0/geojson/schema.json -o schema.json

Check vec jsonschema --help for more details.

Validate a Vecorel Schema

To validate a Vecorel Schema YAML file, you can for example run:

  • vec validate-schema schema/schema.yaml

Check vec validate-schema --help for more details.

Improve a Vecorel Parquet file

Various "improvements" can be applied to a Vecorel GeoParquet file. The commands allows to

  • change the CRS (--crs)
  • change the GeoParquet version (-gp1) and compression (-pc)
  • add/fill missing perimeter/area values (-sz)
  • fix invalid geometries (-g)
  • rename columns (-r)

Example:

  • vec improve file.parquet -o file2.parquet -g -sz -r old=new -pc zstd

Check vec improve --help for more details.

Update an extension template with new names

Once you've created and git cloned a new extension, you can use the CLI to update all template placeholders with proper names.

For example, if your extension is meant to have

  • the title "Administrative Division Extension",
  • the prefix admin (e.g. field admin:country_code or admin:subdivision_code),
  • is hosted at https://github.io/vecorel/administrative-division-extension (organization: vecorel, repository /administrative-division-extension),
  • and you run Vecorel in the folder of the extension.

Then the following command could be used:

  • vec rename-extension . -t "Administrative Division" -p admin -s administrative-division-extension -o vecorel

Check vec rename-extension --help for more details.

Converter for existing datasets

The CLI ships various converters for existing datasets.

To get a list of available converters/datasets with title, license, etc. run:

  • vec converters

Use any of the IDs from the list to convert an existing dataset to Vecorel:

  • vec convert de_nrw

See Implement a converter for details about how to

Development

This project uses Pixi for dependency management and development workflows.

# Install all dependencies including development tools
pixi install -e dev

# Install the package in editable mode
pixi run install-dev

# Run tests
pixi run test

# Format and lint code
pixi run format
pixi run lint

# Run all checks (lint, format, test)
pixi run check

# Install and run pre-commit
pixi run pre-commit-install
pixi run pre-commit-run

Implement a converter

The following high-level description gives an idea how to implement a converter in Vecorel CLI:

  1. Create a new file in vecorel_cli/datasets based on the template.py
  2. Fill in the required variables / test it / run it
  3. Add missing dependencies into the appropriate feature group in pixi.toml (or setup.py for pip users)
  4. Add the converter to the list above
  5. Create a PR to submit your converter for review

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vecorel_cli-0.2.5.tar.gz (67.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vecorel_cli-0.2.5-py3-none-any.whl (74.3 kB view details)

Uploaded Python 3

File details

Details for the file vecorel_cli-0.2.5.tar.gz.

File metadata

  • Download URL: vecorel_cli-0.2.5.tar.gz
  • Upload date:
  • Size: 67.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for vecorel_cli-0.2.5.tar.gz
Algorithm Hash digest
SHA256 87ff26600ad60fcc0bb57d60b34c979f0df4b2ac0d565353ae2d6fba22f84d06
MD5 5bd6df4e87fd0e5f7bd98007c0b64cb9
BLAKE2b-256 e4d436dec1922c1c45d1929e2b68a4c9b9f0378febd1eeed5cfb7c9a552347f3

See more details on using hashes here.

File details

Details for the file vecorel_cli-0.2.5-py3-none-any.whl.

File metadata

  • Download URL: vecorel_cli-0.2.5-py3-none-any.whl
  • Upload date:
  • Size: 74.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for vecorel_cli-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 9e7c9b499bcb0fb63d3e41005b8d8dd3bf596ea7e4e1528c9b925aa87a665b97
MD5 e98559a3c4d6ec7f763dc9d44ee17700
BLAKE2b-256 c767cd842ad641a94da9be50024f3bbf722cf48de282918518a7dcde14c5f992

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page