Skip to main content

Singer tap for Geo, built with the Meltano Singer SDK.

Project description

tap-geo

tap-geo is a Singer tap for Geospatial datasets.

Built with the Meltano Tap SDK for Singer Taps.

Capabilities

  • catalog
  • state
  • discover
  • activate-version
  • about
  • stream-maps
  • schema-flattening
  • batch

Supported Python Versions

  • 3.10
  • 3.11
  • 3.12
  • 3.13
  • 3.14

Settings

Setting Required Default Description
files True None List of file configs to parse
stream_maps False None Config object for stream maps capability. For more information check out Stream Maps.
stream_maps.else False None Currently, only setting this to __NULL__ is supported. This will remove all other streams.
stream_map_config False None User-defined config values to be used within map expressions.
faker_config False None Config for the Faker instance variable fake used within map expressions. Only applicable if the plugin specifies faker as an additional dependency (through the singer-sdk faker extra or directly).
faker_config.seed False None Value to seed the Faker generator for deterministic output: https://faker.readthedocs.io/en/master/#seeding-the-generator
faker_config.locale False None One or more LCID locale strings to produce localized output for: https://faker.readthedocs.io/en/master/#localization
flattening_enabled False None 'True' to enable schema flattening and automatically expand nested properties.
flattening_max_depth False None The max depth to flatten schemas.
batch_config False None Configuration for BATCH message capabilities.
batch_config.encoding False None Specifies the format and compression of the batch files.
batch_config.encoding.format False None Format to use for batch files.
batch_config.encoding.compression False None Compression format to use for batch files.
batch_config.storage False None Defines the storage layer to use when writing batch files
batch_config.storage.root False None Root path to use when writing batch files.
batch_config.storage.prefix False None Prefix to use when writing batch files.

A full list of supported settings and capabilities is available by running: tap-geo --about

Installation

Install from GitHub:

uv tool install git+https://github.com/celine-eu/tap-geo.git@main

Configuration

Accepted Config Options

See also meltano.yml for a working configuration

Provide a list of files with those fields

paths list of files in glob format, required table_name name of the destination table, default to filename primary_keys list of columns to use as primary keys geometry_format store geospatial information in "wkt" (default) or "geojson"

Example config

config:
  files:
    - paths:
        - "data/osm/*.osm"
        - "data/osm/**/*.pbf"
      table_name: osm_data
      primary_keys: ["id"]
      geometry_format: "wkt"

    - paths:
        - "data/shapes/**/*.shp"
      table_name: shapes
      skip_fields: ["temp_field"]
      expose_fields: ["col_name", "col_2"]
      geometry_format: "geojson"

    - paths:
        - "data/buildings.geojson"
      table_name: buildings
      primary_keys: ["building_id"]

    # e.g. use docker compose up to test locally
    - paths:
        - "s3://local-data/buildings.geojson"
      table_name: buildings
      primary_keys: ["building_id"]

To use an S3-based storage ensure to provide those envirnoment variables:

  • S3_ACCESS_KEY_ID, S3_SECRET_ACCESS_KEY access key/secret pair
  • S3_ENDPOINT_URL Custom S3 endpoint such as minio or compatible interface

Example:

S3_ACCESS_KEY_ID=minioadmin S3_SECRET_ACCESS_KEY=minioadmin S3_ENDPOINT_URL=http://localhost:19000 meltano run tap-geo target-jsonl

Configure using environment variables

This Singer tap will automatically import any environment variables within the working directory's .env if the --config=ENV is provided, such that config values will be considered if a matching environment variable is set either in the terminal context or in the .env file.

Source Authentication and Authorization

Usage

You can easily run tap-geo by itself or in a pipeline using Meltano.

Executing the Tap Directly

tap-geo --version
tap-geo --help
tap-geo --config CONFIG --discover > ./catalog.json

Developer Resources

Follow these instructions to contribute to this project.

Initialize your Development Environment

Prerequisites:

  • Python 3.10+
  • uv
uv sync

Create and Run Tests

Create tests within the tests subfolder and then run:

uv run pytest

You can also test the tap-geo CLI interface directly using uv run:

uv run tap-geo --help

Testing with Meltano

Note: This tap will work in any Singer environment and does not require Meltano. Examples here are for convenience and to streamline end-to-end orchestration scenarios.

Next, install Meltano (if you haven't already) and any needed plugins:

# Install meltano
uv tool install meltano
# Initialize meltano within this directory
cd tap-geo
meltano install

Now you can test and orchestrate using Meltano:

# Test invocation:
meltano invoke tap-geo --version

# OR run a test ELT pipeline:
meltano run tap-geo target-jsonl

SDK Dev Guide

See the dev guide for more instructions on how to use the SDK to develop your own taps and targets.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tap_geo-0.2.0.tar.gz (230.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tap_geo-0.2.0-py3-none-any.whl (15.9 kB view details)

Uploaded Python 3

File details

Details for the file tap_geo-0.2.0.tar.gz.

File metadata

  • Download URL: tap_geo-0.2.0.tar.gz
  • Upload date:
  • Size: 230.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tap_geo-0.2.0.tar.gz
Algorithm Hash digest
SHA256 5f512a917d4176f1f6958c19d0ed43472f9f78413ef8c4a6de3f5b9233ef3125
MD5 4b40a1a497e71ee875915b594a652beb
BLAKE2b-256 fc06700171bf543a09f4c0a5f6328853b41d8635ebafa065c59c74b4d3b91c2a

See more details on using hashes here.

Provenance

The following attestation bundles were made for tap_geo-0.2.0.tar.gz:

Publisher: build.yml on celine-eu/tap-geo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file tap_geo-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: tap_geo-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 15.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tap_geo-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 701dd6e2a55b12738e1ffec1013d4d618554382f925e2d78c081e4d43ccd9b93
MD5 0848fafcd21439210c33856d6e99ff0d
BLAKE2b-256 9c93fd485bd9c82ce44ff7024b1a965902c7989fed0c3038b3d0eae2909c6f76

See more details on using hashes here.

Provenance

The following attestation bundles were made for tap_geo-0.2.0-py3-none-any.whl:

Publisher: build.yml on celine-eu/tap-geo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page