Singer tap for Geo, built with the Meltano Singer SDK.
Project description
tap-geo
tap-geo is a Singer tap for Geospatial datasets.
Built with the Meltano Tap SDK for Singer Taps.
Capabilities
catalogstatediscoveractivate-versionaboutstream-mapsschema-flatteningbatch
Supported Python Versions
- 3.10
- 3.11
- 3.12
- 3.13
- 3.14
Settings
| Setting | Required | Default | Description |
|---|---|---|---|
| files | True | None | List of file configs to parse |
| stream_maps | False | None | Config object for stream maps capability. For more information check out Stream Maps. |
| stream_maps.else | False | None | Currently, only setting this to __NULL__ is supported. This will remove all other streams. |
| stream_map_config | False | None | User-defined config values to be used within map expressions. |
| faker_config | False | None | Config for the Faker instance variable fake used within map expressions. Only applicable if the plugin specifies faker as an additional dependency (through the singer-sdk faker extra or directly). |
| faker_config.seed | False | None | Value to seed the Faker generator for deterministic output: https://faker.readthedocs.io/en/master/#seeding-the-generator |
| faker_config.locale | False | None | One or more LCID locale strings to produce localized output for: https://faker.readthedocs.io/en/master/#localization |
| flattening_enabled | False | None | 'True' to enable schema flattening and automatically expand nested properties. |
| flattening_max_depth | False | None | The max depth to flatten schemas. |
| batch_config | False | None | Configuration for BATCH message capabilities. |
| batch_config.encoding | False | None | Specifies the format and compression of the batch files. |
| batch_config.encoding.format | False | None | Format to use for batch files. |
| batch_config.encoding.compression | False | None | Compression format to use for batch files. |
| batch_config.storage | False | None | Defines the storage layer to use when writing batch files |
| batch_config.storage.root | False | None | Root path to use when writing batch files. |
| batch_config.storage.prefix | False | None | Prefix to use when writing batch files. |
A full list of supported settings and capabilities is available by running: tap-geo --about
Installation
Install from GitHub:
uv tool install git+https://github.com/celine-eu/tap-geo.git@main
Configuration
Accepted Config Options
See also meltano.yml for a working configuration
Provide a list of files with those fields
paths list of files in glob format, required
table_name name of the destination table, default to filename
primary_keys list of columns to use as primary keys
geometry_format store geospatial information in "wkt" (default) or "geojson"
Example config
config:
files:
- paths:
- "data/osm/*.osm"
- "data/osm/**/*.pbf"
table_name: osm_data
primary_keys: ["id"]
geometry_format: "wkt"
- paths:
- "data/shapes/**/*.shp"
table_name: shapes
skip_fields: ["temp_field"]
expose_fields: ["col_name", "col_2"]
geometry_format: "geojson"
- paths:
- "data/buildings.geojson"
table_name: buildings
primary_keys: ["building_id"]
# e.g. use docker compose up to test locally
- paths:
- "s3://local-data/buildings.geojson"
table_name: buildings
primary_keys: ["building_id"]
To use an S3-based storage ensure to provide those envirnoment variables:
S3_ACCESS_KEY_ID,S3_SECRET_ACCESS_KEYaccess key/secret pairS3_ENDPOINT_URLCustom S3 endpoint such as minio or compatible interface
Example:
S3_ACCESS_KEY_ID=minioadmin S3_SECRET_ACCESS_KEY=minioadmin S3_ENDPOINT_URL=http://localhost:19000 meltano run tap-geo target-jsonl
Configure using environment variables
This Singer tap will automatically import any environment variables within the working directory's
.env if the --config=ENV is provided, such that config values will be considered if a matching
environment variable is set either in the terminal context or in the .env file.
Source Authentication and Authorization
Usage
You can easily run tap-geo by itself or in a pipeline using Meltano.
Executing the Tap Directly
tap-geo --version
tap-geo --help
tap-geo --config CONFIG --discover > ./catalog.json
Developer Resources
Follow these instructions to contribute to this project.
Initialize your Development Environment
Prerequisites:
- Python 3.10+
- uv
uv sync
Create and Run Tests
Create tests within the tests subfolder and
then run:
uv run pytest
You can also test the tap-geo CLI interface directly using uv run:
uv run tap-geo --help
Testing with Meltano
Note: This tap will work in any Singer environment and does not require Meltano. Examples here are for convenience and to streamline end-to-end orchestration scenarios.
Next, install Meltano (if you haven't already) and any needed plugins:
# Install meltano
uv tool install meltano
# Initialize meltano within this directory
cd tap-geo
meltano install
Now you can test and orchestrate using Meltano:
# Test invocation:
meltano invoke tap-geo --version
# OR run a test ELT pipeline:
meltano run tap-geo target-jsonl
SDK Dev Guide
See the dev guide for more instructions on how to use the SDK to develop your own taps and targets.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tap_geo-0.1.2.tar.gz.
File metadata
- Download URL: tap_geo-0.1.2.tar.gz
- Upload date:
- Size: 229.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f045c7bbb8b972d6f263319d47707ff41f09df457f3b065a2dacf66b4afe9a86
|
|
| MD5 |
d7ec8c36eba557d8277cb688a70c6e92
|
|
| BLAKE2b-256 |
6643e9e403f94e4c5dde7b552950a2513f8441d8a322dbb3b0890296941e0621
|
Provenance
The following attestation bundles were made for tap_geo-0.1.2.tar.gz:
Publisher:
build.yml on celine-eu/tap-geo
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tap_geo-0.1.2.tar.gz -
Subject digest:
f045c7bbb8b972d6f263319d47707ff41f09df457f3b065a2dacf66b4afe9a86 - Sigstore transparency entry: 667425070
- Sigstore integration time:
-
Permalink:
celine-eu/tap-geo@dcdef7aff1754d1b6319c25056bc38990c26a90f -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/celine-eu
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yml@dcdef7aff1754d1b6319c25056bc38990c26a90f -
Trigger Event:
push
-
Statement type:
File details
Details for the file tap_geo-0.1.2-py3-none-any.whl.
File metadata
- Download URL: tap_geo-0.1.2-py3-none-any.whl
- Upload date:
- Size: 15.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
82ad0e81f177a45be9be39aba2a42fd0e74d3c690cf66898b78a0b485eb1d41d
|
|
| MD5 |
2b566c9e83ff3b01b66634b4c67a7aba
|
|
| BLAKE2b-256 |
28f65789e326733997d16a8fbe6c44053b62626fd5cfa774dad304e521e22fb6
|
Provenance
The following attestation bundles were made for tap_geo-0.1.2-py3-none-any.whl:
Publisher:
build.yml on celine-eu/tap-geo
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tap_geo-0.1.2-py3-none-any.whl -
Subject digest:
82ad0e81f177a45be9be39aba2a42fd0e74d3c690cf66898b78a0b485eb1d41d - Sigstore transparency entry: 667425072
- Sigstore integration time:
-
Permalink:
celine-eu/tap-geo@dcdef7aff1754d1b6319c25056bc38990c26a90f -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/celine-eu
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yml@dcdef7aff1754d1b6319c25056bc38990c26a90f -
Trigger Event:
push
-
Statement type: