Pre-delineated MERIT-Hydro watershed boundaries for ~60,000 gauging stations across 16 countries
Project description
watershed-retrieve
Instant access to ~60,000 pre-delineated watershed boundaries and river networks across 16 countries, served as GeoParquet. Supports multiple hydrofabrics (MERIT-Hydro and HydroSHEDS v1). No data download required — basins are fetched on demand from a public CDN and cached locally.
Background
This library is a community contribution to the RivRetrieve ecosystem. Where RivRetrieve provides observed streamflow time series for gauging stations worldwide, watershed-retrieve adds the corresponding watershed boundaries and river networks.
Delineation is available on two hydrofabrics:
- MERIT-Hydro — ~90m global hydrography (default)
- HydroSHEDS v1 — ~90m global hydrography, alternative dataset
The watershed delineation was performed using a Rust reimplementation of the algorithm described in mheberger/delineator. This is the same methodology used by CAMELS-DE (Loritz et al., 2024) to derive consistent catchment boundaries for 1582 gauging stations across Germany from MERIT Hydro.
See the original proposal: kratzert/RivRetrieve-Python#87.
Installation
pip install watershed-retrieve
Quick Start
import watershed_retrieve as wr
# Zero-config — data is fetched from R2 CDN and cached locally (MERIT-Hydro by default)
watershed = wr.get_watershed("portugal", "04K/04A")
# With river network
watershed, rivers = wr.get_watershed_with_rivers("portugal", "04K/04A")
# Bulk retrieval — all watersheds for a country
all_watersheds = wr.get_watersheds("portugal")
Selecting a hydrofabric
from watershed_retrieve import Fabric
# Use HydroSHEDS v1 instead of MERIT-Hydro
wr.configure(fabric=Fabric.HYDROSHEDS_V1)
# All subsequent calls use HydroSHEDS data
watershed = wr.get_watershed("portugal", "04K/04A")
# Switch back to MERIT-Hydro
wr.configure(fabric=Fabric.MERIT)
Local data directory
# Option 1: Environment variable
# export WATERSHED_RETRIEVE_DATA_DIR=/path/to/parquet/files
# Option 2: Programmatic
wr.configure("/path/to/parquet/files")
# Option 3: Explicit backend selection
from watershed_retrieve import Backend
wr.configure(backend=Backend.R2, cache_dir=Path("~/.my-cache"))
API Reference
Discovery
# List all supported countries
wr.available_countries()
# -> ['australia', 'brazil', 'canada', ..., 'usa']
# List gauge IDs for a country
wr.available_gauges("portugal")
# -> ['02G-02H', '02O-01H', ..., '16J-01H'] (~710 gauges)
Single Watershed
# Watershed boundary (GeoDataFrame, 1 row)
gdf = wr.get_watershed("portugal", "04K/04A")
# Watershed + river network (WatershedResult — unpackable NamedTuple)
result = wr.get_watershed_with_rivers("portugal", "04K/04A")
watershed, rivers = result
Bulk Retrieval
# All watersheds for a country
gdf = wr.get_watersheds("portugal") # -> GeoDataFrame (~710 rows)
# Subset by gauge IDs
gdf = wr.get_watersheds("portugal", ["04K/04A", "05G/01A"])
# With rivers
result = wr.get_watersheds_with_rivers("portugal")
result.watershed # GeoDataFrame
result.rivers # GeoDataFrame
Gauge ID Normalization
Slashes are automatically normalized to dashes:
wr.get_watershed("portugal", "04K/04A") # slash
wr.get_watershed("portugal", "04K-04A") # dash — equivalent
Errors
from watershed_retrieve import (
WatershedRetrieveError, # base class
CountryNotFoundError, # invalid country name
GaugeNotFoundError, # gauge ID not in dataset
DataNotFoundError, # parquet file missing
DataUnavailableError, # region exists but data not yet extracted
R2ConnectionError, # CDN fetch failed
)
DataUnavailableError is raised for regions where gauging stations are registered in RivRetrieve but basin delineation is pending (e.g., UK regions — the British Isles fall outside MERIT-Hydro coverage).
Supported Countries
| Country | MERIT-Hydro | HydroSHEDS v1 | Status |
|---|---|---|---|
| Australia | ~6,210 | ~6,240 | Available |
| Brazil | ~4,600 | ~4,610 | Available |
| Canada | ~7,630 | ~7,240 | Available |
| Chile | ~540 | ~530 | Available |
| Czech Republic | ~820 | ~820 | Available |
| France | ~5,330 | ~5,360 | Available |
| Germany | ~190 | ~190 | Available |
| Japan | ~820 | ~810 | Available |
| Lithuania | ~100 | ~100 | Available |
| Norway | ~4,540 | ~1,460 | Available |
| Poland | ~1,300 | ~1,300 | Available |
| Portugal | ~710 | ~710 | Available |
| Slovenia | ~710 | ~710 | Available |
| South Africa | ~1,290 | ~1,290 | Available |
| Spain | ~1,480 | ~1,490 | Available |
| UK (EA) | — | — | Pending — coverage gap |
| UK (NRFA) | — | — | Pending — coverage gap |
| USA | ~23,860 | ~23,700 | Available |
Development
# Install
git clone https://github.com/CooperBigFoot/watershed-retrieve.git
cd watershed-retrieve
uv sync
# Unit tests (no data or network needed)
uv run pytest tests/ -v -m "not integration and not network"
# Integration tests (requires local parquet data)
WATERSHED_RETRIEVE_DATA_DIR=/path/to/data uv run pytest tests/ -v -m integration
# Lint & format
uv run ruff check --fix src/ tests/
uv run ruff format src/ tests/
See CONTRIBUTING.md for full development guidelines.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file watershed_retrieve-1.1.1.tar.gz.
File metadata
- Download URL: watershed_retrieve-1.1.1.tar.gz
- Upload date:
- Size: 7.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
183bdc04d424818c895a1580acfc3a2a13a0f3b794f2e8e7acce2363c50e45ee
|
|
| MD5 |
5ab0529a2b6dc14c3de893f05c54a853
|
|
| BLAKE2b-256 |
52893b68bdf073548604e11ceb23433ea8aaaa1fc3dfb5ed4e46f33f9eebf625
|
File details
Details for the file watershed_retrieve-1.1.1-py3-none-any.whl.
File metadata
- Download URL: watershed_retrieve-1.1.1-py3-none-any.whl
- Upload date:
- Size: 9.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8c6e9fd83068f0b9ad6c026fd4bf64f0b19c70a665236580e50719674c6a6a8b
|
|
| MD5 |
690a43532e082c5f172de3073537e431
|
|
| BLAKE2b-256 |
8477aa229c38a8d0dc953701ad8139c8d92590ea7a29f91e7a4d890d20dd2870
|