Skip to main content

A open-data solution for WIS

Project description

hydro-opendata

image

📜 中文文档

Methods and paths for obtaining, managing, and utilizing open data for hydrological scientific computations.

Background

In the era of artificial intelligence, data-driven hydrological models have been extensively researched and applied. With the advancements in remote sensing technologies and the trend towards open data sharing, accessing data has become more straightforward with a plethora of options. For researchers, questions like what data is required, what data can be accessed, where to download it, how to read it, and how to process it, are crucial. This repository aims to address these concerns.

This repository primarily focuses on external open data, categorizing data types, and creating a list. It aims to build a data flow and its tech stack that can seamlessly "download-store-process-read-write-visualize" the data.

Overall Solution

Data Framework

Main Data Sources

From our current understanding, the external data suitable for hydrological modeling includes but is not limited to:

Primary Category Secondary Category Update Frequency Data Structure Example
Basic Geography Hydrological Elements Static Vector Watershed boundary, site
Terrain Static Raster DEM, flow direction, land use
Weather & Meteorology Reanalysis Dynamic Raster ERA5
Near Real-Time Dynamic Raster GPM
Forecast Rolling Raster GFS
Imagery Satellite Remote Sensing Dynamic Raster Landsat, Sentinel, MODIS
Street View Images Static Multimedia
Surveillance Videos Dynamic Multimedia
Drone Footage Dynamic Multimedia
Crowdsourced Data POI Static Vector Baidu Map
Social Networks Dynamic Multimedia Weibo
Hydrological Data River Flow Data Dynamic Tabular GRDC

Data can be categorized based on their update frequency into static and dynamic data.

From a structural perspective, data can be classified into vector, raster, and multimedia (unstructured data).

Structure and Functional Framework

Code Repository

wis-stac

Data inventory and its metadata. Returns a data list based on AOI.

wis-downloader

Downloads data from external sources. Depending on the data source, the download methods may vary, including:

  • Integration with official APIs, e.g., bmi_era5
  • Retrieving data download links, e.g., Herbie, MultiEarth, Satpy. Most cloud data platforms like Microsoft, AWS, etc., organize data mostly as stac.

wis-processor

Preprocesses the data, such as watershed averaging, feature extraction, etc.

Uses kerchunk to convert different format data to zarr format and stores it in MinIO server. This enables cross-file reading and enhances data reading efficiency.

wis-s3api

After data processing in MinIO, it supports cross-file reading. Just provide data type, time range, and spatial range parameters to fetch the data.

For remote sensing imagery, due to the vast amount of data, it's not feasible to download and read each file. One can use stac+stackstac to directly read Sentinel or Landsat data into an xarray dataset.

wis-gistools

Integrates commonly used GIS tools, such as Kriging interpolation, Thiessen polygons, etc.

Visualization

Use leafmap to display geospatial data within the Jupyter platform.

Others

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hydro_opendata-0.0.8.tar.gz (43.0 kB view details)

Uploaded Source

File details

Details for the file hydro_opendata-0.0.8.tar.gz.

File metadata

  • Download URL: hydro_opendata-0.0.8.tar.gz
  • Upload date:
  • Size: 43.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/6.3.0 pkginfo/1.8.3 requests/2.28.1 requests-toolbelt/0.9.1 tqdm/4.64.0 CPython/3.9.12

File hashes

Hashes for hydro_opendata-0.0.8.tar.gz
Algorithm Hash digest
SHA256 022da64234691c00e7ec2112f852aac01612865ef0261224f5637dd191ac2c11
MD5 4f10c91695c68266fb617a6cea016b4c
BLAKE2b-256 ad211402cab0051b7f1c1a8dfa549adf72529c1bd42df72b1b8d81e5a95a9fbb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page