Skip to main content

Extract, analyze, and plot hydrological timeseries for a site or watershed using the Google Earth Engine and USGS APIs

Project description

waterpyk

open in colab readthedocs badge PyPi badge Follow me on Twitter badge MIT license badge

A python package to analyze hydrological timeseries for a site or watershed using the Google Earth Engine and USGS APIs.

waterpyk is a simple package for extracting data from the Google Earth Engine (GEE) and USGS APIs and performing simple water-balance analyses. All you need to begin is a lat/long or a USGS gauge ID to start.

For more information on the root-zone water storage capacity and deficit, see Dralle et al., 2021, McCormick et al., 2021, and Rempe, McCormick et al. (in prep).

Check out the tutorial for basic waterpyk usage in GoogleColab here.

Get Started

waterpyk requires geopandas and ee. Read the workflow to install and authenticate ee (GEE) for Google Colab, pip, or conda. You will need to sign up for a GEE account if it is your first time.

Install waterpyk:

pip install waterpyk

To get all of the available data and plots for a site, simply supply a list of coordinates (such as [lat, long] or [gaugeID]).:

from waterpyk install main

coords = [LAT/LONG] # or
gage = [USGS_gage_ID]
yoursitename = main.StudyArea(COORDS_or_GAGE, layers = 'all')

A note on the layers parameter: Two default options are available for extracting data.:

layers = 'all'
layers = 'minimal'

‘All’ will extract precipitation (P), evapotranspiration (ET) from MODIS and PML, MODIS snow cover, MODIS landcover class, and SRTM elevation, including all relevant bands.

‘Minimal’ is the fastest option and extracts only the necessary data for the deficit (P, ET, and snow without extra MODIS products).

Alternatively, layers can also take a dataframe containing a row for each desired GEE asset. See the main and/or gee module docs for more information.

Note that most GEE assets should be extracted with no errors, however not all date formats have been tested, so new assets may throw errors and require manual extraction outside of waterpyk. For these cases, I suggest using the geemap package, which has many fantastic GEE tools and inspired some of this code as well!

Now that you’ve initialized your StudyArea object, all of the dataframes of extracted data will now be available in the form of:

yoursitename.daily_df_wide # includes deficit, wateryear deficit, and cumulative wateryear timeseries
yoursitename.streamflow
yoursitename.wateryear_totals
yoursitename.stats

Some single values are also available as attributes in this way, such as root-zone water storage capacity (Smax, mm) and and mean annual precipitation (MAP, mm):

yoursite.smax
yoursite.MAP

To plot, supply a string for which kind of plot (see below for the 5 options), such as:

yoursitename.plot(kind = 'timeseries')

Extra keyword arguments are available for the plots to change the color, line types, or included data for each kind. See the plots module for default kwargs.

Docs

For full documentation, see full documentation.

What Waterpyk Can Do

  • Download timeseries of GEE assets and:

    • Combining bands

    • Applying a scaling factor

    • Interpolating to daily

  • Download streamflow for gauged watersheds

  • Download watershed information, such as:

    • Boundary and flowline geometry (as GEE or geopandas format)

    • Metadata (such as CRS, name, and gauge ID)

    • Daily discharge (in native units, and converted to mm)

    • Centroid latitude (for calculation of Hargreaves PET)

    • URLs necessary to access all of the above directly

  • Calculate cumulative and total wateryear timeseries

  • Calculate the root-zone water storage (RWS) deficit

  • Create 4 figures, out of the box, with P, ET, Q, and deficit data, including:

    • Wateryear cumulative timeseries (kind = 'wateryear')

    • Daily timeseries, with options (kind = 'timeseries')

    • Plot & calculate Spearman correlation coefficient for summer ET and wateryear P (kind = 'spearman')

    • RWS capacity (Smax) relative to P distribution (kind = 'distribution')

    • RWS timeseries (kind = 'RWS')

Contact

This is a work in progress and is primarily intended for the Rempe Lab Group and co-authors. Feel free to report bugs, and note that major version updates may not be backwards compatible. For more information, contact Erica McCormick at erica.elmstead@gmail.com or the email address given on her homepage.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

waterpyk-2.1.0.tar.gz (39.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

waterpyk-2.1.0-py3-none-any.whl (41.0 kB view details)

Uploaded Python 3

File details

Details for the file waterpyk-2.1.0.tar.gz.

File metadata

  • Download URL: waterpyk-2.1.0.tar.gz
  • Upload date:
  • Size: 39.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 CPython/3.9.13

File hashes

Hashes for waterpyk-2.1.0.tar.gz
Algorithm Hash digest
SHA256 2a66ba1a40bda4a0b60a33587b9b8bfa25b3972bd520cdf3e136ecfd7e9e0878
MD5 866868ea7a24ef9e5593bea1f81fc916
BLAKE2b-256 e9594738b41c3ebc7fa25117a1047691b4c60fbb8b78557189ceaa117e4c6a42

See more details on using hashes here.

File details

Details for the file waterpyk-2.1.0-py3-none-any.whl.

File metadata

  • Download URL: waterpyk-2.1.0-py3-none-any.whl
  • Upload date:
  • Size: 41.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 CPython/3.9.13

File hashes

Hashes for waterpyk-2.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f93e9cb7423f3c7495e0f113d53d2dd37090e438f18e3f450e0643e73db172fe
MD5 55828694c9895f6f65264b1812329474
BLAKE2b-256 637c0d18399be885cf648810954a432a64105b755a93380f197df8f7389266ef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page