Skip to main content

pywatershed is a Python package for hydrologic modeling

Project description

pywatershed

ci-badge codecov-badge Documentation Status

PyPI Version PyPI Status PyPI Versions

Purpose

The purpose of this repository is to refactor and redesign the PRMS modeling system while maintaining its functionality. Code modernization is a step towards unification with MODFLOW 6 (MF6).

The following motivations are taken from our AGU poster from December 2022 which provides additional details on motivations, project status, and current directions of this project as of approximately January 2023.

Goals of the USGS Enterprise Capacity (EC) project include:

  • A sustainable integrated, hydrologic modeling framework for the U.S. Geological Survey (USGS)
  • Interoperable modeling across the USGS, partner agencies, and academia

Goals for EC Watershed Modeling:

  • Couple the Precipitation-Runoff Modeling System (PRMS, e.g. Regan et al, 2018)  with MODFLOW 6 (MF6, e.g. Langevin et al, 2017) in a sustainable way
  • Redesign PRMS to be more modern and flexible
  • Prioritize process representations in the current National Hydrological Model (NHM) based on PRMS 5.2.1

Prototype an EC watershed model: "pywatershed"

  • Redesign PRMS quickly in python
  • Couple to MF6 via BMI/XMI interface (Hughes et al, 2021; Hutton et al, 2020)
  • Establish a prototyping ground for EC codes that couples to the compiled framework: low cost proof of concepts (at the price of potentially less computational performance)
  • Enable process representation hypothesis testing
  • Use cutting-edge techniques and technologies to improve models
  • Machine learning, automatic differentiation
  • Address challenges of modeling across space and time scales
  • Transition prototype watershed model to compiled EC code

Installation and Python Environments

This installation assumes you do not need to run automated tests locally. See the following "Developer Installation" section for complete installation instructions.

To install the software you will need Python >= 3.8. We recommend installing the python package dependencies using anaconda or miniconda. Most users will likely want to create the pyws_nb conda environment by running

conda env create -f examples/examples_env.yml.

One could also do

pip install -r examples/exampes_env.txt

but this is not guaranteed.

Once the environment is established, activate the environment and install pywatershed

conda activate pyws_nb; cd pywatershed; pip install .

If you would like to compile the fortran computational kernels for certain physical process representations (not required), you'll need a fortran compiler and you will run

export PYWS_FORTRAN=true; cd pywatershed; pip install .

See Developer Requirements below for more details.

Developer Installation

Git is required if you plan to contribute code back to the repository.

C and Fortran compilers are required. We are currently using gnu (gcc, gfortran) 11 and 12 as well as intel (icc, ifort) 2021 on Windows, Linux, and MacOS (including Apple Silicon). Both of these are freely obtainable but the installation process varies widely. We are looking for a conda-based approach to obtaining compilers, but currently do not have a solution. Compilers are needed for two applications:

  1. Compiling and running C/Fortran PRMS code to generate testing/verification data
  2. Compiling (installing) and running fortran backends/kernels for some hydrological process representations in pywatershed

On Apple Silicon, the PRMS source code is only currently known to compile with intel while the fortran kernels in pywatershed only compile with gnu.

Python >= 3.8 is required. Three different python environments are specified within the repository. These are:

  • Minimal (for developing/testing), 'pyws': ci/requirements/environment.yml
  • Notebooks (~= minimal + jupyter), 'pyws_nb': examples/examples_env.yml
  • Documentation (only if you want to build the documentation), 'pyws-docs': ci/requirements/doc.yml

We recommend (because we test it in CI) using anacoda or miniconda to establish these environments with the following commands

conda env create -f path/to/env_of_choice.yml

which will create the environment with "name" specified on the first line of the file, given before the path to the file above.

More detailed python environment installation instructions using conda can be found in examples/00_python_virtual_env.ipynb.

There are also .txt equivalents that can be used for installing from pip, like so:

pip install -r env_of_choice.txt

though these are not comprehensive installs as with conda and not tested.

Once the python environment and dependencies are established and activated (conda activate env_of_choice), pywatershed is installed for development into that environment with the following command

cd pywatershed; pip install -e .

The numpy extension F2PY is used to provide fortran compiled kernels of core calculations to boost performance. F2PY is documented within numpy. This repository is configured NOT to compile on install by default. Currently, we have not established this compilation procedure for Windows. On linux and MacOS, compilation of fortran kernels on package installation is achieved by the following code:

export SETUPTOOLS_ENABLE_FEATURES="legacy-editable"
export CC=path/to/gcc  # for example
export FC=path/to/gfortran  # for example
export PYWS_FORTRAN=true
cd path/to/pywatershed
pip install -e .

Once the dependencies are available, we want to verify the software by running its test suite. The following testing procedures are also covered in the notebook examples/01_automated_testing.ipynb. To run the tests, we first need to generate the test data. This consists of running PRMS and then converting the output to netcdf:

cd path/to/pywatershed/test_data/scripts
pytest -v -n=4 test_run_domains.py
pytest -v -n=8 test_nc_domains.py

Finally, run the tests themselves,

cd path/to/pywatershed/autotest
pytest -v -n=8

All tests should pass, XPASS, or XFAIL. XFAIL is an expected failure.

Contributing

We welcome community development! Please file Issues and/or Pull Requests in the appropriate places on github. The continuous integration (CI) procedure is the first gate keeper for new code contribution. The CI procedure is defined by .github/workflows/ci.yaml. This includes running the formatting and linting packages isort, black, and flake8 in addition to generating the test data and running the tests in autotest/. New codes need new tests so they can be verified moving ahead in time.

Example Notebooks

Jupyter notebooks containing examples are found in the examples/ directory. Numbered notebooks are tested. Notebooks 00 and 01 walk the user through the setting the python environment and running the software tests. Notebook 02 demonstrates modeling with pywatershed. Non-numbered notebooks cover additional topics. These notebooks are note yet covered by testing and so may be expected to have some issues until they are added to testing.

Overview of Repository Contents

The contents of directories at this level is described. Therein you may discover another README.md for more information.

.github/    Github actions for deploying continuous integration (CI)
autotest/   pywatershed package testing using pytest
bin/        PRMS executables distributed
ci/         Python environments for CI
doc/        Package/code documentation source code
examples/   How to use the package, mostly jupyter notebooks
prms_src/   PRMS source used for generating executables in bin/
pywatershed/      Package source
reference/  Ancillary materials for development
resources/  Static stuff like images
test_data/  Data used for automated testing

Disclaimer

This information is preliminary or provisional and is subject to revision. It is being provided to meet the need for timely best science. The information has not received final approval by the U.S. Geological Survey (USGS) and is provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the information.

From: https://www2.usgs.gov/fsp/fsp_disclaimers.asp#5

This software is in the public domain because it contains materials that originally came from the U.S. Geological Survey, an agency of the United States Department of Interior. For more information, see the official USGS copyright policy

Although this software program has been used by the USGS, no warranty, expressed or implied, is made by the USGS or the U.S. Government as to the accuracy and functioning of the program and related program material nor shall the fact of distribution constitute any such warranty, and no responsibility is assumed by the USGS in connection therewith. This software is provided "AS IS."

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywatershed-0.1.2.tar.gz (181.1 kB view details)

Uploaded Source

Built Distribution

pywatershed-0.1.2-py3-none-any.whl (190.8 kB view details)

Uploaded Python 3

File details

Details for the file pywatershed-0.1.2.tar.gz.

File metadata

  • Download URL: pywatershed-0.1.2.tar.gz
  • Upload date:
  • Size: 181.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for pywatershed-0.1.2.tar.gz
Algorithm Hash digest
SHA256 416eabe0fc88a6839adf677bdfade070db6a7ebb19c69c80cd0322548de37318
MD5 48dd8e38a4e3831e4a0c3460de742db3
BLAKE2b-256 c244a6ed2baa97af5f41bbe14efa7264b3ff875464fb1cf25f136042aed83b07

See more details on using hashes here.

File details

Details for the file pywatershed-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: pywatershed-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 190.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for pywatershed-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 267b261aa787e62cb8eaef91c8e93c0ef8c5a1137be888641c1f094a2fb1df8e
MD5 b2d1c58aa3cd8f7880add136aa78c6ed
BLAKE2b-256 0cfdf292dfa5827bdc7abee0b3c8204be768dad8e813ca3224b99ff1f5420b7d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page