Skip to main content

New Mexico Water Data Integration Engine

Project description

New Mexico Unified Water Data: Data Integration Engine

Format code Publish Python 🐍 distributions 📦 to PyPI and TestPyPI CI/CD Dependabot Updates

NMWDI NMBGMR

This package provides a command line interface to New Mexico Water Data Initiaive's Data Integration Engine. This tool is used to integrate the water data from multiple sources.

Installation

pip install nmuwd

Sources

Data comes from the following sources. We are continuously adding new sources as we learn of them and they become available. If you have data that you would like to be part of the Data Integration Engine please get in touch at newmexicowaterdata@nmt.edu.

Usage

Parameter Data

To obtain parameter summary or time series data, use

die weave {parameter}

where {parameter} is the name of the parameter whose data is to be retrieved, followed by the desired output type, excluded data sources, date filters, and geographic filters. {parameter} is case-insensitive.

Available Parameters

waterlevels arsenic bicarbonate conductivity calcium carbonate chloride fluoride magnesium nitrate ph potassium silica sodium specific conductance sulfate tds uranium
bernco X - - - - - - - - - - - - - - - - -
bor - X - X X - X X X X X X X X - X X X
cabq X - - - - - - - - - - - - - - - - -
ebid X - - - - - - - - - - - - - - - - -
nmbgmr-amp X X X - X X X X X X X X X X X X X X
nmed-dwb - X X - X - X X X X X X X X X X X X
nmose-isc-seven-rivers X - X X X - X X X X X X X X X X X -
nmose-pod - - - - - - - - - - - - - - - - - -
nmose-roswell X - - - - - - - - - - - - - - - - -
nwis X - - - - - - - - - - - - - - - - -
pvacd X - - - - - - - - - - - - - - - - -
wqp X X X X X X X X X X X X X X X X X* X

*TDS data from WQP may contain duplicates. Duplicates are identified when they have the same ActivityIdentifier. If duplicates are identified, only one is kept as identified by its USGS pCode. The order of preference for the pCodes is: 70300, 70301, 70303.

**While conductivity and specific conductance are often used interchangeably, they are distinguished here by the methods with which they are determined. A record is defined as specific conductance if it was determined at the standard 25°C (e.g. EPA method 120.1), otherwise it is defined as conductivity

Output Type

The --output-type option is required and used to set the output type:

--output-type summary
  • A summary table consisting of location information as well as summary statistics for the parameter of interest for every location that has observations.
--output-type timeseries_unified
  • A single table consisting of time series data for all locations for the parameter of interest.
  • A single table of site data that contains information such as latitude, longitude, and elevation
--output-type timeseries_separated
  • Separate time series tables for all locations for the parameter of interest.
  • A single table of site data that contains information such as latitude, longitude, and elevation

The data is saved to a directory titled output in the current working directory. If the directory output already exists, then the output directory will be called output_1. If enumerated output directories already exist, then the output directory will be called output_{n} where n is equal to the greatest existing integer suffix +1.

A log of the inputs and processes, called die.log, is also saved to the output directory.

Summary Table

field/header description data type always present
source the organization/source for the site string Y
id the id of the site. The id is used as the key to join the site and timeseries tables string Y
name the colloquial name for the site string Y
usgs_site_id USGS site id string N
alternate_site_id alternate site id string N
latitude latitude in decimal degrees float Y
longitude longitude in decimal degrees float Y
horizontal_datum horizontal datum of the latitude and longitude. Defaults to WGS84 string Y
elevation* ground surface elevation of the site float Y
elevation_units the units of the ground surface elevation. Defaults to ft string Y
well_depth depth of well float N
well_depth_units units of well depth. Defaults to ft string N
parameter_name the name of the parameter whose measurements are reported in the table string Y
parameter_units units of the observation string Y
nrecords number of records at the site for the parameter integer Y
min the minimum observation float Y
max the maximum observation float Y
mean the mean value of the observations float Y
earliest_date date of the earliest record in YYYY-MM-DD string Y
earliest_time time of the earliest record in HH:MM:SS or HH:MM:SS.mmm string N
earliest_value value of the earliest recent record float Y
earliest_units units of the earliest record string Y
latest_date date of the latest record in YYYY-MM-DD string Y
latest_time time of the latest record in HH:MM:SS or HH:MM:SS.mmm string N
latest_value value of the latest recent record float Y
latest_units units of the latest record string Y

*CABQ elevation is calculated as [elevation at top of casing] - [stickup height]; if stickup height < 0 the measuring point is assumed to be beneath the ground surface

Sites Table

field/header description data type always present
source the organization/source for the site string Y
id the id of the site. The id is used as the key to join the site and timeseries tables string Y
name the colloquial name for the site string Y
latitude latitude in decimal degrees float Y
longitude longitude in decimal degrees float Y
elevation** ground surface elevation of the site float Y
elevation_units the units of the ground surface elevation. Defaults to ft string Y
horizontal_datum horizontal datum of the latitude and longitude. Defaults to WGS84 string Y
vertical_datum vertical datum of the elevation string N
usgs_site_id USGS site id string N
alternate_site_id alternate site id string N
formation geologic formation in which the well terminates string N
aquifer aquifer from which the well draws water string N
well_depth depth of well float N
well_depth_units units of well depth. Defaults to ft string N

**CABQ elevation is calculated as [elevation at top of casing] - [stickup height]; if stickup height < 0 the measuring point is assumed to be beneath the ground surface

Time Series Table(s)

field/header description data type always present
source the organization/source for the site string Y
id the id of the site. The id is used as the key to join the site and timeseries tables string Y
parameter_name the name of the parameter whose measurements are reported in the table string Y
parameter_value value of the observation float Y
parameter_units units of the observation string Y
date_measured date of measurement in YYYY-MM-DD string Y
time_measured time of measurement in HH:MM:SS or HH:MM:SS.mmm string N
source_parameter_name the name of the parameter from the source string Y
source_parameter_units the unit of measurement from the source string Y
conversion_factor the factor applied to the result to convert the measurement to standardized units float or int Y

Output Format

The --output-format option is used to determine the file format for the summary and sites tables. The available options are csv and geojson. If not specified, it defaults to csv.

Source Inclusion & Exclusion

The Data Integration Engine enables the user to obtain groundwater level and groundwater quality data from a variety of sources. Data from sources are automatically included in the output if available unless specifically excluded. The following flags are available to exclude specific data sources:

  • --no-bernco to exclude Bernalillo County (BernCo) data
  • --no-bor to exclude Bureau of of Reclamation (Bor) data
  • --no-cabq to exclude City of Albuquerque (CABQ) data
  • --no-ebid to exclude Elephant Butte Irrigation District (EBID) data
  • --no-nmbgmr-amp to exclude New Mexico Bureau of Geology and Mineral Resources (NMBGMR) Aquifer Mapping Program (AMP) data
  • --no-nmed-dwb to exclude New Mexico Environment Department (NMED) Drinking Water Bureau (DWB) data
  • --no-nmose-isc-seven-rivers to exclude New Mexico Office of State Engineer (NMOSE) Interstate Stream Commission (ISC) Seven Rivers data
  • --no-nmose-pod to exclude New Mexico Office of State Engineer (NMOSE) Point of Diversion (POD) data (though none except for well information is currently available)
  • --no-nmose-roswell to exclude New Mexico Office of State Engineer (NMOSE) Roswell data
  • --no-nwis to exclude USGS NWIS data
  • --no-pvacd to exclude Pecos Valley Artesian Convservancy District (PVACD) data
  • --no-wqp to exclude Water Quality Portal (WQP) data

Geographic Filters [In Development]

The following flags can be used to geographically filter data:

-- county {county name}
-- bbox 'x1 y1, x2 y2'
-- wkt {wkt polygon or multipolygon}

Date Filters [In Development]

The following flags can be used to filter by dates:

--start-date YYYY-MM-DD 
--end-date YYYY-MM-DD

Source Enumeration [In Development]

Use

die sources {parameter}

to print the sources that report that parameter to the terminal.

Sites

Use

die sites

to export site information only

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nmuwd-0.10.0.tar.gz (71.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nmuwd-0.10.0-py3-none-any.whl (92.9 kB view details)

Uploaded Python 3

File details

Details for the file nmuwd-0.10.0.tar.gz.

File metadata

  • Download URL: nmuwd-0.10.0.tar.gz
  • Upload date:
  • Size: 71.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for nmuwd-0.10.0.tar.gz
Algorithm Hash digest
SHA256 f4a07dd5a85d68e91fbe1b975d9b2159a05418baf438464201da590a3bce5b42
MD5 5b265c130c6683218c51daff61746f31
BLAKE2b-256 a7964e0b252a694af777bede228ddffe99bde43555bfcffbbee5763af287e281

See more details on using hashes here.

File details

Details for the file nmuwd-0.10.0-py3-none-any.whl.

File metadata

  • Download URL: nmuwd-0.10.0-py3-none-any.whl
  • Upload date:
  • Size: 92.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for nmuwd-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3d3bd23c083f2113608b50368cf6de43bba58dc08f9abd01fe533db569f266b5
MD5 02d30e9b2ac3e46f897b447b151a3dfa
BLAKE2b-256 56d69a113ab9e78615d22115d26194f58011b56e38583f06c063fa40abb79c91

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page