Skip to main content

Environmental data quality control

Project description

EnviroDataQC

This library provides a framework for assessing quality of environmental data.

Data is assessed with respect to:

  • Data Range
  • Data rate of change
  • Data flatlining

Additionally, special methods are provided for assessing wind speed and direction data. Data is classified as either suspicious or bad based on either default or custom user settings.

Installation

pip install envirodataqc

The configuration file is currently a python script in the package, so it may be easier to clone the repository rather than install from PYPI if you want to change the configuration.

Basic Use

Pass data (Pandas series) and measurement type to check_vals(). A Pandas dataframe is returned with three new columns: 'flags_range', 'flags_rate', 'flags_flat'. Measurement types supported are defined in 'QCconfig.py'.

Flags:

  • 0 : Good
  • 1 : Suspicious
  • 2 : Bad

Configuration

Change and/or add dictionaries defined in 'QCconfig.py'. Dictionary entries define "good" ranges and "suspicious" ranges for each flag category. Configuration ranges can be non-continuous and any overlap between "good" and "suspicious" ranges will be flagged as "good".

Wind specific checks

  • check_windsp_ratio - Check ratio of mean windspeed to max
  • check_windsp_withdir - Check if windspeed is consistent with direction
  • check_winddir_withsp - Check if direction is consistent with windspeed

Other Functions

  • check_gaps(index) - Given a Pandas datetime index outputs total time where gaps between timestamps are > 1hr.
  • daily_quality(data) - Given a Pandas series (index=timestamp, values=data flags), return a pandas series with a consolidated daily quality level (0,1,2)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

envirodataqc-0.3.1.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

envirodataqc-0.3.1-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file envirodataqc-0.3.1.tar.gz.

File metadata

  • Download URL: envirodataqc-0.3.1.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.7

File hashes

Hashes for envirodataqc-0.3.1.tar.gz
Algorithm Hash digest
SHA256 f56b99857ccb371e9f4e29dff0fa444a74b36296303a17a0e8a20e595300e8bf
MD5 a761a8bdc2ae47ac65aeb452581b797c
BLAKE2b-256 c39e9f6f785ffc355325025ed4b6607c7f8565e4a3aab6d87a4a744dd818297d

See more details on using hashes here.

File details

Details for the file envirodataqc-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: envirodataqc-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.0 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.7

File hashes

Hashes for envirodataqc-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ac646afb90c0b55323a0158cf5798c5c588b91d2da883a74769a1b7dd9a546ea
MD5 9127bcb5a8f898afe04de44d3cf331af
BLAKE2b-256 70ef133067f41da9648f816fb78fef581b5175dd64bedf6f827682826c991017

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page