Skip to main content

Geoprocessing routines for GIS

Project description

About PyGeoprocessing

PyGeoprocessing is a Python/Cython based library that provides a set of commonly used raster, vector, and hydrological operations for GIS processing. Similar functionality can be found in ArcGIS/QGIS raster algebra, ArcGIS zonal statistics, and ArcGIS/GRASS/TauDEM hydrological routing routines.

PyGeoprocessing was developed at the Natural Capital Project to create a programmable, open source, and free GIS processing library to support the ecosystem service software InVEST. PyGeoprocessing’s design prioritizes computation and memory efficient runtimes, easy installation and cross compatibility with other open source and proprietary software licenses, and a simplified set of orthogonal GIS processing routines that interact with GIS data via filename. Specifically the functionally provided by PyGeoprocessing includes

  • programmable raster algebra routine (vectorize_datasets)

  • routines for simplified raster creation and statistics

  • integration with vector based geometry in many of the routines

  • a simplified hydrological routing library including,
    • d-infinity flow direction

    • support for plateau drainage

    • weighted and unweighted flow accumulation

    • and weighted and unweighted flow distance

Dependencies include

  • cython>=0.20.2

  • numpy>=1.8.2

  • scipy>=0.13.3

  • shapely>=1.3.3

  • gdal>=1.10.1

Release History

0.3.0 (XXX)

  • Added functions to construct raster and vectors on disk from reasonable datatypes (numpy matrices for rasters, lists of Shapely geometries for vectors).

  • Fixed an issue where reproject_datasource_uri would add geometry that couldn’t be projected directly into the output datasource. Function now only adds geometries that transformed without error and reports if any features failed to transform.

  • Added file flushing and dataset swig deletion in reproject_datasource_uri to handle a race condition that might have been occuring.

  • Fixed an issue when “None” was passed in on new raster creation that would attempt to directly set that value as the nodata value in the raster.

  • Added basic filetype-specific assertions for many geospatial filetypes, and tests for these assertions. These assertions are exposed in pygeoprocessing.testing.

  • Pygeoprocessing package tests can be run by invoking python setup.py nosetests. A subset of tests may also be run from an installed pygeoprocessing distribution by calling pygeoprocessing.test().

  • Fixed an issue with reclassify dataset that would occur when small rasters whose first memory block would extend beyond the size of the raster thus passing in “0” values in the out of bounds area. Reclassify dataset identified these as valid pixels, even though vectorize_datsets would mask them out later. Now vectorize_datasets only passes memory blocks that contain valid pixel data to its kernel op.

  • Added support for very small AOIs that result in rasters less than a pixel wide. Additionally an all_touched flag was added to allow the ALL_TOUCHED=TRUE option to be passed to RasterizeLayer in the AOI mask calculation.

  • Added watershed delination routine to pygeoprocessing.routing.delineate_watershed. Operates on a DEM and point shapefile, optionally snaps outlet points to nearest stream as defined by a thresholded flow accumulation raster and copies the oulet point fields into the constructed watershed shapefile.

  • Fixing a memory leak in block caches that held on to dataset, band, and block references even after the object was destroyed.

  • Add an option to route_flux that lets the current pixel’s source be included in the flux, or not. Previous version would include on the source no matter what.

  • Now using natcap.versioner for versioning instead of local versioning logic.

0.2.2 (2015-05-07)

  • Adding MinGW-specific compiler flags for statically linking pygeoprocessing binaries against libstdc++ and libgcc. Fixes an issue on many user’s computers when installing from a wheel on the Python Package Index without having two needed DLLs on the PATH, resuling in an ImportError on pygeoprocessing.geoprocessing_core.pyd.

  • Fixing an issue with versioning where ‘dev’ was displayed instead of the version recorded in pygeoprocessing/__init__.py.

  • Adding all pygeoprocessing.geoprocessing functions to pygeoprocessing.__all__, which allows those functions to appear when calling help(pygeoprocessing).

  • Adding routing_core.pxd to the manifest. This fixes an issue where some users were unable to compiler pygeoprocessing from source.

0.2.1 (2015-04-23)

  • Fixed a bug on the test that determines if a raster should be memory blocked. Rasters were not getting square blocked if the memory block was row aligned. Now creates 256x256 blocks on rasters larger than 256x256.

  • Updates to reclassify_dataset_uri to use numpy.digitize rather than Python loops across the number of keys.

  • More informative error messages raised on incorrect bounding box mode.

  • Updated docstring on get_lookup_from_table to indicate the headers are case insensitive.

  • Added updates to align dataset list that report which dataset is being aligned. This is helpful for logging feedback when many datasets are passed in that don’t take long enough to get a report from the underlying reproject dataset function.

  • pygeoprocessing.routing.routing_core includes pxd to be `cimport`able from a Cython module.

0.2.0 (2015-04-14)

  • Fixed a library wide issue relating to the underlying numpy types of GDT_Byte Datasets. Now correctly identify the signed and unsigned versions and removed all instances where code used to mod byte data to unsigned data and correctly creates signed/unsigned byte datasets during resampling.

  • Removed extract_band_and_nodata function since it exposes the underlying GDAL types.

  • Removed reclassify_by_dictionary since reclassify_dataset_uri provided almost the same functionality and was widely used.

  • Removed the class OrderedDict that was not used.

  • Removed the function calculate_value_not_in_dataset since it loaded the entire dataset into memory and was not useful.

0.1.8 (2015-04-13)

  • Fixed an issue on reclassifying signed byte rasters that had negative nodata values but the internal type stored for vectorize datasets was unsigned.

0.1.7 (2015-04-02)

  • Package logger objects are now identified by python heirarchical package paths (e.g. pygeoprocessing.routing)

  • Fixed an issue where rasters that had undefined nodata values caused striping in the reclassify_dataset_uri function.

0.1.6 (2015-03-24)

  • Fixing LICENSE.TXT to .txt issue that keeps reoccuring.

0.1.5 (2015-03-16)

  • Fixed an issue where int32 dems with INT_MIN as the nodata value were being treated as real DEM values because of an internal cast to a float for the nodata type, but a cast to double for the DEM values.

  • Fixed an issue where flat regions, such as reservoirs, that could only drain off the edge of the DEM now correctly drain as opposed to having undefined flow directions.

0.1.4 (2015-03-13)

  • Fixed a memory issue for DEMs on the order of 25k X 25k, still may have issues with larger DEMs.

0.1.3 (2015-03-08)

  • Fixed an issue so tox correctly executes on the repository.

  • Created a history file to document current and previous releases.

  • Created an informative README.rst.

0.1.2 (2015-03-04)

  • Fixing issue that caused “LICENSE.TXT not found” during pip install.

0.1.1 (2015-03-04)

  • Fixing issue with automatic versioning scheme.

0.1.0 (2015-02-26)

  • First release on PyPI.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pygeoprocessing-0.3.0a8.zip (131.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pygeoprocessing-0.3.0a8-cp27-none-win32.whl (495.4 kB view details)

Uploaded CPython 2.7Windows x86

File details

Details for the file pygeoprocessing-0.3.0a8.zip.

File metadata

  • Download URL: pygeoprocessing-0.3.0a8.zip
  • Upload date:
  • Size: 131.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pygeoprocessing-0.3.0a8.zip
Algorithm Hash digest
SHA256 bdd6db3848cee4c15254d2313b65c0380661c20a7ae6ffb7856838b39110b361
MD5 b72f6159844e1c23a382dd9bd65a70fc
BLAKE2b-256 bb5ad67c68b2971c6de78e62651a392ddde57bcdec861d390250ba7316cfd553

See more details on using hashes here.

File details

Details for the file pygeoprocessing-0.3.0a8-cp27-none-win32.whl.

File metadata

File hashes

Hashes for pygeoprocessing-0.3.0a8-cp27-none-win32.whl
Algorithm Hash digest
SHA256 3887b7255016ca54f8e13b3991d8157bd2f73431569bf8e8a11c0b6ba821c724
MD5 8de0d8c8272d82daeef11dc28f0ef85a
BLAKE2b-256 54badef091a67ed9d8d91beff023b97de84e620f4761093c10155a6a75583268

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page