Geoprocessing routines for GIS
PyGeoprocessing is a Python/Cython based library that provides a set of commonly used raster, vector, and hydrological operations for GIS processing. Similar functionality can be found in ArcGIS/QGIS raster algebra, ArcGIS zonal statistics, and ArcGIS/GRASS/TauDEM hydrological routing routines.
PyGeoprocessing was developed at the Natural Capital Project to create a programmable, open source, and free GIS processing library to support the ecosystem service software InVEST. PyGeoprocessing’s design prioritizes computation and memory efficient runtimes, easy installation and cross compatibility with other open source and proprietary software licenses, and a simplified set of orthogonal GIS processing routines that interact with GIS data via filename. Specifically the functionally provided by PyGeoprocessing includes
- programmable raster algebra routine (vectorize_datasets)
- routines for simplified raster creation and statistics
- integration with vector based geometry in many of the routines
- a simplified hydrological routing library including,
- d-infinity flow direction
- support for plateau drainage
- weighted and unweighted flow accumulation
- and weighted and unweighted flow distance
- Added support for very small AOIs that result in rasters less than a pixel wide. Additionally an
all_touchedflag was added to allow the ALL_TOUCHED=TRUE option to be passed to RasterizeLayer in the AOI mask calculation.
- Added watershed delination routine to pygeoprocessing.routing.delineate_watershed. Operates on a DEM and point shapefile, optionally snaps outlet points to nearest stream as defined by a thresholded flow accumulation raster and copies the oulet point fields into the constructed watershed shapefile.
- Fixing a memory leak in block caches that held on to dataset, band, and block references even after the object was destroyed.
- Add an option to route_flux that lets the current pixel’s source be included in the flux, or not. Previous version would include on the source no matter what.
- Now using natcap.versioner for versioning instead of local versioning logic.
- Adding MinGW-specific compiler flags for statically linking pygeoprocessing binaries against libstdc++ and libgcc. Fixes an issue on many user’s computers when installing from a wheel on the Python Package Index without having two needed DLLs on the PATH, resuling in an ImportError on pygeoprocessing.geoprocessing_core.pyd.
- Fixing an issue with versioning where ‘dev’ was displayed instead of the version recorded in pygeoprocessing/__init__.py.
- Adding all pygeoprocessing.geoprocessing functions to pygeoprocessing.__all__, which allows those functions to appear when calling help(pygeoprocessing).
- Adding routing_core.pxd to the manifest. This fixes an issue where some users were unable to compiler pygeoprocessing from source.
- Fixed a bug on the test that determines if a raster should be memory blocked. Rasters were not getting square blocked if the memory block was row aligned. Now creates 256x256 blocks on rasters larger than 256x256.
- Updates to reclassify_dataset_uri to use numpy.digitize rather than Python loops across the number of keys.
- More informative error messages raised on incorrect bounding box mode.
- Updated docstring on get_lookup_from_table to indicate the headers are case insensitive.
- Added updates to align dataset list that report which dataset is being aligned. This is helpful for logging feedback when many datasets are passed in that don’t take long enough to get a report from the underlying reproject dataset function.
- pygeoprocessing.routing.routing_core includes pxd to be `cimport`able from a Cython module.
- Fixed a library wide issue relating to the underlying numpy types of GDT_Byte Datasets. Now correctly identify the signed and unsigned versions and removed all instances where code used to mod byte data to unsigned data and correctly creates signed/unsigned byte datasets during resampling.
- Removed extract_band_and_nodata function since it exposes the underlying GDAL types.
- Removed reclassify_by_dictionary since reclassify_dataset_uri provided almost the same functionality and was widely used.
- Removed the class OrderedDict that was not used.
- Removed the function calculate_value_not_in_dataset since it loaded the entire dataset into memory and was not useful.
- Fixed an issue on reclassifying signed byte rasters that had negative nodata values but the internal type stored for vectorize datasets was unsigned.
- Package logger objects are now identified by python heirarchical package paths (e.g. pygeoprocessing.routing)
- Fixed an issue where rasters that had undefined nodata values caused striping in the reclassify_dataset_uri function.
- Fixing LICENSE.TXT to .txt issue that keeps reoccuring.
- Fixed an issue where int32 dems with INT_MIN as the nodata value were being treated as real DEM values because of an internal cast to a float for the nodata type, but a cast to double for the DEM values.
- Fixed an issue where flat regions, such as reservoirs, that could only drain off the edge of the DEM now correctly drain as opposed to having undefined flow directions.
- Fixed a memory issue for DEMs on the order of 25k X 25k, still may have issues with larger DEMs.
- Fixed an issue so tox correctly executes on the repository.
- Created a history file to document current and previous releases.
- Created an informative README.rst.
- Fixing issue that caused “LICENSE.TXT not found” during pip install.
- Fixing issue with automatic versioning scheme.
- First release on PyPI.
Release history Release notifications
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size & hash SHA256 hash help||File type||Python version||Upload date|
|pygeoprocessing-0.3.0a4-cp27-none-win32.whl (862.0 kB) Copy SHA256 hash SHA256||Wheel||2.7|
|pygeoprocessing-0.3.0a4.zip (500.8 kB) Copy SHA256 hash SHA256||Source||None|