A Python library for n-dimensional Earth observation data processing
The main goal of this library is to generalize methods that work in lower dimensions to higher-dimensional data.
Multi-dimensional data often arises as spatio-temporal datacubes,
e.g. climate data or time series of geospatial satellite data.
Many data analysis methods are designed to work on single images
or time series at a single point.
nd makes it easy to broadcast these methods across a whole dataset,
adding additional features such as automatic parallelization.
- pixelwise change detection algorithms
- reprojection between coordinate systems
- machine learning algorithms
nd is built on
Internally, all data is passed around as
xarray Datasets and all provided methods expect this format as inputs.
xarray.Dataset is essentially a Python representation of the NetCDF file format and as such easily reads/writes NetCDF files.
nd is making heavy use of the
The GDAL library is only used via
rasterio as a compatibility layer to enable reading supported file formats.
nd.open_dataset may be used to read any NetCDF file or any GDAL-readable file into an
Read the Documentation for detailed user guides.
You can also have a look at these two example notebooks:
pip install nd
It is recommended that you have GDAL available before installation and also make sure to have the correct environment variable set:
export GDAL_DATA=$(gdal-config --datadir)
Note that the following algorithms require the
libgsl-dev C library to be installed:
What does this library add?
xarray provides all data structures required for dealing with
n-dimensional data in Python.
nd explicitly does not aim to add additional data structures or file formats.
Rather, the aim is to bring the various corners of the scientific ecosystem in Python closer together.
nd adds functionality to more seamlessly integrate libraries like
ndallows to reproject an entire multivariate and multi-temporal dataset between different coordinate systems by wrapping
ndprovides a wrapper for
scikit-learnestimators to easily apply classification algorithms to raster data.
nd contains a growing library of algorithms that are especially useful for spatio-temporal datacubes, for example:
change detection algorithms
xarray is our library of choice for representing geospatial raster data, this is also an attempt to promote the use of
xarray and the NetCDF file format in the Earth Observation community.
NetCDF (specifically NetCDF-4) is a highly efficient file format that was built on top of HDF5. It is capable of random access which ties in with indexing and slicing in
Because slices of a large dataset can be accessed independently, it becomes feasible to handle larger-than-memory file sizes. NetCDF-4 also supports data compression using
zlib. Random access capability for compressed data is maintained through data chunking.
Furthermore, NetCDF is designed to be fully self-descriptive. Crucially, it has a concept of named dimensions and coordinates, can store units and arbitrary metadata.
Feature requests and bug reports
For feature requests and bug reports please submit an issue on the Github repository.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size nd-0.2.tar.gz (99.9 MB)||File type Source||Python version None||Upload date||Hashes View|