Skip to main content

A pure python implementation of the Data Access Protocol.

Project description

pydap

Ubuntu CI MacOS CI Python PyPI conda forge black license pre-commit.ci status DOI

What is pydap?

pydap is an open-source implementation of the OPeNDAP protocol, written from scratch in pure Python. You can use pydap to access scientific data available on the many OPeNDAP servers publicly-available through the internet. Because pydap supports remote and lazy evaluation, you can access the data without having to download it; Instead, you work with special array and iterable objects that download data on-the-fly as necessary, saving bandwidth and time. The module also comes with a robust-but-lightweight OPeNDAP server, implemented as a WSGI application.

Why pydap?

Originally developed in the 2000s, pydap is one of the oldest open-source Python projects available and is routinely developed and maintained by the OPeNDAP community at large. In addition, pydap is a long-recognized backend engine (and dependency) for xarray and chances are you have used pydap in the past without knowing it.

Quickstart

pydap is a lightweight python package that you can use in either of the two modalities: a client and as a server. You can install the latest version using pip. After installing pip you can install pydap with this command:

    $ pip install pydap

This will install pydap together with all the required dependencies.

pydap is also available through Anaconda. Below we install pydap and its required dependencies, along with common additional packages in a fresh conda environment named "pydap":

    $ conda create -n pydap -c conda-forge python=3.11 pydap numpy jupyterlab ipython netCDF4 scipy matplotlib

Now you simply activate the pydap environment:

    $ conda activate pydap

[!NOTE] If you have mamba installed, you can replace conda in the commands with mamba.

You can now use pydap as a client and open any remotely served dataset, and pydap will download the accessed data on-the-fly as needed. For example consider this dataset currently hosted on OPeNDAP's Hyrax data server

    from pydap.client import open_url
    pyds = open_url('http://test.opendap.org:8080/opendap/catalog/ghrsst/20210102090000-JPL-L4_GHRSST-SSTfnd-MUR-GLOB-v02.0-fv04.1.nc', protocol='dap4')
    pyds.tree()
    .20210102090000-JPL-L4_GHRSST-SSTfnd-MUR-GLOB-v02.0-fv04.1.nc
    ├──time
    ├──lat
    ├──lon
    ├──analysed_sst
    ├──analysis_error
    ├──mask
    ├──sea_ice_fraction
    ├──dt_1km_data
    └──sst_anomaly
    pyds['sst_anomaly'].shape
    (1, 17999, 36000)

[!NOTE] In the example above, no data was downloaded, it was all lazily evaluated using OPeNDAP's DMR (DAP4) metadata representation. For more information, please check the documentation on using pydap as a client.

pydap also comes with a simple server, implemented as a WSGI application. To use it, you first need to install the server and optionally a data handler:

Running pydap as a Server

    $ pip install "pydap[server,netcdf]"

This will install the necessary dependencies for running pydap as a server, along with extra dependencies for handling netCDF4 dataset. Now create a directory for your server data.

To run the server just issue the command:

    $ pydap --data ./myserver/data/ --port 8001 --workers 4 --threads 4

This will start a standalone server running on the default http://localhost:8001/, serving netCDF files from ./myserver/data/ Since the server uses the WSGI standard, pydap uses by default one (1) worker and one (1) thread, but these can be defined by the user like in the case above (four (4) workers and four (4) threads). pydap can also easily be run behind Apache.

The server documentation has more information on how to better deploy pydap.

Documentation

For more information, see the pydap documentation.

Help and Community

If you need any help with pydap, open an issue in this repository. You can also email the mailing list. Finally, if you have a broader OPeNDAP access question, you can reach the OPeNDAP team on the OPeNDAP Discourse!

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydap-3.5.10.tar.gz (20.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydap-3.5.10-py3-none-any.whl (2.5 MB view details)

Uploaded Python 3

File details

Details for the file pydap-3.5.10.tar.gz.

File metadata

  • Download URL: pydap-3.5.10.tar.gz
  • Upload date:
  • Size: 20.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pydap-3.5.10.tar.gz
Algorithm Hash digest
SHA256 3c7047e5ff8cda18c65e356875ff3c1ae5966e4a71a9f8ea6558385252266fc1
MD5 a09e45c6272d8014e33465a2b751c8a7
BLAKE2b-256 4a7f713d1bba63248dd2a2f701d36a0aae438604609be910e8692e9b30a7e25b

See more details on using hashes here.

File details

Details for the file pydap-3.5.10-py3-none-any.whl.

File metadata

  • Download URL: pydap-3.5.10-py3-none-any.whl
  • Upload date:
  • Size: 2.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pydap-3.5.10-py3-none-any.whl
Algorithm Hash digest
SHA256 d5ee8252d8852986605dc55e0d1c02f1998c15c04e860cffa516524eeabfcf8f
MD5 bcb9d623454f2f314783e7902744a8f8
BLAKE2b-256 10161616a533765f44e3babddfae8c594cf6374c47ed54fd65e2a057471e4f19

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page