nwp-consumer

Tool for aggregating NWP .grib files into .zarrs

Project description

NWP CONSUMER

Microservice for consuming NWP data.

A microservice for multi-source consumption of NWP data, storing it in a common format. Built with inspiration from the Hexagonal Architecture pattern, the nwp-consumer is currently packaged with adapters for pulling and converting .grib data from:

Similarly, the service can write to multiple sinks:

Local filesystem
AWS S3
HuggingFace Datasets

Its modular nature enables straightforward extension to alternate future sources.

Running the service

The service uses environment variables to configure sources and sinks in accordance with the Twelve-Factor App methodology. The program will inform you of missing env vars when using an adaptor, but you can also check the config for the given module, or use the env command.

Using Docker

This service is designed to be run as a Docker container. The Containerfile is the Dockerfile for the service. It is recommended to run it this way due to the dependency on external non-python binaries, which at the moment cannot be easily distributed in a PyPi package. To run, pull the latest version from ghcr.io via:

$ docker run \
  -v /path/to/datadir:/data \
  -e ENV_VAR=<value> \
  ghcr.io/openclimatefix/nwp-consumer:latest <command...>

Using the Python Package

Ensure the external dependencies are installed. Then, do one of the following:

Either

Install from PyPI with
```
$ pip install nwp-consumer
```

Clone the repository and install the package via

$ git clone git@github.com:openclimatefix/nwp-consumer.git
$ cd nwp-consumer
$ pip install .

Then run the service via

$ ENV_VAR=<value> nwp-consumer <command...>

CLI

Whether running via Docker or the Python package, available commands can be found with the command help or the --help flag. For example:

$ nwp-consumer --help
# or
$ docker run ghcr.io/openclimatefix/nwp-consumer:latest --help

Ubiquitous Language

The following terms are used throughout the codebase and documentation. They are defined here to avoid ambiguity.

InitTime - The time at which a forecast is initialised. For example, a forecast initialised at 12:00 on 1st January.
TargetTime - The time at which a predicted value is valid. For example, a forecast with InitTime 12:00 on 1st January predicts that the temperature at TargetTime 12:00 on 2nd January at position x will be 10 degrees.

Repository structure

Produced using exa:

$ exa --tree --git-ignore -F -I "*init*|test*.*"

./
├── Containerfile # The Dockerfile for the service
├── pyproject.toml # The build configuration for the service
├── README.md
└── src/
   ├── nwp_consumer/ # The main library package
   │  ├── cmd/
   │  │  └── main.py # The entrypoint to the service
   │  └── internal/ # Packages internal to the service. Like the 'lib' folder
   │     ├── config/ 
   │     │  └── config.py # Contains the configuration specification for running the service
   │     ├── inputs/ # Holds subpackages for each incoming data source
   │     │  ├── ceda/
   │     │  │  ├── _models.py
   │     │  │  ├── client.py # Contains the client and functions to map CEDA data to the service model
   │     │  │  └── README.md # Info about the CEDA data source
   │     │  └── metoffice/
   │     │     ├── _models.py
   │     │     ├── client.py # # Contains the client and functions to map MetOffice data to the service model
   │     │     └── README.md # Info about the MetOffice data source
   │     ├── models.py # Describes the internal data models for the service
   │     ├── outputs/ # Holds subpackages for each data sink
   │     │  ├── localfs/
   │     │  │  └── client.py # Contains the client for storing data on the local filesystem
   │     │  └── s3/
   │     │     └── client.py # Contains the client for storing data on S3
   │     └── service/ # Contains the business logic and use-cases of the application
   │        └── service.py # Defines the service class for the application, whose methods are the use-cases
   └── test_integration/

nwp-consumer is structured following principles from the hexagonal architecture pattern. In brief, this means a clear separation between the application's business logic - it's Core - and the Actors that are external to it. In this package, the core of the service is in internal/service/ and the actors are in internal/inputs/ and internal/outputs/. The service logic has no knowledge of the external actors, instead defining interfaces that the actors must implement. These are found in internal/models.py. The actors are then responsible for implementing these interfaces, and are dependency-injected in at runtime. This allows the service to be easily tested and extended. See further reading for more information.

Local development

Clone the repository, and create and activate a new python virtualenv for it. cd to the repository root.

Install the External and Python dependencies as shown in the sections below.

Taskfile

This repository bundles often used commands into a taskfile for convenience. To use these commands, ensure go-task is installed, easily done via homebrew.

You can then see the available tasks using

$ task -l

External dependencies

The cfgrib python library depends on the ECMWF cfgrib binary, which is a wrapper around the ECMWF ecCodes library. One of these must be installed on the system and accessible as a shared library.

On a MacOS with HomeBrew use

$ brew install eccodes

Or if you manage binary packages with Conda use

$ conda install -c conda-forge cfgrib

As an alternative you may install the official source distribution by following the instructions at https://confluence.ecmwf.int/display/ECC/ecCodes+installation

You may run a simple selfcheck command to ensure that your system is set up correctly:

$ python -m <eccodes OR cfgrib> selfcheck
Found: ecCodes v2.27.0.
Your system is ready.

Python requirements

Install the required python dependencies and make it editable with

$ pip install -e .

or use the taskfile

$ task install

This looks for requirements specified in the pyproject.toml file.

Note that these are the bare dependencies for running the application. If you want to run tests, you need the development dependencies as well, which can be installed via

$ pip install -e .[dev]

$ task install-dev

Where is the requirements.txt file?

There is no requirements.txt file. Instead, the project uses setuptool's pyproject.toml integration to specify dependencies. This is a new feature of setuptools and pip, and is the recommended way to specify dependencies. See the setuptools guide and the PEP621 specification for more information, as well as Further Reading.

Running tests

Ensure you have installed the Python requirements and the External dependencies.

Run the unit tests with

$ python -m unittest discover -s src/nwp_consumer -p "test_*.py"

$ task test-unit

and the integration tests with

$ python -m unittest discover -s test_integration -p "test_*.py"

$ task test-integration

See further reading for more information on the src directory structure.

Contributing and community

See the OCF Organisation Repo for details on contributing.
Find out more about OCF in the Meta Repo.
Follow OCF on Twitter.
Check out the OCF blog at https://openclimatefix.org/blog for updates.

Project details

Release history Release notifications | RSS feed

0.5.33

Oct 22, 2024

0.5.32

Oct 22, 2024

0.5.30

Oct 22, 2024

0.5.28

Oct 15, 2024

0.5.27

Oct 15, 2024

0.5.23

Aug 2, 2024

0.5.22

Aug 2, 2024

0.5.19

Jul 29, 2024

0.5.18

Jul 26, 2024

0.5.17

Jul 26, 2024

0.5.14

Jun 17, 2024

0.5.12

May 13, 2024

0.5.10

Apr 29, 2024

0.5.9

Apr 10, 2024

0.5.8

Apr 9, 2024

0.5.7

Mar 18, 2024

0.3.2

Feb 9, 2024

0.3.1

Feb 6, 2024

0.3.0

Feb 5, 2024

0.2.1

Feb 2, 2024

0.2.0

Jan 30, 2024

0.1.30

Dec 13, 2023

0.1.29

Dec 13, 2023

0.1.28

Dec 13, 2023

0.1.27

Dec 12, 2023

This version

0.1.26

Dec 11, 2023

0.1.23

Oct 17, 2023

0.1.22

Oct 17, 2023

0.1.21

Oct 5, 2023

0.1.20

Oct 4, 2023

0.1.19

Sep 26, 2023

0.1.18

Sep 26, 2023

0.1.17

Sep 26, 2023

0.1.16

Sep 15, 2023

0.1.15 yanked

Sep 15, 2023

Reason this release was yanked:

Broken entrypoint

0.1.14

Sep 15, 2023

0.1.13

Sep 15, 2023

0.0.1 yanked

Sep 15, 2023

Reason this release was yanked:

Test Upload

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

nwp_consumer-0.1.26-py3-none-any.whl (68.5 MB view hashes)

Uploaded Dec 11, 2023 Python 3

Hashes for nwp_consumer-0.1.26-py3-none-any.whl

Hashes for nwp_consumer-0.1.26-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3f0cc5bdb27ebfec9834e9c5073944008d654d76d40857f6205306a71121baa8`
MD5	`43d6e2a5d3f59ec730be35e7a727f95e`
BLAKE2b-256	`6005e6c6b77f34477d7693100800cbfed68d04ba44c1f2aa4425ba9c42633314`

nwp-consumer 0.1.26

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NWP CONSUMER

Microservice for consuming NWP data.

Running the service

Using Docker

Using the Python Package

CLI

Ubiquitous Language

Repository structure

Local development

Taskfile

External dependencies

Python requirements

Running tests

Further reading

Contributing and community

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

nwp-consumer 0.1.26

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NWP CONSUMER Microservice for consuming NWP data.

Running the service

Using Docker

Using the Python Package

CLI

Ubiquitous Language

Repository structure

Local development

Taskfile

External dependencies

Python requirements

Running tests

Further reading

Contributing and community

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

NWP CONSUMER

Microservice for consuming NWP data.