A high-performance storage and computation system for Blosc2 datasets

These details have not been verified by PyPI

Project links

Home

Project description

Caterva2: On-demand access to Blosc2/HDF5 data repositories

What is it?

Caterva2 is a service meant for serving Blosc2 and HDF5 datasets among authenticated users, work groups, or the public. There are several interfaces to Caterva2, including a web GUI, a REST API, a Python API, and a command-line client.

It can be used either remotely or locally, as a simple way to access datasets in a directory hierarchy, or to share them with other users in the same network.

The Python API is the recommended way for building your own Caterva2 clients, whereas the web client provides a more user-friendly interface for browsing and accessing datasets.

Caterva2 Clients

The main role of the Caterva2 package is to provide a simple and lightweight library to build your own Caterva2 clients. The variety of interfaces available allows you to choose the one that best fits your needs. For example, querying a dataset from source can be accomplished :

Via the web GUI using a browser
Via the Python API

import caterva2 as cat2
client = cat2.Client("https://cat2.cloud/demo")
print(client.get("@public/examples/numbers_color.b2nd")[2])

Via the command line client

$ cat2-client --server https://cat2.cloud/demo info @public/examples/numbers_color.b2nd

Via the REST API using a REST client like Postman or curl (see here).

In addition, as Caterva2 supports authentication, all client interfaces expose a way to log in and access private datasets. Administration of authenticated users may be done using the internal mechanics of Caterva2 (see section "User authentication" below).

Installation

You may install Caterva2 in several ways:

Pre-built wheel from PyPI:
```
python -m pip install caterva2
```

Developer setup:

git clone https://github.com/ironArray/Caterva2
cd Caterva2
python -m pip install -e .

When a client is used (web GUI, REST API, Python API, or command line) to query datasets, it will connect to a Caterva2 server service, which provides access to the datasets it serves. The server services may be managed via the command line by installing the caterva2 package with the [server] extra feature (we also wish to use the command line client, so we will also install the clients extra too):

python -m pip install caterva2 [server, clients]

In general, if you intend to run Caterva2 services, client programs, or the test suite, you need to enable the proper extra features by appending [feature1,feature2...] to the last argument of pip commands above. The following extras are supported:

server for running the Caterva2 server service
clients to use Caterva2 client programs (command-line or terminal)
blosc2-plugins to enable extra Blosc2 features like Btune or JPEG 2000 support
plugins to enable web GUI features like the tomography display
tests if you want to run the Caterva2 test suite

Testing

After installing with the [tests] extra, you can quickly check that the package is sane by running the test suite (that comes with the package):

$ python -m caterva2.tests
$ CATERVA2_SECRET=c2sikrit python -m caterva2.tests  # tests requiring authentication

You may also run tests from the root of the source code directory:

$ python -m pytest
$ CATERVA2_SECRET=c2sikrit python -m pytest  # tests requiring authentication

Tests will use a copy of Caterva2's root-example directory. After they finish, state files will be left under the _caterva2_tests directory for inspection (it will be re-created when tests are run again).

Quick start

(Find more detailed step-by-step tutorials in Caterva2 documentation.)

For this quick start, let's:

create a virtual environment and install Caterva2 with the [server,clients] extras (see above).
copy the configuration file caterva2.sample.toml to caterva2.toml, ~/.caterva2.toml, or /etc/caterva2.toml.
copy the server configuration file caterva2-server.sample.toml to caterva2-server.toml, ~/.caterva2-server.toml, or /etc/caterva2-server.toml.

Clients will search for configuration in the following order: current directory (./caterva2.toml), home directory (~/.caterva2.toml), and system-wide (/etc/caterva2.toml on Unix). You can also specify an alternative file with the --conf option. See also configuration.md in Caterva2 tutorials for more details. Server will look for caterva2-server.toml instead, either in the current directory, home directory, or system-wide; you can also specify an alternative file with the --conf option.

Then run the server:

CATERVA2_SECRET=c2sikrit cat2-server &  # server

The CATERVA2_SECRET environment variable is mandatory and is explained below in the following section.

Now, let's see the directories that have been created by the server:

tree _caterva2
_caterva2
└── state
    ├── db.json
    ├── db.sqlite
    ├── media
    ├── personal
    ├── public
    └── shared

6 directories, 2 files

We see that we have a state directory with several subdirectories. The personal directory is where we will store our personal datasets. The public directory is where we will store datasets that are shared with the public. The shared directory is where we will store datasets that are shared with other users. The media directory is where the server will store temporary media files (images, videos, etc.) that are used by the web GUI for internal purposes. The db.json and db.sqlite files are where the server will store its metadata.

For populating the server with datasets, let's copy the root-example directory from the Caterva2 source code:

$ cp -r root-example/ _caterva2/state/public/
$ tree _caterva2/state/public/
_caterva2/state/public/
├── dir1
│   ├── ds-2d.b2nd
│   └── ds-3d.b2nd
├── dir2
│   └── ds-4d.b2nd
├── ds-1d-b.b2nd
├── ds-1d-fields.b2nd
├── ds-1d.b2nd
├── ds-2d-fields.b2nd
├── ds-hello.b2frame
├── ds-sc-attr.b2nd
├── ex-noattr.h5
├── README.md
└── root-example.h5

3 directories, 12 files

Cool! Now we have a server with some datasets to play with. If you want to see the web GUI, open a browser and go to http://localhost:8000/?roots=@public for browsing these datasets.

User authentication

The Caterva2 server includes support for authenticating users. To enable it, run the server with the environment variable CATERVA2_SECRET set to some non-empty, secure string that will be used for various user management operations. Note that new accounts may be registered, but their addresses are not verified. Password recovery does not work either.

To create a user, you can use the cat2-admin adduser command. For example:

$ cat2-admin adduser user@example.com foobar11
User user@example.com with id b2f6f251-d2f0-4b8e-8d17-46ff65832e98 has been added.
User 'user@example.com' added successfully.
Password: foobar11

Client queries then require the same user credentials:

The user will be prompted to login when accessing the web client using a browser.
The Python API client can be authenticated in the following way:

client = cat2.Client("http://localhost:8000", ('user@example.com', 'foobar11'))

The command line client can be authenticated with the --user and --pass options (see below).

The command line client

Now that the services are running, we can use the cat2-client client to talk to the server. In another shell, let's list all the available roots in the system:

$ cat2-client --user "user@example.com" --pass "foobar11" roots
@public
@personal
@shared

To experiment a bit, let's upload a file from the root-examplefolder to the @personal root:

$ cat2-client --username user@example.com --password foobar11 upload root-example/ds-1d.b2nd @personal/ds-1d.b2nd
Dataset stored in @personal/ds-1d.b2nd

Now, let's list the datasets in the @personal root and see that the uploaded file appears:

$ cat2-client --username user@example.com --password foobar11 list @personal
ds-1d.b2nd

Let's ask the server for more info about the dataset:

$ cat2-client --username user@example.com --password foobar11 info @personal/ds-1d.b2nd
Getting info for @personal/ds-1d.b2nd
shape : [1000]
chunks: [100]
blocks: [10]
dtype : int64
nbytes: 7.81 KiB
cbytes: 4.90 KiB
ratio : 1.59x
mtime : 2025-10-03T12:08:03.962856Z
cparams:
  codec  : ZSTD (5)
  clevel : 1
  filters: [SHUFFLE]

This command returns dataset's metadata, including its shape, chunks, blocks, data type, and compression parameters.

You can also ask for the contents of the @public root:

$ cat2-client --username user@example.com --password foobar11 tree @public
├── README.md
├── dir1
│   ├── ds-2d.b2nd
│   └── ds-3d.b2nd
├── dir2
│   └── ds-4d.b2nd
├── ds-1d-b.b2nd
├── ds-1d-fields.b2nd
├── ds-1d.b2nd
├── ds-2d-fields.b2nd
├── ds-hello.b2frame
├── ds-sc-attr.b2nd
├── ex-noattr.h5
└── root-example.h5

We see the contents of the root-example directory that we copied before are available in the @public root too.

There are more commands available in the cat2-client client; ask for help with:

$ cat2-client --help

Docs

To see how to use the Python and REST API and web GUI, check out the Caterva2 documentation. You'll also find more information on how to use Caterva2, including tutorials, API references, and examples here.

That's all folks!

Project details

These details have not been verified by PyPI

Project links

Home

Release history Release notifications | RSS feed

This version

2025.12.3

Dec 3, 2025

2025.12.0.dev0 pre-release

Dec 3, 2025

2025.11.17.1

Nov 17, 2025

2025.11.17

Nov 17, 2025

2025.10.25

Oct 25, 2025

2025.8.7

Aug 7, 2025

2025.6.26

Jun 26, 2025

2025.5.2

May 2, 2025

2025.4.9

Apr 9, 2025

2025.2.20

Feb 20, 2025

2025.2.13

Feb 13, 2025

2025.1.30.1

Jan 30, 2025

2025.1.30

Jan 30, 2025

2024.7.1

Jul 1, 2024

2024.6.27

Jun 27, 2024

0.2.0

Mar 25, 2024

0.1

Feb 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

caterva2-2025.12.3.tar.gz (5.6 MB view details)

Uploaded Dec 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

caterva2-2025.12.3-py3-none-any.whl (919.5 kB view details)

Uploaded Dec 3, 2025 Python 3

File details

Details for the file caterva2-2025.12.3.tar.gz.

File metadata

Download URL: caterva2-2025.12.3.tar.gz
Upload date: Dec 3, 2025
Size: 5.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for caterva2-2025.12.3.tar.gz
Algorithm	Hash digest
SHA256	`43c24cbe606417b898a6fc6ec70d455f087b4e23416e1d1937507ceed233d5df`
MD5	`f3dfbf7262cd203ec85c796ab5422669`
BLAKE2b-256	`77fab84000ed5096a32501d83c2d35bb83134435ceaec7f85c912d9cf2c3802e`

See more details on using hashes here.

File details

Details for the file caterva2-2025.12.3-py3-none-any.whl.

File metadata

Download URL: caterva2-2025.12.3-py3-none-any.whl
Upload date: Dec 3, 2025
Size: 919.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for caterva2-2025.12.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4cddf4be3ef4d056d98c0b83c5c82ef63b7791a378fca3ac39d3b8c043f37c5e`
MD5	`f642df6acfd7798c0eda06ae038c4a49`
BLAKE2b-256	`3c3f8bff03e295b3fcca1bf7c87c434d0e3798845ec4b5b8fefe60a0776c2ca5`

See more details on using hashes here.

caterva2 2025.12.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Caterva2: On-demand access to Blosc2/HDF5 data repositories

What is it?

Caterva2 Clients

Installation

Testing

Quick start

User authentication

The command line client

Docs

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes