A Python API wrapping services of the Superb Data Kraken (SDK)

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

superb-data-klient

superb-data-klient is a simple library to access various services of the Superb Data Kraken platform (SDK). It abstracts accessing services and resources of the SDK with a Python client object managing Authorization, Fetching and Indexing data.

It is primarily intended for use in an Jupyter Hub environment within the platform itself, but can be configured for different environments as well.

Installation and Supported Versions

$ python -m pip install superb-data-klient

superb-data-klient officially supports Python 3.7+.

Usage

Authentication

Before using the api, it is necessary to authenticate against the OIDC provider of the SDK. This is done at instantiation time of the client object. There are two ways to do this.

Using system environment variables. This is the default case which should be used in a Jupyter environment. Simply instantiating the client object is enough in this case.
```
import superbdataklient as sdk
client = sdk.SDKClient()
```
This however assumes access and refresh tokens are accessible via the enviroment variables SDK_ACCESS_TOKEN, SDK_REFRESH_TOKEN.

Using login credentials

import superbdataklient as sdk
sdk.SDKClient(username='hasslethehoff', password='lookingforfreedom')

Configuration

By default everything is configured for usage with the default instance of the SDK but comes with settings for various different instances as well.

Setting Environment

import superbdataklient as sdk
client = sdk.SDKClient(env='sdk-dev')
client = sdk.SDKClient(env='sdk')

Overwriting Settings

client = sdk.SDKClient(domain='mydomain.ai', realm='my-realm', client_id='my-client-id', api_version='v13.37')

Examples

Organizations

client.organization_get_all()
client.organization_get_by_id(1337)
client.organization_get_by_name('my-organization')

Spaces

Get all Spaces to a given Organization

organization_id = 1234
client.space_get_all(organization_id)

client.space_get_by_id(organization_id, space_id)
client.space_get_by_name(organization_id, space_name)

Index

List all Indices accessible with given credentials

indices = client.index_get_all()

Get specific document

document = client.index_get_document(index_name, doc_id)

Get all documents of an index:

documents = client.index_get_all_documents("index_name")

Get documents of an index lazily with a generator:

documents = client.index_get_documents("index-name")
for document in documents:
   print(document)

Write document to index

documents = [
   {
      "_id": 123
      "name": "document01",
      "value": "value"
   },
   {
      "_id": 1337
      "name": "document01",
      "value": "value"
   }
]
index_name = "index" 
client.index_documents(documents, index_name)

The (optional) field _id is parsed and used as document id to index to opensearch.

List all indices filtered by organization, space and type accessible with given credentials

client.index_filter_by_space("my-organization", "my-space", "index-type")

use .* instead of my_space to get indices from all spaces in the given organization

index_type is either ANALYSIS or MEASUREMENTS

Create an application index

mapping = {
   ...
}
client.application_index_create("my-application-index", "my-organization", "my-space", mapping)

Delete an application index by name

client.application_index_delete("my-organization_my-space_analysis_my-application-index")

Storage

List files in Storage

files = client.storage_list_blobs(org_name, space_name)

Download files from Storage to local directory

files = [
   'file01.txt'
   'directory/file02.json',
]
client.storage_download_files(organization='my-organization', space='my-space', files=files, local_dir='tmp')

Download files from Storage to local directory depending on a regular expression (regex)

files = [
   'file01.txt',
   'directory/file02.json'
]
client.storage_download_files_with_regex(organization='my-organization', space='my-space', files=files, local_dir='tmp', regex=r'.*json$')

Upload files from local directory to storage. A meta.json file has to exist and will be validated against a schema.

files = [
   'meta.json',
   'file01.txt',
   'file02.txt'
]

client.storage_upload_files_to_loadingzone(organization='my-organization', space='my-space', files= files, local_dir='tmp')

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.3.0

Jul 24, 2024

1.2.3

Jul 8, 2024

1.2.2

Jul 4, 2024

1.2.1

Jun 28, 2024

1.2.0

Mar 6, 2024

1.1.0

Jan 18, 2024

1.0.2

Nov 7, 2023

1.0.1

Aug 17, 2023

This version

1.0.0

Jun 19, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

superb-data-klient-1.0.0.tar.gz (14.2 kB view hashes)

Uploaded Jun 19, 2023 Source

Built Distribution

superb_data_klient-1.0.0-py3-none-any.whl (15.1 kB view hashes)

Uploaded Jun 19, 2023 Python 3

Hashes for superb-data-klient-1.0.0.tar.gz

Hashes for superb-data-klient-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`f0542f00daec788658c75386bfeb851298f7fb0967aa67b1649d5e61828f97bf`
MD5	`76c0d597d3828cf8161789ed6b20d15a`
BLAKE2b-256	`fdb9efd850068688212c0b878302a6eaea9a7d11e413db4cda4fb4977c222ac1`

Hashes for superb_data_klient-1.0.0-py3-none-any.whl

Hashes for superb_data_klient-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b611ac8fe9c380610dbdf4ecfa0a40bc8fd56ad9f21ac49c78318bbdfe217263`
MD5	`4cc0aca1c366f6dde77e38f9e354d734`
BLAKE2b-256	`8d86ec99ad103e99b688300ef95821ddbf8d7b0b99b453cbf4213499fccf33b6`