Skip to main content

Datalake Catalog

Project description

Setup and start the API

Configure the parameters with a python file

For example, catalog.conf.py

SECRET_KEY = b"changemenow"
DB_STRING = "sqlite://localhost/catalog.sqlite"

Start the catalog

docker run -d \
    -p '8080:8080' \
    -v 'catalog.conf.py:/etc/datacatalog/catalog.conf.py' \
    -e 'CATALOG_SETTINGS=/etc/datacatalog/catalog.conf.py' \
    public.ecr.aws/equancy-tech/datalake-catalog

Generate an API token

The catalog create-api-key generated tokens that can be used with restricted endpoint.

  • admin role can access all restricted endpoints
  • author role can only access the restricted endpoints in /catalog
Usage: catalog create-api-key [OPTIONS] NAME

Options:
  -e, --expires INTEGER      number of days before token expires
  -r, --role [author|admin]  the role associated with the name
  --help                     Show this message and exit.

Update the storages configuration

Storage configures aliases to help resolve actual buckets (S3, Azure, local fs, etc.). It consists of a bucket name and an optional prefix.

For example:

{
    "landing": {
        "bucket": "my-raw-bucket",
        "prefix": "landing"
    },
    "archive": {
        "bucket": "my-raw-bucket",
        "prefix": "archives"
    },
    "bronze": {
        "bucket": "my-bucket-bronze"
    },
    "silver": {
        "bucket": "my-bucket-silver"
    },
    "gold": {
        "bucket": "my-bucket-gold",
    }
}
curl -XPUT http://localhost:8080/storage \
    -H "Content-Type: application/json" \
    -H "Authorization: Bearer ${CATALOG_ADMIN_TOKEN}" \
    --data "@${STORAGE_CONFIG_FILE}" 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datalake-catalog-1.0.4.tar.gz (9.6 kB view details)

Uploaded Source

Built Distribution

datalake_catalog-1.0.4-py3-none-any.whl (11.6 kB view details)

Uploaded Python 3

File details

Details for the file datalake-catalog-1.0.4.tar.gz.

File metadata

  • Download URL: datalake-catalog-1.0.4.tar.gz
  • Upload date:
  • Size: 9.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.8.13 Linux/5.4.181-99.354.amzn2.x86_64

File hashes

Hashes for datalake-catalog-1.0.4.tar.gz
Algorithm Hash digest
SHA256 1e1978182ac4195ac1ff1949d1c97b63b6a4ac1899c9c680bc86c5d4f2dcf0a0
MD5 10478d0f523af1f55f5571eb1271c888
BLAKE2b-256 9fbcde1c36cf50ba8e54e0f5097ff4bf207924f50b0749aeca24de0bf1dce84c

See more details on using hashes here.

File details

Details for the file datalake_catalog-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: datalake_catalog-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 11.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.8.13 Linux/5.4.181-99.354.amzn2.x86_64

File hashes

Hashes for datalake_catalog-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 4cb20fc558a5603c69279e8f9e5010e91a2f95cf30df0ea1a017639dd45dbdf5
MD5 fad132e72c44c9cbf410e4c4fc7d21af
BLAKE2b-256 c80b6fc23014f514e07c02226c2946dc02a2fd00c130cf78f0db915367434379

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page