Skip to main content

articat: data artifact catalog

Project description

articat

CI PYPI

Minimal metadata catalog to store and retrieve metadata about data artifacts.

High level features:

  • set of predefined metadata models
  • flexible metadata (arbitrary key-values)
  • no long running services (low maintenance)
  • IO/data format agnostic
  • immutable metadata

Example:

To publish a file system Artifact:

from articat.fs_artifact import FSArtifact
from pathlib import Path
from datetime import date

with FSArtifact.partitioned(id="foo", partition=date.today()) as fsa:
    tmp_file = fsa.joinpath("answer.txt")
    Path(tmp_file).write_text("42")
    fsa.metadata.description = "Answer to the Ultimate Question of Life, the Universe, and Everything"

To retrieve metadata about the Artifact above:

from articat.fs_artifact import FSArtifact
from datetime import date

FSArtifact.partitioned(id="foo", partition=date.today()).fetch()

Artifact flavours

Currently available Artifact flavours:

  • File System Artifact: FSArtifact
  • BigQuery Artifact: BQArtifact
  • Notebook Artifact: NotebookArtifact

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

articat-0.1.1a1.tar.gz (36.1 kB view hashes)

Uploaded Source

Built Distribution

articat-0.1.1a1-py3-none-any.whl (42.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page