Skip to main content

No project description provided

Project description

pyoso

WARNING: THIS IS A WORK IN PROGRESS

pyoso is a Python package for fetching models and metrics from OSO. This package provides an easy-to-use interface to interact with oso and retrieve valuable data for analysis and monitoring.

Features

  • Execute custom SQL queries for analyzing the OSO dataset.
  • Inspect data dependencies and freshness with an analytics tree.
  • Semantic modeling layer to build and execute complex queries (optional).

Installation

You can install pyoso using pip:

pip install pyoso

Optional Semantic Modeling

For semantic modeling capabilities, you can install with the semantic extra:

pip install pyoso[semantic]

This will include the oso_semantic package for building semantic models and queries.

Usage

Here is a basic example of how to use pyoso to fetch data directly into a pandas DataFrame:

import os
from pyoso import Client

# Initialize the client with an API key
os.environ["OSO_API_KEY"] = 'your_api_key'
client = Client()

# Fetch artifacts
query = "SELECT * FROM artifacts_v1 LIMIT 5"
artifacts = client.to_pandas(query)

print(artifacts)

Inspecting Data Dependencies

For more advanced use cases, the client.query() method returns a QueryResponse object that contains both the data and analytics metadata. This allows you to inspect the dependency tree of the data sources used in your query.

import os
from pyoso import Client

# Initialize the client
os.environ["OSO_API_KEY"] = "your_api_key"
client = Client()

# Execute a query to get a QueryResponse object
query = "SELECT * FROM artifacts_v1 LIMIT 5"
response = client.query(query)

# You can still get the DataFrame as before
df = response.to_pandas()
print("\n--- Query Data ---")
print(df)

# Now, inspect the analytics to see the dependency tree
print("\n--- Data Dependency Tree ---")
response.analytics.print_tree()

This will output a tree structure showing how the final artifacts_v1 table was constructed from its upstream dependencies, helping you understand the data's origin and freshness.

Using in notebooks

pyoso also has special support for notebooks. Specifically, we provide a helper function to create a database connection to the OSO dataset that can be used with marimo. To use this, we suggest you make a cell at the start of your notebook with the following code:

from pyoso.notebook import marimo_db
pyoso_db_conn = marimo_db()

Documentation

For detailed documentation about the OSO dataset, please refer to the official documentation.

Future Plans

  • Create DataFrame wrapper for creating SQL query from data transforms

Manually testing with pyodide

We need to add pyodide to CI, but for now to manually run tests do the following:

Get current pyodide version

You will need to do this from the pyoso directory.

PYODIDE_EMSCRIPTEN_VERSION=$(pyodide config get emscripten_version)

Install emscripten

Choose a place to store the code and git clone emsdk:

cd some/base/directory
git clone https://github.com/emscripten-core/emsdk
cd emsdk

./emsdk install ${PYODIDE_EMSCRIPTEN_VERSION}
./emsdk activate ${PYODIDE_EMSCRIPTEN_VERSION}
source emsdk_env.sh

Build pyodide wheel

Now go back to the pyoso directory

cd oso/warehouse/pyoso
uv run pyodide build

This will generate a .whl file in dist

Download pyodide version

Download the recent pyodide version (at the time of writing is 0.27.2):

cd dist/
wget https://github.com/pyodide/pyodide/releases/download/0.27.2/pyodide-0.27.2.tar.bz2
tar xjf pyodide-0.27.2.tar.bz2

This will now have generated a dist/pyodide directory.

Run pytest

uv run pytest --run-in-pyodide . --runtime node --dist-dir=./dist

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyoso-0.6.6.tar.gz (10.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyoso-0.6.6-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file pyoso-0.6.6.tar.gz.

File metadata

  • Download URL: pyoso-0.6.6.tar.gz
  • Upload date:
  • Size: 10.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for pyoso-0.6.6.tar.gz
Algorithm Hash digest
SHA256 1faf8c0668d9a2d4b1fce2f762b891423185ba7555ac11192ebdc3b61527e5ab
MD5 8d34bb15a3354dade961b18b76f658b7
BLAKE2b-256 07c9105d21951195378cf4ae15fc06e29a190d6dc5bcc8f93b63f5d50fc47b39

See more details on using hashes here.

File details

Details for the file pyoso-0.6.6-py3-none-any.whl.

File metadata

  • Download URL: pyoso-0.6.6-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.13

File hashes

Hashes for pyoso-0.6.6-py3-none-any.whl
Algorithm Hash digest
SHA256 aa444cdb7f2712e5a36115cf9baab068a09aa4bc60f61f5d6c0a0de2581c7155
MD5 4a834c68db90ee28150bd3302fad1314
BLAKE2b-256 afb07a1d5fee5441d703de8374798e5a8f27da36409c1b4fedd03bd8f0f1aee2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page