Official Python SDK for Unity Catalog
Project description
Unity Catalog Python Client SDK
Welcome to the official Python Client SDK for Unity Catalog!
Unity Catalog is the industry's only universal catalog for data and AI.
- Multimodal interface supports any format, engine, and asset
- Multi-format support: It is extensible and supports Delta Lake, Apache Iceberg and Apache Hudi via UniForm, Apache Parquet, JSON, CSV, and many others.
- Multi-engine support: With its open APIs, data cataloged in Unity can be read by many leading compute engines.
- Multimodal: It supports all your data and AI assets, including tables, files, functions, AI models.
- Open source API and implementation - OpenAPI spec and OSS implementation (Apache 2.0 license). It is also compatible with Apache Hive's metastore API and Apache Iceberg's REST catalog API. Unity Catalog is currently a sandbox project with LF AI and Data Foundation (part of the Linux Foundation).
- Unified governance for data and AI - Govern and secure tabular data, unstructured assets, and AI assets with a single interface.
Python Client SDK
The Unity Catalog Python SDK provides a convenient Python-native interface to all of the functionality of the Unity Catalog REST APIs. The library includes interfaces for all supported public modules.
This library is generated using the OpenAPI Generator toolkit, providing client interfaces using the aiohttp request handling library.
Installation
The Python Client SDK and associated shared namespace package for Unity Catalog use hatch as their supported build backend.
To ensure that you can install the package, install hatch via any of the listed options here.
To use the unitycatalog-client SDK, you can install directly from PyPI:
pip install unitycatalog-client
To install from source, you will need to fork and clone the unitycatalog repository locally.
To build the Python source locally, you will need to have JDK17 installed and activated.
Once your configuration supports the execution of sbt, you can run the following within the root of the repository to generate the Python Client SDK source:
build/sbt pythonClient/generate
The source code will be generated at unitycatalog/clients/python/target.
You can then install the package in editable mode from the repository root:
pip install -e clients/python/target
Usage
To get started with using the Python Client SDK, first ensure that you have a running Unity Catalog server to connect to. You can follow instructions here to quickly get started with setting up a local Unity Catalog server if needed.
Once the server is running, you can set up the Python client to make async requests to the Unity Catalog server.
For the examples listed here, we will be using a local unauthenticated server for simplicity's sake.
from unitycatalog.client import Configuration
config = Configuration()
config.host = "http://localhost:8080/api/2.1/unity-catalog"
Once we have our configuration, we can set our client that we will be using for each request:
from unitycatalog.client import ApiClient
client = ApiClient(configuration=config)
With our client configured and instantiated, we can use any of the Unity Catalog APIs by importing from the
client namespace directly and send requests to the server.
from unitycatalog.client import CatalogsApi
catalogs_api = CatalogsApi(api_client=client)
my_catalogs = await catalogs_api.list_catalogs()
Note: APIs that support pagination (such as
list_catalogs) should have continutation token logic for assembling the paginated return values into a single collection.
A simple example of consuming a paginated response is:
async def list_all_catalogs(catalog_api):
token = None
catalogs = []
while True:
results = await catalog_api.list_catalogs(page_token=token)
catalogs += results.catalogs
if next_token := results.next_page_token:
token = next_token
else:
break
return catalogs
my_catalogs = await list_all_catalogs(catalogs_api)
Creating a new catalog with the Python SDK is straight-forward:
from unitycatalog.client.models import CreateCatalog, CatalogInfo
async def create_catalog(catalog_name, catalog_api, comment=None):
new_catalog = CreateCatalog(
name=catalog_name,
comment=comment or ""
)
return await catalog_api.create_catalog(create_catalog=new_catalog)
await create_catalog("MyNewCatalog", catalog_api=catalogs_api, comment="This is a new catalog.")
Adding a new Schema to our created Catalog is similarly simple:
from unitycatalog.client import SchemasApi
from unitycatalog.client import CreateSchema, SchemaInfo
schemas_api = SchemasApi(api_client=client)
async def create_schema(schema_name, catalog_name, schema_api, comment=None):
new_schema = CreateSchema(
name=schema_name,
catalog_name=catalog_name,
comment=comment or ""
)
return await schema_api.create_schema(create_schema=new_schema)
await create_schema(schema_name="MyNewSchema", catalog_name="MyNewCatalog", schema_api=schemas_api, comment="This is a new schema.")
And listing the schemas within our newly created Catalog (note that if you expect paginated responses, ensure that you are passing
continuation tokens as shown above in list_all_catalogs):
await schemas_api.list_schemas(catalog_name="MyNewCatalog")
Feedback
Have requests for the Unity Catalog project? Interested in getting involved in the Open Source project?
See the repository on GitHub
Read the documentation for more guidance and examples!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file unitycatalog_client-0.3.1.tar.gz.
File metadata
- Download URL: unitycatalog_client-0.3.1.tar.gz
- Upload date:
- Size: 66.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d3c441f99ada77f0c72a9edcbbee0b32518c6691aefa1e9330a8b5df6e4de186
|
|
| MD5 |
4d07f81fe6ffbb5be0fe96da77e52ce7
|
|
| BLAKE2b-256 |
6d69af22d3d47dd6ca04bab100912f6bb7546ce47d85a330133c2b5a62491eb9
|
File details
Details for the file unitycatalog_client-0.3.1-py3-none-any.whl.
File metadata
- Download URL: unitycatalog_client-0.3.1-py3-none-any.whl
- Upload date:
- Size: 176.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
70a3d82014a396f90b87f7beac786da4d163d97d3762c05cc5bcb8421073c14c
|
|
| MD5 |
9cdf11e5ca78e5baa61e61abe30aa7f4
|
|
| BLAKE2b-256 |
c42ba0058fb9468e2971b542de339a75b576c5d0ef6009832af09e56e2641fc6
|