Skip to main content

Python Pachyderm Client

Project description

Pachyderm's Python SDK

Official Python client/SDK for Pachyderm. The successor to https://github.com/pachyderm/python-pachyderm.

This library provides the autogenerated gRPC/protobuf code for Pachyderm, generated using a fork of the betterproto package, along with higher-level functionality.

Installation

pip install pachyderm_sdk

A Small Taste

Here's an example that creates a repo and adds a file:

from pachyderm_sdk import Client
from pachyderm_sdk.api import pfs

# Connects to a pachyderm cluster using your local config
#   at ~/.pachyderm/config.json
client = Client.from_config()

# Creates a pachyderm repo called `test`
repo = pfs.Repo(name="test")
client.pfs.create_repo(repo=repo)

# Create a new commit in `test@master` and upload a file.
branch = pfs.Branch.from_uri("test@master")
with client.pfs.commit(branch=branch) as commit:
    file = commit.put_file_from_bytes(path="/data/file.dat", data=b"DATA")

# Retrieve the uploaded file.
with client.pfs.pfs_file(file) as f:
    print(f.readall())

How to load a CAST file into a pandas dataframe

from pachyderm_sdk import Client
from pachyderm_sdk.api import pfs
import pandas as pd

client = Client.from_config()
file = pfs.File.from_uri("test@master:/path/to/data.csv")
with client.pfs.pfs_file(file) as f:
    df = pd.read_csv(f)

Changes from Python-Pachyderm

This package is a successor to the python-pachyderm package. Listed below are some of the notable changes:

  1. Organization of the API
    • Methods and Message objects are now organized according to the service they are associated with, i.e. auth, pfs (pachyderm file-system), pps (pachyderm pipelining-system).
    • Message objects can be found within their respective submodule of the pachyder_sdk.api module, i.e. pachyderm_sdk.api.pfs.
    • Methods can be found within their respective attribute of the Client class, i.e. client.pps.create_pipeline.
      • Some methods have been renamed to remove redundancy due to this organization, i.e. python_pachyderm.Client.get_enterprise_state -> pachyderm_sdk.Client.enterprise.get_state
  2. The autogenerated code is generated using a fork of the betterproto compiler.
    • Messages are now python dataclasses.
    • Methods require keyword arguments.
    • Pachyderm resources are specified using types.
      • python-pachyderm (old): client.create_repo("test")
      • pachyderm_sdk (new): client.pfs.create_repo(repo=pfs.Repo(name="test"))

Contributing

Please see the contributing guide for more info (including testing instructions)

Developer Guide

Generate python APIs from protobuf:

./generate-protos.sh

Running Tests:

pytest -vvv tests

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pachyderm_sdk-2.8.0a3.tar.gz (60.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pachyderm_sdk-2.8.0a3-py3-none-any.whl (73.0 kB view details)

Uploaded Python 3

File details

Details for the file pachyderm_sdk-2.8.0a3.tar.gz.

File metadata

  • Download URL: pachyderm_sdk-2.8.0a3.tar.gz
  • Upload date:
  • Size: 60.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.1 CPython/3.10.13 Linux/5.15.0-1039-aws

File hashes

Hashes for pachyderm_sdk-2.8.0a3.tar.gz
Algorithm Hash digest
SHA256 9f776a4044f3f6a604c6ecf174d0c8b484d1338c2e12a1b135dbb7e7a1ff7195
MD5 1903f91c756f8ea02f3cc33685a19944
BLAKE2b-256 b43a70b926a42440fcc62eb2fff60798b09e154bbaa36e6f57920341f419e084

See more details on using hashes here.

File details

Details for the file pachyderm_sdk-2.8.0a3-py3-none-any.whl.

File metadata

  • Download URL: pachyderm_sdk-2.8.0a3-py3-none-any.whl
  • Upload date:
  • Size: 73.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.1 CPython/3.10.13 Linux/5.15.0-1039-aws

File hashes

Hashes for pachyderm_sdk-2.8.0a3-py3-none-any.whl
Algorithm Hash digest
SHA256 37647809124e7fedab203059e7deb3185377a5bc2f3b7494361b609d342c9be6
MD5 4d5fa97c13e288b45ff4a5c35f93e468
BLAKE2b-256 567aa44ad85109b71d3fea6cd0eb4961614473ec6aeb5f3f38b18705132a84b1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page