Skip to main content

Cloud utilities for running Hail systematically.

Project description

pycloud / sparkhub

Overview

pycloud / sparkhub is a Python package that provides a set of utilities for running spark pipelines on Google Cloud Platform (GCP) and in an on-prem cluster. It includes functions for generating Hail headers, running Hail pipelines on Dataproc clusters, and managing GCP resources.

Main Features

  • Generate Hail headers for use in Hail pipelines
  • Run Hail pipelines on Dataproc clusters
  • Manage GCP resources, such as Dataproc clusters and Google Cloud Storage buckets

Installation

To install pycloud, you can use pip:

pip install sparkhub

Usage

To use pycloud, you can import the relevant functions into your Python code:

from pycloud.hailrunner import get_hail_header, HailRunnerGC, RunnerMagics
from pycloud.submit import *

Then, you can call the functions with the appropriate arguments to generate headers, run pipelines, and manage GCP resources.

Maintainer

pycloud/sparkhub is maintained by TJ Singh. If you have any questions or issues, please contact him at ts3475@cumc.columbia.edu.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparkhub-0.2.6.tar.gz (36.3 kB view details)

Uploaded Source

Built Distribution

sparkhub-0.2.6-py3-none-any.whl (18.8 kB view details)

Uploaded Python 3

File details

Details for the file sparkhub-0.2.6.tar.gz.

File metadata

  • Download URL: sparkhub-0.2.6.tar.gz
  • Upload date:
  • Size: 36.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for sparkhub-0.2.6.tar.gz
Algorithm Hash digest
SHA256 e4e4d586ef22b16b23b5599f2a2f3b2cecb9e27e5bb5f95583044aa598e848be
MD5 910e6ac647be71c4867069a983ba290f
BLAKE2b-256 529b1e7f4b604140b302c021f172a66b35e0ffdca2ce69b799eab473b23a04f1

See more details on using hashes here.

File details

Details for the file sparkhub-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: sparkhub-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 18.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for sparkhub-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f96954ee1606e097997c2eb657175f38388aed32d162e238cda587ed0621cbd3
MD5 7cfeac0f69dd4ae279ecbbdb483de5a3
BLAKE2b-256 6dc2f362c350bea7471951971cab83114b976a994ad03fafd4ea66e33a7e3394

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page