Skip to main content

client for sparksampling

Project description

This is a Python Grpc Stub for sparksampling

sparksampling

sparksampling is a PySpark-based sampling and data quality assessment GRPC service that supports containerized deployments and Spark on K8S

Feature

  • Common sampling methods: Random, Stratified, Simple
  • Relationship Sampling based on DAG and Topological sorting
  • Cloud Native and Spark on K8S support

QUICK START

Installation

The trial only requires direct installation using pypi

pip install sparksampling

run as

sparksampling

The service will start and listen on port 8530

Docker

docker run -p 8530:8530 wh1isper/pysparksampling:latest

MORE

For more, see our github page: https://github.com/Wh1isper/pyspark-sampling/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparksampling_client-0.3.0.tar.gz (9.3 kB view details)

Uploaded Source

Built Distribution

sparksampling_client-0.3.0-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file sparksampling_client-0.3.0.tar.gz.

File metadata

  • Download URL: sparksampling_client-0.3.0.tar.gz
  • Upload date:
  • Size: 9.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for sparksampling_client-0.3.0.tar.gz
Algorithm Hash digest
SHA256 28d61a16b039521b7bc3c1edffb54d4df0a490dc9e34e644736537d209cce291
MD5 9094c25d71b798aae66969f17dc6db00
BLAKE2b-256 32c52473deee09fe58d7816ee9c14bb8bf098f6d89a7e8ccd9721a84910977fc

See more details on using hashes here.

File details

Details for the file sparksampling_client-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sparksampling_client-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7e75bfadbc1ea3ac4d793a21c96b9a6fd915bc89ee2f651c19e8c2636f5ff897
MD5 426afbf4963670deb7c99d1afd357bb8
BLAKE2b-256 27a8693c04b4b8badd686ae9a41a6651b5f80803ef1eaa4354506eb8c79550fc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page