Skip to main content

client for sparksampling

Project description

This is a Python Grpc Stub for sparksampling

sparksampling

sparksampling is a PySpark-based sampling and data quality assessment GRPC service that supports containerized deployments and Spark on K8S

Feature

  • Common sampling methods: Random, Stratified, Simple
  • Relationship Sampling based on DAG and Topological sorting
  • Cloud Native and Spark on K8S support

QUICK START

Installation

The trial only requires direct installation using pypi

pip install sparksampling

run as

sparksampling

The service will start and listen on port 8530

Docker

docker run -p 8530:8530 wh1isper/pysparksampling:latest

MORE

For more, see our github page: https://github.com/Wh1isper/pyspark-sampling/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparksampling_client-0.2.2.tar.gz (9.3 kB view details)

Uploaded Source

Built Distribution

sparksampling_client-0.2.2-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file sparksampling_client-0.2.2.tar.gz.

File metadata

  • Download URL: sparksampling_client-0.2.2.tar.gz
  • Upload date:
  • Size: 9.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for sparksampling_client-0.2.2.tar.gz
Algorithm Hash digest
SHA256 408fba97913901ce56c79a9c347777e0aca1e6e37679f2711581e7ef154f83d1
MD5 dbe17d3aafe7f7123b565fc8484c8e45
BLAKE2b-256 31619824d60d46cae0d5a52a7a05b8e3b09d6979fdef6363b82c7b9f1d5f7215

See more details on using hashes here.

File details

Details for the file sparksampling_client-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for sparksampling_client-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a7e5986a5b0e59848c199c116bff46eefb28a0ea5a2e6cae5c556cf0eb6fd7e4
MD5 b822cdd69a02219dd073f982f21e7294
BLAKE2b-256 0d659738e4b8926c9791e26ff63a944388761b5be855929067a75eb58d660c03

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page