Skip to main content

pyspark-sampling

Project description

This is a Python Grpc Stub for sparksampling

sparksampling

sparksampling is a PySpark-based sampling and data quality assessment GRPC service that supports containerized deployments and Spark on K8S

Feature

  • Common sampling methods: Random, Stratified, Simple

  • Relationship Sampling based on DAG and Topological sorting

  • Cloud Native and Spark on K8S support

QUICK START

Installation

The trial only requires direct installation using pypi

pip install sparksampling

run as

sparksampling

The service will start and listen on port 8530

Docker

docker run -p 8530:8530 wh1isper/pysparksampling:latest

MORE

For more, see our github page: https://github.com/Wh1isper/pyspark-sampling/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparksampling_client-0.1.0.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

sparksampling_client-0.1.0-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file sparksampling_client-0.1.0.tar.gz.

File metadata

  • Download URL: sparksampling_client-0.1.0.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/37.0 requests/2.28.1 requests-toolbelt/0.9.1 urllib3/1.26.12 tqdm/4.64.0 importlib-metadata/4.12.0 keyring/23.8.2 rfc3986/2.0.0 colorama/0.4.5 CPython/3.8.13

File hashes

Hashes for sparksampling_client-0.1.0.tar.gz
Algorithm Hash digest
SHA256 944958b7deaf5fd5f6edd2610ea78f114a6c83a29faa5d0950c50cd7015ac52b
MD5 d37c8ced96669b3eb3392112be3d6a89
BLAKE2b-256 d85b8ff814e959bd5d2d8651a228bd4630a7b0949cfe89763cbeef10b00afc97

See more details on using hashes here.

File details

Details for the file sparksampling_client-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: sparksampling_client-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/37.0 requests/2.28.1 requests-toolbelt/0.9.1 urllib3/1.26.12 tqdm/4.64.0 importlib-metadata/4.12.0 keyring/23.8.2 rfc3986/2.0.0 colorama/0.4.5 CPython/3.8.13

File hashes

Hashes for sparksampling_client-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 065437451dc817359220c5d00028ba227963bd3c61065c79cc936e77c1190b26
MD5 597120bb847624c5b1b12ad6343243b8
BLAKE2b-256 788fb8345765f8fd629c7ab8ba87ac75447bfde6ce51dbb058f4b2cd1da46fb4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page