client for sparksampling
Project description
This is a Python Grpc Stub for sparksampling
sparksampling
sparksampling
is a PySpark-based sampling and data quality assessment GRPC service that supports containerized deployments and Spark on K8S
Feature
- Common sampling methods: Random, Stratified, Simple
- Relationship Sampling based on DAG and Topological sorting
- Cloud Native and Spark on K8S support
QUICK START
Installation
The trial only requires direct installation using pypi
pip install sparksampling
run as
sparksampling
The service will start and listen on port 8530
Docker
docker run -p 8530:8530 wh1isper/pysparksampling:latest
MORE
For more, see our github page: https://github.com/Wh1isper/pyspark-sampling/
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sparksampling_client-0.2.2.tar.gz
.
File metadata
- Download URL: sparksampling_client-0.2.2.tar.gz
- Upload date:
- Size: 9.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 408fba97913901ce56c79a9c347777e0aca1e6e37679f2711581e7ef154f83d1 |
|
MD5 | dbe17d3aafe7f7123b565fc8484c8e45 |
|
BLAKE2b-256 | 31619824d60d46cae0d5a52a7a05b8e3b09d6979fdef6363b82c7b9f1d5f7215 |
File details
Details for the file sparksampling_client-0.2.2-py3-none-any.whl
.
File metadata
- Download URL: sparksampling_client-0.2.2-py3-none-any.whl
- Upload date:
- Size: 8.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7e5986a5b0e59848c199c116bff46eefb28a0ea5a2e6cae5c556cf0eb6fd7e4 |
|
MD5 | b822cdd69a02219dd073f982f21e7294 |
|
BLAKE2b-256 | 0d659738e4b8926c9791e26ff63a944388761b5be855929067a75eb58d660c03 |