Skip to main content

Data sampler from streaming data

Project description

StreamSampler

StreamSampler package allows you to sample a particular number of elements from a stream of data of which length is very large or unknown.

StreamSampler is provided in both forms of an executable command and library. It utilizes Reservoir sampling algorithm [Vitter85]

You can take a look at the README.txt of other projects, such as repoze.bfg (http://bfg.repoze.org/trac/browser/trunk/README.txt) for some ideas.

License

MIT License

See Also

  • sample-cli by Paul Butler is a command line tool providing almost the same feature. StreamSampler is intended to be a library, although it has a command line interface, so that it can be a part of other packages including my future projects.

News

0.1.1

  • Tests in Python 2.6, 2.7, 3.1, 3.2, 3.3

0.1.0

First public version

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

StreamSampler-0.1.1.tar.gz (4.3 kB view details)

Uploaded Source

File details

Details for the file StreamSampler-0.1.1.tar.gz.

File metadata

File hashes

Hashes for StreamSampler-0.1.1.tar.gz
Algorithm Hash digest
SHA256 f4b3ace37a0d09cbbe7a36f40002551eedf12e451b349cdcd6640e1b96ece30f
MD5 3a11ecc5a75923376a0b100c809a81ec
BLAKE2b-256 e9bed34681061c6b0e30edb3a27bfbfe7ca808e53227d8741d06a43a2ac98c39

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page