Skip to main content

Data sampler from streaming data

Project description

StreamSampler

StreamSampler package allows you to sample a particular number of elements from a stream of data of which length is very large or unknown.

StreamSampler is provided in both forms of an executable command and library. It utilizes Reservoir sampling algorithm [Vitter85]

You can take a look at the README.txt of other projects, such as repoze.bfg (http://bfg.repoze.org/trac/browser/trunk/README.txt) for some ideas.

License

MIT License

See Also

  • sample-cli by Paul Butler is a command line tool providing almost the same feature. StreamSampler is intended to be a library, although it has a command line interface, so that it can be a part of other packages including my future projects.

News

0.1.1

  • Tests in Python 2.6, 2.7, 3.1, 3.2, 3.3

0.1.0

First public version

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for StreamSampler, version 0.1.1
Filename, size File type Python version Upload date Hashes
Filename, size StreamSampler-0.1.1.tar.gz (4.3 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page