Skip to main content

Data sampler from streaming data

Project description


StreamSampler package allows you to sample a particular number of elements from a stream of data of which length is very large or unknown.

StreamSampler is provided in both forms of an executable command and library. It utilizes Reservoir sampling algorithm [Vitter85]

You can take a look at the README.txt of other projects, such as repoze.bfg ( for some ideas.


MIT License

See Also

  • sample-cli by Paul Butler is a command line tool providing almost the same feature. StreamSampler is intended to be a library, although it has a command line interface, so that it can be a part of other packages including my future projects.



First public version

Project details

Release history Release notifications

History Node


This version
History Node


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
StreamSampler-0.1.0.tar.gz (4.2 kB) Copy SHA256 hash SHA256 Source None Jan 7, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page