Data sampler from streaming data
Project description
StreamSampler
StreamSampler package allows you to sample a particular number of elements from a stream of data of which length is very large or unknown.
StreamSampler is provided in both forms of an executable command and library. It utilizes Reservoir sampling algorithm [Vitter85]
You can take a look at the README.txt of other projects, such as repoze.bfg (http://bfg.repoze.org/trac/browser/trunk/README.txt) for some ideas.
License
MIT License
See Also
sample-cli by Paul Butler is a command line tool providing almost the same feature. StreamSampler is intended to be a library, although it has a command line interface, so that it can be a part of other packages including my future projects.
News
0.1.1
Tests in Python 2.6, 2.7, 3.1, 3.2, 3.3
0.1.0
First public version
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file StreamSampler-0.1.1.tar.gz
.
File metadata
- Download URL: StreamSampler-0.1.1.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f4b3ace37a0d09cbbe7a36f40002551eedf12e451b349cdcd6640e1b96ece30f |
|
MD5 | 3a11ecc5a75923376a0b100c809a81ec |
|
BLAKE2b-256 | e9bed34681061c6b0e30edb3a27bfbfe7ca808e53227d8741d06a43a2ac98c39 |