Unix Pipe Fittings For Data Science
Project description
UNIX pipe fittings for statistics
In the quest for command line data science, this kit contains three command line utilities intended to be used in UNIX pipes.
All three process STDIN to STDOUT output their docstrings if run without parameters.
Python 3 is required.
sd_c (smalldata count)
Is a regular expression counter filter, contained in smalldata/counter.py
. Please see docstring for further help.
sd_g (smalldata groupby)
Concatenates lines from stdin that match a regular expression, contained in smalldata/groupby.py
. Please see docstring.
sd_e (smalldata extract)
In the spirit of RegExSerDe, this
tool uses regular expressions to generate a CSV file from a free-form text file. It is contained in smalldata/extract.py
and has a docstring.
Other Useful Tools
If you've got CSV files, you should definitively check out q.
To Do
A cookbook would be nice. Showing how to analyze log files etc.
History
Used to live in a gist: https://gist.github.com/martinvirtel/94cf47f64bf304e1c66598e93cd565c4
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file smalldata-0.0.2.tar.gz
.
File metadata
- Download URL: smalldata-0.0.2.tar.gz
- Upload date:
- Size: 5.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.1 requests-toolbelt/0.9.0 tqdm/4.30.0 CPython/3.6.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 344a04a82c3395cc620dd9b4fe8e6c4a267d4c7a5312956bde2fd477581fe9ac |
|
MD5 | 8b5a20b2a09c92e0112286a3236c1047 |
|
BLAKE2b-256 | d26016b9175791c56106aac25b4f808ea643f20bad4fcaa50bfec90aa6660401 |
File details
Details for the file smalldata-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: smalldata-0.0.2-py3-none-any.whl
- Upload date:
- Size: 8.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.1 requests-toolbelt/0.9.0 tqdm/4.30.0 CPython/3.6.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8d42b19959d1d0198a7715ea9455247ce2b9c7ad1d29e3d13923b57d6f0e69e0 |
|
MD5 | 89a570eccf419d6763f0cc8904626a40 |
|
BLAKE2b-256 | d0e36925a57c2e3105de9437db6197f26ae757bd3bb5eb1dcb04d9f82d33f7b4 |