Skip to main content

Unix Pipe Fittings For Data Science

Project description

UNIX pipe fittings for statistics

In the quest for command line data science, this kit contains three command line utilities intended to be used in UNIX pipes.

All three process STDIN to STDOUT output their docstrings if run without parameters.

Python 3 is required.

sd_c (smalldata count)

Is a regular expression counter filter, contained in smalldata/counter.py. Please see docstring for further help.

sd_g (smalldata groupby)

Concatenates lines from stdin that match a regular expression, contained in smalldata/groupby.py. Please see docstring.

sd_e (smalldata extract)

In the spirit of RegExSerDe, this tool uses regular expressions to generate a CSV file from a free-form text file. It is contained in smalldata/extract.py and has a docstring.

Other Useful Tools

If you've got CSV files, you should definitively check out q.

To Do

A cookbook would be nice. Showing how to analyze log files etc.

History

Used to live in a gist: https://gist.github.com/martinvirtel/94cf47f64bf304e1c66598e93cd565c4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smalldata-0.0.2.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

smalldata-0.0.2-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file smalldata-0.0.2.tar.gz.

File metadata

  • Download URL: smalldata-0.0.2.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.1 requests-toolbelt/0.9.0 tqdm/4.30.0 CPython/3.6.6

File hashes

Hashes for smalldata-0.0.2.tar.gz
Algorithm Hash digest
SHA256 344a04a82c3395cc620dd9b4fe8e6c4a267d4c7a5312956bde2fd477581fe9ac
MD5 8b5a20b2a09c92e0112286a3236c1047
BLAKE2b-256 d26016b9175791c56106aac25b4f808ea643f20bad4fcaa50bfec90aa6660401

See more details on using hashes here.

File details

Details for the file smalldata-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: smalldata-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.1 requests-toolbelt/0.9.0 tqdm/4.30.0 CPython/3.6.6

File hashes

Hashes for smalldata-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8d42b19959d1d0198a7715ea9455247ce2b9c7ad1d29e3d13923b57d6f0e69e0
MD5 89a570eccf419d6763f0cc8904626a40
BLAKE2b-256 d0e36925a57c2e3105de9437db6197f26ae757bd3bb5eb1dcb04d9f82d33f7b4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page