Skip to main content

Unix Pipe Fittings For Data Science

Project description

UNIX pipe fittings for statistics

In the quest for command line data science, this kit contains three command line utilities intended to be used in UNIX pipes.

All three process STDIN to STDOUT output their docstrings if run without parameters.

Python 3 is required.

sd_c (smalldata count)

Is a regular expression counter filter, contained in smalldata/counter.py. Please see docstring for further help.

sd_g (smalldata groupby)

Concatenates lines from stdin that match a regular expression, contained in smalldata/groupby.py. Please see docstring.

sd_e (smalldata extract)

In the spirit of RegExSerDe, this tool uses regular expressions to generate a CSV file from a free-form text file. It is contained in smalldata/extract.py and has a docstring.

Other Useful Tools

If you've got CSV files, you should definitively check out q.

To Do

A cookbook would be nice. Showing how to analyze log files etc.

History

Used to live in a gist: https://gist.github.com/martinvirtel/94cf47f64bf304e1c66598e93cd565c4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smalldata-0.0.3.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

smalldata-0.0.3-py3-none-any.whl (9.1 kB view details)

Uploaded Python 3

File details

Details for the file smalldata-0.0.3.tar.gz.

File metadata

  • Download URL: smalldata-0.0.3.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.1 requests-toolbelt/0.9.0 tqdm/4.30.0 CPython/3.6.6

File hashes

Hashes for smalldata-0.0.3.tar.gz
Algorithm Hash digest
SHA256 eb12da44ec61555e4f7c308611428b336bed438b2e9a740ed5945dc0a5133737
MD5 2a85b8bcef23403c0638e28358a61e88
BLAKE2b-256 39dadd0328cfdfcd1784656782d417fdbad5f9a020e4b6f84dce78c936d03c64

See more details on using hashes here.

File details

Details for the file smalldata-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: smalldata-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 9.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.7.1 requests-toolbelt/0.9.0 tqdm/4.30.0 CPython/3.6.6

File hashes

Hashes for smalldata-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 dc497f8307fa457ff202a47e628700b3704808a9381759929a65351fe9385380
MD5 0ea4e5018527b41a3e5c74670d583fa6
BLAKE2b-256 66cf4a95042fe460e38384f25b3da7744cb2a719191d044d93a09fea1d10fce4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page