Skip to main content

Pluck columns from CSV files in the command line. Like cut or awk, but doesn't choke on delimiter escaping.

Project description

Pluck columns from csv files in the commandline. Like cut or awk, but without choking on delimiter escaping.

install

pip install pluckr. Done.

usage

Grab columns 2 and 3 from stdin (-f is zero-indexed):

pluckr -f 1,2 < sample1.csv

Or straight from a file:

pluckr -f 2,1 sample1.csv

Your requested order is retained:

pluckr -f 1,2 sample1.csv != pluckr -f 2,1 sample1.csv

You can also use Python’s negative indices… grab the first and last cols:

pluckr -f 0,-1 sample1.csv

Fields that don’t exist (e.g. -f11111) will be ignored.

Skip header row(s):

cat sample1.csv | pluckr -s1

Grab all but the 2nd column (drop columns instead of selecting them):

pluckr -f 1 -i < sample1.csv

Read (and write) various delimiters:

pluckr -d, –out-delimiter=| < with-commas.csv > with-pipes.csv

Note that passing tabs as arguments can be awkward; in a pinch, use:

pluckr -d $’t’

help

Via –help:

usage: pluckr [-h] [-f FIELDS] [-i] [-d DELIMITER] [-q QUOTECHAR] [-s SKIP]
[infile]

Grab columns from csv input. http://github.com/philadams/pluckr

positional arguments:
infile input file (.csv)
optional arguments:
-h, --help show this help message and exit
-f FIELDS, --fields FIELDS
 ordered list of columns to retain; zero-indexed
-i, --inverse invert column retention: drop those in -f
-d DELIMITER, --delimiter DELIMITER
 field delimiter when reading infile
-q QUOTECHAR, --quotechar QUOTECHAR
 field quotechar when reading infile
-s SKIP, --skip SKIP
 number of rows to skip

future

  • allow -f to take columns by name when there’s a header row?
  • implement –out-** for output delimiters etc.
  • add out delimiter support
  • add out quotechar support
  • tests!!!

Project details


Release history Release notifications

History Node

0.0.9

History Node

0.0.8

History Node

0.0.7

History Node

0.0.6

This version
History Node

0.0.5

History Node

0.0.4

History Node

0.0.3

History Node

0.0.2

History Node

0.0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
pluckr-0.0.5.tar.gz (2.7 kB) Copy SHA256 hash SHA256 Source None Jan 17, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page