Skip to main content

Parquet from and to CSV format converter

Project description

Parquet_CSV

CI | PyPI

A Parquet to and from CSV converter that is based on Apache Arrow for its speed and memory efficiency.

How to install

pip install parquet_csv

Use pip3 if both Python2 and Python3 are installed. This application only works with Python3.

How to use

Converting Parquet

parquet_to_csv converts parquet files to csv files. By default it prints to the standard output, but can be directed via pipe or -o flag to write to a file.

Usage: parquet_to_csv.py [OPTIONS] INPUT_FILE

Options:
  -o, --output-path FILE  [default: (standard output)]
  --header / --no-header
  --verbose BOOLEAN
  --help                  Show this message and exit.

Selecting columns, gzip-ing output

Following UNIX principle, you should be using xsv for selecting columns from the csv or do other transformations: just pipe the output to xsv and you're all set.

Similarly if you'd want the file to be compressed, pipe the result to gzip and direct to a local file ending in .csv.gz.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parquet_csv-0.2.0.tar.gz (3.4 kB view hashes)

Uploaded Source

Built Distribution

parquet_csv-0.2.0-py3-none-any.whl (3.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page