Skip to main content

Easy install parquet-tools

Project description

parquet-tools

Run Unittest Run CLI test

This is a pip installable parquet-tools. In other words, parquet-tools is a CLI tools of Apache Arrow or easy install parquet-tools. You can show parquet file content/schema on local disk or on Amazon S3.

Features

  • Read Parquet data (local file or file on S3)
  • Read Parquet metadata/schema (local file or file on S3)

Installation

$ pip install parquet-tools

Usage

$ parquet-tools --help
usage: parquet-tools [-h] {show,csv,inspect} ...

parquet CLI tools

positional arguments:
  {show,csv,inspect}
    show              Show human readble format. see `show -h`
    csv               Cat csv style. see `csv -h`
    inspect           Inspect parquet file. see `inspect -h`

optional arguments:
  -h, --help          show this help message and exit

Usage Examples

Show local parquet file

$ parquet-tools show test.parquet
+-------+-------+---------+
|   one | two   | three   |
|-------+-------+---------|
|  -1   | foo   | True    |
| nan   | bar   | False   |
|   2.5 | baz   | True    |
+-------+-------+---------+

Show parquet file on S3

$ parquet-tools show s3://bucket-name/prefix/*
+-------+-------+---------+
|   one | two   | three   |
|-------+-------+---------|
|  -1   | foo   | True    |
| nan   | bar   | False   |
|   2.5 | baz   | True    |
+-------+-------+---------+

Cat CSV parquet and transform csvq

$ parquet-tools csv s3://bucket-name/test.parquet |csvq "select one, three where three"
+-------+-------+
|  one  | three |
+-------+-------+
| -1.0  | True  |
| 2.5   | True  |
+-------+-------+

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parquet_tools-0.2.1.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

parquet_tools-0.2.1-py3-none-any.whl (8.6 kB view details)

Uploaded Python 3

File details

Details for the file parquet_tools-0.2.1.tar.gz.

File metadata

  • Download URL: parquet_tools-0.2.1.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.8.0 tqdm/4.46.1 CPython/3.8.2

File hashes

Hashes for parquet_tools-0.2.1.tar.gz
Algorithm Hash digest
SHA256 b0c37ece0f529ce133f47f2392cc2e4abfe2d5ef262dfe3021e35132f9eece7d
MD5 0151f87fd71781e8524ba1d38b005e84
BLAKE2b-256 acc15c657f919944aa6af6fceb9a9e4b65771f24c54111c1cf1876bb3f5024d1

See more details on using hashes here.

File details

Details for the file parquet_tools-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: parquet_tools-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 8.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.8.0 tqdm/4.46.1 CPython/3.8.2

File hashes

Hashes for parquet_tools-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b1ca911a26841d4ac7daab68e65740eb52be6be38f7af7816928f0bc07f264b6
MD5 f710415ae6809c870d7af7416b524001
BLAKE2b-256 4fd9d367f52d6038615bfe33874d7c71659616624d01049ea4ca5b16115f7a0b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page