A tool to show metadata about a Parquet file
Project description
parquet-metadata
Dump metadata about a Parquet file. You may also be interested in csv2parquet.
sudo pip install parquet-metadata
parquet-metadata parquet.file
Sample output:
file created_by parquet-cpp version 1.4.1-SNAPSHOT
file columns 9
file row_groups 1
file rows 2
row_group 0 size 634
row_group 0 rows 2
row_group 0 columns 9
row_group 0 bool type BOOLEAN
row_group 0 bool num_values 2
row_group 0 bool compression SNAPPY
row_group 0 bool encodings PLAIN,RLE
row_group 0 bool compressed_size 36
row_group 0 bool uncompressed_size 34
row_group 0 bool stats:min False
row_group 0 bool stats:max True
row_group 0 float32 type FLOAT
row_group 0 float32 num_values 2
row_group 0 float32 compression SNAPPY
row_group 0 float32 encodings PLAIN_DICTIONARY,PLAIN,RLE
row_group 0 float32 compressed_size 68
row_group 0 float32 uncompressed_size 64
row_group 0 float32 stats:min 0.5
row_group 0 float32 stats:max 0.6
[...]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
parquet-metadata-0.0.1.tar.gz
(2.3 kB
view hashes)
Built Distribution
Close
Hashes for parquet_metadata-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9801e777eedccc0ecfa04f575db1a774f9e48472bc4e1eb6b4e790449b026d7b |
|
MD5 | b76e7ec63d4a1ee973ce35d3c6c0270b |
|
BLAKE2b-256 | 9eb62fc796a8fbc8f8359e0ccae4e5274b083477368749591fa93d54042121c3 |