Skip to main content

A tool to show metadata about a Parquet file

Project description

parquet-metadata

Build Status

Dump metadata about a Parquet file. You may also be interested in csv2parquet.

sudo pip install parquet-metadata
parquet-metadata parquet.file

Sample output:

file	created_by	parquet-cpp version 1.4.1-SNAPSHOT
file	columns	9
file	row_groups	1
file	rows	2
row_group	0		size	634
row_group	0		rows	2
row_group	0		columns	9
row_group	0	bool	type	BOOLEAN
row_group	0	bool	num_values	2
row_group	0	bool	compression	SNAPPY
row_group	0	bool	encodings	PLAIN,RLE
row_group	0	bool	compressed_size	36
row_group	0	bool	uncompressed_size	34
row_group	0	bool	stats:min	False
row_group	0	bool	stats:max	True
row_group	0	float32	type	FLOAT
row_group	0	float32	num_values	2
row_group	0	float32	compression	SNAPPY
row_group	0	float32	encodings	PLAIN_DICTIONARY,PLAIN,RLE
row_group	0	float32	compressed_size	68
row_group	0	float32	uncompressed_size	64
row_group	0	float32	stats:min	0.5
row_group	0	float32	stats:max	0.6
[...]

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
parquet_metadata-0.0.1-py3-none-any.whl (3.4 kB) Copy SHA256 hash SHA256 Wheel py3
parquet-metadata-0.0.1.tar.gz (2.3 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page