A command line application that helps converting raw data into highly-structured data in Parquet.
Project description
Parquest
Parquest is a project that aims to structure raw data into a structured format based on the Parquet file format. The name "Parquest" is a portmanteau of "Parquet" and "quest," symbolizing the journey of transforming raw data into a structured format.
Introduction
In today's data-driven world, dealing with large volumes of raw data can be challenging. The Parquest project provides a solution by leveraging the power of the Parquet file format to structure and organize raw data efficiently.
Features
- Data Structuring: Parquest enables you to convert raw data into a structured format based on the Parquet file format.
- Efficient Storage: The Parquet file format is designed for efficient storage and retrieval of structured data, making it ideal for big data applications.
- Columnar Storage: Parquest stores data in a columnar format, which allows for faster query performance and better compression ratios.
- Schema Evolution: Parquest supports schema evolution, allowing you to easily modify the structure of your data over time without breaking compatibility.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
parquest-0.0.11.tar.gz
(2.6 kB
view hashes)
Built Distribution
Close
Hashes for parquest-0.0.11-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 05b61e0b42266daaf053bd75cdc003cf716f0be6b7ddb1df92b50fbdf9a784e0 |
|
MD5 | ff3f9e8c10bb52d2b47cf4d1c6451ebe |
|
BLAKE2b-256 | f9d95952a81d6a4636c5efebfcbccd0c7cae6922ad510395fbbcfb151af9ba25 |