Skip to main content

A command line application that helps converting raw data into highly-structured data in Parquet.

Project description

Parquest

Parquest is a project that aims to structure raw data into a structured format based on the Parquet file format. The name "Parquest" is a portmanteau of "Parquet" and "quest," symbolizing the journey of transforming raw data into a structured format.

Introduction

In today's data-driven world, dealing with large volumes of raw data can be challenging. The Parquest project provides a solution by leveraging the power of the Parquet file format to structure and organize raw data efficiently.

Features

  • Data Structuring: Parquest enables you to convert raw data into a structured format based on the Parquet file format.
  • Efficient Storage: The Parquet file format is designed for efficient storage and retrieval of structured data, making it ideal for big data applications.
  • Columnar Storage: Parquest stores data in a columnar format, which allows for faster query performance and better compression ratios.
  • Schema Evolution: Parquest supports schema evolution, allowing you to easily modify the structure of your data over time without breaking compatibility.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parquest-0.0.15.tar.gz (10.1 kB view hashes)

Uploaded Source

Built Distribution

parquest-0.0.15-py3-none-any.whl (12.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page