Hive import statement generator for Parquet datasets
Project description
# parquet2hive [![Build Status](https://travis-ci.org/mozilla/parquet2hive.svg?branch=master)](https://travis-ci.org/mozilla/parquet2hive)
Hive import statement generator for Parquet datasets. Supports versioned datasets and schema evolution.
## Installing from Pypi
To install this package from Pypi, run:
```bash
pip install parquet2hive
```
## Updating the Package on PyPi
To upload the most recent version, run:
```bash
python setup.py sdist upload
```
## Using the TestPypi Servers
You will need a separate account on https://testpypi.python.org.
To upload the file to the pypi test servers, ensure your ```~/.pypirc``` contains the following:
```bash
[distutils]
index-servers=
pypi
pypitest
[pypitest]
repository = https://testpypi.python.org/pypi
username = testpypi_username
password = testpypi_password
[pypi]
repository = https://pypi.python.org/pypi
username = pypi_username
password = pypi_password
```
Upload the code using:
```bash
python setup.py sdist upload -r https://testpypi.python.org/pypi
```
Finally, pull the most recent package from the test-repository on any machine using:
```bash
pip install parquet2hive -i https://testpypi.python.org/pypi
```
## Example usage
```bash
parquet2hive s3://telemetry-parquet/longitudinal | bash
```
To see the allowed command line interface arguments, run ```parquet2hive -h```
Hive import statement generator for Parquet datasets. Supports versioned datasets and schema evolution.
## Installing from Pypi
To install this package from Pypi, run:
```bash
pip install parquet2hive
```
## Updating the Package on PyPi
To upload the most recent version, run:
```bash
python setup.py sdist upload
```
## Using the TestPypi Servers
You will need a separate account on https://testpypi.python.org.
To upload the file to the pypi test servers, ensure your ```~/.pypirc``` contains the following:
```bash
[distutils]
index-servers=
pypi
pypitest
[pypitest]
repository = https://testpypi.python.org/pypi
username = testpypi_username
password = testpypi_password
[pypi]
repository = https://pypi.python.org/pypi
username = pypi_username
password = pypi_password
```
Upload the code using:
```bash
python setup.py sdist upload -r https://testpypi.python.org/pypi
```
Finally, pull the most recent package from the test-repository on any machine using:
```bash
pip install parquet2hive -i https://testpypi.python.org/pypi
```
## Example usage
```bash
parquet2hive s3://telemetry-parquet/longitudinal | bash
```
To see the allowed command line interface arguments, run ```parquet2hive -h```
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
parquet2hive-0.3.0.tar.gz
(22.6 kB
view details)
File details
Details for the file parquet2hive-0.3.0.tar.gz
.
File metadata
- Download URL: parquet2hive-0.3.0.tar.gz
- Upload date:
- Size: 22.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e689d70bd2f2ab5a2658900545694cce51ccc6e89696e326d659e1f56620d787 |
|
MD5 | bbc69b2ffd3c2f7806c725b8078a06a7 |
|
BLAKE2b-256 | f6b6b5ac2e6d3fb51d80be849fbd5cfab2e85220d1ca899d8ff8b1c9dad8a9c1 |