Skip to main content

Hive import statement generator for Parquet datasets

Project description

# parquet2hive [![Build Status](https://travis-ci.org/mozilla/parquet2hive.svg?branch=master)](https://travis-ci.org/mozilla/parquet2hive)
Hive import statement generator for Parquet datasets. Supports versioned datasets and schema evolution.

## Installing from Pypi
To install this package from Pypi, run:

```bash
pip install parquet2hive
```

## Updating the Package on PyPi
To upload the most recent version, run:

```bash
python setup.py sdist upload
```

## Using the TestPypi Servers
You will need a separate account on https://testpypi.python.org.
To upload the file to the pypi test servers, ensure your ```~/.pypirc``` contains the following:

```bash
[distutils]
index-servers=
pypi
pypitest

[pypitest]
repository = https://testpypi.python.org/pypi
username = testpypi_username
password = testpypi_password

[pypi]
repository = https://pypi.python.org/pypi
username = pypi_username
password = pypi_password
```

Upload the code using:
```bash
python setup.py sdist upload -r https://testpypi.python.org/pypi
```

Finally, pull the most recent package from the test-repository on any machine using:
```bash
pip install parquet2hive -i https://testpypi.python.org/pypi
```

## Example usage
```bash
parquet2hive s3://telemetry-parquet/longitudinal | bash
```

To see the allowed command line interface arguments, run ```parquet2hive -h```

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parquet2hive-0.3.0.tar.gz (22.6 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page