Skip to main content

Python DB API 2.0 (PEP 249) compliant wrapper for Amazon Athena JDBC driver

Project description

https://img.shields.io/pypi/pyversions/PyAthenaJDBC.svg https://circleci.com/gh/laughingman7743/PyAthenaJDBC.svg?style=shield https://codecov.io/gh/laughingman7743/PyAthenaJDBC/branch/master/graph/badge.svg https://img.shields.io/pypi/l/PyAthenaJDBC.svg

PyAthenaJDBC

PyAthenaJDBC is a python DB API 2.0 (PEP 249) compliant wrapper for Amazon Athena JDBC driver.

Requirements

  • Python

    • CPython 2.6, 2,7, 3,4, 3.5

  • Java

    • Java 7+

Installation

$ pip install PyAthenaJDBC

Usage

Basic usage:

from pyathenajdbc import connect

conn = connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')
try:
    with conn.cursor() as cursor:
        cursor.execute("""
        SELECT * FROM test_pyathena_jdbc.one_row
        """)
        print(cursor.description)
        print(cursor.fetchall())
finally:
    conn.close()

Cursor iteration:

from pyathenajdbc import connect

conn = connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')
try:
    with conn.cursor() as cursor:
        cursor.execute("""
        SELECT * FROM test_pyathena_jdbc.many_rows LIMIT 10
        """)
        for row in cursor:
            print(row)
finally:
    conn.close()

Query with parameter:

from pyathenajdbc import connect

conn = connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')
try:
    with conn.cursor() as cursor:
        cursor.execute("""
        SELECT col_int FROM test_pyathena_jdbc.one_row_complex where col_int = {0}
        """, 2147483647)
        print(cursor.fetchall())

        cursor.execute("""
        SELECT col_string FROM test_pyathena_jdbc.one_row_complex where col_string = {param}
        """, param='a string')
        print(cursor.fetchall())
finally:
    conn.close()

Minimal example for Pandas DataFrame:

from pyathenajdbc import connect
import pandas as pd

conn = connect(access_key=<access key>,
               secret_key=<secret key>,
               s3_staging_dir=<staging dir>,
               region_name=<region name>,
               jvm_path=<jvm path>) # optional, as used by jpype
df = pd.read_sql("SELECT * FROM <table name> LIMIT 10", conn)

As Pandas DataFrame:

import contextlib
from pyathenajdbc import connect
from pyathenajdbc.util import as_pandas

with contextlib.closing(
        connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')) as conn:
    with conn.cursor() as cursor:
        cursor.execute("""
        SELECT * FROM test_pyathena_jdbc.many_rows
        """)
        df = as_pandas(cursor)
print(df.describe())

Credential

Support AWS CLI credentials configuration.

Credential Files

~/.aws/credentials

[default]
aws_access_key_id=YOUR_ACCESS_KEY_ID
aws_secret_access_key=YOUR_SECRET_ACCESS_KEY

~/.aws/config

[default]
region=us-west-2
output=json

Environment variables

$ export AWS_ACCESS_KEY_ID=YOUR_ACCESS_KEY_ID
$ export AWS_SECRET_ACCESS_KEY=YOUR_SECRET_ACCESS_KEY
$ export AWS_DEFAULT_REGION=us-west-2

Additional environment variable:

$ export AWS_ATHENA_S3_STAGING_DIR=s3://YOUR_S3_BUCKET/path/to/

Testing

Depends on the following environment variables:

$ export AWS_ACCESS_KEY_ID=YOUR_ACCESS_KEY_ID
$ export AWS_SECRET_ACCESS_KEY=YOUR_SECRET_ACCESS_KEY
$ export AWS_DEFAULT_REGION=us-west-2
$ export AWS_ATHENA_S3_STAGING_DIR=s3://YOUR_S3_BUCKET/path/to/

Run test:

$ pip install pytest
$ py.test

Run test multiple Python versions:

$ pip install tox
$ pyenv local 2.6.9 2.7.12 3.4.5 3.5.2
$ tox

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyAthenaJDBC-1.0.3.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

PyAthenaJDBC-1.0.3-py2.py3-none-any.whl (8.5 MB view details)

Uploaded Python 2 Python 3

File details

Details for the file PyAthenaJDBC-1.0.3.tar.gz.

File metadata

File hashes

Hashes for PyAthenaJDBC-1.0.3.tar.gz
Algorithm Hash digest
SHA256 0de344a03e2a76bfcf81993bac8a3d9882d18f89965b33b3cd2d00ea34178a2a
MD5 14d66a4073d86be78655b156171849aa
BLAKE2b-256 6eb349ed87ee52d2ead7e0670a8cd51842cac2cbcefff428fbe152356dd84c78

See more details on using hashes here.

Provenance

File details

Details for the file PyAthenaJDBC-1.0.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for PyAthenaJDBC-1.0.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 a955350aadeffac60bf060ed13912fcb3c3500f104a965852837a1c0668017e3
MD5 9adcec09c1d0d586041256b23d506b99
BLAKE2b-256 cc5e940a5e079a592243f6509b921bc5f6a303cb27d540544e541c24af419934

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page