Python DB API 2.0 (PEP 249) compliant wrapper for Amazon Athena JDBC driver
Project description
PyAthenaJDBC
PyAthenaJDBC is a python DB API 2.0 (PEP 249) compliant wrapper for Amazon Athena JDBC driver.
Requirements
Python
CPython 2.6, 2,7, 3,4, 3.5
Java
Java 7+
Installation
$ pip install PyAthenaJDBC
Credential
Support AWS CLI credentials configuration.
Credential Files
~/.aws/credentials
[default]
aws_access_key_id=YOUR_ACCESS_KEY_ID
aws_secret_access_key=YOUR_SECRET_ACCESS_KEY
~/.aws/config
[default]
region=us-west-2
output=json
Environment variables
$ export AWS_ACCESS_KEY_ID=YOUR_ACCESS_KEY_ID
$ export AWS_SECRET_ACCESS_KEY=YOUR_SECRET_ACCESS_KEY
$ export AWS_DEFAULT_REGION=us-west-2
Additional environment variable:
$ export AWS_ATHENA_S3_STAGING_DIR=s3://YOUR_S3_BUCKET/path/to/
Usage
Basic usage:
from pyathenajdbc import connect
conn = connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')
try:
with conn.cursor() as cursor:
cursor.execute("""
SELECT * FROM test_pyathena_jdbc.one_row
""")
print(cursor.description)
print(cursor.fetchall())
finally:
conn.close()
Cursor iteration:
from pyathenajdbc import connect
conn = connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')
try:
with conn.cursor() as cursor:
cursor.execute("""
SELECT * FROM test_pyathena_jdbc.many_rows LIMIT 10
""")
for row in cursor:
print(row)
finally:
conn.close()
Query with parameter:
from pyathenajdbc import connect
conn = connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')
try:
with conn.cursor() as cursor:
cursor.execute("""
SELECT col_int FROM test_pyathena_jdbc.one_row_complex where col_int = {0}
""", 2147483647)
print(cursor.fetchall())
cursor.execute("""
SELECT col_string FROM test_pyathena_jdbc.one_row_complex where col_string = {param}
""", param='a string')
print(cursor.fetchall())
finally:
conn.close()
Pandas DataFrame:
import contextlib
from pyathenajdbc import connect
from pyathenajdbc.util import as_pandas
with contextlib.closing(
connect(s3_staging_dir='s3://YOUR_S3_BUCKET/path/to/')) as conn:
with conn.cursor() as cursor:
cursor.execute("""
SELECT * FROM test_pyathena_jdbc.many_rows
""")
df = as_pandas(cursor)
print(df.describe())
Testing
Depends on the following environment variables:
$ export AWS_ACCESS_KEY_ID=YOUR_ACCESS_KEY_ID
$ export AWS_SECRET_ACCESS_KEY=YOUR_SECRET_ACCESS_KEY
$ export AWS_DEFAULT_REGION=us-west-2
$ export AWS_ATHENA_S3_STAGING_DIR=s3://YOUR_S3_BUCKET/path/to/
Run test:
$ pip install pytest
$ py.test
Run test multiple Python versions:
$ pip install tox
$ pyenv local 2.6.9 2.7.12 3.4.5 3.5.2
$ tox
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
PyAthenaJDBC-1.0.1.tar.gz
(10.1 kB
view hashes)
Built Distribution
Close
Hashes for PyAthenaJDBC-1.0.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7459c4d57561502fbc1f2a4e9dc364d9f2c860d6a8276b30ac91b08af2622eec |
|
MD5 | 911dc2f1b26adb862745a7eb1cf016e9 |
|
BLAKE2b-256 | 0ed25cf11307d09586eb630415c67c24c90ac3e8bd3bfb77c70ce8d17120f73c |