Python Driver for Apache Drill.
Project description
===============================
pydrill
===============================
.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
:target: https://travis-ci.org/PythonicNinja/pydrill
.. image:: https://img.shields.io/pypi/v/pydrill.svg
:target: https://pypi.python.org/pypi/pydrill
.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
:target: https://readthedocs.org/projects/pydrill/?badge=latest
:alt: Documentation Status
.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master
Python Driver for `Apache Drill <https://drill.apache.org/>`_.
*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*
* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.
Features
--------
* Python 2/3 compatibility,
* Support for all rest API calls inluding profiles/options/metrics `docs with full list <http://pydrill.readthedocs.io/en/latest/readme.html#supported-api-calls>`_.
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,
Installation
------------
::
Version from https://pypi.python.org/pypi/pydrill::
$ pip install pydrill
Latest version from git::
$ pip install git+git://github.com/PythonicNinja/pydrill.git
Sample usage
------------
::
from pydrill.client import PyDrill
drill = PyDrill(host='localhost', port=8047)
if not drill.is_active():
raise ImproperlyConfigured('Please run Drill first')
yelp_reviews = drill.query('''
SELECT * FROM
`dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
LIMIT 5
''')
for result in yelp_reviews:
print("%s: %s" %(result['type'], result['date']))
# pandas dataframe
df = yelp_reviews.to_dataframe()
print(df[df['stars'] > 3])
=======
History
=======
0.1.1 (2016-05-21)
------------------
* Anaconda requirements fixed
0.1.0 (2016-05-19)
------------------
* First minor release
* Updated docs
0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe
0.0.1 (2015-12-28)
------------------
* Project start
pydrill
===============================
.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
:target: https://travis-ci.org/PythonicNinja/pydrill
.. image:: https://img.shields.io/pypi/v/pydrill.svg
:target: https://pypi.python.org/pypi/pydrill
.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
:target: https://readthedocs.org/projects/pydrill/?badge=latest
:alt: Documentation Status
.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master
Python Driver for `Apache Drill <https://drill.apache.org/>`_.
*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*
* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.
Features
--------
* Python 2/3 compatibility,
* Support for all rest API calls inluding profiles/options/metrics `docs with full list <http://pydrill.readthedocs.io/en/latest/readme.html#supported-api-calls>`_.
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,
Installation
------------
::
Version from https://pypi.python.org/pypi/pydrill::
$ pip install pydrill
Latest version from git::
$ pip install git+git://github.com/PythonicNinja/pydrill.git
Sample usage
------------
::
from pydrill.client import PyDrill
drill = PyDrill(host='localhost', port=8047)
if not drill.is_active():
raise ImproperlyConfigured('Please run Drill first')
yelp_reviews = drill.query('''
SELECT * FROM
`dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
LIMIT 5
''')
for result in yelp_reviews:
print("%s: %s" %(result['type'], result['date']))
# pandas dataframe
df = yelp_reviews.to_dataframe()
print(df[df['stars'] > 3])
=======
History
=======
0.1.1 (2016-05-21)
------------------
* Anaconda requirements fixed
0.1.0 (2016-05-19)
------------------
* First minor release
* Updated docs
0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe
0.0.1 (2015-12-28)
------------------
* Project start
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pydrill-0.1.1.tar.gz
(21.4 kB
view hashes)
Built Distribution
Close
Hashes for pydrill-0.1.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f87243a2d75ecc08f151b4bac0c74cf0c6bd08dd209e9cc4f384daaaec23aa77 |
|
MD5 | 28d39cda83e3dc96410ce98015536c65 |
|
BLAKE2b-256 | 144548315630a29bd4adb0ed8de7158efcc3a68e26f318e45b68d4293935a9eb |