Python Driver for Apache Drill.
Project description
===============================
pydrill
===============================
.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
:target: https://travis-ci.org/PythonicNinja/pydrill
.. image:: https://img.shields.io/pypi/v/pydrill_dsl.svg
:target: https://pypi.python.org/pypi/pydrill_dsl
.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
:target: https://readthedocs.org/projects/pydrill/?badge=latest
:alt: Documentation Status
.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master
Python Driver for `Apache Drill <https://drill.apache.org/>`_.
*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*
* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.
Features
--------
* Python 2/3 compatibility,
* Support for all rest API calls inluding profiles/options/metrics `docs with full list <http://pydrill.readthedocs.io/en/latest/readme.html#supported-api-calls>`_.
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,
Installation
------------
::
Version from https://pypi.python.org/pypi/pydrill::
$ pip install pydrill
Latest version from git::
$ pip install git+git://github.com/PythonicNinja/pydrill.git
Sample usage
------------
::
from pydrill.client import PyDrill
drill = PyDrill(host='localhost', port=8047)
if not drill.is_active():
raise ImproperlyConfigured('Please run Drill first')
yelp_reviews = drill.query('''
SELECT * FROM
`dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
LIMIT 5
''')
for result in yelp_reviews:
print("%s: %s" %(result['type'], result['date']))
# pandas dataframe
df = yelp_reviews.to_dataframe()
print(df[df['stars'] > 3])
=======
History
=======
0.1.0 (2016-05-19)
------------------
* First minor release
* Updated docs
0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe
0.0.1 (2015-12-28)
------------------
* Project start
pydrill
===============================
.. image:: https://img.shields.io/travis/PythonicNinja/pydrill.svg
:target: https://travis-ci.org/PythonicNinja/pydrill
.. image:: https://img.shields.io/pypi/v/pydrill_dsl.svg
:target: https://pypi.python.org/pypi/pydrill_dsl
.. image:: https://readthedocs.org/projects/pydrill/badge/?version=latest
:target: https://readthedocs.org/projects/pydrill/?badge=latest
:alt: Documentation Status
.. image:: https://coveralls.io/repos/PythonicNinja/pydrill/badge.svg?branch=master&service=github
:target: https://coveralls.io/github/PythonicNinja/pydrill?branch=master
Python Driver for `Apache Drill <https://drill.apache.org/>`_.
*Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage*
* Free software: MIT license
* Documentation: https://pydrill.readthedocs.org.
Features
--------
* Python 2/3 compatibility,
* Support for all rest API calls inluding profiles/options/metrics `docs with full list <http://pydrill.readthedocs.io/en/latest/readme.html#supported-api-calls>`_.
* Mapping Results to internal python types,
* Compatibility with Pandas data frame,
Installation
------------
::
Version from https://pypi.python.org/pypi/pydrill::
$ pip install pydrill
Latest version from git::
$ pip install git+git://github.com/PythonicNinja/pydrill.git
Sample usage
------------
::
from pydrill.client import PyDrill
drill = PyDrill(host='localhost', port=8047)
if not drill.is_active():
raise ImproperlyConfigured('Please run Drill first')
yelp_reviews = drill.query('''
SELECT * FROM
`dfs.root`.`./Users/macbookair/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
LIMIT 5
''')
for result in yelp_reviews:
print("%s: %s" %(result['type'], result['date']))
# pandas dataframe
df = yelp_reviews.to_dataframe()
print(df[df['stars'] > 3])
=======
History
=======
0.1.0 (2016-05-19)
------------------
* First minor release
* Updated docs
0.0.2 (2016-04-24)
------------------
* First release on PyPI.
* Implementation of metrics/storage/options/stats
* Builds are tested by docker container with Apache Drill running
* support for pandas with ResultQuery.to_dataframe
0.0.1 (2015-12-28)
------------------
* Project start
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for pydrill-0.1.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 923dfeb6fc1f4f20a135dd26f601076eef83f8b6ea8b19e3bc063a8557a74035 |
|
MD5 | c7d97fce5af9d82c377890ef6eccbcd1 |
|
BLAKE2b-256 | 20a6639062f7b6d05094aa4254034d916161a73aadc02eccc7cadd56b571bcde |