Codenize your data sources
Project description
akagi
Free software: MIT license
Features
akagi supports iter and save interface for various data sources such as Amazon Redshift, Amazon S3 (more in future).
Installation
Install via pip:
pip install akagi
or from source:
$ git clone https://github.com/ayemos/akagi akagi $ cd akagi $ python setup.py install
Example
RedshiftDataSource
with RedshiftDataSource.for_query(
'log-redshift-unload.ap-northeast-1', # S3 Bucket for intermediate storage
'select * from (select user_id, path from logs.imp limit 10000)', # Your Query here
'logs', # schema
'imp' # table (Those two are used to generate unique prefix for S3 object (e.g. logs/imp/20170312_081527)
) as ds:
ds.save('./akagi_test') # save results to local
for d in ds:
print(d) # iterate on result
S3DataSource
with S3DataSource.for_prefix(
'image-data.ap-northeast-1',
'data/image_net/zebra',
FileFormat.BINARY) as ds:
...
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
akagi-0.1.6.tar.gz
(6.0 kB
view hashes)
Built Distribution
akagi-0.1.6-py2.py3-none-any.whl
(11.3 kB
view hashes)
Close
Hashes for akagi-0.1.6-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e069a64969f34b17a995eec6247d92d41129fb45111b975cf09c711b370cd1cf |
|
MD5 | df240c66b7bb1af1969227f9280d4c8a |
|
BLAKE2b-256 | b57d424f41443db21e7426dfb47b317e9beb6b358ecfec542708a0067c4808ad |