Skip to main content

An extension for storing data in HDFS

Project description

ckanext-hdfs - HDFS storing extension

ckanext-hdfs is an extension for enabling the file storage in HDFS - Hadoop Distributed File System.

This extension provides an ability to let users store a certain resource in HDFS, instead of the local file system.

Notes:

  • JAVA_HOME and HADOOP_HOME need to be set correctly.

Requirements

This extension was developed and tested under CKAN-2.7.3 and HADOOP-3.0.0

Installation

To install ckanext-hdfs:

  1. Activate your CKAN virtual environment, for example:

    . /usr/lib/ckan/default/bin/activate
  2. Install the ckanext-hdfs Python package into your virtual environment:

    pip install ckanext-hdfs
  3. Add hdfs setting in your CKAN config file (by default the config file is located at /etc/ckan/default/production.ini) as follows:

    ckan.plugins = hdfs <other-plugins>
    ckan.hdfs.storage_path = /ckan/data
  4. Restart CKAN. For example if you’ve deployed CKAN with Apache on Ubuntu:

    sudo service apache2 reload

Development Installation

To install ckanext-hdfs for development, activate your CKAN virtualenv and do:

git clone https://github.com/etri-odp/ckanext-hdfs.git
cd ckanext-hdfs
python setup.py develop
pip install -r dev-requirements.txt

Running the Tests

To run the tests, do:

nosetests --nologcapture --with-pylons=test.ini

To run the tests and produce a coverage report, first make sure you have coverage installed in your virtualenv (pip install coverage) then run:

nosetests --nologcapture --with-pylons=test.ini --with-coverage --cover-package=ckanext.hdfs --cover-inclusive --cover-erase --cover-tests

Registering ckanext-hdfs on PyPI

ckanext-hdfs should be available on PyPI as https://pypi.python.org/pypi/ckanext-hdfs. If that link doesn’t work, then you can register the project on PyPI for the first time by following these steps:

  1. Create a source distribution of the project:

    python setup.py sdist
  2. Register the project:

    python setup.py register
  3. Upload the source distribution to PyPI:

    python setup.py sdist upload
  4. Tag the first release of the project on GitHub with the version number from the setup.py file. For example if the version number in setup.py is 0.0.1 then do:

    git tag 0.0.1
    git push --tags

Acknowledgements

This work was supported by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No.2017-00253, Development of an Advanced Open Data Distribution Platform based on International Standards)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ckanext-hdfs-0.0.1.tar.gz (20.2 kB view details)

Uploaded Source

Built Distribution

ckanext_hdfs-0.0.1-py2-none-any.whl (19.6 kB view details)

Uploaded Python 2

File details

Details for the file ckanext-hdfs-0.0.1.tar.gz.

File metadata

  • Download URL: ckanext-hdfs-0.0.1.tar.gz
  • Upload date:
  • Size: 20.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.11.1 setuptools/20.4 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.12

File hashes

Hashes for ckanext-hdfs-0.0.1.tar.gz
Algorithm Hash digest
SHA256 04af740b27640c046e174854ffa1bd5b2dbd59e0517574f786254ac923e41c5b
MD5 24ded8171f6a26f83139a7d5b5eb4f82
BLAKE2b-256 f20f63c98aba00687d05be4d47202307acd2bb61c9d282b8f73b531f481afded

See more details on using hashes here.

File details

Details for the file ckanext_hdfs-0.0.1-py2-none-any.whl.

File metadata

  • Download URL: ckanext_hdfs-0.0.1-py2-none-any.whl
  • Upload date:
  • Size: 19.6 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.11.1 setuptools/20.4 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.12

File hashes

Hashes for ckanext_hdfs-0.0.1-py2-none-any.whl
Algorithm Hash digest
SHA256 5eb48caa61d5af82f556a95dd6c18805a7ddc1f5099a70894525ab06601f02ac
MD5 3f1884cf316f42a64b78f88cb2ccf8b7
BLAKE2b-256 e883a74fb96cb47a9c5fd678f1b200db63596d379ebf6932354e79ed57267179

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page