Skip to main content

A py4j based hdfs client for python for native hdfs CLI performance.

Project description

pyhdfs-client : Powerful HDFS Client for python

https://pypi.python.org/pypi/pyhdfs_client

Why it's fast powerful?

Native hdfs client offers much better performance than webhdfs clients. However calling native client for hadoop operations have an additional overhead of starting jvm. pyhdfs-client brings the performance of native hdfs client without any overhead of starting jvm on every command execution.

Features

  • HDFS client for python
  • Easy to integrate with python applications
  • Better Performance than webhdfs clients
  • Provide native hadoop client performance without any overhead
  • Support both UNIX and Windows

Whats new in 0.1.3?

  • Multiple instances of HDFS client enabled.
  • [fix] Temporary folder deletion
  • [fix] Java process shutdown issues on UNIX

Installation

pip install pyhdfs-client

Requirements: hadoop binaries and py4j installed

Sample Usage

>>> from pyhdfs_client.pyhdfs_client import HDFSClient
>>> hdfs_client = HDFSClient()
>>> ret, out, err = hdfs_client.run(['-ls', '/'])
>>> print(out)
Found 1 items
drwxr-xr-x   - gp supergroup          0 2021-03-21 01:10 /f1
>>> hdfs_client.stop() # to terminate hdfs client

Contribution

  • Any contribution for enhancements and bug fixes is welcome.

Credits

History

  • 0.1.2 (2021-03-23)
    • added UNIX Support
  • 0.1.1 (2021-03-22)
    • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyhdfs_client-0.1.3.tar.gz (13.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyhdfs_client-0.1.3-py2.py3-none-any.whl (6.4 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pyhdfs_client-0.1.3.tar.gz.

File metadata

  • Download URL: pyhdfs_client-0.1.3.tar.gz
  • Upload date:
  • Size: 13.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for pyhdfs_client-0.1.3.tar.gz
Algorithm Hash digest
SHA256 605994f5b42c2a4f5b0dc3808af04d95d8efbab3fe2e366120b8699e47db09be
MD5 5c494ee8ebd52d1fb2688e1893ae3a77
BLAKE2b-256 30d8867697b5b870ced9001a84cf025e0a981b2dc3c3d7727cae51d7b24626d6

See more details on using hashes here.

File details

Details for the file pyhdfs_client-0.1.3-py2.py3-none-any.whl.

File metadata

  • Download URL: pyhdfs_client-0.1.3-py2.py3-none-any.whl
  • Upload date:
  • Size: 6.4 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for pyhdfs_client-0.1.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 5491c5f9070951afb21468bb434c6a0d3ab0258c9f023547f3d51c2ea6df4b36
MD5 48c15495b4c3c72f5d611e8829355b59
BLAKE2b-256 f8b3623f9fee236d1a5a7ebbc0fd8186d8e45b3a51a6758024d0a0663db781f7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page