Skip to main content

Jupyter Server extension to browse HDFS filesystem

Project description

HdfsBrowser

Hadoop JupyterLab Extension

This extension is composed of a Python package named hdfsbrowser, which installs the server+nbextension and a NPM package named @swan-cern/hdfsbrowser for the JupyterLab extension.

Hadoop JupyterLab Extension

Requirements

  • JupyterLab >= 2.1

Install

Note: You will need NodeJS to install the extension.

pip install hdfsbrowser
jupyter nbextension install hdfsbrowser --py
jupyter nbextension enable  hdfsbrowser --py
jupyter lab build

Configure extension to work with Hadoop cluster through hdfs-site.xml

Configure notebook jupyter_notebook_config.py:

c.HDFSBrowserConfig.hdfs_site_path = "/cvmfs/sft.cern.ch/lcg/etc/hadoop-confext/conf/etc/analytix/hadoop.analytix/hdfs-site.xml"
c.HDFSBrowserConfig.hdfs_site_namenodes_property = "dfs.ha.namenodes.analytix"
c.HDFSBrowserConfig.hdfs_site_namenodes_port = "50070"
c.HDFSBrowserConfig.webhdfs_token = "dummy"

Troubleshoot

If you are not seeing the frontend, check if it's installed:

jupyter labextension list

If it is installed, try:

jupyter lab clean
jupyter lab build

Contributing

Install

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Move to hdfsbrowser directory

# Install server extension
# This will also build the js code
pip install -e .

# Install and enable the nbextension
jupyter nbextension install hdfsbrowser --py --sys-prefix
jupyter nbextension enable  hdfsbrowser --py --sys-prefix

# Link your development version of the extension with JupyterLab
jupyter labextension link .
# Rebuild JupyterLab after making any changes
jupyter lab build

# Rebuild Typescript source after making changes
jlpm build
# Rebuild JupyterLab after making any changes
jupyter lab build

You can watch the source directory and run JupyterLab in watch mode to watch for changes in the extension's source and automatically rebuild the extension and application.

# Watch the source directory in another terminal tab
jlpm watch
# Run jupyterlab in watch mode in one terminal tab
jupyter lab --watch

Uninstall

pip uninstall hdfsbrowser
jupyter labextension uninstall @swan-cern/hdfsbrowser

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hdfsbrowser-1.0.0.tar.gz (17.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hdfsbrowser-1.0.0-py3-none-any.whl (16.4 kB view details)

Uploaded Python 3

File details

Details for the file hdfsbrowser-1.0.0.tar.gz.

File metadata

  • Download URL: hdfsbrowser-1.0.0.tar.gz
  • Upload date:
  • Size: 17.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.5

File hashes

Hashes for hdfsbrowser-1.0.0.tar.gz
Algorithm Hash digest
SHA256 0b62f3bd4ba0be2ca1f269cef4b351ad6d156c130e0db699883cc80c9162e37b
MD5 e28e2eaee12757c32979648974094f7f
BLAKE2b-256 91d560612bfde2c8b8e588f5c0f65dffd5a2d09283ddf877dfa99ae85685860d

See more details on using hashes here.

File details

Details for the file hdfsbrowser-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: hdfsbrowser-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 16.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.5

File hashes

Hashes for hdfsbrowser-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e6435bb52c541c7a6521e59b73f84bdbb69bcd9e855ac2e1e4586d1363db767d
MD5 212ec21665a282abce059227a32ae909
BLAKE2b-256 51302225b2ad99a4532abdbe488ed6b21dc70749d4385901106045cceaa01062

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page