Skip to main content

Jupyter Server extension to browse HDFS filesystem

Project description

HdfsBrowser

Hadoop JupyterLab Extension

This extension is composed of a Python package named hdfsbrowser, which installs the server+nbextension and a NPM package named @swan-cern/hdfsbrowser for the JupyterLab extension.

Hadoop JupyterLab Extension

Requirements

  • JupyterLab >= 2.1

Install

Note: You will need NodeJS to install the extension.

pip install hdfsbrowser
jupyter nbextension install hdfsbrowser --py
jupyter nbextension enable  hdfsbrowser --py
jupyter lab build

Configure extension to work with Hadoop cluster through hdfs-site.xml

Configure notebook jupyter_notebook_config.py:

c.HDFSBrowserConfig.hdfs_site_path = "/cvmfs/sft.cern.ch/lcg/etc/hadoop-confext/conf/etc/analytix/hadoop.analytix/hdfs-site.xml"
c.HDFSBrowserConfig.hdfs_site_namenodes_property = "dfs.ha.namenodes.analytix"
c.HDFSBrowserConfig.hdfs_site_namenodes_port = "50070"
c.HDFSBrowserConfig.webhdfs_token = "dummy"

Troubleshoot

If you are not seeing the frontend, check if it's installed:

jupyter labextension list

If it is installed, try:

jupyter lab clean
jupyter lab build

Contributing

Install

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Move to hdfsbrowser directory

# Install server extension
# This will also build the js code
pip install -e .

# Install and enable the nbextension
jupyter nbextension install hdfsbrowser --py --sys-prefix
jupyter nbextension enable  hdfsbrowser --py --sys-prefix

# Link your development version of the extension with JupyterLab
jupyter labextension link .
# Rebuild JupyterLab after making any changes
jupyter lab build

# Rebuild Typescript source after making changes
jlpm build
# Rebuild JupyterLab after making any changes
jupyter lab build

You can watch the source directory and run JupyterLab in watch mode to watch for changes in the extension's source and automatically rebuild the extension and application.

# Watch the source directory in another terminal tab
jlpm watch
# Run jupyterlab in watch mode in one terminal tab
jupyter lab --watch

Uninstall

pip uninstall hdfsbrowser
jupyter labextension uninstall @swan-cern/hdfsbrowser

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hdfsbrowser-1.1.1.tar.gz (19.0 kB view details)

Uploaded Source

Built Distribution

hdfsbrowser-1.1.1-py3-none-any.whl (74.0 kB view details)

Uploaded Python 3

File details

Details for the file hdfsbrowser-1.1.1.tar.gz.

File metadata

  • Download URL: hdfsbrowser-1.1.1.tar.gz
  • Upload date:
  • Size: 19.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.6

File hashes

Hashes for hdfsbrowser-1.1.1.tar.gz
Algorithm Hash digest
SHA256 7d90d858a0eb560bd5b47e6481cd4a268c91bc988eb9f660ebfffa5b3254c236
MD5 9e886fce8c6be12603807799174105db
BLAKE2b-256 dbd6877d79a724bd3289425def62195e8332b6c19478b722e3b414fde693a717

See more details on using hashes here.

File details

Details for the file hdfsbrowser-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: hdfsbrowser-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 74.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.6

File hashes

Hashes for hdfsbrowser-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f41034a7bca15106e2e1b4eb7ac1e31a9aca9bc1f63e44a25d3e908ad94ec111
MD5 f012571c3bd3c6f9173853495bec187c
BLAKE2b-256 4f1a81fa3c1e5b490a61605e69c3ce04f5ba76da19e54d76f41825588f098ab2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page