Skip to main content

Jupyter Server extension to browse HDFS filesystem

Project description

HdfsBrowser

Hadoop JupyterLab Extension to browse an HDFS filesystem.

This extension is composed of a Python package named hdfsbrowser for the server extension, a NPM package named @swan-cern/hdfsbrowser for the frontend extension and an nbextension for nbclassic.

Requirements

  • JupyterLab >= 4.0.0

Install

To install the extension, execute:

pip install hdfsbrowser

Configure extension to work with Hadoop cluster through hdfs-site.xml

You can set some configurations (check the serverextension.py for the full reference) in you jupyter-server or notebook config files (i.e jupyter_lab_config.py):

c.HDFSBrowserConfig.hdfs_site_path = "/cvmfs/sft.cern.ch/lcg/etc/hadoop-confext/conf/etc/analytix/hadoop.analytix/hdfs-site.xml"
c.HDFSBrowserConfig.hdfs_site_namenodes_property = "dfs.ha.namenodes.analytix"
c.HDFSBrowserConfig.hdfs_site_namenodes_port = "50070"
c.HDFSBrowserConfig.webhdfs_token = "dummy"

Uninstall

To remove the extension, execute:

pip uninstall hdfsbrowser

Troubleshoot

If you are seeing the frontend extension, but it is not working, check that the server extension is enabled:

jupyter server extension list

If the server extension is installed and enabled, but you are not seeing the frontend extension, check the frontend extension is installed:

jupyter labextension list

Contributing

Development install

Note: You will need NodeJS to build the extension package.

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Change directory to the hdfsbrowser directory
# Install package in development mode
pip install -e "."
# Link your development version of the extension with JupyterLab
jupyter labextension develop . --overwrite
# Server extension must be manually installed in develop mode
jupyter server extension enable hdfsbrowser
# Rebuild extension Typescript source after making changes
jlpm build

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

# Watch the source directory in one terminal, automatically rebuilding when needed
jlpm watch
# Run JupyterLab in another terminal
jupyter lab

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

jupyter lab build --minimize=False

Development uninstall

# Server extension must be manually disabled in develop mode
jupyter server extension disable hdfsbrowser
pip uninstall hdfsbrowser

In development mode, you will also need to remove the symlink created by jupyter labextension develop command. To find its location, you can run jupyter labextension list to figure out where the labextensions folder is located. Then you can remove the symlink named @swan-cern/hdfsbrowser within that folder.

Packaging the extension

See RELEASE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hdfsbrowser-1.2.1.tar.gz (132.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hdfsbrowser-1.2.1-py3-none-any.whl (30.8 kB view details)

Uploaded Python 3

File details

Details for the file hdfsbrowser-1.2.1.tar.gz.

File metadata

  • Download URL: hdfsbrowser-1.2.1.tar.gz
  • Upload date:
  • Size: 132.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for hdfsbrowser-1.2.1.tar.gz
Algorithm Hash digest
SHA256 6ef0e694810b32e081c573e041c1d720732905ae565a5c9f6211103db8ec44bc
MD5 e280b3477266e8ff41ad2df4bdb64c9c
BLAKE2b-256 55aa10c5b87843372e9c8f3fd4df4c2768d210dbd0a93e8da613427199bf7128

See more details on using hashes here.

Provenance

The following attestation bundles were made for hdfsbrowser-1.2.1.tar.gz:

Publisher: release.yml on swan-cern/jupyter-extensions

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file hdfsbrowser-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: hdfsbrowser-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 30.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for hdfsbrowser-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 aefdfb4ecfeb83dfad2c929962a360b136a79a9f231dc576952db2792f8c3c9c
MD5 2342e5ab442d93a01262d7dc053de622
BLAKE2b-256 ba8f63140d623844ecfec7c2805e6209b6123831b72a88e982564d847e6c7b71

See more details on using hashes here.

Provenance

The following attestation bundles were made for hdfsbrowser-1.2.1-py3-none-any.whl:

Publisher: release.yml on swan-cern/jupyter-extensions

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page