Skip to main content

Helper to connect to CERN's Spark Clusters

Project description

SparkConnector

Helper to connect to CERN's Spark Clusters

This extension is built as a Python module named sparkconnector, which simplifies the connection to Spark clusters.

It installs:

  1. an nbclassic-extension
  2. a Jupyterlab extension
  3. an iPython extension

Requirements

  • JupyterLab >= 4.0.0
  • pyspark (not installed by default)

Install

To install the extension, execute:

pip install sparkconnector
jupyter nbclassic-extension install sparkconnector --py
jupyter nbclassic-extension enable  sparkconnector --py

It is also necessary to enable the iPython code. Append the following code to the config file (usually in ~/.ipython/profile_default/ipython_kernel_config.py, check here):

c.InteractiveShellApp.extensions.append('sparkconnector.connector')

Uninstall

To remove the extension, execute:

pip uninstall sparkconnector

Contributing

Development install

Note: You will need NodeJS to build the extension package.

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Change directory to the sparkconnector directory
# Install package in development mode
pip install -e "."
# Link your development version of the extension with JupyterLab
jupyter labextension develop . --overwrite
# Rebuild extension Typescript source after making changes
jlpm build

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

# Watch the source directory in one terminal, automatically rebuilding when needed
jlpm watch
# Run JupyterLab in another terminal
jupyter lab

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

jupyter lab build --minimize=False

Development uninstall

pip uninstall sparkconnector

In development mode, you will also need to remove the symlink created by jupyter labextension develop command. To find its location, you can run jupyter labextension list to figure out where the labextensions folder is located. Then you can remove the symlink named @swan-cern/sparkconnector within that folder.

Packaging the extension

See RELEASE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparkconnector-3.0.9.tar.gz (385.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sparkconnector-3.0.9-py3-none-any.whl (401.4 kB view details)

Uploaded Python 3

File details

Details for the file sparkconnector-3.0.9.tar.gz.

File metadata

  • Download URL: sparkconnector-3.0.9.tar.gz
  • Upload date:
  • Size: 385.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.22

File hashes

Hashes for sparkconnector-3.0.9.tar.gz
Algorithm Hash digest
SHA256 f2b85affaf5a68e5e3fb2593cd92f65c0b293e0d4c3c33720f310b0846faabdf
MD5 43dc9c6b87a35981495be974c29c84c7
BLAKE2b-256 687901e373daa17918546c2ad9a7243ffa0776bec1aff67edaba938000917e7b

See more details on using hashes here.

File details

Details for the file sparkconnector-3.0.9-py3-none-any.whl.

File metadata

  • Download URL: sparkconnector-3.0.9-py3-none-any.whl
  • Upload date:
  • Size: 401.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.22

File hashes

Hashes for sparkconnector-3.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 9006bba9ccf664f50e3d5cbb74d799db92f99f02424d2f6b263d93d402878e75
MD5 ea65c5228ff6f4d0d2b5bc63cd113d17
BLAKE2b-256 26d6b724d1b9d12dca07bcca39888ccc67be62cf076c8f271b286d2c923a0501

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page