Skip to main content

Helper to connect to CERN's Spark Clusters

Project description

SparkConnector

Helper to connect to CERN's Spark Clusters

This extension is built as a Python module named sparkconnector, which simplifies the connection to Spark clusters.

It installs:

  1. an nbclassic-extension
  2. a Jupyterlab extension
  3. an iPython extension

Requirements

  • JupyterLab >= 4.0.0
  • pyspark (not installed by default)

Install

To install the extension, execute:

pip install sparkconnector
jupyter nbclassic-extension install sparkconnector --py
jupyter nbclassic-extension enable  sparkconnector --py

It is also necessary to enable the iPython code. Append the following code to the config file (usually in ~/.ipython/profile_default/ipython_kernel_config.py, check here):

c.InteractiveShellApp.extensions.append('sparkconnector.connector')

Uninstall

To remove the extension, execute:

pip uninstall sparkconnector

Contributing

Development install

Note: You will need NodeJS to build the extension package.

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Change directory to the sparkconnector directory
# Install package in development mode
pip install -e "."
# Link your development version of the extension with JupyterLab
jupyter labextension develop . --overwrite
# Rebuild extension Typescript source after making changes
jlpm build

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

# Watch the source directory in one terminal, automatically rebuilding when needed
jlpm watch
# Run JupyterLab in another terminal
jupyter lab

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

jupyter lab build --minimize=False

Development uninstall

pip uninstall sparkconnector

In development mode, you will also need to remove the symlink created by jupyter labextension develop command. To find its location, you can run jupyter labextension list to figure out where the labextensions folder is located. Then you can remove the symlink named @swan-cern/sparkconnector within that folder.

Packaging the extension

See RELEASE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparkconnector-3.0.4.tar.gz (384.0 kB view details)

Uploaded Source

Built Distribution

sparkconnector-3.0.4-py3-none-any.whl (400.3 kB view details)

Uploaded Python 3

File details

Details for the file sparkconnector-3.0.4.tar.gz.

File metadata

  • Download URL: sparkconnector-3.0.4.tar.gz
  • Upload date:
  • Size: 384.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.9.19

File hashes

Hashes for sparkconnector-3.0.4.tar.gz
Algorithm Hash digest
SHA256 ed92c2f942b1f473359a42e540149fcaa058034611658fa7c722cbde7b2e45c3
MD5 758bb980223b589bdb3337b9c1c636ee
BLAKE2b-256 88fcfc04f758e567e9e4f47d61de6ec2aa51e14b3aad4778f85315e370d52f94

See more details on using hashes here.

File details

Details for the file sparkconnector-3.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for sparkconnector-3.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 3540c2ce15b5ad83bc3ca11edc2ff17de7ab59cd4eb4c4ffb884fdda843ebeee
MD5 cda5168c8b4bc989323c54128c280749
BLAKE2b-256 1bc192ddad5112ce6f5c5d56a6828dc73e54c49d24d71bcf8156a35c26eae30f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page