Skip to main content

A JupyterLab extension for running dataflow notebooks

Project description

Dataflow Notebook Extension for Jupyter/Python

License

This package is part of the Dataflow Notebooks project and provides the Dataflow Notebook interface for JupyterLab, and is intended to be used with the dfkernel kernel. Dataflow notebooks seek to elevate outputs as memorable waypoints during exploratory computation. To that end,

  • Cell identifiers are persistent across sessions and are random UUIDs to signal they do not depend on top-down order.
  • As with standard IPython, outputs are designated by being written as expressions or assignments on the last line of a cell.
  • Each output is identified by its variable name if one is specified (e.g. a, c,d = 4,5), and the cell identifier if not (e.g. 4 + c)
  • Variable names can be reused across cells.
  • Cells are executed as closures so only the outputs are accessible from other cells.
  • An output can then be referenced in three ways:
    1. unscoped: foo refers to the most recent execution output named foo
    2. persistent: foo$ba012345 refers to output foo from cell ba012345
    3. tagged: foo$bar refers to output foo from the cell tagged as bar
  • All output references are transformed to persistent names upon execution.
  • Output references implicitly define a dataflow in a directed acyclic graph, and the kernel automatically executes dependencies.

Example Notebook

Dataflow Notebook Example

Requirements

  • JupyterLab >= 4.0.0

Install

This extension uses a Jupyter kernel named dfkernel for the backend and a Jupyter extension named dfnotebook for the frontend.

To install the kernel, kernel:

pip install dfkernel

To install the extension, execute:

pip install dfnotebook

Uninstall

To remove the extension, execute:

pip uninstall dfnotebook

Contributing

Development install

Note: You will need NodeJS to build the extension package.

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Change directory to the dfnotebook directory
# Install package in development mode
pip install -e "."
# Link your development version of the extension with JupyterLab
jupyter labextension develop . --overwrite
# Rebuild extension Typescript source after making changes
jlpm build

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

# Watch the source directory in one terminal, automatically rebuilding when needed
jlpm watch
# Run JupyterLab in another terminal
jupyter lab

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

jupyter lab build --minimize=False

Development uninstall

pip uninstall dfnotebook

In development mode, you will also need to remove the symlink created by jupyter labextension develop command. To find its location, you can run jupyter labextension list to figure out where the labextensions folder is located. Then you can remove the symlink named @dfnotebook/dfnotebook-extension within that folder.

Testing the extension

Frontend tests

This extension is using Jest for JavaScript code testing.

To execute them, execute:

jlpm
jlpm test

Integration tests

This extension uses Playwright for the integration tests (aka user level tests). More precisely, the JupyterLab helper Galata is used to handle testing the extension in JupyterLab.

More information are provided within the ui-tests README.

Packaging the extension

See RELEASE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dfnotebook-0.4.0a2.tar.gz (4.6 MB view details)

Uploaded Source

Built Distribution

dfnotebook-0.4.0a2-py3-none-any.whl (4.4 MB view details)

Uploaded Python 3

File details

Details for the file dfnotebook-0.4.0a2.tar.gz.

File metadata

  • Download URL: dfnotebook-0.4.0a2.tar.gz
  • Upload date:
  • Size: 4.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.1

File hashes

Hashes for dfnotebook-0.4.0a2.tar.gz
Algorithm Hash digest
SHA256 1a7c819b33f26e1ab23ef056ca3a1794a1f8c6f3166a843d40dd71d14884cac8
MD5 a7a09bd9a19904dd33b9755fa30a9edf
BLAKE2b-256 d0b2cf187e308730d5f95d52e0b89fff55458a436e57ea05b4a252bdc37027be

See more details on using hashes here.

File details

Details for the file dfnotebook-0.4.0a2-py3-none-any.whl.

File metadata

File hashes

Hashes for dfnotebook-0.4.0a2-py3-none-any.whl
Algorithm Hash digest
SHA256 da12b8bfc0e2a9d52f90931491fca7d0c62c9eeb7267533b901471f60d063d0f
MD5 4b4d831997ef9eff9e48c907873ee2a5
BLAKE2b-256 8c26db990f45fe0c5505879d4eabdc66e73cc8cde189e9574a08618d02f044e1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page