Skip to main content

Scope Boundaries for Jupyter Notebooks

Project description

Pagebreaks : Scope Boundaries for Jupyter Notebooks

Pagebreaks is a Jupyter Notebooks extension (with a supporting IPython plugin) which creates scope boundaries between groups of cells, allowing cells within a pagebreak to share state as usual, but keeping state isolated to that group. To use variables between Pagebreaks, they can be "exported" at the footer of the pagebreak in a read-only format to be used in all later cells.

The goal is to make it easier to keep variables organized in notebooks while changing as little as possible about how notebook programmers like to work. To do that, Pagebreaks allows users to organize their notebook state by organizing their cells within the notebook.

Participate in Research

My name is Eric Rawn. I’m a PhD student at UC Berkeley. To build the best system we can (and do some research along the way), we’re evaluating how our extension aids how real users program, and so we’re running a 4-6 week study with folks who use Jupyter notebooks regularly in their everyday programming. We're especially interested in how Pagebreaks might help data science programmers, so if that's you, we would really appreciate your insight!

The extension will log some usage data locally on your machine, which you'll send to me at the end of the study. We'll then spend about an hour chatting about your experience, the kind of work you do daily, how you use notebooks, and any other thoughts or feedback you have. Participants will be compensated for their time spent interviewing, at $30/hour, so expect $60-$90 overall. The consent form has detailed information about the study if you’re interested. I care a lot about protecting the privacy of my participants, and so a lot of the consent form (and our first chat together at the beginning of the study) is dedicated to explaining exactly what data will be collected, how it will be used, and addressing any questions or concerns.

A full open-source release will be out on PyPi after we can incorporate what we learned from the study, but you're welcome to keep using the extension after the study if you participate and find it helps you work!

If you’re interested in participating, please fill out this interest form

If you have any questions at all, feel free to send me an email at erawn@berkeley.edu, and feel free to forward this to anyone you think might be interested! Thanks!

Install

To install the extension, execute:

python -m pip install pagebreaks

Then, open Jupyter Lab jupyter lab and open a Notebook. You should see a warning banner telling you to run %load_ext pagebreaksip, at which point the warning will disappear, and you'll be good to go. (You might have to restart the kernel and reload the webpage, just to be safe)

Overview

Each "pagebreak" keeps top-level variables isolated, so that within a pagebreak you can reference variables between cells normally, but in the rest of the notebook those variables will be inaccessible: showscopebound
To reference a variable defined in an earlier Pagebreak, add the variable to the “export” list at the bottom of the Pagebreak in which it was defined. Once it’s exported, later cells can read its value, but they can’t overwrite it. Screenshot 2024-07-01 at 2 37 02 PM
Exported variables continue to be readable and writeable in their own Pagebreak, as usual. After that Pagebreak, later cells can read but not write the exported variable. Before that Pagebreak, cells can neither read nor write the exported variable. Screenshot 2024-07-01 at 2 45 00 PM
To check out the current state of the notebook, you can use the %who_pb IPython magic. If the magic is run in a Pagebreak in which an exported variable is available to be called (i.e. the variable is exported from a previous pagebreak, but not a later one), it will list under Export Exist? as True. who_pb
Modules remain global, so you only have to import them once: packages_are_global

*Because Python doesn't have a built-in way to ensure read-only variables, we check for redefinitions at the AST level and dynamically after each cell run, checking to see if the value has changed.

Pagebreak Actions

Making a New Pagebreak

New pagebreaks are made by pressing this button: Screenshot 2024-07-01 at 1 50 30 PM on the "Export" cell of a pagebreak at the bottom.

makenewpb

Merging Pagebreaks

Instead of deleting Pagebreaks, you can merge the cells of a pagebreak into the one above it with:Screenshot 2024-07-01 at 1 50 56 PM

mergepb

For example:

The bottom Pagebreak ----> Merges with the top
Screenshot 2024-07-01 at 1 52 36 PM ----> Screenshot 2024-07-01 at 1 52 44 PM

%who_pb

We've added the IPython magic %who_pb", which is a pagebreaks-specific version of %who_ls. %who_pb" prints out your notebook state by its pagebreak, listing whether each variable is currently being exported. Pagebreaks only generates the export variables it needs for each cell, so you won't see variables that are exported in later pagebreaks, because those are currently out of scope!

Autoloading the IPython Plugin

To avoid having to run %load_ext pagebreaksip each time you start your kernel, you can start it automatically by adding: c.InteractiveShellApp.extensions.append('pagebreaksip') to your .ipython/profile/ipython_config.py file.

How it works

You shouldn't need to know what's going on under the hood to use Pagebreaks, but if you're curious, read on!

Rather than dynamically storing and reloading different global variables in your kernel, Pagebreaks manipulates the programs you write before they go to the interpreter, changing the names of variables under the hood.

For example, the variable a is actually stored as pb_0_a in the global state, because it is in Pagebreak 0. Screenshot 2024-07-01 at 2 47 14 PM

When a variable is exported to be used between pagebreaks, a new variable pb_export_b is generated for each cell run (as a user, you don't have to worry about any of this, you can just use a and b as normal!). Because Python doesn't have a way to enforce that variables are read-only at compile time, Pagebreaks will check after your cell has run that the pb_export_b variable still matches the original pb_0_b variable. If it doesn't, Pagebreaks will revert the variables in your current pagebreak back to what they were before you ran the cell.

Because b is accessible in the second pagebreak because it's been exported, a pb_export_b varaible is generated for later cells to reference, preventing those cells from modifying our real b variable, which is pb_0_b Screenshot 2024-07-01 at 2 49 40 PM

Requirements

  • JupyterLab >= 4.0.0
  • Pagebreaks is currently only available for IPython notebooks in Jupyter.

Uninstall

To remove the extension, execute:

pip uninstall pagebreaks

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pagebreaks-0.1.13.tar.gz (264.1 kB view details)

Uploaded Source

Built Distribution

pagebreaks-0.1.13-py3-none-any.whl (60.4 kB view details)

Uploaded Python 3

File details

Details for the file pagebreaks-0.1.13.tar.gz.

File metadata

  • Download URL: pagebreaks-0.1.13.tar.gz
  • Upload date:
  • Size: 264.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pagebreaks-0.1.13.tar.gz
Algorithm Hash digest
SHA256 b04a8de6ddadb1f88d9db51da76c608d062525b8bda60b828b5e3fcb97a99764
MD5 67f7649cb9023efecda4641f130824e0
BLAKE2b-256 08a67e60338d16e2dadee36c34d6ad48f1476178bea473cddfd1c1983c32ce04

See more details on using hashes here.

File details

Details for the file pagebreaks-0.1.13-py3-none-any.whl.

File metadata

  • Download URL: pagebreaks-0.1.13-py3-none-any.whl
  • Upload date:
  • Size: 60.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pagebreaks-0.1.13-py3-none-any.whl
Algorithm Hash digest
SHA256 89a077c845e1530e3ce13cfae2deb2c7651d5b6cfcf4b8add4049bac820e99da
MD5 3302b62acb864d754ada61c780aa6424
BLAKE2b-256 eb543f8dbf10d2308c7389e4c3abc6c72918814156bc084b074cfc576f7038f0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page