Skip to main content

Scope Boundaries for Jupyter Notebooks

Project description

Pagebreaks : Scope Boundaries for Jupyter Notebooks

Pagebreaks is a Jupyter Notebooks extension (with a supporting IPython plugin) which creates scope boundaries between groups of cells, allowing cells within a pagebreak to share state as usual, but keeping state isolated to that group. To use variables between Pagebreaks, they can be "exported" at the footer of the pagebreak in a read-only format to be used in all later cells.

The goal is to make it easier to keep variables organized in notebooks while changing as little as possible about how notebook programmers like to work. To do that, Pagebreaks allows users to organize their notebook state by organizing their cells within the notebook.

Participate in Research!

My name is Eric Rawn. I’m a PhD student at UC Berkeley. To build the best system we can (and do some research along the way), we’re evaluating how our extension aids how real users program, and so we’re running a 4-6 week study with folks who use Jupyter notebooks regularly in their everyday programming. We're especially interested in how Pagebreaks might help data science programmers, so if that's you, we would really appreciate your insight!

The extension will log some usage data locally on your machine, which you'll send to me at the end of the study. We'll then spend about an hour chatting about your experience, the kind of work you do daily, how you use notebooks, and any other thoughts or feedback you have. Participants will be compensated for their time spent interviewing, at $30/hour, so expect $60-$90 overall. The consent form has detailed information about the study if you’re interested. I care a lot about protecting the privacy of my participants, and so a lot of the consent form (and our first chat together at the beginning of the study) is dedicated to explaining exactly what data will be collected, how it will be used, and addressing any questions or concerns.

A full open-source release will be out on PyPi after we can incorporate what we learned from the study, but you're welcome to keep using the extension after the study if you participate and find it helps you work!

If you’re interested in participating, please fill out this interest form

If you have any questions at all, feel free to send me an email at erawn@berkeley.edu, and feel free to forward this to anyone you think might be interested! Thanks!

Install

The extension will be on PyPi when the study begins. Hang tight until then! Thanks!

Overview

Each "pagebreak" keeps top-level variables isolated, so that within a pagebreak you can reference variables between cells normally, but in the rest of the notebook those variables will be inaccessible: showscopebound
To reference a variable defined in an earlier Pagebreak, add the variable to the “export” list at the bottom of the Pagebreak in which it was defined. Once it’s exported, later cells can read its value, but they can’t overwrite it. Screenshot 2024-07-01 at 2 37 02 PM
Exported variables continue to be readable and writeable in their own Pagebreak, as usual. After that Pagebreak, later cells can read but not write the exported variable. Before that Pagebreak, cells can neither read nor write the exported variable. Screenshot 2024-07-01 at 2 45 00 PM
To check out the current state of the notebook, you can use the %who_pb IPython magic. If the magic is run in a Pagebreak in which an exported variable is available to be called (i.e. the variable is exported from a previous pagebreak, but not a later one), it will list under Export Exist? as True. who_pb
Modules remain global, so you only have to import them once: packages_are_global

*Because Python doesn't have a built-in way to ensure read-only variables, we check for redefinitions at the AST level and dynamically after each cell run, checking to see if the value has changed.

Pagebreak Actions

Making a New Pagebreak

New pagebreaks are made by pressing this button: Screenshot 2024-07-01 at 1 50 30 PM on the "Export" cell of a pagebreak at the bottom.

makenewpb

Merging Pagebreaks

Instead of deleting Pagebreaks, you can merge the cells of a pagebreak into the one above it with:Screenshot 2024-07-01 at 1 50 56 PM

mergepb

For example:

The bottom Pagebreak ----> Merges with the top
Screenshot 2024-07-01 at 1 52 36 PM ----> Screenshot 2024-07-01 at 1 52 44 PM

%who_pb

We've added the IPython magic %who_pb", which is a pagebreaks-specific version of %who_ls. %who_pb" prints out your notebook state by its pagebreak, listing whether each variable is currently being exported. Pagebreaks only generates the export variables it needs for each cell, so you won't see variables that are exported in later pagebreaks, because those are currently out of scope!

How it works

You shouldn't need to know what's going on under the hood to use Pagebreaks, but if you're curious, read on!

Rather than dynamically storing and reloading different global variables in your kernel, Pagebreaks manipulates the programs you write before they go to the interpreter, changing the names of variables under the hood.

For example, the variable a is actually stored as pb_0_a in the global state, because it is in Pagebreak 0. Screenshot 2024-07-01 at 2 47 14 PM

When a variable is exported to be used between pagebreaks, a new variable pb_export_b is generated for each cell run (as a user, you don't have to worry about any of this, you can just use a and b as normal!). Because Python doesn't have a way to enforce that variables are read-only at compile time, Pagebreaks will check after your cell has run that the pb_export_b variable still matches the original pb_0_b variable. If it doesn't, Pagebreaks will revert the variables in your current pagebreak back to what they were before you ran the cell.

Because b is accessible in the second pagebreak because it's been exported, a pb_export_b varaible is generated for later cells to reference, preventing those cells from modifying our real b variable, which is pb_0_b Screenshot 2024-07-01 at 2 49 40 PM

Requirements

  • JupyterLab >= 4.0.0
  • Pagebreaks is currently only available for IPython notebooks in Jupyter.

Uninstall

To remove the extension, execute:

pip uninstall pagebreaks

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pagebreaks-0.1.2.tar.gz (260.0 kB view details)

Uploaded Source

Built Distribution

pagebreaks-0.1.2-py3-none-any.whl (59.3 kB view details)

Uploaded Python 3

File details

Details for the file pagebreaks-0.1.2.tar.gz.

File metadata

  • Download URL: pagebreaks-0.1.2.tar.gz
  • Upload date:
  • Size: 260.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pagebreaks-0.1.2.tar.gz
Algorithm Hash digest
SHA256 0c50544bd11aa80ed5a6aee2d1ad1387ff3869cfb14266f4d2d0ffbbbf56aafa
MD5 754c6e6c3719ec167448ae626907a1e4
BLAKE2b-256 18577c917e61e2eef0481afc8076a9c87342e2d30c3db2390ba8ab3e7cc67914

See more details on using hashes here.

File details

Details for the file pagebreaks-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: pagebreaks-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 59.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for pagebreaks-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 af629aa5425a695520420dab6b2e32672404d2267ddbe0fdd1f7b46eb04df71e
MD5 a3e24a5502b7c288bcd7360b453419cc
BLAKE2b-256 2b5eaef43ca2e913ed0bf6a31c2c059d40dc70e76cc417754a2a6589d3d0e972

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page