Remove noise from notebooks for cleaner version control diffs
Project description
Squeaky
✨ Make your Jupyter notebooks squeaky clean ✨
Squeaky removes unwanted changes to your notebooks to make your Git diffs cleaner. Tools like nbdime and ReviewNB are awesome, but can be clunky and difficult to learn. Instead, Squeaky makes it easier to manage notebook diffs with standard Git tools.
Usage
First, install:
pip install squeaky
Then use Squeaky to clean a notebook:
squeaky path/to/notebook(s).ipynb
To check notebooks without modifying them, use the --check
flag.
squeaky path/to/notebook.ipynb --check
Jupyter pre-save hook
You can automatically run your notebooks through Squeaky before saving them by editing your Jupyter config file. First, find the location of your config directory by running
jupyter --config-dir
Your config directory may contain either a jupyter_lab_config.py
or a
jupyter_server_config.py
(if neither exists, create an empty file with either
of these names).
:warning: If you're using Jupyter notebook < 7, then the file will be called
jupyter_notebook_config.py
.
Then, add the following lines to that config file.
from squeaky import squeaky_clean_hook
c.FileContentsManager.pre_save_hook = squeaky_clean_hook
Pre-commit hook
To use with pre-commit, add the following to your
.pre-commit-config.yaml
.
repos:
- repo: local
hooks:
- name: Clean notebooks
id: clean-notebooks
entry: squeaky --check
language: python
files: '(.*?).ipynb$'
Features
-
Reset metadata
If you're running your notebooks in CI, then you probably don't care about the last environment a notebook was edited in. Squeaky resets notebook metadata to a standard metadata.
-
Minify SVGs
Setting code image outputs to SVGs makes images look great, but comes with horrendous diffs. Squeaky minifies SVG outputs to a single line, which also reduces file size.
-
Reset IDs in SVG outputs
SVG outputs have another problem: Randomized IDs in the source. Squeaky re-generates these IDs deterministically using on the cell's unique ID. This means re-running the notebook won't change the output unless the image actually changes.
-
Remove trailing whitespace
Often missed in markdown (but not in version control), Squeaky removes trailing whitespace from ends of lines and empty lines from ends and beginnings of cells.
-
Remove empty cells
Squeaky removes empty cells, including that annoying empty code cell at the bottom of every other notebook.
-
Add missing cell IDs
Cells without IDs now raise a warning in
nbformat
. Squeaky adds these IDs for you so you don't need to think about it.
Contributing
To install requirements and add pre-commit hooks:
pip install -r requirements-dev.txt
pre-commit install
To run unit tests (also run on commit):
tox
Wishlist
- Turn off features through config file
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file squeaky-0.7.0.tar.gz
.
File metadata
- Download URL: squeaky-0.7.0.tar.gz
- Upload date:
- Size: 22.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 404c79ed32be0bdcf031998aabd65011a22af48ccb2905c298e61081aa5effc1 |
|
MD5 | 296a17756c0db80aab66fd1970b21e7a |
|
BLAKE2b-256 | 6a67afc4ced0e3bffa871ba66b656a7f63d7b72851a9d1f99daf6304230f0b54 |
File details
Details for the file squeaky-0.7.0-py3-none-any.whl
.
File metadata
- Download URL: squeaky-0.7.0-py3-none-any.whl
- Upload date:
- Size: 12.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9810e459e394dc0a66115fd2a7a508ded2c8f11fe2e856bc5c48339739fd48a |
|
MD5 | ca3f28c43eea2547a5fa9c85e3714cdb |
|
BLAKE2b-256 | b63a3ca6127c4987482f0cfc3e9f8ca71b6de85647979a3f115bb12c261558a8 |