Helpers to create summary Jupyter notebooks.
Project description
Jupyter Summary Notebooks
How do you show off your latest plots and tables when you meet with collaborators?
I used to drag my figures and tables into Powerpoint or a Google Doc. But it's tedious to import and position each item one-by-one. And it's even more painful to delete and re-import new versions after changing some code (are you positive you updated all your figures to the latest version?)
Or you could scroll through your original Jupyter notebooks, live on screen share. Admit it, those notebooks are messy! Do you want to be switching tabs and scrolling through all your intermediate results during your meeting? What about the results you generated with scripts, not with notebooks? Most importantly, you can't look at two related figures side-by-side if they come from different sources.
Present your results easily in a Jupyter "summary notebook".
A summary notebook is just a plain Jupyter notebook:
- A plain-English description of your analysis
- Shows important figures and tables inline with your text, imported by their filenames
- Committed and versioned with your code — meaning the summary notebook always reflects your analysis at that point in time, because it imports your latest result files.
Write out the analysis as you go along, and incorporate relevant figures and tables inline.
Use summarynb
to render any plot or table alongside your text, by its filename:
from summarynb import show
show("plot.png")
-
summarynb knows what to do for common file extensions
-
summarynb uses sane defaults for figure sizes. You won't get ginormous figures like you'd see if showing an image with plain Markdown.
Look at related figures side-by-side:
from summarynb import show
show([ "plot1.png", "plot2.png", "results.csv" ])
-
Review two visualizations of the same experiment, produced by different scripts and notebooks, in the same row of my summary notebook.
-
Pull in a results table alongside a figure. Imagine a linear regression: With a one-liner call to
show()
, review the scatterplot and a table of regression coefficients side-by-side. -
If you've generated figures for every data point, review them all easily in an auto-layout grid. Just pass an array of entries to
chunks()
and then toshow()
, docs below.
Screen share your summary notebook or send a Github link to collaborators. Be concise. Only include the best figures and tables, not the intermediate plots.
The presented results are up-to-date with the code version checked out.
Auto-regenerate your summary notebook on every Git commit by installing the optional git commit hook.
Easily go back to the exact source. What generated that plot or table? The filename is right in the notebook. Grep for that filename to track down which script or notebook wrote it. Or link in your summary notebook to the source scripts/notebooks that generated your results.
Since 2015, every project of mine has included a summary notebook, thanks to a tip from my former colleague Nick. This documentation practice saves so much time when returning to old projects.
Example summarynb usage.
Your first show()
Install: pip install summarynb
Run this in a Jupyter notebook:
from summarynb import show
show('myimage.png')
Arrange in rows and columns
Here is a plot with an associated table, side-by-side:
show(["run_1.png", "run_1.csv"])
Let's add a second row, as well as some headers:
show(
[
["run_1.png", "run_1.csv"],
["run_2.png", "run_2.csv"],
],
headers=['Images', 'Tables']
)
Customize the display
show()
automatically decides how to display the given file path based on the file extension. You can customize this behavior.
Here are a couple examples:
from summarynb import show, image, csv, indexed_csv
show(csv("run_summary.tsv", sep="\t", index_col=0))
show(
[
["run_1.png", indexed_csv("run_1.csv")],
[image("run_2.png"), indexed_csv("run_2.csv")],
],
max_width=1200,
)
See the docs for the full reference.
Auto Layout
Let's say you've generated 16 plots + tables. summarynb will automatically lay those out in rows and columns for you.
from summarynb import show, image, indexed_csv, chunks
show(
chunks(
entries=[
[
image("run_%d.png" % (i + 1)),
indexed_csv("run_%d.csv" % (i + 1)),
]
for i in range(16)
],
shape=4,
)
)
Automatically update on commit
Let's say you have a summary notebook named summary.ipynb
. You can install a git pre-commit hook to run the notebook automatically and incorporate it within your commit:
# install the git hook
summarynb install
# mark for automatic execution on every commit
summarynb mark summary.ipynb
When you run git commit
, summarynb will automatically re-execute summary.ipynb
and add the changes to your commit.
Customize this further:
# view help
summarynb --help
# view list of auto-updated notebooks
summarynb list
# run manually
summarynb run
# unmark
summarynb unmark summary.ipynb
# uninstall the git hook
summarynb uninstall
Other tips to make your notebooks beautiful
Adding a table of contents really makes summary notebooks shine. Here's how to install a Jupyter extension for this, and for code formatting. (Note: requires nodejs.)
# if using conda:
conda upgrade -c conda-forge -y jupyterlab nodejs
# code formatting and table of contents
pip install jupyterlab_code_formatter black
jupyter serverextension enable --py jupyterlab_code_formatter
jupyter labextension install -y --no-build @jupyterlab/toc @ryantam626/jupyterlab_code_formatter
jupyter labextension update -y --no-build --all # important that code formatter server and lab extension versions match
mkdir -p $HOME/.cache/black/19.10b0 # create directory for black grammar tables
# notebook diffing (nbdiff)
pip install nbdime
nbdime config-git --enable --global
# force a build to include code formatting, TOC, and nbdime-jupyterlab
jupyter lab build
May also need to set default formatter to black following these instructions. This involves adding the following in Jupyterlab Advanced Settings Editor:
{
"preferences": {
"default_formatter": {
"python": "black"
}
}
}
To update later:
# first update lab extension
jupyter labextension update --all
# then update server extension to matching version
pip install jupyterlab_code_formatter==x.x.x
Development
pip install -r requirements_dev.txt
pip install -e .
make test
# bump version before submitting a PR against master (all master commits are deployed)
bump2version patch # possible: major / minor / patch
# also ensure CHANGELOG.md updated
TODOs:
- Lint
- Pre-commit support
- Accept pdf and other image formats
Changelog
0.1.0
- First release on PyPI.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file summarynb-0.1.3.tar.gz
.
File metadata
- Download URL: summarynb-0.1.3.tar.gz
- Upload date:
- Size: 598.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.3.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d39c2bd178606febb4be52f02d030b1dbb86fb92750e090411ec764755824926 |
|
MD5 | 99aa279586acaa9d8a47d3d972f241ca |
|
BLAKE2b-256 | 9d3cd34af527de546c83c51b9d05bb32a88571d8d998caae0007c4342e32ba1c |
File details
Details for the file summarynb-0.1.3-py2.py3-none-any.whl
.
File metadata
- Download URL: summarynb-0.1.3-py2.py3-none-any.whl
- Upload date:
- Size: 10.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.3.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6c3b7c998e69a8337bc8c89b6eaee12478bc0a64be837781f1d22b4254366575 |
|
MD5 | dc05c51e36caf226fa607525d2eab153 |
|
BLAKE2b-256 | a016df11cb210a5d85a53f6809ae38e7d546a00ad52ba33492de28cf68c27372 |