Skip to main content

Library for plotting interactive graphs on Jupyter Notebook

Project description

Croquis: plot graphs 100x faster on Jupyter Notebook

Croquis is a lightweight Python library for drawing interactive graphs really fast on Jupyter Notebook. It lets you effortlessly browse and examine much larger data than other similar libraries.

As an example, here's hourly ground temperature data of 2020 from the world's weather stations, downloaded from NOAA website. The data set contains 127 million points.

https://user-images.githubusercontent.com/31876421/123535161-0b402d00-d6d7-11eb-9486-6218279eda9d.mp4

The word "croquis" means a quick, sketchy drawing - it's from French croquis which simply means "sketch." (The final -s is silent: it's French, after all.)

Requirements

  • 64-bit Linux/Mac OS running on x86 with AVX2 instruction set support. (Intel: Haswell (2013) or later; AMD: Excavator (2015) or later.)
    • Windows support is under work.
    • Sorry, other architectures aren't supported yet.
  • Python 3.6 or later.
  • Jupyter Notebook.
  • A modern browser (if it can run Jupyter Notebook, it's probably fine).

How to install

pip install croquis

For building from the source, see DEVELOPMENT.md.

To test if it's working correctly, try this inside Jupyter Notebook:

# Paste into a Jupyter cell.

import croquis
import numpy as np

N = 1000000
X = np.random.normal(size=(N, 1))
Y = np.random.normal(size=(N, 1))
labels=['pt %d' % i for i in range(N)]

fig = croquis.plot()
fig.add(X, Y, marker_size=3, labels=labels)
fig.show()

It should generate a plot like this:

Gaussian distribution example

See below for documentation.

By the way, this library is of course open source (MIT License) and totally free to use, but just in case you really liked it for some reason, the author could use a cup of coffee or two... :)

buy me a coffee

Limitations

Croquis is still experimental: as of version 0.1, we only support the absolute bare minimum functionality. In particular:

  • Only line plots are supported, nothing else: no bars, pie charts, heatmaps, etc.
  • All lines are solid: no dotted/dashed lines.
  • All markers are solid circles: no other shapes are currently supported.
  • No subplots: each Jupyter cell can contain only one graph.
  • Very few options to customize the plot. No titles, axis labels, or secondary axes.
  • No support for mobile browsers.
  • No dark mode.
  • As you can see, the UI is rather primitive.

If croquis seems useful to you, but some features are missing for your use case, then please feel free to file an issue. (Of course I can't guarantee anything, but it will be nice to know that someone's interested.)

Documentation

See the tutorial and the (very short) reference.

FAQ

Is it really 100 times faster?

With large data, croquis can be several hundred times faster than other popular libraries. With very small data, there's less difference, as fixed-size overheads start to dominate.

Can we use it outside of Jupyter Notebook?

No, croquis is currently tied to Jupyter's message passing architecture, and all computation is done in the backend, so it needs an active Jupyter Python kernel.

How does it work?

Unlike most other similar libraries, croquis works by running a C++ "tile server," which computes fixed-sized "tiles" which is then sent back to the browser. If you have used Google Maps, the idea should be familiar. This has an important advantage:

  • The browser only has to know about tiles. Hence, the size of the data the browser needs to know is independent of the data set size.

As a result, the browser stays lean and "snappy" even with massive data. (As explained in the reference, we support copy_data=False option that even eliminates data copies altogether.)

Moreover, unlike the browser's single-threaded javascript code, the C++-based tile server can draw multiple tiles in parallel, which allows even more speedup.

(On the other hand, there are drawbacks - we have to basically re-implement every graph drawing algorithm inside this tile server, not being able to use any javascript API, except for very trivial things like coordinate grids.)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

croquis-0.1.0-cp39-cp39-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (260.6 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.12+ x86-64

croquis-0.1.0-cp39-cp39-macosx_10_9_x86_64.whl (218.3 kB view hashes)

Uploaded CPython 3.9 macOS 10.9+ x86-64

croquis-0.1.0-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (259.9 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.12+ x86-64

croquis-0.1.0-cp38-cp38-macosx_10_9_x86_64.whl (218.2 kB view hashes)

Uploaded CPython 3.8 macOS 10.9+ x86-64

croquis-0.1.0-cp37-cp37m-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (264.3 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.12+ x86-64

croquis-0.1.0-cp37-cp37m-macosx_10_9_x86_64.whl (216.5 kB view hashes)

Uploaded CPython 3.7m macOS 10.9+ x86-64

croquis-0.1.0-cp36-cp36m-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (264.6 kB view hashes)

Uploaded CPython 3.6m manylinux: glibc 2.12+ x86-64

croquis-0.1.0-cp36-cp36m-macosx_10_9_x86_64.whl (216.5 kB view hashes)

Uploaded CPython 3.6m macOS 10.9+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page