Skip to main content

draw a dataset from inside Jupyter

Project description

drawdata

This small Python library contains Jupyter widgets that allow you to draw a dataset in a Jupyter notebook. This should be very useful when teaching machine learning algorithms.

The project uses anywidget under the hood so our tools should work in Jupyter, VSCode and Colab.

Installation

Installation occurs via pip.

python -m pip install drawdata

To read the data, polars is useful, but this library also suppots pandas:

python -m pip install pandas polars

Usage

You can load the scatter widget to start drawing immediately.

from drawdata import ScatterWidget

widget = ScatterWidget()
widget

If you want to use the dataset that you've just drawn you can do so via:

# Get the drawn data as a list of dictionaries
widget.data

# Get the drawn data as a dataframe
widget.data_as_pandas
widget.data_as_polars

Shoutout

This work was originally part of my work over at calmcode labs but my employer probabl has been very supportive and has allowed me to work on this project during my working hours. This was super cool and I wanted to make sure I recognise them for it.





Old Features

The original implementation of our widget would use an iframe to load a site in order to be able to draw from a Jupyter notebook. This works, but requires more manual effort, only works with pandas via the clipboard feature and needs an internet connection. Here's what that widget looks like:

It will be kept around, but the way forward for this library is to build on top of anywidget.

Old Feature Usage

When you run this from jupyter, you should load in an iframe.

from drawdata import draw_scatter

draw_scatter()

Once you're done drawing you can copy the data to the clipboard. After this you can use pandas to read the clipboard to get your drawn data into a dataframe.

import pandas as pd 
pd.read_clipboard(sep=",")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

drawdata-0.3.0.tar.gz (118.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

drawdata-0.3.0-py2.py3-none-any.whl (118.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file drawdata-0.3.0.tar.gz.

File metadata

  • Download URL: drawdata-0.3.0.tar.gz
  • Upload date:
  • Size: 118.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.6

File hashes

Hashes for drawdata-0.3.0.tar.gz
Algorithm Hash digest
SHA256 03911aac873d6d7fff26c2f98cb6754d11c5936ffa0dc969d2b07549bde891b7
MD5 0189bb17d0bda89e035c046e44c3ceff
BLAKE2b-256 c3e9a3ff4229eb5e0e160c8fe47dbe730e72cc98203c1704a1279ba7cfa48ed9

See more details on using hashes here.

File details

Details for the file drawdata-0.3.0-py2.py3-none-any.whl.

File metadata

  • Download URL: drawdata-0.3.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 118.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.6

File hashes

Hashes for drawdata-0.3.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 63c60c245f2f98a78123d46e25d7d74049aa8b898bf5d444bc7e71b35bfee185
MD5 bf90036971e86f2bf61abe8a51d5cad2
BLAKE2b-256 5c583a258c99977503a5ed8f42f924c02fdd2f2d5d7f046cbd6f3e9628527fc4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page