Label data at scale. Fun and precision included.
Project description
Explore and label on a map of raw data.
Get enough to feed your model in no time.
hover
speeds up data labeling through embedding + visualization + callbacks
.
- You just need raw data and an embedding to get a map.
- Explore the map to find "zones" that are easy and those that are tricky.
- Start the conquest of your data by coloring the zones through wisdom!
:rocket: Live Demos
With code
- edit & run code in your browser to get a labeling interface, with guides along the way.
Without code
- go directly to a labeling interface, pre-coded and hosted on Binder.
:sparkles: Features (click bullets to show GIF)
It's fast because it labels data in bulk.
:telescope: A semantic scatter plot of your data for labeling, equipped with
-
Tooltip for each point on mouse hover
-
Table view for inspecting all selected points
-
Toggle buttons that clearly distinguish data subsets
-
Search widgets for ad-hoc data highlight
It's accurate because multiple angles work together.
:microscope: Supplementary views to use in conjunction with the annotator, including
-
Finder: filter data by search criteria
-
Softlabel: active learning by in-the-loop model prediction score
-
Snorkel: custom functions for labeling and filtering
It's flexible (and fun!) because the process never gets old.
:toolbox: Additional tools and options that allow you to
-
Go to higher dimensions (3D? 4D?) and choose your xy-axes
-
Consecutively select across areas, dimensions, and views
-
Kick outliers and fix mistakes
:bulb: Click on the bullet points above to view GIFs!
:package: Install
Python: 3.8+
OS: Linux & Mac & Windows
PyPI (for all releases): pip install hover
Conda-forge (for 0.6.0 and above): conda install -c conda-forge hover
For Windows users, we recommend Windows Subsystem for Linux.
- On Windows itself you will need C++ build tools for dependencies.
:book: Resources
:flags: Announcements
- Jan 21, 2023 version 0.8.0 is now available. Check out the changelog for details :partying_face:.
:bell: Remarks
Shoutouts
- Thanks to
Bokeh
becausehover
would not exist without linked plots and callbacks, or be nearly as good without embeddable server apps. - Thanks to Philip Vollet for sharing
hover
with the community even when it was really green.
Contributing
- All feedbacks are welcome, especially what you find lacking and want it fixed!
./requirements-dev.txt
lists required packages for development.- Pull requests are advised to use a superset of the pre-commit hooks listed in .pre-commit-config.yaml.
Citation
If you have found hover
useful to your work, please let us know :hugs:
@misc{hover,
title={{hover}: label data at scale},
url={https://github.com/phurwicz/hover},
note={Open software from https://github.com/phurwicz/hover},
author={
Pavel Hurwicz and
Haochuan Wei},
year={2021},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file hover-0.8.1.tar.gz
.
File metadata
- Download URL: hover-0.8.1.tar.gz
- Upload date:
- Size: 57.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3852ea8ab36f1f4d50f9ab649e2bda3ec2b78ffda8d7a0ba0e3590b5fa2a35fc |
|
MD5 | 4a1db05ae88c79b1c734f2ad292c6ca1 |
|
BLAKE2b-256 | c6f0abd2eaaca4538dcab4a9cc7502031079bf6f1082112cdc92a25e7713125a |
File details
Details for the file hover-0.8.1-py3-none-any.whl
.
File metadata
- Download URL: hover-0.8.1-py3-none-any.whl
- Upload date:
- Size: 64.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9ab3ff6ce77d328302d32de9c755858b1c23f93550a973e69c5572196de39db |
|
MD5 | 36442a5d3246b691f8188ccaee7b5dc5 |
|
BLAKE2b-256 | f973c882a4d900947955627af28cbab94d1312f38240aabdaf739ea6a936b6ba |