Skip to main content

Label data at scale. Fun and precision included.

Project description

Hover

Explore and label on a map of raw data.

Get enough to feed your model in no time.

PyPI Version Conda Version Downloads Main Build Status Nightly Build Status Codacy Grade Codacy Coverage

hover speeds up data labeling through embedding + visualization + callbacks.

  • You just need raw data and an embedding to get a map.
  • Explore the map to find "zones" that are easy and those that are tricky.
  • Start the conquest of your data by coloring the zones through wisdom!

GIF Demo

:rocket: Live Demos

With code

  • edit & run code in your browser to get a labeling interface, with guides along the way.

Without code

  • go directly to a labeling interface, pre-coded and hosted on Binder.

:sparkles: Features (click bullets to show GIF)

It's fast because it labels data in bulk.

:telescope: A semantic scatter plot of your data for labeling, equipped with

  • Tooltip for each point on mouse hover
  • Table view for inspecting all selected points
  • Toggle buttons that clearly distinguish data subsets
  • Search widgets for ad-hoc data highlight

It's accurate because multiple angles work together.

:microscope: Supplementary views to use in conjunction with the annotator, including

  • Finder: filter data by search criteria
  • Softlabel: active learning by in-the-loop model prediction score
  • Snorkel: custom functions for labeling and filtering

It's flexible (and fun!) because the process never gets old.

:toolbox: Additional tools and options that allow you to

  • Go to higher dimensions (3D? 4D?) and choose your xy-axes
  • Consecutively select across areas, dimensions, and views
  • Kick outliers and fix mistakes

:bulb: Click on the bullet points above to view GIFs!

:package: Install

Python: 3.8+

OS: Linux & Mac & Windows

PyPI (for all releases): pip install hover

Conda-forge (for 0.6.0 and above): conda install -c conda-forge hover

For Windows users, we recommend Windows Subsystem for Linux.

:book: Resources

:flags: Announcements

  • Jan 21, 2023 version 0.8.0 is now available. Check out the changelog for details :partying_face:.

:bell: Remarks

Shoutouts

  • Thanks to Bokeh because hover would not exist without linked plots and callbacks, or be nearly as good without embeddable server apps.
  • Thanks to Philip Vollet for sharing hover with the community even when it was really green.

Contributing

  • All feedbacks are welcome, especially what you find lacking and want it fixed!
  • ./requirements-dev.txt lists required packages for development.
  • Pull requests are advised to use a superset of the pre-commit hooks listed in .pre-commit-config.yaml.

Citation

If you have found hover useful to your work, please let us know :hugs:

@misc{hover,
  title={{hover}: label data at scale},
  url={https://github.com/phurwicz/hover},
  note={Open software from https://github.com/phurwicz/hover},
  author={
    Pavel Hurwicz and
    Haochuan Wei},
  year={2021},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hover-0.8.0.tar.gz (57.7 kB view details)

Uploaded Source

Built Distribution

hover-0.8.0-py3-none-any.whl (64.6 kB view details)

Uploaded Python 3

File details

Details for the file hover-0.8.0.tar.gz.

File metadata

  • Download URL: hover-0.8.0.tar.gz
  • Upload date:
  • Size: 57.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for hover-0.8.0.tar.gz
Algorithm Hash digest
SHA256 6d62b7b9bd20c0cef1893e6d6374dc2d2f180c500c9dc8e5c9f295c06bd43a48
MD5 3df4bc81cd6c5021c6504177e502b566
BLAKE2b-256 efde6b682947f8639fbf560639d8b51ae148cd0e5664afd5ef5d94b3a8b6fcf7

See more details on using hashes here.

File details

Details for the file hover-0.8.0-py3-none-any.whl.

File metadata

  • Download URL: hover-0.8.0-py3-none-any.whl
  • Upload date:
  • Size: 64.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.15

File hashes

Hashes for hover-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bc765834db19263a4bf646e033c3f63ba2450a247d8e725517e2496ccf78239c
MD5 28c681316ede92f767eb2fcc2ff301b0
BLAKE2b-256 caad8e4b4035b758762bc69d5b8126a6879ea0754b4b0cb15c501b84cc4f3f14

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page