Skip to main content

Label data at scale. Fun and precision included.

Project description

Hover

Explore and label on a map of raw data.

Get enough to feed your model in no time.

PyPI Version Conda Version Downloads Build Status Codacy Grade Codacy Coverage

hover speeds up data labeling through embedding + visualization + callbacks.

  • You just need raw data and an embedding to start.

Demo

:sparkles: Features

It's fast because it labels in bulk.

:telescope: A 2D-embedded view of your dataset for labeling, equipped with

  • Tooltip for each point and table view for groups of points.
  • Search widgets for ad-hoc highlight of data matching search criteria.
  • Toggle buttons that clearly distinguish data subsets ("raw"/"train"/"dev"/"test").

It's accurate because you can filter and extend.

:microscope: Supplementary views to provide further labeling precision, such as

  • Advanced search view which can filter points by search criteria and provides stronger highlight.
  • Active learning view which puts a model in the loop and can filter by confidence score.
  • Function-based view which can leverage custom functions for labeling and filtering.

It's fun because the process never gets old.

  • Explore the map to find out which "zones" are easy and which ones are tricky.
  • Join the conquest of your data by coloring all of those zones through wisdom!

Check out @phurwicz/hover-binder for a list of demo apps.

:rocket: Quickstart

Code + Walkthrough -> Labeling App

  • edit & run code right in your browser, with guides along the way.

Jump to Labeling App

  • interactive plot for labeling data, pre-built and hosted on Binder.

:package: Install

Python: 3.7+

OS: Linux & Mac & Windows

PyPI (for all releases): pip install hover

Conda-forge (for 0.6.0 and above): conda install -c conda-forge hover

For Windows users, we recommend Windows Subsystem for Linux.

:book: Resources

:flags: Project News

  • Feb 25, 2022 version 0.7.0 is now available. Check out the changelog for details :partying_face:. Some tl-dr for the impatient:
    • audio and image support supply audio/image files through URLs to label with hover!
      • any type supported by HTML (and your browser) will be supported here.
    • high-dimensional support you can now use higher-than-2D embeddings.
      • hover still plots in 2D, but you can dynamically choose which two dimension to use.

:bell: Remarks

Shoutouts

  • Thanks to Bokeh because hover would not exist without linked plots and callbacks, or be nearly as good without embeddable server apps.
  • Thanks to Philip Vollet for sharing hover with the community even when it was really green.

Contributing

  • All feedbacks are welcome, especially what you find lacking and want it fixed!
  • ./requirements-dev.txt lists required packages for development.
  • Pull requests are advised to use a superset of the pre-commit hooks listed in .pre-commit-config.yaml.

Citation

If you have found hover useful to your work, please let us know :hugs:

@misc{hover,
  title={{hover}: label data at scale},
  url={https://github.com/phurwicz/hover},
  note={Open software from https://github.com/phurwicz/hover},
  author={
    Pavel Hurwicz and
    Haochuan Wei},
  year={2021},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hover-0.7.0.tar.gz (54.9 kB view details)

Uploaded Source

Built Distribution

hover-0.7.0-py3-none-any.whl (61.6 kB view details)

Uploaded Python 3

File details

Details for the file hover-0.7.0.tar.gz.

File metadata

  • Download URL: hover-0.7.0.tar.gz
  • Upload date:
  • Size: 54.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for hover-0.7.0.tar.gz
Algorithm Hash digest
SHA256 1aae054f90ec869e898affa2f06bed08c1b00531d21f02f1faeafefc19ff6d98
MD5 96d52b8a54c292f23401b49971c76ea0
BLAKE2b-256 fcd43651320e9281b62ca868574a3fcf5082a581f24876c7d50e667d65bd4f04

See more details on using hashes here.

File details

Details for the file hover-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: hover-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 61.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for hover-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ec647d66d58f923b278d218b7a4330ab115272f0e02cb2d28b3e6268b4c30746
MD5 dcdee3d4097d929b9466319fcfc0d83d
BLAKE2b-256 fe8b63ba4debb2401f64ad8921a2ebc720238a15e016fc7cddde2a8e8e53b8c8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page