Skip to main content

Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era.

Project description

docs

Documentation Status

tests

Travis-CI Build Status Code Coverage

package

PyPI Package latest release PyPI Wheel Conda-Forge Latest Version
Supported versions Supported implementations
GitHub license

Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era.

  • Free software: Apache 2.0 license

What is Hangar?

Hangar is based off the belief that too much time is spent collecting, managing, and creating home-brewed version control systems for data. At it’s core Hangar is designed to solve many of the same problems faced by traditional code version control system (ie. Git), just adapted for numerical data:

  • Time travel through the historical evolution of a dataset.

  • Zero-cost Branching to enable exploratory analysis and collaboration

  • Cheap Merging to build datasets over time (with multiple collaborators)

  • Completely abstracted organization and management of data files on disk

  • Ability to only retrieve a small portion of the data (as needed) while still maintaining complete historical record

  • Ability to push and pull changes directly to collaborators or a central server (ie a truly distributed version control system)

The ability of version control systems to perform these tasks for codebases is largely taken for granted by almost every developer today; However, we are in-fact standing on the shoulders of giants, with decades of engineering which has resulted in these phenomenally useful tools. Now that a new era of “Data-Defined software” is taking hold, we find there is a strong need for analogous version control systems which are designed to handle numerical data at large scale… Welcome to Hangar!

The Hangar Workflow:

   Checkout Branch
          |
          ▼
 Create/Access Data
          |
          ▼
Add/Remove/Update Samples
          |
          ▼
       Commit

Log Style Output:

*   5254ec (master) : merge commit combining training updates and new validation samples
|\
| * 650361 (add-validation-data) : Add validation labels and image data in isolated branch
* | 5f15b4 : Add some metadata for later reference and add new training samples received after initial import
|/
*   baddba : Initial commit adding training images and labels

Learn more about what Hangar is all about at https://hangar-py.readthedocs.io/

Installation

Hangar is in early alpha development release!

pip install hangar

Documentation

https://hangar-py.readthedocs.io/

Development

To run the all tests run:

tox

Note, to combine the coverage data from all the tox environments run:

Windows

set PYTEST_ADDOPTS=--cov-append
tox

Other

PYTEST_ADDOPTS=--cov-append tox

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hangar-0.4.0b0.tar.gz (645.6 kB view details)

Uploaded Source

Built Distribution

hangar-0.4.0b0-py2.py3-none-any.whl (190.4 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file hangar-0.4.0b0.tar.gz.

File metadata

  • Download URL: hangar-0.4.0b0.tar.gz
  • Upload date:
  • Size: 645.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.3

File hashes

Hashes for hangar-0.4.0b0.tar.gz
Algorithm Hash digest
SHA256 c8ce744574fadcb937826d41c22c0881891ce1ac6fd1b9d9ebc207f297d7fc74
MD5 a894d519ee247d5607ad42166e039e7a
BLAKE2b-256 cece780fc50336084e65a7c05b63154a2f4e657c9a36029b29cb9f47e1475ab4

See more details on using hashes here.

File details

Details for the file hangar-0.4.0b0-py2.py3-none-any.whl.

File metadata

  • Download URL: hangar-0.4.0b0-py2.py3-none-any.whl
  • Upload date:
  • Size: 190.4 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.3

File hashes

Hashes for hangar-0.4.0b0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 1b247642382a785752ec37fcaccbd2eb55fc9a39750bda9b4ee810f74a8a1ca9
MD5 7f1a1dce6c53a9c49314c1dc6f23b779
BLAKE2b-256 e47f58e09d9f1b9cc9ac3fbbef70413539a32a0a150f48164887941a6c5a2a4f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page