Skip to main content

DVC's data management subsystem

Project description

PyPI Status Python Version License

Tests Codecov pre-commit Black

Features

  • TODO

Requirements

  • TODO

Installation

You can install DVC data via pip from PyPI:

$ pip install dvc-data

Usage

HashFile

HashFile

Based on dvc-object’s Object, this is an object that has a particular hash that can be used to verify its contents. Similar to git’s ShaFile.

from dvc_data.hashfile import HashFile

obj = HashFile("/path/to/file", fs, HashInfo("md5", "36eba1e1e343279857ea7f69a597324e")

HashFileDB

Based on dvc-object’s ObjectDB, but stores HashFile objects and so is able to verify their contents by their hash_info. Similar to git’s ObjectStore.

from dvc_data.hashfile import HashFileDB

odb = HashFileDB(fs, "/path/to/odb")

Index

Index

A trie-like structure that represents data files and directories.

from dvc_data.index import DataIndex, DataIndexEntry

index = DataIndex()
index[("foo",)] = DataIndexEntry(hash_info=hash_info, meta=meta)

Storage

A mapping that describes where to find data contents for index entries. Can be either ObjectStorage for HashFileDB-based storage or FileStorage for backup-like plain file storage.

index.storage_map[("foo",)] = ObjectStorage(...)

Contributing

Contributions are very welcome. To learn more, see the Contributor Guide.

License

Distributed under the terms of the Apache 2.0 license, DVC data is free and open source software.

Issues

If you encounter any problems, please file an issue along with a detailed description.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dvc-data-3.9.0.tar.gz (70.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dvc_data-3.9.0-py3-none-any.whl (69.3 kB view details)

Uploaded Python 3

File details

Details for the file dvc-data-3.9.0.tar.gz.

File metadata

  • Download URL: dvc-data-3.9.0.tar.gz
  • Upload date:
  • Size: 70.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for dvc-data-3.9.0.tar.gz
Algorithm Hash digest
SHA256 b72a582aaac8e7de1583cf7117bc36fb19cc6b467497ee1a4af9988fce9ffb71
MD5 973ffa1a062c2c38b324c797e8b4bd7a
BLAKE2b-256 b525a732f76d9e60fab06709b54aa178987b7e1a9eac72c2e97adebbd45db815

See more details on using hashes here.

File details

Details for the file dvc_data-3.9.0-py3-none-any.whl.

File metadata

  • Download URL: dvc_data-3.9.0-py3-none-any.whl
  • Upload date:
  • Size: 69.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for dvc_data-3.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d1a1172fa0b18806c714e2fcd4e0c3fb27ff44035ae6bed13826532d3a53fcbc
MD5 d5875ea2fea62adef15c59da4e23f225
BLAKE2b-256 8701f921ec97374172a14557b77393cfdccb45e7902ad1e0ff06eb9fd7fbbb9a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page