Skip to main content

DVC's data management subsystem

Project description

PyPI Status Python Version License

Tests Codecov pre-commit Black

Features

  • TODO

Requirements

  • TODO

Installation

You can install DVC data via pip from PyPI:

$ pip install dvc-data

Usage

HashFile

HashFile

Based on dvc-object’s Object, this is an object that has a particular hash that can be used to verify its contents. Similar to git’s ShaFile.

from dvc_data.hashfile import HashFile

obj = HashFile("/path/to/file", fs, HashInfo("md5", "36eba1e1e343279857ea7f69a597324e")

HashFileDB

Based on dvc-object’s ObjectDB, but stores HashFile objects and so is able to verify their contents by their hash_info. Similar to git’s ObjectStore.

from dvc_data.hashfile import HashFileDB

odb = HashFileDB(fs, "/path/to/odb")

Index

Index

A trie-like structure that represents data files and directories.

from dvc_data.index import DataIndex, DataIndexEntry

index = DataIndex()
index[("foo",)] = DataIndexEntry(hash_info=hash_info, meta=meta)

Storage

A mapping that describes where to find data contents for index entries. Can be either ObjectStorage for HashFileDB-based storage or FileStorage for backup-like plain file storage.

index.storage_map[("foo",)] = ObjectStorage(...)

Contributing

Contributions are very welcome. To learn more, see the Contributor Guide.

License

Distributed under the terms of the Apache 2.0 license, DVC data is free and open source software.

Issues

If you encounter any problems, please file an issue along with a detailed description.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dvc_data-3.16.7.tar.gz (82.3 kB view details)

Uploaded Source

Built Distribution

dvc_data-3.16.7-py3-none-any.whl (78.4 kB view details)

Uploaded Python 3

File details

Details for the file dvc_data-3.16.7.tar.gz.

File metadata

  • Download URL: dvc_data-3.16.7.tar.gz
  • Upload date:
  • Size: 82.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dvc_data-3.16.7.tar.gz
Algorithm Hash digest
SHA256 03272532ec538277e341ae29b63884de0b4043ed47bdfbe4f3aa01ea01cd8e2c
MD5 1316049cd4fc705fad5569010d50cf21
BLAKE2b-256 678acf0306b3535236055043a80ceedfc27b4d5d67fc359d413991c24fb93b20

See more details on using hashes here.

Provenance

The following attestation bundles were made for dvc_data-3.16.7.tar.gz:

Publisher: release.yml on iterative/dvc-data

Attestations:

File details

Details for the file dvc_data-3.16.7-py3-none-any.whl.

File metadata

  • Download URL: dvc_data-3.16.7-py3-none-any.whl
  • Upload date:
  • Size: 78.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dvc_data-3.16.7-py3-none-any.whl
Algorithm Hash digest
SHA256 185c5b5e5c0a97b654c9d1c9129da5af4ac26dd66af97818a5f7590882ae8432
MD5 89242ed8ed3fa4cd163c20c07ab99a2b
BLAKE2b-256 c442e23f1d2041be6e327b68587597c311dfd8e3ce7b8bb28874055bb214ed4b

See more details on using hashes here.

Provenance

The following attestation bundles were made for dvc_data-3.16.7-py3-none-any.whl:

Publisher: release.yml on iterative/dvc-data

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page